Sciweavers

41 search results - page 4 / 9
» Corpus Based Unsupervised Labeling of Documents
Sort
View
DL
1999
Springer
154views Digital Library» more  DL 1999»
15 years 1 months ago
SOMLib: A Digital Library System Based on Neural Networks
Digital Libraries have gained tremendous interest with numerous research projects addressing the wealth of challenges in this field. While computational intelligence systems are ...
Andreas Rauber, Dieter Merkl
85
Voted
EMNLP
2004
14 years 11 months ago
Trained Named Entity Recognition using Distributional Clusters
This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured documents, to the problem of named entity recogniti...
Dayne Freitag
KDD
2005
ACM
118views Data Mining» more  KDD 2005»
15 years 10 months ago
On the use of linear programming for unsupervised text classification
We propose a new algorithm for dimensionality reduction and unsupervised text classification. We use mixture models as underlying process of generating corpus and utilize a novel,...
Mark Sandler
CIKM
2004
Springer
15 years 1 months ago
InfoAnalyzer: a computer-aided tool for building enterprise taxonomies
In this paper we study the problem of collecting training samples for building enterprise taxonomies. We develop a computer-aided tool named InfoAnalyzer, which can effectively as...
Li Zhang, Shixia Liu, Yue Pan, Liping Yang
MICAI
2007
Springer
15 years 3 months ago
Variants of Tree Kernels for XML Documents
In this paper, we discuss tree kernels that can be applied for the classification of XML documents based on their DOM trees. DOM trees are ordered trees, in which every node might...
Peter Geibel, Helmar Gust, Kai-Uwe Kühnberger