Sciweavers

538 search results - page 1 / 108
» Mining Relevant Text from Unlabelled Documents
Sort
View
ICDM
2003
IEEE
126views Data Mining» more  ICDM 2003»
13 years 9 months ago
Mining Relevant Text from Unlabelled Documents
Automatic classification of documents is an important area of research with many applications in the fields of document searching, forensics and others. Methods to perform class...
Daniel Barbará, Carlotta Domeniconi, Ning K...
AAAI
1998
13 years 5 months ago
Learning to Classify Text from Labeled and Unlabeled Documents
In many important text classification problems, acquiring class labels for training documents is costly, while gathering large quantities of unlabeled data is cheap. This paper sh...
Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...
IJCAI
2003
13 years 5 months ago
Learning to Classify Texts Using Positive and Unlabeled Data
In traditional text classification, a classifier is built using labeled training documents of every class. This paper studies a different problem. Given a set P of documents of a ...
Xiaoli Li, Bing Liu
AUSDM
2008
Springer
367views Data Mining» more  AUSDM 2008»
13 years 6 months ago
Categorical Proportional Difference: A Feature Selection Method for Text Categorization
Supervised text categorization is a machine learning task where a predefined category label is automatically assigned to a previously unlabelled document based upon characteristic...
Mondelle Simeon, Robert J. Hilderman
ML
2000
ACM
124views Machine Learning» more  ML 2000»
13 years 4 months ago
Text Classification from Labeled and Unlabeled Documents using EM
This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training documents with a large pool of unlabeled documents. ...
Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...