Sciweavers

32 search results - page 2 / 7
» Combining Labeled and Unlabeled Data for MultiClass Text Cat...
Sort
View
CIKM
2009
Springer
13 years 11 months ago
Combining labeled and unlabeled data with word-class distribution learning
We describe a novel simple and highly scalable semi-supervised method called Word-Class Distribution Learning (WCDL), and apply it the task of information extraction (IE) by utili...
Yanjun Qi, Ronan Collobert, Pavel Kuksa, Koray Kav...
SIGIR
2010
ACM
13 years 8 months ago
Combining coregularization and consensus-based self-training for multilingual text categorization
We investigate the problem of learning document classifiers in a multilingual setting, from collections where labels are only partially available. We address this problem in the ...
Massih-Reza Amini, Cyril Goutte, Nicolas Usunier
AAAI
1998
13 years 6 months ago
Learning to Classify Text from Labeled and Unlabeled Documents
In many important text classification problems, acquiring class labels for training documents is costly, while gathering large quantities of unlabeled data is cheap. This paper sh...
Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...
IJCAI
2003
13 years 6 months ago
Learning to Classify Texts Using Positive and Unlabeled Data
In traditional text classification, a classifier is built using labeled training documents of every class. This paper studies a different problem. Given a set P of documents of a ...
Xiaoli Li, Bing Liu
ML
2000
ACM
124views Machine Learning» more  ML 2000»
13 years 4 months ago
Text Classification from Labeled and Unlabeled Documents using EM
This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training documents with a large pool of unlabeled documents. ...
Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...