Sciweavers

AWIC
2007
Springer

Improving Text Classification by Web Corpora

13 years 10 months ago
Improving Text Classification by Web Corpora
A major difficulty of supervised approaches for text classification is that they require a great number of training instances in order to construct an accurate classifier. This paper proposes a semi-supervised method that is specially suited to work with very few training examples. It considers the automatic extraction of unlabeled examples from the Web as well as an iterative integration of unlabeled examples into the training process. Preliminary results indicate that our proposal can significantly improve the classification accuracy in scenarios where there are less than ten training examples available per class.
Rafael Guzmán-Cabrera, Manuel Montes-y-G&oa
Added 07 Jun 2010
Updated 07 Jun 2010
Type Conference
Year 2007
Where AWIC
Authors Rafael Guzmán-Cabrera, Manuel Montes-y-Gómez, Paolo Rosso, Luis Villaseñor Pineda
Comments (0)