Sciweavers

33 search results - page 2 / 7
» An EM Based Training Algorithm for Cross-Language Text Categ...
Sort
View
IJIT
2004
13 years 6 months ago
Combining ILP with Semi-supervised Learning for Web Page Categorization
This paper presents a semi-supervised learning algorithm called Iterative-Cross Training (ICT) to solve the Web pages classification problems. We apply Inductive logic programming ...
Nuanwan Soonthornphisaj, Boonserm Kijsirikul
ML
2000
ACM
124views Machine Learning» more  ML 2000»
13 years 4 months ago
Text Classification from Labeled and Unlabeled Documents using EM
This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training documents with a large pool of unlabeled documents. ...
Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...
DASFAA
2004
IEEE
135views Database» more  DASFAA 2004»
13 years 8 months ago
Semi-supervised Text Classification Using Partitioned EM
Text classification using a small labeled set and a large unlabeled data is seen as a promising technique to reduce the labor-intensive and time consuming effort of labeling traini...
Gao Cong, Wee Sun Lee, Haoran Wu, Bing Liu
RIAO
2004
13 years 6 months ago
Multilingual document clusters discovery
Cross Language Information Retrieval community has brought up search engines over multilingual corpora, and multilingual text categorization systems. In this paper, we focus on th...
Benoît Mathieu, Romaric Besançon, Chr...
ECAI
2008
Springer
13 years 6 months ago
Author Identification Using a Tensor Space Representation
Author identification is a text categorization task with applications in intelligence, criminal law, computer forensics, etc. Usually, in such cases there is shortage of training t...
Spyridon Plakias, Efstathios Stamatatos