Sciweavers

85 search results - page 1 / 17
» Improving Text Classification by Web Corpora
Sort
View
AWIC
2007
Springer
13 years 11 months ago
Improving Text Classification by Web Corpora
A major difficulty of supervised approaches for text classification is that they require a great number of training instances in order to construct an accurate classifier. This pap...
Rafael Guzmán-Cabrera, Manuel Montes-y-G&oa...
ISIWI
2000
13 years 6 months ago
Aiding Web Searches by Statistical Classification Tools
We describe an infrastructure for the collection and management of large amounts of text, and discuss the possibility of information extraction and visualisation from text corpora...
Gerhard Heyer, Uwe Quasthoff, Christian Wolff
WWW
2004
ACM
14 years 5 months ago
Liveclassifier: creating hierarchical text classifiers through web corpora
Many Web information services utilize techniques of information extraction (IE) to collect important facts from the Web. To create more advanced services, one possible method is t...
Chien-Chung Huang, Shui-Lung Chuang, Lee-Feng Chie...
ICML
2005
IEEE
14 years 5 months ago
Hierarchical Dirichlet model for document classification
The proliferation of text documents on the web as well as within institutions necessitates their convenient organization to enable efficient retrieval of information. Although tex...
Sriharsha Veeramachaneni, Diego Sona, Paolo Avesan...
ICML
1999
IEEE
13 years 9 months ago
Feature Engineering for Text Classification
Most research in text classification to date has used a “bag of words” representation in which each feature corresponds to a single word. This paper examines some alternative ...
Sam Scott, Stan Matwin