Sciweavers

115 search results - page 7 / 23
» Training Data Cleaning for Text Classification
Sort
View
KDD
2004
ACM
114views Data Mining» more  KDD 2004»
16 years 1 days ago
Mining reference tables for automatic text segmentation
Automatically segmenting unstructured text strings into structured records is necessary for importing the information contained in legacy sources and text collections into a data ...
Eugene Agichtein, Venkatesh Ganti
125
Voted
KDD
2009
ACM
262views Data Mining» more  KDD 2009»
16 years 6 days ago
Sentiment analysis of blogs by combining lexical knowledge with text classification
The explosion of user-generated content on the Web has led to new opportunities and significant challenges for companies, that are increasingly concerned about monitoring the disc...
Prem Melville, Wojciech Gryc, Richard D. Lawrence
KDD
2005
ACM
118views Data Mining» more  KDD 2005»
16 years 1 days ago
On the use of linear programming for unsupervised text classification
We propose a new algorithm for dimensionality reduction and unsupervised text classification. We use mixture models as underlying process of generating corpus and utilize a novel,...
Mark Sandler
90
Voted
SIGKDD
2002
92views more  SIGKDD 2002»
14 years 11 months ago
A Machine Learning Approach for the Curation of Biomedical Literature - KDD Cup 2002 (Task 1)
In this paper, we present an automated text classification system for the classification of biomedical papers. This classification is based on whether there is experimental eviden...
S. Sathiya Keerthi, Chong Jin Ong, Keng Boon Siah,...
COLING
2008
15 years 1 months ago
Automatic Seed Word Selection for Unsupervised Sentiment Classification of Chinese Text
We describe and evaluate a new method of automatic seed word selection for unsupervised sentiment classification of product reviews in Chinese. The whole method is unsupervised an...
Taras Zagibalov, John Carroll