Sciweavers

115 search results - page 10 / 23
» Training Data Cleaning for Text Classification
Sort
View
DMIN
2009
195views Data Mining» more  DMIN 2009»
14 years 9 months ago
Improved k-NN Algorithm for Text Classification
- Over the last twenty years, text classification has become one of the key techniques for organizing electronic information such as text and web documents. The k-Nearest Neighbor ...
Muhammed Miah
ADVIS
2004
Springer
15 years 5 months ago
Multiple Sets of Rules for Text Categorization
An important issue in text mining is how to make use of multiple pieces knowledge discovered to improve future decisions. In this paper, we propose a new approach to combining mult...
Yaxin Bi, Terry J. Anderson, Sally I. McClean
KDD
2006
ACM
165views Data Mining» more  KDD 2006»
16 years 2 days ago
Training linear SVMs in linear time
Linear Support Vector Machines (SVMs) have become one of the most prominent machine learning techniques for highdimensional sparse data commonly encountered in applications like t...
Thorsten Joachims
INTERSPEECH
2010
14 years 6 months ago
Topic and style-adapted language modeling for Thai broadcast news ASR
The amount of available Thai broadcast news transcribed text for training a language model is still very limited, comparing to other major languages. Since the construction of a b...
Markpong Jongtaveesataporn, Sadaoki Furui
NIPS
2008
15 years 1 months ago
Semi-supervised Learning with Weakly-Related Unlabeled Data: Towards Better Text Categorization
The cluster assumption is exploited by most semi-supervised learning (SSL) methods. However, if the unlabeled data is merely weakly related to the target classes, it becomes quest...
Liu Yang, Rong Jin, Rahul Sukthankar