Sciweavers

115 search results - page 9 / 23
» Training Data Cleaning for Text Classification
Sort
View
MICAI
2009
Springer
15 years 4 months ago
Using Nearest Neighbor Information to Improve Cross-Language Text Classification
Cross-language text classification (CLTC) aims to take advantage of existing training data from one language to construct a classifier for another language. In addition to the expe...
Adelina Escobar-Acevedo, Manuel Montes-y-Gó...
AINA
2004
IEEE
15 years 3 months ago
Online Training of SVMs for Real-time Intrusion Detection
Abstract-- As intrusion detection essentially can be formulated as a binary classification problem, it thus can be solved by an effective classification technique-Support Vector Ma...
Zonghua Zhang, Hong Shen
106
Voted
KDD
2002
ACM
126views Data Mining» more  KDD 2002»
16 years 2 days ago
Integrating feature and instance selection for text classification
Instance selection and feature selection are two orthogonal methods for reducing the amount and complexity of data. Feature selection aims at the reduction of redundant features i...
Dimitris Fragoudis, Dimitris Meretakis, Spiros Lik...
89
Voted
KDD
2008
ACM
128views Data Mining» more  KDD 2008»
16 years 2 days ago
Scaling up text classification for large file systems
: We combine the speed and scalability of information retrieval with the generally superior classification accuracy offered by machine learning, yielding a two-phase text classifie...
George Forman, Shyamsundar Rajaram
SIGMOD
2001
ACM
145views Database» more  SIGMOD 2001»
15 years 11 months ago
Automatic Segmentation of Text into Structured Records
In this paper we present a method for automatically segmenting unformatted text records into structured elements. Several useful data sources today are human-generated as continuo...
Vinayak R. Borkar, Kaustubh Deshmukh, Sunita Saraw...