Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

14

SIGIR
1999
ACM

favoriteEmaildiscussreport

115views Information Technology» more SIGIR 1999»

A Re-Examination of Text Categorization Methods

13 years 8 months ago

A Re-Examination of Text Categorization Methods

Download boston.lti.cs.cmu.edu

This paper reports a controlled study with statistical signi cance tests on ve text categorization methods: the Support Vector Machines (SVM), a k-Nearest Neighbor (kNN) classi er, a neural network (NNet) approach, the Linear Leastsquares Fit (LLSF) mapping and a Naive Bayes (NB) classier. We focus on the robustness of these methods in dealing with a skewed category distribution, and their performance as function of the training-set category frequency. Our results show that SVM, kNN and LLSF signi cantly outperform NNet and NB when the number of positive training instances per category are small (less than ten), and that all the methods perform comparably when the categories are su ciently common (over 300 instances).

Yiming Yang, Xin Liu

Real-time Traffic

Information Management | SIGIR 1999 | Signi Cance Tests | Skewed Category Distribution | Training-set Category Frequency |

claim paper

Related Content

» A Framework of Feature Selection Methods for Text Categorization

» Text Classification Methodologies Applied to MicroText in Military Chat

» Categorical Proportional Difference A Feature Selection Method for Text Categorization

» Semisupervised Collaborative Text Classification

» Hierarchical Text Categorization Using Neural Networks

» A sparse version of the ridge logistic regression for largescale text categorization

» On a new model for automatic text categorization based on Vector Space Model

» On the strength of hyperclique patterns for text categorization

» An efficient feature ranking measure for text categorization

Post Info
More Details (n/a)

Added	03 Aug 2010
Updated	03 Aug 2010
Type	Conference
Year	1999
Where	SIGIR
Authors	Yiming Yang, Xin Liu

Comments (0)