Sciweavers

ISNN
2007
Springer

A Probabilistic Approach to Feature Selection for Multi-class Text Categorization

13 years 10 months ago
A Probabilistic Approach to Feature Selection for Multi-class Text Categorization
Abstract. In this paper, we propose a probabilistic approach to feature selection for multi-class text categorization. Specifically, we regard document class and occurrence of each feature as events, calculate the probability of occurrence of each feature by the theorem on the total probability and utilize the values as a ranking criterion. Experiments on Reuters-2000 collection show that the proposed method can yield better performance than information gain and χ-square, which are two wellknown feature selection methods.
Ke Wu, Bao-Liang Lu, Masao Uchiyama, Hitoshi Isaha
Added 08 Jun 2010
Updated 08 Jun 2010
Type Conference
Year 2007
Where ISNN
Authors Ke Wu, Bao-Liang Lu, Masao Uchiyama, Hitoshi Isahara
Comments (0)