Sciweavers

ECAI
2006
Springer
15 years 8 months ago
Text Sampling and Re-Sampling for Imbalanced Authorship Identification Cases
Authorship identification can be seen as a single-label multi-class text categorization problem. Very often, there are extremely few training texts at least for some of the candida...
Efstathios Stamatatos
119
Voted
ISNN
2007
Springer
15 years 11 months ago
A Probabilistic Approach to Feature Selection for Multi-class Text Categorization
Abstract. In this paper, we propose a probabilistic approach to feature selection for multi-class text categorization. Specifically, we regard document class and occurrence of eac...
Ke Wu, Bao-Liang Lu, Masao Uchiyama, Hitoshi Isaha...