Sciweavers

11 search results - page 1 / 3
» Summarization as Feature Selection for Document Categorizati...
Sort
View
TAL
2010
Springer
13 years 2 months ago
Summarization as Feature Selection for Document Categorization on Small Datasets
Abstract. Most common feature selection techniques for document categorization are supervised and require lots of training data in order to accurately capture the descriptive and d...
Emmanuel Anguiano-Hernández, Luis Villase&n...
ADMA
2006
Springer
139views Data Mining» more  ADMA 2006»
13 years 10 months ago
Semantic Scoring Based on Small-World Phenomenon for Feature Selection in Text Mining
This paper proposes an effective scoring scheme for feature selection in Text Mining, using characteristics of Small-World Phenomenon on the semantic networks of documents. Our foc...
Chong Huang, YongHong Tian, Tiejun Huang, Wen Gao
ICML
2004
IEEE
14 years 5 months ago
Text categorization with many redundant features: using aggressive feature selection to make SVMs competitive with C4.5
Text categorization algorithms usually represent documents as bags of words and consequently have to deal with huge numbers of features. Most previous studies found that the major...
Evgeniy Gabrilovich, Shaul Markovitch
AI
2009
Springer
13 years 11 months ago
An Empirical Study of Category Skew on Feature Selection for Text Categorization
In this paper, we present an empirical comparison of the effects of category skew on six feature selection methods. The methods were evaluated on 36 datasets generated from the 20...
Mondelle Simeon, Robert J. Hilderman
IFIP12
2004
13 years 6 months ago
Impact on Performance of Hypertext Classification of Selective Rich HTML Capture
: Hypertext categorization is the automatic classification of web documents into predefined classes. It poses new challenges for automatic categorization because of the rich inform...
Houda Benbrahim, Max Bramer