Sciweavers

523 search results - page 49 / 105
» Metric Learning for Text Documents
Sort
View
ADC
2003
Springer
115views Database» more  ADC 2003»
15 years 2 months ago
Document Classification via Structure Synopses
Information available in the Internet is frequently supplied simply as plain ascii text, structured according to orthographic and semantic conventions. Traditional document classi...
Liping Ma, John Shepherd, Anh Nguyen
HIS
2003
15 years 17 days ago
Evolving Better Stoplists for Document Clustering and Web Intelligence
: Text classification, document clustering and similar document analysis tasks are currently the subject of significant global research, since such areas underpin web intelligence,...
Mark P. Sinka, David Corne
ICML
2003
IEEE
15 years 4 months ago
An Evaluation on Feature Selection for Text Clustering
Feature selection methods have been successfully applied to text categorization but seldom applied to text clustering due to the unavailability of class label information. In this...
Tao Liu, Shengping Liu, Zheng Chen, Wei-Ying Ma
ICML
2004
IEEE
16 years 5 hour ago
Text categorization with many redundant features: using aggressive feature selection to make SVMs competitive with C4.5
Text categorization algorithms usually represent documents as bags of words and consequently have to deal with huge numbers of features. Most previous studies found that the major...
Evgeniy Gabrilovich, Shaul Markovitch
IFIP12
2004
15 years 17 days ago
Impact on Performance of Hypertext Classification of Selective Rich HTML Capture
: Hypertext categorization is the automatic classification of web documents into predefined classes. It poses new challenges for automatic categorization because of the rich inform...
Houda Benbrahim, Max Bramer