Sciweavers

168 search results - page 5 / 34
» Document Classification Using Multiword Features
Sort
View
TREC
2004
14 years 11 months ago
Feature Generation, Feature Selection, Classifiers, and Conceptual Drift for Biomedical Document Triage
We approached the problem of classifying papers for the TREC 2004 Genomics Track triage task as a four step process: feature generation, feature selection, classifier training, an...
Aaron M. Cohen, Ravi Teja Bhupatiraju, William R. ...
DEXAW
2007
IEEE
105views Database» more  DEXAW 2007»
15 years 3 months ago
Classifying XML Documents by Using Genre Features
The categorization of documents is traditionally topic-based. This paper presents a complementary analysis of research and experiments on genre to show that encouraging results ca...
Malcolm Clark, Stuart N. K. Watt
85
Voted
SIGIR
2002
ACM
14 years 9 months ago
Automatic classification in product catalogs
In this paper, we present the AutoCat system for product classification. AutoCat uses a vector space model, modified to consider product attributes unavailable in traditional docu...
Ben Wolin
DMIN
2006
150views Data Mining» more  DMIN 2006»
14 years 11 months ago
Effect of Document Representation on the Performance of Medical Document Classification
Text classification in the medical domain is a real world problem with wide applicability. This paper investigates extensively the effect of text representation approaches on the p...
Fathi H. Saad, Beatriz de la Iglesia, Duncan G. Be...
CIKM
2009
Springer
15 years 1 months ago
Improving binary classification on text problems using differential word features
We describe an efficient technique to weigh word-based features in binary classification tasks and show that it significantly improves classification accuracy on a range of proble...
Justin Martineau, Tim Finin, Anupam Joshi, Shamit ...