Sciweavers

506 search results - page 12 / 102
» Feature Selection for the Classification of Large Document C...
Sort
View
ELPUB
2006
ACM
15 years 3 months ago
Automated Building of OAI Compliant Repository from Legacy Collection
In this paper, we report on our experience with the creation of an automated, human-assisted process to extract metadata from documents in a large (>100,000), dynamically growi...
Jianfeng Tang, Kurt Maly, Steven J. Zeil, Mohammad...
IFIP12
2004
14 years 11 months ago
Impact on Performance of Hypertext Classification of Selective Rich HTML Capture
: Hypertext categorization is the automatic classification of web documents into predefined classes. It poses new challenges for automatic categorization because of the rich inform...
Houda Benbrahim, Max Bramer
IJCNN
2007
IEEE
15 years 4 months ago
Random Feature Subset Selection for Analysis of Data with Missing Features
Abstract - We discuss an ensemble-of-classifiers based algorithm for the missing feature problem. The proposed approach is inspired in part by the random subspace method, and in pa...
Joseph DePasquale, Robi Polikar
DOCENG
2006
ACM
15 years 3 months ago
NEWPAR: an automatic feature selection and weighting schema for category ranking
Category ranking provides a way to classify plain text documents into a pre-determined set of categories. This work proposes to have a look at typical document collections and ana...
Fernando Ruiz-Rico, José Luis Vicedo Gonz&a...
ECIR
2003
Springer
14 years 11 months ago
Hierarchical Classification of HTML Documents with WebClassII
This paper describes a new method for the classification of a HTML document into a hierarchy of categories. The hierarchy of categories is involved in all phases of automated docum...
Michelangelo Ceci, Donato Malerba