Sciweavers

506 search results - page 17 / 102
» Feature Selection for the Classification of Large Document C...
Sort
View
SIGIR
2005
ACM
15 years 3 months ago
A phonotactic-semantic paradigm for automatic spoken document classification
We demonstrate a phonotactic-semantic paradigm for spoken document categorization. In this framework, we define a set of acoustic words instead of lexical words to represent acous...
Bin Ma, Haizhou Li
ICPR
2008
IEEE
15 years 11 months ago
Combining content and structure similarity for XML document classification using composite SVM kernels
Combination of structure and content features is necessary for effective retrieval and classification of XML documents. Composite kernels provide a way for fusion of content and s...
Pabitra Mitra, Saptarshi Ghosh
PAKDD
2000
ACM
128views Data Mining» more  PAKDD 2000»
15 years 1 months ago
A Comparative Study of Classification Based Personal E-mail Filtering
This paper addresses personal E-mail filtering by casting it in the framework of text classification. Modeled as semi-structured documents, Email messages consist of a set of field...
Yanlei Diao, Hongjun Lu, Dekai Wu
CRV
2005
IEEE
201views Robotics» more  CRV 2005»
15 years 3 months ago
Minimum Bayes Error Features for Visual Recognition by Sequential Feature Selection and Extraction
The extraction of optimal features, in a classification sense, is still quite challenging in the context of large-scale classification problems (such as visual recognition), inv...
Gustavo Carneiro, Nuno Vasconcelos
WEBI
2005
Springer
15 years 3 months ago
A Semi-Supervised Document Clustering Algorithm Based on EM
Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...
Leonardo Rigutini, Marco Maggini