Sciweavers

168 search results - page 6 / 34
» Document Classification Using Multiword Features
Sort
View
ISIWI
2000
14 years 11 months ago
Automatic Document Classification - A thorough Evaluation of various Methods
(Automatic) document classification is generally defined as content-based assignment of one or more predefined categories to documents. Usually, machine learning, statistical patt...
Christoph Goller, J. Löning, T. Will, W. Wolf...
ICPR
2008
IEEE
15 years 10 months ago
Combining content and structure similarity for XML document classification using composite SVM kernels
Combination of structure and content features is necessary for effective retrieval and classification of XML documents. Composite kernels provide a way for fusion of content and s...
Pabitra Mitra, Saptarshi Ghosh
79
Voted
KDD
2004
ACM
160views Data Mining» more  KDD 2004»
15 years 10 months ago
Boosting for Text Classification with Semantic Features
Abstract. Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic...
Stephan Bloehdorn, Andreas Hotho
JIIS
2006
73views more  JIIS 2006»
14 years 9 months ago
Using KCCA for Japanese-English cross-language information retrieval and document classification
Kernel Canonical Correlation Analysis (KCCA) is a method of correlating linear relationship between two variables in a kernel defined feature space. A machine learning algorithm b...
Yaoyong Li, John Shawe-Taylor
ECIR
2003
Springer
14 years 11 months ago
Hierarchical Classification of HTML Documents with WebClassII
This paper describes a new method for the classification of a HTML document into a hierarchy of categories. The hierarchy of categories is involved in all phases of automated docum...
Michelangelo Ceci, Donato Malerba