Sciweavers

168 search results - page 4 / 34
» Document Classification Using Multiword Features
Sort
View
IPM
2002
106views more  IPM 2002»
14 years 9 months ago
A feature mining based approach for the classification of text documents into disjoint classes
This paper proposes a new approach for classifying text documents into two disjoint classes. The new approach is based on extracting patterns, in the form of two logical expressio...
Salvador Nieto Sánchez, Evangelos Triantaph...
COLING
2002
14 years 9 months ago
Text Categorization using Feature Projections
This paper proposes a new approach for text categorization, based on a feature projection technique. In our approach, training data are represented as the projections of training ...
Youngjoong Ko, Jungyun Seo
ECIR
2004
Springer
14 years 11 months ago
Complex Linguistic Features for Text Classification: A Comprehensive Study
Abstract. Previous researches on advanced representations for document retrieval have shown that statistical state-of-the-art models are not improved by a variety of different ling...
Alessandro Moschitti, Roberto Basili
ADC
2003
Springer
115views Database» more  ADC 2003»
15 years 1 months ago
Document Classification via Structure Synopses
Information available in the Internet is frequently supplied simply as plain ascii text, structured according to orthographic and semantic conventions. Traditional document classi...
Liping Ma, John Shepherd, Anh Nguyen
ICDAR
2003
IEEE
15 years 2 months ago
Document page similarity based on layout visual saliency: Application to query by example and document classification
In this paper we propose to define a measure of visual similarity to compare different pages in a corpus. This measure is based on the analysis of the visual layout saliency of th...
Véronique Eglin, Stéphane Bres