13 years 1 months ago
Using top n Recognition Candidates to Categorize On-line Handwritten Documents
The traditional weighting schemes used in text categorization for the vector space model (VSM) cannot exploit information intrinsic to texts obtained through on-line handwriting r...
Sebastián Peña Saldarriaga, Emmanuel...
13 years 1 months ago
Trash article detection using categorization techniques
We explore techniques for detecting news articles containing invalid information, using the help of text categorization technology. The information that exists on the World Wide W...
Christos Bouras, Vassilis Tsogkas, Vassilis Poulop...
13 years 2 months ago
A comparative study on two large-scale hierarchical text classification tasks' solutions
: Patent classification is a large scale hierarchical text classification (LSHTC) task. Though comprehensive comparisons, either learning algorithms or feature selection strategies...
Jian Zhang, Hai Zhao, Bao-Liang Lu
168views more  JIIS 2002»
13 years 3 months ago
Hidden Markov Models for Text Categorization in Multi-Page Documents
In the traditional setting, text categorization is formulated as a concept learning problem where each instance is a single isolated document. However, this perspective is not appr...
Paolo Frasconi, Giovanni Soda, Alessandro Vullo
13 years 4 months ago
Text Categorization using Feature Projections
This paper proposes a new approach for text categorization, based on a feature projection technique. In our approach, training data are represented as the projections of training ...
Youngjoong Ko, Jungyun Seo
160views more  BMCBI 2005»
13 years 4 months ago
Data-poor categorization and passage retrieval for Gene Ontology Annotation in Swiss-Prot
Background: In the context of the BioCreative competition, where training data were very sparse, we investigated two complementary tasks: 1) given a Swiss-Prot triplet, containing...
Frédéric Ehrler, Antoine Geissbü...
122views more  ISCI 2007»
13 years 4 months ago
On the strength of hyperclique patterns for text categorization
The use of association patterns for text categorization has attracted great interest and a variety of useful methods have been developed. However, the key characteristics of patte...
Tieyun Qian, Hui Xiong, Yuanzhen Wang, Enhong Chen
151views more  ESWA 2007»
13 years 4 months ago
A novel feature selection algorithm for text categorization
With the development of the web, large numbers of documents are available on the Internet. Digital libraries, news sources and inner data of companies surge more and more. Automat...
Wenqian Shang, Houkuan Huang, Haibin Zhu, Yongmin ...
130views more  JIPS 2008»
13 years 4 months ago
Inverted Index based Modified Version of KNN for Text Categorization
: This research proposes a new strategy where documents are encoded into string vectors and modified version of KNN to be adaptable to string vectors for text categorization. Tradi...
Taeho Jo
13 years 4 months ago
Sequential patterns for text categorization
Text categorization is a well-known task based essentially on statistical approaches using neural networks, Support Vector Machines and other machine learning algorithms. Texts are...
Simon Jaillet, Anne Laurent, Maguelonne Teisseire