Sciweavers

536 search results - page 16 / 108
» Feature Engineering for Text Classification
Sort
View
KDD
2002
ACM
179views Data Mining» more  KDD 2002»
15 years 10 months ago
Combining clustering and co-training to enhance text classification using unlabelled data
In this paper, we present a new co-training strategy that makes use of unlabelled data. It trains two predictors in parallel, with each predictor labelling the unlabelled data for...
Bhavani Raskutti, Herman L. Ferrá, Adam Kow...
AVSS
2009
IEEE
14 years 7 months ago
A Classification Architecture Based on Connected Components for Text Detection in Unconstrained Environments
The paper presents a method for efficient text detection in unconstrained environments, based on image features derived from connected components and on a classification architect...
Luca Zini, Augusto Destrero, Francesca Odone
CORR
2002
Springer
93views Education» more  CORR 2002»
14 years 9 months ago
Ellogon: A New Text Engineering Platform
This paper presents Ellogon, a multi-lingual, cross-platform, general-purpose text engineering environment. Ellogon was designed in order to aid both researchers in natural langua...
Georgios Petasis, Vangelis Karkaletsis, Georgios P...
EMNLP
2010
14 years 7 months ago
Cross Language Text Classification by Model Translation and Semi-Supervised Learning
In this paper, we introduce a method that automatically builds text classifiers in a new language by training on already labeled data in another language. Our method transfers the...
Lei Shi, Rada Mihalcea, Mingjun Tian
CIARP
2006
Springer
15 years 1 months ago
Oscillating Feature Subset Search Algorithm for Text Categorization
Abstract. A major characteristic of text document categorization problems is the extremely high dimensionality of text data. In this paper we explore the usability of the Oscillati...
Jana Novovicová, Petr Somol, Pavel Pudil