Sciweavers

4 search results - page 1 / 1
» Exploring a new space of features for document classificatio...
Sort
View
CASCON
2006
150views Education» more  CASCON 2006»
13 years 5 months ago
Exploring a new space of features for document classification: figure clustering
Automatic document classification is an important step in organizing and mining documents. Information in documents is often conveyed using both text and images that complement ea...
Nawei Chen, Hagit Shatkay, Dorothea Blostein
ICDAR
2009
IEEE
13 years 1 months ago
Document Content Extraction Using Automatically Discovered Features
We report an automatic feature discovery method that achieves results comparable to a manually chosen, larger feature set on a document image content extraction problem: the locat...
Sui-Yu Wang, Henry S. Baird, Chang An
KDD
2005
ACM
118views Data Mining» more  KDD 2005»
14 years 4 months ago
On the use of linear programming for unsupervised text classification
We propose a new algorithm for dimensionality reduction and unsupervised text classification. We use mixture models as underlying process of generating corpus and utilize a novel,...
Mark Sandler
ICPR
2004
IEEE
14 years 4 months ago
Serialized Unsupervised Classifier for Adaptative Color Image Segmentation: Application to Digitized Ancient Manuscripts
This paper presents an adaptative algorithm for the segmentation of color images suited for document image analysis. The algorithm is based on a serialization of the k-means algor...
Frank Le Bourgeois, Hubert Emptoz, Yann Leydier