Sciweavers

167 search results - page 1 / 34
» Time and space optimization of document content classifiers
Sort
View
DRR
2010
13 years 6 months ago
Time and space optimization of document content classifiers
Scaling up document-image classifiers to handle an unlimited variety of document and image types poses serious challenges to conventional trainable classifier technologies. Highly...
Dawei Yin, Henry S. Baird, Chang An
SAC
2004
ACM
13 years 9 months ago
An optimized approach for KNN text categorization using P-trees
The importance of text mining stems from the availability of huge volumes of text databases holding a wealth of valuable information that needs to be mined. Text categorization is...
Imad Rahal, William Perrizo
ICDAR
2009
IEEE
13 years 2 months ago
Document Content Extraction Using Automatically Discovered Features
We report an automatic feature discovery method that achieves results comparable to a manually chosen, larger feature set on a document image content extraction problem: the locat...
Sui-Yu Wang, Henry S. Baird, Chang An
CASCON
2006
150views Education» more  CASCON 2006»
13 years 5 months ago
Exploring a new space of features for document classification: figure clustering
Automatic document classification is an important step in organizing and mining documents. Information in documents is often conveyed using both text and images that complement ea...
Nawei Chen, Hagit Shatkay, Dorothea Blostein
PR
2006
84views more  PR 2006»
13 years 4 months ago
Document zone content classification and its performance evaluation
This paper describes an algorithm for the determination of zone content type of a given zone within a document image. We take a statistical based approach and represent each zone ...
Yalin Wang, Ihsin T. Phillips, Robert M. Haralick