Sciweavers

48 search results - page 3 / 10
» Rule-based word clustering for document metadata extraction
Sort
View
WEBI
2005
Springer
13 years 11 months ago
Metadata Propagation in the Web Using Co-Citations
Given the large heterogeneity of the World Wide Web, using metadata on the search engines side seems to be a useful track for information retrieval. Though, because a manual quali...
Camille Prime-Claverie, Michel Beigbeder, Thierry ...
ESANN
2007
13 years 7 months ago
Kernel PCA based clustering for inducing features in text categorization
We study dimensionality reduction or feature selection in text document categorization problem. We focus on the first step in building text categorization systems, that is the cho...
Zsolt Minier, Lehel Csató
ICDE
2007
IEEE
126views Database» more  ICDE 2007»
14 years 7 months ago
Organizing Hidden-Web Databases by Clustering Visible Web Documents
In this paper we address the problem of organizing hidden-Web databases. Given a heterogeneous set of Web forms that serve as entry points to hidden-Web databases, our goal is to ...
Luciano Barbosa, Juliana Freire, Altigran Soares d...
DIAL
2004
IEEE
181views Image Analysis» more  DIAL 2004»
13 years 9 months ago
Forensic Handwritten Document Retrieval System
Document storage and retrieval capabilities of the CEDAR-FOX forensic handwritten document examination system are described. The system is designed for automated and semi-automate...
Sargur N. Srihari, Zhixin Shi
ICDAR
2003
IEEE
13 years 11 months ago
Features for Word Spotting in Historical Manuscripts
For the transition from traditional to digital libraries, the large number of handwritten manuscripts that exist pose a great challenge. Easy access to such collections requires a...
Toni M. Rath, R. Manmatha