Sciweavers

572 search results - page 51 / 115
» Winnowing-based text clustering
Sort
View
ICDAR
2009
IEEE
15 years 6 months ago
Text Lines and Snippets Extraction for 19th Century Handwriting Documents Layout Analysis
In this paper we propose a new approach to improve electronic editions of human science corpus, providing an efficient estimation of manuscripts pages structure. In any handwriti...
Vincent Malleron, Véronique Eglin, Hubert E...
WIA
2001
Springer
15 years 4 months ago
Finite-State Transducer Cascade to Extract Proper Names in Texts
This article describes a finite-state cascade for the extraction of person names in texts in French. We extract these proper names in order to categorize and to cluster texts with...
Nathalie Friburger, Denis Maurel
ICDAR
1997
IEEE
15 years 4 months ago
Enhancing Degraded Document Images via Bitmap Clustering and Averaging
Proper display and accurate recognition of document images are often hampered by degradations caused by poor scanning or transmission conditions. We propose a method to enhance su...
John D. Hobby, Tin Kam Ho
NAACL
2007
15 years 1 months ago
Clustered Sub-Matrix Singular Value Decomposition
This paper presents an alternative algorithm based on the singular value decomposition (SVD) that creates vector representation for linguistic units with reduced dimensionality. T...
Fang Huang, Yorick Wilks
ACL
1998
15 years 1 months ago
Automatic Retrieval and Clustering of Similar Words
Bootstrapping semantics from text is one of the greatest challenges in natural language learning. We first define a word similarity measure based on the distributional pattern of ...
Dekang Lin