Sciweavers

298 search results - page 28 / 60
» An information-theoretic measure for document similarity
Sort
View
ICDAR
2009
IEEE
15 years 6 months ago
Keyword Spotting in Document Images through Word Shape Coding
With large databases of document images available, a method for users to find keywords in documents will be useful. One approach is to perform Optical Character Recognition (OCR) ...
Shuyong Bai, Linlin Li, Chew Lim Tan
KES
2004
Springer
15 years 5 months ago
Knowledge Extraction from Semi-structured Data Based on Fuzzy Techniques
Abstract. In this work we propose a fuzzy technique to compare XML documents belonging to a semi-structured flow and sharing a common vocabulary of tags. Our approach is based on t...
Paolo Ceravolo, Maria Cristina Nocerino, Marco Viv...
IDA
2010
Springer
14 years 10 months ago
Selecting the Links in BisoNets Generated from Document Collections
According to Koestler, the notion of a bisociation denotes a connection between pieces of information from habitually separated domains or categories. In this paper, we consider a ...
Marc Segond, Christian Borgelt
ICML
2005
IEEE
16 years 19 days ago
Multi-way distributional clustering via pairwise interactions
We present a novel unsupervised learning scheme that simultaneously clusters variables of several types (e.g., documents, words and authors) based on pairwise interactions between...
Ron Bekkerman, Ran El-Yaniv, Andrew McCallum
ICDAR
1997
IEEE
15 years 4 months ago
Shape Matrices as a Mixed Shape Factor for Off-line Signature Verification
Shape matrices have been used as a representation of planar shapes like industrial parts or printed characters. In this paper, we investigate the use of shape matrices as a mixed ...
Robert Sabourin, Jean-Pierre Drouhard, Etienne Sum...