Sciweavers

8795 search results - page 85 / 1759
» Measuring Generality of Documents
Sort
View
121
Voted
DOCENG
2006
ACM
15 years 9 months ago
Evaluating invariances in document layout functions
With the development of variable-data-driven digital presses where each document printed is potentially unique there is a need for pre-press optimization to identify material that...
Alexander J. Macdonald, David F. Brailsford, John ...
98
Voted
DEXAW
2007
IEEE
94views Database» more  DEXAW 2007»
15 years 10 months ago
A system for summary-document similarity in notary domain
In this paper we propose a methodology to perform a comparison between a legal document and its related handwritten summary. We thus describe the algorithms that verify when a hum...
Carmine Cesarano, Antonino Mazzeo, Antonio Picarie...
126
Voted
KDD
2004
ACM
195views Data Mining» more  KDD 2004»
16 years 4 months ago
Improved robustness of signature-based near-replica detection via lexicon randomization
Detection of near duplicate documents is an important problem in many data mining and information filtering applications. When faced with massive quantities of data, traditional d...
Aleksander Kolcz, Abdur Chowdhury, Joshua Alspecto...
127
Voted
KDD
2005
ACM
166views Data Mining» more  KDD 2005»
16 years 4 months ago
A general model for clustering binary data
Clustering is the problem of identifying the distribution of patterns and intrinsic correlations in large data sets by partitioning the data points into similarity classes. This p...
Tao Li
127
Voted
ICSE
1999
IEEE-ACM
15 years 8 months ago
Generalizing Perspective-Based Inspection to Handle Object-Oriented Development Artifacts
The value of software inspection for uncovering defects early in the development lifecycle has been well documented. Of the various types of inspection methods published to date, ...
Oliver Laitenberger, Colin Atkinson