Sciweavers

8795 search results - page 46 / 1759
» Measuring Generality of Documents
Sort
View
134
Voted
SIGIR
2004
ACM
15 years 9 months ago
Constructing a text corpus for inexact duplicate detection
As online document collections continue to expand, both on the Web and in proprietary environments, the need for duplicate detection becomes more critical. The goal of this work i...
Jack G. Conrad, Cindy P. Schriber
SIGIR
2003
ACM
15 years 8 months ago
Retrieval and novelty detection at the sentence level
Previous research in novelty detection has focused on the task of finding novel material, given a set or stream of documents on a certain topic. This study investigates the more ...
James Allan, Courtney Wade, Alvaro Bolivar
102
Voted
SIGMOD
2010
ACM
166views Database» more  SIGMOD 2010»
15 years 1 months ago
Efficient two-sided error-tolerant search
We consider fast two-sided error-tolerant search that is robust against errors both on the query side (type alogrithm, find documents with algorithm) as well as on the document si...
Hannah Bast, Marjan Celikik
ICDAR
2011
IEEE
14 years 3 months ago
Evaluating the Rarity of Handwriting Formations
—Identifying unusual or unique characteristics of an observed sample in useful in forensics in general and handwriting analysis in particular. Rarity is formulated as the probabi...
Sargur N. Srihari
SIGIR
2004
ACM
15 years 9 months ago
Evaluation of filtering current news search results
We describe an evaluation of result set filtering techniques for providing ultra-high precision in the task of presenting related news for general web queries. In this task, the n...
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury...