Search Sciweavers | Sciweavers

241 search results - page 10 / 49

» Detecting Co-Derivative Documents in Large Text Collections

167

click to vote

DRR
2008

143views Document Analysis» more DRR 2008»

Segmentation-based retrieval of document images from diverse collections

15 years 7 months ago

Download www.cse.lehigh.edu

We describe a methodology for retrieving document images from large extremely diverse collections. First we perform content extraction, that is the location and measurement of reg...

Michael A. Moll, Henry S. Baird

claim paper

Read More »

146

click to vote

DGO
2006

134views Education» more DGO 2006»

Next steps in near-duplicate detection for eRulemaking

15 years 7 months ago

Download www.cs.cmu.edu

Large volume public comment campaigns and web portals that encourage the public to customize form letters produce many near-duplicate documents, which increases processing and sto...

Hui Yang, Jamie Callan, Stuart W. Shulman

claim paper

Read More »

148

click to vote

QSIC
2007
IEEE

165views Software Engineering» more QSIC 2007»

Automatic Quality Assessment of SRS Text by Means of a Decision-Tree-Based Text Classifier

16 years 2 days ago

Download users.encs.concordia.ca

The success of a software project is largely dependent upon the quality of the Software Requirements Specification (SRS) document, which serves as a medium to communicate user req...

Ishrar Hussain, Olga Ormandjieva, Leila Kosseim

claim paper

Read More »

198

click to vote

SIGIR
2008
ACM

176views Information Technology» more SIGIR 2008»

SpotSigs: robust and efficient near duplicate detection in large web collections

15 years 5 months ago

Download ilpubs.stanford.edu

Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...

Martin Theobald, Jonathan Siddharth, Andreas Paepc...

claim paper

Read More »

132

click to vote

COLING
2000

88views Computational Linguistics» more COLING 2000»

Experiments in Automated Lexicon Building for Text Searching

15 years 7 months ago

Download www.lirmm.fr

This paper describes experiments in the automatic construction of lexicons that would be useful in searching large document collections for text fragments that address a specific ...

Barry Schiffman, Kathleen McKeown

claim paper

Read More »

« Prev « First page 10 / 49 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers