Search Sciweavers | Sciweavers

174 search results - page 1 / 35

» On Finding Duplication and Near-Duplication in Large Softwar...

click to vote

WCRE
1995
IEEE

105views Software Engineering» more WCRE 1995»

On Finding Duplication and Near-Duplication in Large Software Systems

13 years 8 months ago

Download plg.uwaterloo.ca

This paper describes how a program called dup can be used to locate instances of duplication or nearduplication in a software system. D u p reports both textually identical sectio...

Brenda S. Baker

claim paper

Read More »

click to vote

ICPR
2010
IEEE

233views Computer Vision» more ICPR 2010»

Beyond "Near Duplicates": Learning Hash Codes for Efficient Similar-Image Retrieval

13 years 2 months ago

Download www.mangolassi.org

Finding similar images in a large database is an important, but often computationally expensive, task. In this paper, we present a two-tier similar-image retrieval system with the...

Shumeet Baluja, Michele Covell

claim paper

Read More »

click to vote

ICPR
2010
IEEE

203views Computer Vision» more ICPR 2010»

Beyond "Near-Duplicates": Learning Hash Codes for Efficient Similar-Image Retrieval

13 years 8 months ago

Download static.googleusercontent.com

Finding similar images in a large database is an important, but often computationally expensive, task. In this paper, we present a two-tier similar-image retrieval system with the...

Shumeet Baluja, Michele Covell

claim paper

Read More »

click to vote

SIGIR
2008
ACM

176views Information Technology» more SIGIR 2008»

SpotSigs: robust and efficient near duplicate detection in large web collections

13 years 4 months ago

Download ilpubs.stanford.edu

Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...

Martin Theobald, Jonathan Siddharth, Andreas Paepc...

claim paper

Read More »

click to vote

ICCS
2009
Springer

107views Applied Computing» more ICCS 2009»

Frequent Itemset Mining for Clustering Near Duplicate Web Documents

13 years 11 months ago

Download www.mendeley.com

A vast amount of documents in the Web have duplicates, which is a challenge for developing eﬃcient methods that would compute clusters of similar documents. In this paper we use ...

Dmitry I. Ignatov, Sergei O. Kuznetsov

claim paper

Read More »

« Prev « First page 1 / 35 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers