Search Sciweavers | Sciweavers

19

WWW
2008
ACM

214views Internet Technology» more WWW 2008»

14 years 6 months ago

Efficient similarity joins for near duplicate detection

Download www2008.org

With the increasing amount of data and the need to integrate data from multiple data sources, a challenging issue is to find near duplicate records efficiently. In this paper, we ...

Chuan Xiao, Wei Wang 0011, Xuemin Lin, Jeffrey Xu ...

claim paper

Read More »

10

click to vote

P2P
2010
IEEE

202views Communications» more P2P 2010»

Optimizing Near Duplicate Detection for P2P Networks

13 years 3 months ago

Download www.l3s.de

—In this paper, we propose a probabilistic algorithm for detecting near duplicate text, audio, and video resources efﬁciently and effectively in large-scale P2P systems. To thi...

Odysseas Papapetrou, Sukriti Ramesh, Stefan Siersd...

claim paper

Read More »

22

click to vote

SIGIR
2008
ACM

176views Information Technology» more SIGIR 2008»

SpotSigs: robust and efficient near duplicate detection in large web collections

13 years 5 months ago

Download ilpubs.stanford.edu

Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...

Martin Theobald, Jonathan Siddharth, Andreas Paepc...

claim paper

Read More »

33

click to vote

GIS
2010
ACM

312views Automated Reasoning» more GIS 2010»

Detecting nearly duplicated records in location datasets

13 years 3 months ago

Download research.microsoft.com

The quality of a local search engine, such as Google and Bing Maps, heavily relies on its geographic datasets. Typically, these datasets are obtained from multiple sources, e.g., ...

Yu Zheng, Xixuan Fen, Xing Xie, Shuang Peng, James...

claim paper

Read More »

22

click to vote

MMM
2008
Springer

156views Multimedia» more MMM 2008»

Cross-Lingual Retrieval of Identical News Events by Near-Duplicate Video Segment Detection

13 years 11 months ago

Download www.murase.nuie.nagoya-u.ac.jp

Recently, for reusing large quantities of accumulated news video, technology for news topic searching and tracking has become necessary. Moreover, since we need to understand a cer...

Akira Ogawa, Tomokazu Takahashi, Ichiro Ide, Hiros...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers