Search Sciweavers | Sciweavers

43 search results - page 5 / 9

» Efficient similarity joins for near duplicate detection

132

click to vote

ICMCS
2006
IEEE

188views Multimedia» more ICMCS 2006»

Large-Scale Duplicate Detection for Web Image Search

15 years 8 months ago

Download www.cecs.uci.edu

Finding visually identical images in large image collections is important for many applications such as intelligence propriety protection and search result presentation. Several a...

Bin Wang, Zhiwei Li, Mingjing Li, Wei-Ying Ma

claim paper

Read More »

113

click to vote

SIGIR
2010
ACM

169views Information Technology» more SIGIR 2010»

Efficient partial-duplicate detection based on sequence matching

14 years 8 months ago

Download homepage.fudan.edu.cn

With the ever-increasing growth of the Internet, numerous copies of documents become serious problem for search engine, opinion mining and many other web applications. Since parti...

Qi Zhang, Yue Zhang, Haomin Yu, Xuanjing Huang

claim paper

Read More »

107

click to vote

DEXA
2004
Springer

136views Database» more DEXA 2004»

PC-Filter: A Robust Filtering Technique for Duplicate Record Detection in Large Databases

15 years 7 months ago

Download eprints.usq.edu.au

: In this paper, we will propose PC-Filter (PC stands for Partition Comparison), a robust data filter for approximately duplicate record detection in large databases. PC-Filter dis...

Ji Zhang, Tok Wang Ling, Robert M. Bruckner, Han L...

claim paper

Read More »

129

click to vote

ICAIL
2007
ACM

147views Artificial Intelligence» more ICAIL 2007»

Essential deduplication functions for transactional databases in law firms

15 years 5 months ago

Download www.conradweb.org

As massive document repositories and knowledge management systems continue to expand, in proprietary environments as well as on the Web, the need for duplicate detection becomes i...

Jack G. Conrad, Edward L. Raymond

claim paper

Read More »

click to vote

INFOCOM
2010
IEEE

158views Communications» more INFOCOM 2010»

14 years 12 months ago

Efficient Similarity Estimation for Systems Exploiting Data Redundancy

Download www.cs.cmu.edu

Many modern systems exploit data redundancy to improve efficiency. These systems split data into chunks, generate identifiers for each of them, and compare the identifiers among ot...

Kanat Tangwongsan, Himabindu Pucha, David G. Ander...

claim paper

Read More »

« Prev « First page 5 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers