Sciweavers

9 search results - page 1 / 2
» Multi-component similarity method for web product duplicate ...
Sort
View
WWW
2008
ACM
14 years 5 months ago
Efficient similarity joins for near duplicate detection
With the increasing amount of data and the need to integrate data from multiple data sources, a challenging issue is to find near duplicate records efficiently. In this paper, we ...
Chuan Xiao, Wei Wang 0011, Xuemin Lin, Jeffrey Xu ...
ICMCS
2006
IEEE
188views Multimedia» more  ICMCS 2006»
13 years 10 months ago
Large-Scale Duplicate Detection for Web Image Search
Finding visually identical images in large image collections is important for many applications such as intelligence propriety protection and search result presentation. Several a...
Bin Wang, Zhiwei Li, Mingjing Li, Wei-Ying Ma
SIGIR
2010
ACM
12 years 11 months ago
Efficient partial-duplicate detection based on sequence matching
With the ever-increasing growth of the Internet, numerous copies of documents become serious problem for search engine, opinion mining and many other web applications. Since parti...
Qi Zhang, Yue Zhang, Haomin Yu, Xuanjing Huang
BIRD
2007
Springer
13 years 8 months ago
An Evaluation of Text Retrieval Methods for Similarity Search of Multi-dimensional NMR-Spectra
Abstract. Searching and mining nuclear magnetic resonance (NMR)spectra of naturally occurring substances is an important task to investigate new potentially useful chemical compoun...
Alexander Hinneburg, Andrea Porzel, Karina Wolfram
ICAIL
2007
ACM
13 years 8 months ago
Essential deduplication functions for transactional databases in law firms
As massive document repositories and knowledge management systems continue to expand, in proprietary environments as well as on the Web, the need for duplicate detection becomes i...
Jack G. Conrad, Edward L. Raymond