Sciweavers

6 search results - page 1 / 2
» SpotSigs: robust and efficient near duplicate detection in l...
Sort
View
SIGIR
2008
ACM
13 years 4 months ago
SpotSigs: robust and efficient near duplicate detection in large web collections
Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...
Martin Theobald, Jonathan Siddharth, Andreas Paepc...
ICMCS
2006
IEEE
188views Multimedia» more  ICMCS 2006»
13 years 10 months ago
Large-Scale Duplicate Detection for Web Image Search
Finding visually identical images in large image collections is important for many applications such as intelligence propriety protection and search result presentation. Several a...
Bin Wang, Zhiwei Li, Mingjing Li, Wei-Ying Ma
MM
2009
ACM
249views Multimedia» more  MM 2009»
13 years 9 months ago
MyFinder: near-duplicate detection for large image collections
The explosive growth of multimedia data poses serious challenges to data storage, management and search. Efficient near-duplicate detection is one of the required technologies for...
Xin Yang, Qiang Zhu, Kwang-Ting Cheng
SIGIR
2010
ACM
12 years 11 months ago
Efficient partial-duplicate detection based on sequence matching
With the ever-increasing growth of the Internet, numerous copies of documents become serious problem for search engine, opinion mining and many other web applications. Since parti...
Qi Zhang, Yue Zhang, Haomin Yu, Xuanjing Huang
ICAIL
2007
ACM
13 years 8 months ago
Essential deduplication functions for transactional databases in law firms
As massive document repositories and knowledge management systems continue to expand, in proprietary environments as well as on the Web, the need for duplicate detection becomes i...
Jack G. Conrad, Edward L. Raymond