Search Sciweavers | Sciweavers

26 search results - page 2 / 6

» Partial duplicate detection for large book collections

108

Voted

CIKM
2003
Springer

130views Information Technology» more CIKM 2003»

Online duplicate document detection: signature reliability in a dynamic retrieval environment

15 years 7 months ago

Download www.conradweb.org

As online document collections continue to expand, both on the Web and in proprietary environments, the need for duplicate detection becomes more critical. Few users wish to retri...

Jack G. Conrad, Xi S. Guo, Cindy P. Schriber

claim paper

Read More »

127

Voted

MM
2009
ACM

249views Multimedia» more MM 2009»

MyFinder: near-duplicate detection for large image collections

15 years 6 months ago

Download www.uweb.ucsb.edu

The explosive growth of multimedia data poses serious challenges to data storage, management and search. Efficient near-duplicate detection is one of the required technologies for...

Xin Yang, Qiang Zhu, Kwang-Ting Cheng

claim paper

Read More »

105

click to vote

WCRE
1999
IEEE

108views Software Engineering» more WCRE 1999»

Partial Redesign of Java Software Systems Based on Clone Analysis

15 years 6 months ago

Download www.swen.uwaterloo.ca

Code duplication, plausibly caused by copying source code and slightly modifying it, is often observed in large systems. Clone detection and documentation have been investigated b...

Magdalena Balazinska, Ettore Merlo, Michel Dagenai...

claim paper

Read More »

110

click to vote

COLING
2010

108views Computational Linguistics» more COLING 2010»

Large Scale Parallel Document Mining for Machine Translation

14 years 8 months ago

Download static.googleusercontent.com

A distributed system is described that reliably mines parallel text from large corpora. The approach can be regarded as cross-language near-duplicate detection, enabled by an init...

Jakob Uszkoreit, Jay Ponte, Ashok C. Popat, Moshe ...

claim paper

Read More »

128

Voted

ICAIL
2007
ACM

147views Artificial Intelligence» more ICAIL 2007»

Essential deduplication functions for transactional databases in law firms

15 years 5 months ago

Download www.conradweb.org

As massive document repositories and knowledge management systems continue to expand, in proprietary environments as well as on the Web, the need for duplicate detection becomes i...

Jack G. Conrad, Edward L. Raymond

claim paper

Read More »

« Prev « First page 2 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers