Sciweavers

260 search results - page 6 / 52
» Industry-scale duplicate detection
Sort
View
91
Voted
LREC
2008
130views Education» more  LREC 2008»
14 years 11 months ago
Detecting Co-Derivative Documents in Large Text Collections
We have analyzed the SPEX algorithm by Bernstein and Zobel (2004) for detecting co-derivative documents using duplicate n-grams. Although we totally agree with the claim that not ...
Jan Pomikálek, Pavel Rychlý
DATE
2005
IEEE
101views Hardware» more  DATE 2005»
15 years 3 months ago
Compiler-Directed Instruction Duplication for Soft Error Detection
In this work, we experiment with complier-directed instruction duplication to detect soft errors in VLIW datapaths . In the proposed approach, the compiler determines the instruct...
Jie S. Hu, Feihui Li, Vijay Degalahal, Mahmut T. K...
AAAI
2007
14 years 12 months ago
Parallel Structured Duplicate Detection
We describe a novel approach to parallelizing graph search using structured duplicate detection. Structured duplicate detection was originally developed as an approach to external...
Rong Zhou, Eric A. Hansen
HT
2010
ACM
14 years 7 months ago
Citation based plagiarism detection: a new approach to identify plagiarized work language independently
This paper describes a new approach towards detecting plagiarism and scientific documents that have been read but not cited. In contrast to existing approaches, which analyze docu...
Bela Gipp, Jöran Beel
CEAS
2007
Springer
15 years 3 months ago
Filtering Image Spam with Near-Duplicate Detection
A new trend in email spam is the emergence of image spam. Although current anti-spam technologies are quite successful in filtering text-based spam emails, the new image spams ar...
Zhe Wang, William K. Josephson, Qin Lv, Moses Char...