Sciweavers

ECAI
2008
Springer

Pedigree Tracking in the Face of Ancillary Content

13 years 6 months ago
Pedigree Tracking in the Face of Ancillary Content
The accurate tracking and retrieval of content pedigree is a quickly growing requirement as our abilities to create information assets increases exponentially. Plagiarism detection, accurate accreditation, and classification tasks all rely on the ability to determine where content is being used and where it originated. We present an approach to document pedigree tracking that is based on an efficient disk-based data structure and the use of two contrasting collections of historical text. These collections enable content of two types (or degrees of importance) to be defined and accounted for when locating documents with overlapping content. This approach is resilient in the face of substantial ancillary content and paraphrasing, two common sources of error in existing content tracking techniques.
Eugene Creswick, Emi Fujioka, Terrance Goan
Added 19 Oct 2010
Updated 19 Oct 2010
Type Conference
Year 2008
Where ECAI
Authors Eugene Creswick, Emi Fujioka, Terrance Goan
Comments (0)