Sciweavers

241 search results - page 20 / 49
» Detecting Co-Derivative Documents in Large Text Collections
Sort
View
CICLING
2010
Springer
15 years 1 months ago
Word Length n-Grams for Text Re-use Detection
Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...
Alberto Barrón-Cedeño, Chiara Basile...
ECCV
2008
Springer
15 years 11 months ago
Signature-Based Document Image Retrieval
As the most pervasive method of individual identification and document authentication, signatures present convincing evidence and provide an important form of indexing for effectiv...
Guangyu Zhu, Yefeng Zheng, David S. Doermann
CORIA
2006
14 years 11 months ago
On Combining Text and MeSH Searches to Improve the Retrieval of MEDLINE documents
The MEDLINE database is the world largest repository of bio-medical abstracts. It is a central information entry point for most biologists despite the growing availability of full-...
Fabrice Camous, Stephen Blott, Alan F. Smeaton
SDM
2009
SIAM
140views Data Mining» more  SDM 2009»
15 years 6 months ago
Straightforward Feature Selection for Scalable Latent Semantic Indexing.
Latent Semantic Indexing (LSI) has been validated to be effective on many small scale text collections. However, little evidence has shown its effectiveness on unsampled large sca...
Jun Yan, Shuicheng Yan, Ning Liu, Zheng Chen
DEXAW
1999
IEEE
91views Database» more  DEXAW 1999»
15 years 2 months ago
Document Analysis Techniques for the Infinite Memory Multifunction Machine
A system that saves a digital copy of every document that users copy, print, or fax, without asking the user, has recently been proposed. Referred to as the Infinite Memory Multif...
Jonathan J. Hull, Dar-Shyang Lee, John F. Cullen, ...