Sciweavers

82 search results - page 1 / 17
» A search engine for imaged documents in PDF files
Sort
View
SIGIR
2004
ACM
13 years 10 months ago
A search engine for imaged documents in PDF files
Large quantities of documents in the Internet and digital libraries are simply scanned and archived in image format, many of which are packed in PDF files. The word search tool pr...
Yue Lu, Li Zhang, Chew Lim Tan
DOCENG
2009
ACM
13 years 11 months ago
Object-level document analysis of PDF files
The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...
Tamir Hassan
DOCENG
2003
ACM
13 years 9 months ago
Two diet plans for fat PDF
As Adobe's Portable Document Format has exploded in popularity so too has the number PDF generators, and predictably the quality of generated PDF varies considerably. This pa...
Thomas A. Phelps, Robert Wilensky
DIAL
2004
IEEE
156views Image Analysis» more  DIAL 2004»
13 years 8 months ago
Xed: A New Tool for eXtracting Hidden Structures from Electronic Documents
PDF became a very common format for exchanging printable documents. Further, it can be easily generated from the major documents formats, which make a huge number of PDF documents...
Karim Hadjar, Maurizio Rigamonti, Denis Lalanne, R...
BTW
2009
Springer
145views Database» more  BTW 2009»
13 years 11 months ago
Retrieving Metadata for Your Local Scholarly Papers
: We present a novel approach to retrieve metadata to scholarly papers stored locally as PDF files. A fingerprint is produced from the PDF fulltext to query an online metadata repo...
David Aumüller