Sciweavers

170 search results - page 14 / 34
» Text Retrieval from Document Images based on N-Gram Algorith...
Sort
View
86
Voted
AUSAI
2001
Springer
15 years 3 months ago
Fast Text Classification Using Sequential Sampling Processes
A central problem in information retrieval is the automated classification of text documents. While many existing methods achieve good levels of performance, they generally require...
Michael D. Lee
DAS
2010
Springer
14 years 9 months ago
Page frame detection for double page document images
Scanning two book pages at the same time helps to accelerate the scanning process but on the other hand introduces several difficulties if the user needs to have one page per imag...
Nikolaos Stamatopoulos, Basilios Gatos, Thodoris G...
WWW
2007
ACM
16 years 10 days ago
Deriving knowledge from figures for digital libraries
Figures in digital documents contain important information. Current digital libraries do not summarize and index information available within figures for document retrieval. We pr...
Xiaonan Lu, James Ze Wang, Prasenjit Mitra, C. Lee...
105
Voted
DOCENG
2010
ACM
15 years 23 days ago
FormCracker: interactive web-based form filling
Filling out document forms distributed by email or hosted on the Web is still problematic and usually requires a printer and scanner. Users commonly download and print forms, fill...
Laurent Denoue, John Adcock, Scott Carter, Patrick...
WWW
2007
ACM
16 years 10 days ago
Query-driven indexing for peer-to-peer text retrieval
We describe a query-driven indexing framework for scalable text retrieval over structured P2P networks. To cope with the bandwidth consumption problem that has been identified as ...
Gleb Skobeltsyn, Toan Luu, Karl Aberer, Martin Raj...