Sciweavers

316 search results - page 13 / 64
» Imaged Document Text Retrieval Without OCR
Sort
View
ICDAR
2011
IEEE
13 years 11 months ago
Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning
—Reading text from photographs is a challenging problem that has received a signicant amount of attention. Two key components of most systems are (i) text detection from images a...
Adam Coates, Blake Carpenter, Carl Case, Sanjeev S...
ICDAR
2011
IEEE
13 years 11 months ago
BLSTM Neural Network Based Word Retrieval for Hindi Documents
—Retrieval from Hindi document image collections is a challenging task. This is partly due to the complexity of the script, which has more than 800 unique ligatures. In addition,...
Raman Jain, Volkmar Frinken, C. V. Jawahar, Raghav...
ECIR
2009
Springer
15 years 9 months ago
Revisiting N-Gram Based Models for Retrieval in Degraded Large Collections
The traditional retrieval models based on term matching are not effective in collections of degraded documents (output of OCR or ASR systems for instance). This paper presents a n...
Javier Parapar, Ana Freire, Alvaro Barreiro
106
Voted
SIGIR
2008
ACM
14 years 11 months ago
Classifiers without borders: incorporating fielded text from neighboring web pages
Accurate web page classification often depends crucially on information gained from neighboring pages in the local web graph. Prior work has exploited the class labels of nearby p...
Xiaoguang Qi, Brian D. Davison
ICPR
2004
IEEE
16 years 26 days ago
Morphological Tagging Approach in Document Analysis of Invoices
In this paper a morphological tagging approach for document image invoice analysis is described. Tokens close by their morphology and confirmed in their location within different ...
Abdel Belaïd, Yolande Belaïd