Sciweavers

3090 search results - page 133 / 618
» Document Processing with LinkIT
Sort
View
EMNLP
2010
14 years 8 months ago
Evaluating Models of Latent Document Semantics in the Presence of OCR Errors
Models of latent document semantics such as the mixture of multinomials model and Latent Dirichlet Allocation have received substantial attention for their ability to discover top...
Daniel David Walker, William B. Lund, Eric K. Ring...
DAS
2010
Springer
15 years 1 months ago
Text extraction from graphical document images using sparse representation
A novel text extraction method from graphical document images is presented in this paper. Graphical document images containing text and graphics components are considered as two-d...
Thai V. Hoang, Salvatore Tabbone
DASFAA
2009
IEEE
253views Database» more  DASFAA 2009»
15 years 1 months ago
Implementing and Optimizing Fine-Granular Lock Management for XML Document Trees
Abstract. Fine-grained lock protocols with lock modes and lock granules adjusted to the various XML processing models, allow for highly concurrent transaction processing on XML tre...
Sebastian Bächle, Theo Härder, Michael P...
ICIP
2001
IEEE
15 years 11 months ago
Restoration of images scanned from thick bound documents
Perspective distortion always occurs while scanning thick, bound documents. This distortion mainly causes two sources of degradation for the scanned grayscale image ? i) shade alo...
Zheng Zhang 0003, Chew Lim Tan
ICIP
1999
IEEE
15 years 11 months ago
Color Documents on the Web with DJVU
We present a new image compression technique called DjVu" that is speci cally geared towards the compression of scanned documents in color at high resolution. With DjVu, a ma...
Bill Riemers, Léon Bottou, Pascal Vincent, ...