Sciweavers

1261 search results - page 166 / 253
» Extracting Text from PostScript
Sort
View
EACL
2006
ACL Anthology
14 years 11 months ago
The GOD model
GOD (General Ontology Discovery) is an unsupervised system to extract semantic relations among domain specific entities and concepts from texts. Operationally, it acts as a search...
Alfio Massimiliano Gliozzo
PAAMS
2010
Springer
14 years 7 months ago
A Case Study on Grammatical-Based Representation for Regular Expression Evolution
Abstract. Regular expressions, or simply regex, have been widely used as a powerful pattern matching and text extractor tool through decades. Although they provide a powerful and f...
Antonio González-Pardo, David F. Barrero, D...
NAACL
2003
14 years 11 months ago
A Generative Probabilistic OCR Model for NLP Applications
In this paper, we introduce a generative probabilistic optical character recognition (OCR) model that describes an end-to-end process in the noisy channel framework, progressing f...
Okan Kolak, William J. Byrne, Philip Resnik
ICIP
1999
IEEE
15 years 11 months ago
Automatic Caption Localization in Compressed Video
?We present a method to automatically localize captions in JPEG compressed images and the I-frames of MPEG compressed videos. Caption text regions are segmented from background ima...
Yu Zhong, HongJiang Zhang, Anil K. Jain
RIAO
2000
14 years 11 months ago
Combining linguistic and spatial information for document analysis
We present a framework to analyze color documents of complex layout. In addition, no assumption is made on the layout. Our framework combines in a content-driven bottom-up approac...
Marco Aiello, Christof Monz, Leon Todoran