We report on the design and implementation of a system which automates the process of capturing structured documents from the optically recognized form of printed materials. The sy...
Logical entity recognition in heterogeneous collections of document page images remains a challenging problem since the performance of traditional supervised methods degrade drama...
The Stack algorithm, which is a best-first search algorithm widely used in speech recognition, is modified for application to the problem of recognizing machine printed text in th...
Scanned document images are nowadays becoming available in increasingly higher resolutions. Meanwhile, the variations in image quality within typical document collections increase...
Iuliu Konya Konya, Christoph Seibert, Stefan Eicke...
Abstract. This paper presents how Self-Organizing Maps and especially Kohonen maps can be applied to digital images of ancient collections in the perspective of valorization and di...