There is a significant need to recognise the text in images on web pages, both for effective indexing and for presentation by non-visual means (e.g., audio). This paper presents a...
In this paper a methodology for feature selection in unsupervised learning is proposed. It makes use of a multiobjective genetic algorithm where the minimization of the number of ...
A new system is presented for general symbol segmentation, which is applicable for segmentation of any connected string of symbols, including characters and line diagrams. Using a...
This paper addresses the problem of to what extent linear transformation can alleviate nonlinear distortion. We investigate a technique of global affine transformation (GAT) corre...
We turn to the viewpoint of users of a DAU system. Out of the view of users we sketch a picture of “Document Analysis and Understanding” (DAU), only a simple division of DAU i...