Document binarization is an active research area for many years. There are many difficulties associated with satisfactory binarization of document images and especially in cases o...
Numerous approaches, including textual, structural and featural, to detecting duplicate documents have been investigated. Considering document images are usually stored and transm...
In this article, we propose a special type of decision tree, called a decision cascade, for binarizing document images. Such images are produced by cameras, resulting in varying de...
With large databases of document images available, a method for users to find keywords in documents will be useful. One approach is to perform Optical Character Recognition (OCR) ...
We explore connections between digital libraries and interactive document image analysis. Digital libraries can provide useful data and metadata for research in automated document...