This paper describes features and methods for document image comparison and classification at the spatial layout level. The methods are useful for visual similarity based document...
Jianying Hu, Ramanujan S. Kashi, Gordon T. Wilfong
Testing often accounts for more than 50% of the required effort during system development. However, testing is often not well integrated with other development phases. One reason ...
How to merge and organise query results retrieved from different resources is one of the key issues in distributed information retrieval. Some previous research and experiments su...
In this paper, we show how we can learn to select good words for a document title. We view the problem of selecting good title words for a document as a variant of an Information ...
In this paper, we propose a method of text retrieval from document images using a similarity measure based on an N-Gram algorithm. We directly extract image features instead of us...