Sciweavers

DAS
2006
Springer

Retrieval from Document Image Collections

13 years 8 months ago
Retrieval from Document Image Collections
Abstract. This paper presents a system for retrieval of relevant documents from large document image collections. We achieve effective search and retrieval from a large collection of printed document images by matching image features at word-level. For representations of the words, profile-based and shape-based features are employed. A novel DTWbased partial matching scheme is employed to take care of morphologically variant words. This is useful for grouping together similar words during the indexing process. The system supports cross-lingual search using OM-Trans transliteration and a dictionary-based approach. Systemlevel issues for retrieval (eg. scalability, effective delivery etc.) are addressed in this paper.
A. Balasubramanian, Million Meshesha, C. V. Jawaha
Added 22 Aug 2010
Updated 22 Aug 2010
Type Conference
Year 2006
Where DAS
Authors A. Balasubramanian, Million Meshesha, C. V. Jawahar
Comments (0)