Structured document retrieval makes use of document components as the basis of the retrieval process, rather than complete documents. The inherent relationships between these comp...
Jane Reid, Mounia Lalmas, Karen Finesilver, Morten...
In this paper, we propose a word shape recognition method for retrieving image-based documents. Document images are segmented at the word level first. Then the proposed method det...
This paper describes a document retrieval system called CAIRN that uses a case-based reasoning set using a large lexicon to automatically generate a case index to that document se...
There is a growing need to access historical Ottoman documents stored in large archives and therefore managing tools for automatic searching, indexing and transcription of these d...
This paper reports a document retrieval technique that retrieves machine-printed Latin-based document images through word shape coding. Adopting the idea of image annotation, a wo...