The problem of joint modeling the text and image components of multimedia documents is studied. The text component is represented as a sample from a hidden topic model, learned wi...
Nikhil Rasiwasia, Jose Costa Pereira, Emanuele Cov...
In this paper we describe an image based document retrieval system which runs on camera enabled mobile devices. "Mobile Retriever" aims to seamlessly link physical and di...
Large collections of scanned documents (books and journals) are now available in Digital Libraries. The most common method for retrieving relevant information from these collectio...
The use of local features in computer vision has shown to be promising. Local features have several advantages including invariance to image transformations, independence of the ba...
—A method for locating mathematical expressions in document images without the use of optical character recognition is presented. An index of document regions is produced from re...