Extracting titles from a PDFs full text is an important task in information retrieval to identify PDFs. Existing approaches apply complicated and expensive (in terms of calculating...
This research-in-progress paper presents a new approach called Link Proximity Analysis (LPA) for identifying related web pages based on link analysis. In contrast to current techni...
This poster will outline the Digitisation Strategy Toolkit created as part of the LIFE-SHARE project. The toolkit is based on the lifecycle model created by the LIFE project and ex...
Beccy Shipman, Matthew Herring, Ned Potter, Bo Mid...
The PUMA project fosters the Open Access movement und aims at a better support of the researcher's publication work. PUMA stands for an integrated solution, where the upload o...
Abstract. Today, digital libraries more and more have to rely on semantic techniques during the workflows of metadata generation, search and navigational access. But, due to the st...
Abstract. Digital Library support for textual and certain types of nontextual documents has significantly advanced over the last years. While Digital Library support implies many a...
Abstract. We consider how the construction of multi-structured documents implies the definition of structuration vocabularies. In a multiusers context, the growth of these vocabula...
Abstract Searching for entities is an emerging task in Information Retrieval for which the goal is finding well defined entities instead of documents matching the query terms. In t...
Bodo Billerbeck, Gianluca Demartini, Claudiu S. Fi...
We present results of the INEX 2009 Interactive Track which focussed on how users behave in interactive search systems. Three types of working tasks based on a collection of book m...
Thomas Beckers, Norbert Fuhr, Nils Pharo, Ragnar N...