Harvesting for Full-Text Retrieval

15 years 10 months ago

Download www.cis.strath.ac.uk

Abstract. We propose an approach to Distributed Information Retrieval based on the periodic and incremental centralisation of full-text indices of widely dispersed and autonomously managed content sources. Inspired by the success of the Open Archive Initiative’s protocol for metadata harvesting, the approach occupies middle ground between: (i) the crawling of content, and (ii) the distribution of retrieval. As in crawling, some data moves towards the retrieval process, but it is statistics about the content rather than content itself. As in distributed retrieval, some processing is distributed along with the data, but it is indexing rather than retrieval itself. We show that the approach retains the good properties of centralised retrieval without renouncing to cost-eﬀective resource pooling. We discuss the requirements associated with the approach and identify two strategies to deploy it on top of the OAI infrastructure.

Fabio Simeoni, Murat Yakici, Steve Neely, Fabio Cr

Real-time Traffic

ICADL 2005 | Information Retrieval | Open Archive Initiative | Retrieval Process |

claim paper

» Fast Incremental Indexing for FullText Information Retrieval

» Supporting FullText Information Retrieval with a Persistent Object Store

» Is a Morphologically Complex Language Really that Complex in FullText Retrieval

» TileBars Visualization of Term Distribution Information in Full Text Information Access

» Tree patterns with Full Text Search

» Towards Topic Driven Access to Full Text Documents

» On Building a FullText Digital Library of Historical Documents

» FullText and Structural XML Indexing on BTree

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ICADL
Authors	Fabio Simeoni, Murat Yakici, Steve Neely, Fabio Crestani

Comments (0)

Sciweavers

Harvesting for Full-Text Retrieval

ICADL 2005 | Information Retrieval | Open Archive Initiative | Retrieval Process |

Explore & Download

Productivity Tools

Sciweavers