An Incremental Threshold Method for Continuous Text Search Queries

11 years 6 months ago
An Incremental Threshold Method for Continuous Text Search Queries
Abstract—A text filtering system monitors a stream of incoming documents, to identify those that match the interest profiles of its users. The user interests are registered at a server as continuous text search queries. The server constantly maintains for each query a ranked result list, comprising the recent documents (drawn from a sliding window) with the highest similarity to the query. Such a system underlies many text monitoring applications that need to cope with heavy document traffic, such as news and email monitoring. In this paper, we propose the first solution for processing continuous text queries efficiently. Our objective is to support a large number of user queries while sustaining high document arrival rates. Our solution indexes the streamed documents with a structure based on the principles of the inverted file, and processes document arrival and expiration events with an incremental threshold-based method. Using a stream of real documents, we experimentally v...
Kyriakos Mouratidis, HweeHwa Pang
Added 19 May 2010
Updated 19 May 2010
Type Conference
Year 2009
Where ICDE
Authors Kyriakos Mouratidis, HweeHwa Pang
Comments (0)