Spoken document retrieval (SDR) has been extensively studied in recent years because of its potential use in navigating large multimedia collections in the near future. This paper...
In this paper we report our research on building WebSail { an intelligent web search engine that is able to perform real-time adaptive learning. WebSail learns from the user'...
Zhixiang Chen, Xiannong Meng, Binhai Zhu, Richard ...
This paper describes research to enhance the integration between digital models and the services provided by the document management systems of digital libraries. Processing techn...
The Web is a valuable source of language speci c resources but the process of collecting, organizing and utilizing these resources is di cult. We describe CorpusBuilder, an approa...
EuroGOV is a multilingual web corpus that was created to serve as the document collection for WebCLEF, the CLEF 2005 web retrieval task. EuroGOV is a collection of web pages crawl...