In this paper we give an overview of a multiontology disambiguation method, targeted to discover the intended meaning of words in unstructured web contexts. It receives an ambiguo...
Abstract. We show that several previously proposed passage-based document ranking principles, along with some new ones, can be derived from the same probabilistic model. We use lan...
The objective of the emis project is the conception and realization of a multilingualinformation system on European media law with the following functionalities: search by words, ...
This paper proposes a chunking strategy to detect unknown words in Chinese word segmentation. First, a raw sentence is pre-segmented into a sequence of word atoms 1 using a maximum...
In this paper we explore the use of parsimonious language models for web retrieval. These models are smaller thus more efficient than the standard language models and are therefor...