This paper proposes a method for automatically inserting commas into Japanese texts. In Japanese sentences, commas play an important role in explicitly separating the constituents...
In this paper we compare two methods for the automatic identification of geographical articles in encyclopedic resources such as Wikipedia. The methods are a WordNet-based method ...
An empirical study has been conducted investigating the relationship between the performance of a generative language model in terms of perplexity and the corresponding informatio...
Leif Azzopardi, Mark Girolami, Keith van Rijsberge...
Capitalizing on the intuitive underlying assumptions of Language Modelling for Ad-Hoc Retrieval we present a novel approach that is capable of injecting the user’s context of th...
Leif Azzopardi, Mark Girolami, Cornelis Joost van ...
In this paper, we propose a new application of Bayesian language model based on Pitman-Yor process for information retrieval. This model is a generalization of the Dirichlet distr...