This paper describes how to design a digital book for a new science called `Knowledge Science'. We prepare several types of navigation facilities for browsing the book. Speci...
Performance of n-gram language models depends to a large extent on the amount of training text material available for building the models and the degree to which this text matches...
Accentological corpus provides a researcher an opportunity to study word stress and stress variation, which are very important for the Russian language. Moreover, Accentological c...
Abstract. One major goal of text mining is to provide automatic methods to help humans grasp the key ideas in ever-increasing text corpora. To this effect, we propose a statistica...
This paper studies the impact of written language variations and the way it affects the capitalization task over time. A discriminative approach, based on maximum entropy models, ...