An unsupervised part-of-speech (POS) tagging system that relies on graph clustering methods is described. Unlike in current state-of-the-art approaches, the kind and number of dif...
The Natural Language Toolkit is a suite of program modules, data sets and tutorials supporting research and teaching in computational linguistics and natural language processing. ...
In this paper we investigate how to automatically determine if two document collections are written from different perspectives. By perspectives we mean a point of view, for examp...
The interpretation of temporal expressions in text is an important constituent task for many practical natural language processing tasks, including question-answering, information...
We report initial results on the relatively novel task of automatic classification of author personality. Using a corpus of personal weblogs, or `blogs', we investigate the a...