We introduce a new set of tools for working with web-scale N-gram data. These tools lower the barrier for working with web-scale text, and create a new platform for acquiring larg...
Dekang Lin, Kenneth Ward Church, Heng Ji, Satoshi ...
This paper describes our ongoing work on linking Korean word senses with the concepts of an ontology. We have few Korean wordnets which are linked to upper-level ontologies, altho...
This paper presents the work that has been carried out to annotate semantic roles in the Basque Dependency Treebank (BDT) (Aldezabal et al., 2009). In this paper we will present t...
This paper describes the development of tools for a semi-automated process for validation of treebank annotation at various levels. Consistency in treebank annotation is a must fo...
The paper presents an innovative approach to extract Slovene definition candidates from domain-specific corpora using morphosyntactic patterns, automatic terminology recognition a...