In this paper we describe the current state of a new Japanese lexical resource: the Hinoki treebank. The treebank is built from dictionary definitions, examples and news text, and ...
Detecting idioms in a sentence is important to sentence understanding. This paper discusses the linguistic knowledge for idiom detection. The challenges are that idioms can be ambi...
In a 12-month project we have developed a new, register-diverse, 55-million-word bilingual corpus--the New Corpus for Ireland (NCI)--to support the creation of a new English-to-Iri...
Users need more sophisticated tools to handle the growing number of image-based documents available in databases. In this paper, we present a system devoted to the editing and bro...
In this paper the transcription and evaluation of the corpus DIMEx100 for Mexican Spanish is presented. First we describe the corpus and explain the linguistic and computational mo...
Luis Alberto Pineda, Hayde Castellanos, Javier Cu&...