Sciweavers

LREC
2008
108views Education» more  LREC 2008»
13 years 7 months ago
Czech MWE Database
In this paper we deal with a recently developed large Czech MWE database containing at the moment 160 000 MWEs (treated as lexical units). It was compiled from various resources s...
Karel Pala, Lukás Svoboda, Pavel Smerk
LREC
2008
129views Education» more  LREC 2008»
13 years 7 months ago
Named Entity WordNet
This paper presents the automatic extension of Princeton WordNet with Named Entities (NEs). This new resource is called Named Entity WordNet. Our method maps the noun is-a hierarc...
Antonio Toral, Rafael Muñoz, Monica Monachi...
LREC
2008
125views Education» more  LREC 2008»
13 years 7 months ago
Verb-Noun Collocation SyntLex Dictionary: Corpus-Based Approach
The project presented here is a part of a long term research program aiming at a full lexicon grammar for Polish (SyntLex). The main concern of this project is computer-assisted a...
Grazyna Vetulani, Zygmunt Vetulani, Tomasz Obr&eci...
LREC
2008
125views Education» more  LREC 2008»
13 years 7 months ago
Bootstrapping Language Description: the case of Mpiemo (Bantu A, Central African Republic)
Linguists have long been producing grammatical decriptions of yet undescribed languages. This is a time-consuming process, which has already adapted to improved technology for rec...
Harald Hammarström, Christina Thornell, Malin...
LREC
2008
100views Education» more  LREC 2008»
13 years 7 months ago
Evaluating the Relationship between Linguistic and Geographic Distances using a 3D Visualization
In this paper we discuss how linguistic and geographic distances can be related using a 3D visualization. We will convert linguistic data for locations along the German-Dutch bord...
Folkert de Vriend, Jan Pieter Kunst, Louis ten Bos...
LREC
2008
94views Education» more  LREC 2008»
13 years 7 months ago
Strengthening the Estonian Language Technology
The paper will give an overview of developments in Estonia in the field of Human Language Technologies. Despite of the fact that Estonian is one of the smallest official languages...
Einar Meister, Jaak Vilo
LREC
2008
139views Education» more  LREC 2008»
13 years 7 months ago
Identification of Comparable Argument-Head Relations in Parallel Corpora
We present the machine learning framework that we are developing, in order to support explorative search for non-trivial linguistic configurations in low-density languages (langua...
Kathrin Spreyer, Jonas Kuhn, Bettina Schrader
LREC
2008
126views Education» more  LREC 2008»
13 years 7 months ago
Morphosyntactic Resources for Automatic Speech Recognition
Texts generated by automatic speech recognition (ASR) systems have some specificities, related to the idiosyncrasies of oral productions or the principles of ASR systems, that mak...
Stéphane Huet, Guillaume Gravier, Pascale S...
LREC
2008
123views Education» more  LREC 2008»
13 years 7 months ago
Designing and Evaluating a Russian Tagset
This paper reports the principles behind designing a tagset to cover Russian morphosyntactic phenomena, modifications of the core tagset, and its evaluation. The tagset and associ...
Serge Sharoff, Mikhail Kopotev, Tomaz Erjavec, Ann...
LREC
2008
87views Education» more  LREC 2008»
13 years 7 months ago
Is this NE tagger getting old?
This paper focuses on the influence of changing the text time frame on the performance of a named entity tagger. We followed a twofold approach to investigate this subject: on the...
Cristina Mota, Ralph Grishman