Sciweavers

8 search results - page 1 / 2
» From D-Coi to SoNaR: a reference corpus for Dutch
Sort
View
LREC
2008
131views Education» more  LREC 2008»
13 years 6 months ago
From D-Coi to SoNaR: a reference corpus for Dutch
The computational linguistics community in The Netherlands and Belgium has long recognized the dire need for a major reference corpus of written Dutch. In part to answer this need...
Nelleke Oostdijk, Martin Reynaert, Paola Monachesi...
LREC
2010
147views Education» more  LREC 2010»
13 years 6 months ago
Interacting Semantic Layers of Annotation in SoNaR, a Reference Corpus of Contemporary Written Dutch
This paper reports on the annotation of a corpus of 1 million words with four semantic annotation layers, including named entities, coreference relations, semantic roles and spati...
Ineke Schuurman, Véronique Hoste, Paola Mon...
LREC
2010
168views Education» more  LREC 2010»
13 years 6 months ago
Balancing SoNaR: IPR versus Processing Issues in a 500-Million-Word Written Dutch Reference Corpus
In The Low Countries, a major reference corpus for written Dutch is currently being built. In this paper, we discuss the interplay between data acquisition and data processing dur...
Martin Reynaert, Nelleke Oostdijk, Orphée D...
LREC
2010
209views Education» more  LREC 2010»
13 years 6 months ago
Towards a Balanced Named Entity Corpus for Dutch
This paper introduces a new named entity corpus for Dutch. State-of-the-art named entity recognition systems require a substantial annotated corpus to be trained on. Such corpora ...
Bart Desmet, Véronique Hoste
LREC
2010
200views Education» more  LREC 2010»
13 years 6 months ago
The D-TUNA Corpus: A Dutch Dataset for the Evaluation of Referring Expression Generation Algorithms
In this paper, we present the D-TUNA corpus, which is the first semantically annotated corpus of referring expressions in Dutch. Its primary function is to evaluate and improve th...
Ruud Koolen, Emiel Krahmer