Sciweavers

CLIN
2000
13 years 6 months ago
Syntactic Annotation for the Spoken Dutch Corpus Project (CGN)
Of the ten million words of contemporary standard Dutch in the Spoken Dutch Corpus (Corpus Gesproken Nederlands, CGN), a selection of one million words of natural spoken language ...
Heleen Hoekstra, Michael Moortgat, Ineke Schuurman...
EACL
2003
ACL Anthology
13 years 6 months ago
Learning to Identify Fragmented Words in Spoken Discourse
Disfluent speech adds to the difficulty of processing spoken language utterances. In this paper we concentrate on identifying one disfluency phenomenon: fragmented words. Our d...
Piroska Lendvai
CLIN
2003
13 years 6 months ago
A Memory-Based Shallow Parser for Spoken Dutch
We describe the development of a Dutch memory-based shallow parser. The availability of large treebanks for Dutch, such as the one provided by the Spoken Dutch Corpus, allows memo...
Sander Canisius, Antal van den Bosch
CLIN
2003
13 years 6 months ago
Reduction of Dutch Sentences for Automatic Subtitling
We compare machine learning approaches for sentence length reduction for automatic generation of subtitles for deaf and hearing-impaired people with a method which relies on hand-...
Erik F. Tjong Kim Sang, Walter Daelemans, Anja H&o...