Sciweavers

CLIN
2001
13 years 6 months ago
Multi-feature Error Detection in Spoken Dialogue Systems
The present paper evaluates the role selected features and feature combinations play for error detection in spoken dialogue systems. We investigate the relevance of various, readi...
Piroska Lendvai, Antal van den Bosch, Emiel Krahme...
CLIN
2001
13 years 6 months ago
The Alpino Dependency Treebank
In this paper we present the Alpino Dependency Treebank and the tools that we have developed to facilitate the annotation process. Annotation typically starts with parsing a sente...
Leonoor van der Beek, Gosse Bouma, Rob Malouf, Ger...
CLIN
2001
13 years 6 months ago
Memory-Based Phoneme-to-Grapheme Conversion
In this paper, we describe a method to enhance the readability of out-of-vocabulary items (OOVs) in the textual output in a large vocabulary continuous speech recognition system. ...
Bart Decadt, Jacques Duchateau, Walter Daelemans, ...
CLIN
2001
13 years 6 months ago
Applying Monte Carlo Techniques to Language Identification
Two major stages stages in language identification systems can be identified: the language modeling stage, where the distinctive features of languages are determined and stored in...
Arjen Poutsma
CLIN
2001
13 years 6 months ago
A Named Entity Recognition System for Dutch
We describe a Named Entity Recognition system for Dutch that combines gazetteers, handcrafted rules, and machine learning on the basis of seed material. We used gazetteers and a c...
Fien De Meulder, Walter Daelemans, Véroniqu...
CLIN
2001
13 years 6 months ago
Creating a Dutch Information Retrieval Test Corpus
This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch te...
Djoerd Hiemstra, David van Leeuwen
CLIN
2001
13 years 6 months ago
Accurate Stemming of Dutch for Text Classification
This paper investigates the use of stemming for classification of Dutch (email) texts. We introduce a stemmer, which combines dictionary lookup (implemented efficiently as a finit...
Tanja Gaustad, Gosse Bouma
CLIN
2001
13 years 6 months ago
Corpus-based Acquisition of Collocational Prepositional Phrases
Collocational prepositional phrases like ten koste van (at the expense of), met het oog op (with an eye on), and onder het mom van (under the pretext of) are patterns of the form ...
Gosse Bouma, Begoña Villada
CLIN
2001
13 years 6 months ago
Tagging the Dutch PAROLE Corpus
We discuss the annotation with part of speech and lemma of the Dutch PAROLE Internet Corpus. The PAROLE PoS tagger is a combination of statistical taggers. It includes the Markov ...
Jesse de Does, John van der Voort van der Kleij
CLIN
2001
13 years 6 months ago
Performance Grammar: a Declarative Definition
In this paper we present a definition of Performance Grammar (PG), a psycholinguistically motivated syntax formalism, in declarative terms. PG aims not only at describing and expl...
Gerard Kempen, Karin Harbusch