For a very long time, it has been considered that the only way of automatically extracting similar groups of words from a text collection for which no semantic information exists ...
Of the ten million words of contemporary standard Dutch in the Spoken Dutch Corpus (Corpus Gesproken Nederlands, CGN), a selection of one million words of natural spoken language ...
Heleen Hoekstra, Michael Moortgat, Ineke Schuurman...
A language-independent framework for syntactic finlte-state parsing is discussed. The article presents a framework, a formalism, a compiler and a parser for grammars written in th...
Kimmo Koskenniemi, Pasi Tapanainen, Atro Voutilain...
This paper investigates syntactic and sub-lexical features in Turkish discriminative language models (DLMs). DLM is a featurebased language modeling approach. It reranks the ASR o...
Ebru Arisoy, Murat Saraclar, Brian Roark, Izhak Sh...
In this paper, a possible worlds framework for representing general belief change operators is presented. In common with many approaches, an agent’s set of beliefs are specifie...