Sciweavers

COLING
2010

Acquisition of Unknown Word Paradigms for Large-Scale Grammars

12 years 11 months ago
Acquisition of Unknown Word Paradigms for Large-Scale Grammars
Unknown words are a major issue for large-scale grammars of natural language. We propose a machine learning based algorithm for acquiring lexical entries for all forms in the paradigm of a given unknown word. The main advantages of our method are the usage of word paradigms to obtain valuable morphological knowledge, the consideration of different contexts which the unknown word and all members of its paradigm occur in and the employment of a full-blown syntactic parser and the grammar we want to improve to analyse these contexts and provide elaborate syntactic constraints. We test our algorithm on a large-scale grammar of Dutch and show that its application leads to an improved parsing accuracy.
Kostadin Cholakov, Gertjan van Noord
Added 13 May 2011
Updated 13 May 2011
Type Journal
Year 2010
Where COLING
Authors Kostadin Cholakov, Gertjan van Noord
Comments (0)