Sciweavers

LREC
2008

Induction of Treebank-Aligned Lexical Resources

13 years 6 months ago
Induction of Treebank-Aligned Lexical Resources
We describe the induction of lexical resources from unannotated corpora that are aligned with treebank grammars, providing a systematic correspondence between features in the lexical resource and a treebank syntactic resource. We first describe a methodology based on parsing technology for augmenting a treebank database with linguistic features. A PCFG containing these features is created from the augmented treebank. We then use a procedure based on the inside-outside algorithm to learn lexical resources aligned with the treebank PCFG from large unannotated corpora. The method has been applied in creating a feature-annotated English treebank based on the
Tejaswini Deoskar, Mats Rooth
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where LREC
Authors Tejaswini Deoskar, Mats Rooth
Comments (0)