Sciweavers

ACL
2004

Experiments in parallel-text based grammar induction

13 years 5 months ago
Experiments in parallel-text based grammar induction
This paper discusses the use of statistical word alignment over multiple parallel texts for the identification of string spans that cannot be constituents in one of the languages. This information is exploited in monolingual PCFG grammar induction for that language, within an augmented version of the inside-outside algorithm. Besides the aligned corpus, no other resources are required. We discuss an implemented system and present experimental results with an evaluation against the Penn Treebank.
Jonas Kuhn
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2004
Where ACL
Authors Jonas Kuhn
Comments (0)