Sciweavers

NLE
2010

Interlingual annotation of parallel text corpora: a new framework for annotation and evaluation

13 years 2 months ago
Interlingual annotation of parallel text corpora: a new framework for annotation and evaluation
This paper focuses on an important step in the creation of a system of meaning representation and the development of semantically-annotated parallel corpora, for use in applications such as machine translation, question answering, text summarization, and information retrieval. The work described below constitutes the first effort of any kind to annotate multiple translations of foreign-language texts with interlingual content. Three levels of representation are introduced: deep syntactic dependencies (IL0), intermediate semantic representations (IL1), and a normalized representation that unifies conversives, non-literal language, and paraphrase (IL2). The resulting annotated, multilingually-induced, parallel corpora will be useful as an empirical basis for a wide range of research, including the development and evaluation of interlingual NLP systems and paraphrase-extraction systems as well as a host of other research and development efforts in theoretical and applied linguistics,...
Bonnie J. Dorr, Rebecca J. Passonneau, David Farwe
Added 29 Jan 2011
Updated 29 Jan 2011
Type Journal
Year 2010
Where NLE
Authors Bonnie J. Dorr, Rebecca J. Passonneau, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard H. Hovy, Lori S. Levin, Keith J. Miller, Teruko Mitamura, Owen Rambow, Advaith Siddharthan
Comments (0)