Sciweavers

INLG
2010
Springer

Extracting Parallel Fragments from Comparable Corpora for Data-to-text Generation

13 years 2 months ago
Extracting Parallel Fragments from Comparable Corpora for Data-to-text Generation
Building NLG systems, in particular statistical ones, requires parallel data (paired inputs and outputs) which do not generally occur naturally. In this paper, we investigate the idea of automatically extracting parallel resources for data-to-text generation from comparable corpora obtained from the Web. We describe our comparable corpus of data and texts relating to British hills and the techniques for extracting paired input/output fragments we have developed so far.
Anja Belz, Eric Kow
Added 13 Feb 2011
Updated 13 Feb 2011
Type Journal
Year 2010
Where INLG
Authors Anja Belz, Eric Kow
Comments (0)