Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

103

ACL
2011

favoriteEmaildiscussreport

187views Computational Linguistics» more ACL 2011»

Collecting Highly Parallel Data for Paraphrase Evaluation

14 years 5 months ago

Collecting Highly Parallel Data for Paraphrase Evaluation

Download www.cs.utexas.edu

A lack of standard datasets and evaluation metrics has prevented the ﬁeld of paraphrasing from making the kind of rapid progress enjoyed by the machine translation community over the last 15 years. We address both problems by presenting a novel data collection framework that produces highly parallel text data relatively inexpensively and on a large scale. The highly parallel nature of this data allows us to use simple n-gram comparisons to measure both the semantic adequacy and lexical dissimilarity of paraphrase candidates. In addition to being simple and efﬁcient to compute, experiments show that these metrics correlate highly with human judgments.

David Chen, William B. Dolan

Real-time Traffic

ACL 2011 | Computational Linguistics | Evaluation Metrics | Human Judgments | Translation Community |

claim paper

Related Content

» Extracting Paraphrases of Technical Terms from Noisy Parallel Software Corpora

» Linguistic Steganography Using Automatically Generated Paraphrases

» Aligning Needles in a Haystack Paraphrase Acquisition Across the Web

» Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora

» PEM A Paraphrase Evaluation Metric Exploiting Parallel Texts

» Learning Sentential Paraphrases from Bilingual Parallel Corpora for TexttoText Generation

» Tracing garbage collection on highly parallel platforms

» TACOExploiting Cluster Networks for HighLevel Collective Operations

» Collecting Sensor Data for HighPerformance Computing A Casestudy

Post Info
More Details (n/a)

Added	23 Aug 2011
Updated	23 Aug 2011
Type	Journal
Year	2011
Where	ACL
Authors	David Chen, William B. Dolan

Comments (0)