Paragraph Acquisition and Selection for List Question Using Amazon's Mechanical Turk

15 years 5 months ago

Download www.lsv.uni-saarland.de

Creating more fine-grained annotated data than previously relevent document sets is important for evaluating individual components in automatic question answering systems. In this paper, we describe using the Amazon's Mechanical Turk (AMT) to judge whether paragraphs in relevant documents answer corresponding list questions in TREC QA track 2004. Based on AMT results, we build a collection of 1300 gold-standard supporting paragraphs for list questions. Our online experiments suggested that recruiting more people per task assures better annotation quality. In order to learning true labels from AMT annotations, we investigated the influence of annotation accuracy and number of labels per HIT on the performance of those approaches. Experimental studies show that the Naive Bayesian model and EM-based GLAD model can generate results highly agreeing with gold-standard annotations, and dominate significantly over the majority voting method for true label learning. We also suggested sett...

Fang Xu, Dietrich Klakow

Real-time Traffic

Annotation Quality | Better Online Annotation | Education | List Questions | LREC 2010 |

claim paper

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Fang Xu, Dietrich Klakow

Sciweavers

Paragraph Acquisition and Selection for List Question Using Amazon's Mechanical Turk

Annotation Quality | Better Online Annotation | Education | List Questions | LREC 2010 |

Explore & Download

Productivity Tools

Sciweavers