Collecting a Why-Question Corpus for Development and Evaluation of an Automatic QA-System

13 years 5 months ago

Download aclweb.org

Question answering research has only recently started to spread from short factoid questions to more complex ones. One significant challenge is the evaluation: manual evaluation is a difficult, time-consuming process and not applicable within efficient development of systems. Automatic evaluation requires a corpus of questions and answers, a definition of what is a correct answer, and a way to compare the correct answers to automatic answers produced by a system. For this purpose we present a Wikipedia-based corpus of Whyquestions and corresponding answers and articles. The corpus was built by a novel method: paid participants were contacted through a Web-interface, a procedure which allowed dynamic, fast and inexpensive development of data collection methods. Each question in the corpus has several corresponding, partly overlapping answers, which is an asset when estimating the correctness of answers. In addition, the corpus contains information related to the corpus collection proce...

Joanna Mrozinski, Edward W. D. Whittaker, Sadaoki

Real-time Traffic

ACL 2008 | Computational Linguistics | Correct Answers | Question Answering Research | Short Factoid Questions |

claim paper

» Automatic question answering using the web Beyond the Factoid

» A Corpus for Studying Full Answer Justification

» Testing the Reasoning for Question Answering Validation

» Corpus and Evaluation Measures for Automatic Plagiarism Detection

» TimeEfficient Creation of an Accurate Sentence Fusion Corpus

» Building a Greek corpus for Textual Entailment

» Design and Data Collection for Spoken Polish Dialogs Database

» Test Collections for Spoken Document Retrieval from Lecture Audio Data

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	ACL
Authors	Joanna Mrozinski, Edward W. D. Whittaker, Sadaoki Furui

Comments (0)

Sciweavers

Collecting a Why-Question Corpus for Development and Evaluation of an Automatic QA-System

ACL 2008 | Computational Linguistics | Correct Answers | Question Answering Research | Short Factoid Questions |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers