Example Selection for Bootstrapping Statistical Parsers

13 years 10 months ago

Download www.cs.pitt.edu

This paper investigates bootstrapping for statistical parsers to reduce their reliance on manually annotated training data. We consider both a mostly-unsupervised approach, co-training, in which two parsers are iteratively re-trained on each other’s output; and a semi-supervised approach, corrected co-training, in which a human corrects each parser’s output before adding it to the training data. The selection of labeled training examples is an integral part of both frameworks. We propose several selection methods based on the criteria of minimizing errors in the data and maximizing training utility. We show that incorporating the utility criterion into the selection method results in better parsers for both frameworks.

Mark Steedman, Rebecca Hwa, Stephen Clark, Miles O

Real-time Traffic

NAACL 2003 | NAACL 2007 | Parsers | Selection Method | Training Data |

claim paper

» Unsupervised Parse Selection for HPSG

» A bootstrapping approach to annotating large image collection

» StatSnowball a statistical approach to extracting entity relationships

» Bias in random forest variable importance measures Illustrations sources and a solution

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	NAACL
Authors	Mark Steedman, Rebecca Hwa, Stephen Clark, Miles Osborne, Anoop Sarkar, Julia Hockenmaier, Paul Ruhlen, Steven Baker, Jeremiah Crim

Comments (0)

Sciweavers

Example Selection for Bootstrapping Statistical Parsers

NAACL 2003 | NAACL 2007 | Parsers | Selection Method | Training Data |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers