Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

14

CORR
1999
Springer

favoriteEmaildiscussreport

118views Education» more CORR 1999»

Supervised Grammar Induction Using Training Data with Limited Constituent Information

13 years 3 months ago

Supervised Grammar Induction Using Training Data with Limited Constituent Information

Download reference.kfupm.edu.sa

Corpus-based grammar induction generally relies on hand-parsed training data to learn the structure of the language. Unfortunately, the cost of building large annotated corpora is prohibitively expensive. This work aims to improve the induction strategy when there are few labels in the training data. We show that the most informative linguistic constituents are the higher nodes in the parse trees, typically denoting complex noun phrases and sentential clauses. They account for only 20% of all constituents. For inducing grammars from sparsely labeled training data (e.g., only higher-level constituent labels), we propose an adaptation strategy, which produces grammars that parse almost as well as grammars induced from fully labeled corpora. Our results suggest that for a partial parser to replace human annotators, it must be able to automatically extract higher-level constituents rather than base noun phrases.

Rebecca Hwa

Real-time Traffic

Corpus-based Grammar Induction | CORR 1999 | Education | Grammars | Noun Phrases |

claim paper

Related Content

» Natural Language Grammar Induction Using a ConstituentContext Model

» Unsupervised grammar induction using history based approach

» Evolving natural language grammars without supervision

» Inducing TreeSubstitution Grammars

» Formalization of Link Farm Structure Using Graph Grammar

» Improving supervised learning performance by using fuzzy clustering method to select train...

» Using the Web to Reduce Data Sparseness in PatternBased Information Extraction

» Weakly supervised learning using proportionbased information An application to fisheries a...

» Exploiting activelearning strategies for annotating prosodic events with limited labeled d...

Post Info
More Details (n/a)

Added	22 Dec 2010
Updated	22 Dec 2010
Type	Journal
Year	1999
Where	CORR
Authors	Rebecca Hwa

Comments (0)