Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

12

EMNLP
2006

favoriteEmaildiscussreport

84views Natural Language Processing» more EMNLP 2006»

Better Informed Training of Latent Syntactic Features

13 years 6 months ago

Better Informed Training of Latent Syntactic Features

Download www.clsp.jhu.edu

We study unsupervised methods for learning refinements of the nonterminals in a treebank. Following Matsuzaki et al. (2005) and Prescher (2005), we may for example split NP without supervision into NP[0] and NP[1], which behave differently. We first propose to learn a PCFG that adds such features to nonterminals in such a way that they respect patterns of linguistic feature passing: each node's nonterminal features are either identical to, or independent of, those of its parent. This linguistic constraint reduces runtime and the number of parameters to be learned. However, it did not yield improvements when training on the Penn Treebank. An orthogonal strategy was more successful: to improve the performance of the EM learner by treebank preprocessing and by annealing methods that split nonterminals selectively. Using these methods, we can maintain high parsing accuracy while dramatically reducing the model size.

Markus Dreyer, Jason Eisner

Real-time Traffic

Constraint Reduces Runtime | EMNLP 2006 | EMNLP 2007 | Example Split Np | Unsupervised Methods |

claim paper

Related Content

» Two Languages are Better than One for Syntactic Parsing

» Building a better probabilistic model of images by factorization

» Transmembrane helix prediction using amino acid property features and latent semantic anal...

» Syntactic features in question answering

» Address standardization with latent semantic association

» An analysis of open information extraction based on semantic role labeling

» A discriminative method for protein remote homology detection and fold recognition combini...

» Sparse MultiScale Grammars for Discriminative Latent Variable Parsing

» Sentence Simplification for Semantic Role Labeling

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2006
Where	EMNLP
Authors	Markus Dreyer, Jason Eisner

Comments (0)