Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

163

Voted

NAACL
2004

148views Computational Linguistics» more NAACL 2004»

Name Tagging with Word Clusters and Discriminative Training

15 years 7 months ago

Name Tagging with Word Clusters and Discriminative Training

Download www.aclweb.org

We present a technique for augmenting annotated training data with hierarchical word clusters that are automatically derived from a large unannotated corpus. Cluster membership is encoded in features that are incorporated in a discriminatively trained tagging model. Active learning is used to select training examples. We evaluate the technique for named-entity tagging. Compared with a state-of-the-art HMM-based name finder, the presented technique requires only 13% as much annotated data to achieve the same level of performance. Given a large annotated training set of 1,000,000 words, the technique achieves a 25% reduction in error over the state-of-the-art HMM trained on the same material.

Scott Miller, Jethran Guinness, Alex Zamanian

Real-time Traffic

Annotated Training | Annotated Training Data | Large Unannotated Corpus | NAACL 2004 | NAACL 2007 |

claim paper

Related Content

» Discriminating Among Word Meanings by Identifying Similar Contexts

» Trained Named Entity Recognition using Distributional Clusters

» Discriminative HMM training with GA for handwritten word recognition

» Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Ne...

» An ErrorDriven WordCharacter Hybrid Model for Joint Chinese Word Segmentation and POS Tagg...

» Unsupervised Discrimination of Person Names in Web Contexts

» Combining Source and Target Language Information for Name Tagging of Machine Translation O...

» Improved Unsupervised Name Discrimination with Very Wide Bigrams and Automatic Cluster Sto...

» Name Discrimination by Clustering Similar Contexts

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2004
Where	NAACL
Authors	Scott Miller, Jethran Guinness, Alex Zamanian

Comments (0)