Simple Type-Level Unsupervised POS Tagging

15 years 3 months ago

Download people.csail.mit.edu

Part-of-speech (POS) tag distributions are known to exhibit sparsity -- a word is likely to take a single predominant tag in a corpus. Recent research has demonstrated that incorporating this sparsity constraint improves tagging accuracy. However, in existing systems, this expansion come with a steep increase in model complexity. This paper proposes a simple and effective tagging method that directly models tag sparsity and other distributional properties of valid POS tag assignments. In addition, this formulation results in a dramatic reduction in the number of model parameters thereby, enabling unusually rapid training. Our experiments consistently demonstrate that this model architecture yields substantial performance gains over more complex tagging counterparts. On several languages, we report performance exceeding that of more complex state-of-the art systems.1

Yoong Keok Lee, Aria Haghighi, Regina Barzilay

Real-time Traffic

Effective Tagging Method | EMNLP 2010 | Natural Language Processing | Single Predominant Tag | Sparsity |

claim paper

» Crouching Dirichlet Hidden Markov Model Unsupervised POS Tagging with Context Local Tag Ge...

» Unsupervised Lexical Acquisition for Part of Speech Tagging

» LatentDescriptor Clustering for Unsupervised POS Induction

Post Info
More Details (n/a)

Added	11 Feb 2011
Updated	11 Feb 2011
Type	Journal
Year	2010
Where	EMNLP
Authors	Yoong Keok Lee, Aria Haghighi, Regina Barzilay

Comments (0)

Sciweavers

Simple Type-Level Unsupervised POS Tagging

Effective Tagging Method | EMNLP 2010 | Natural Language Processing | Single Predominant Tag | Sparsity |

Explore & Download

Productivity Tools

Sciweavers