Topic Models for Word Sense Disambiguation and Token-Based Idiom Detection

15 years 2 months ago

Download www.coli.uni-saarland.de

This paper presents a probabilistic model for sense disambiguation which chooses the best sense based on the conditional probability of sense paraphrases given a context. We use a topic model to decompose this conditional probability into two conditional probabilities with latent variables. We propose three different instantiations of the model for solving sense disambiguation problems with different degrees of resource availability. The proposed models are tested on three different tasks: coarse-grained word sense disambiguation, fine-grained word sense disambiguation, and detection of literal vs. nonliteral usages of potentially idiomatic expressions. In all three cases, we outperform state-of-the-art systems either quantitatively or statistically significantly.

Linlin Li, Benjamin Roth, Caroline Sporleder

Real-time Traffic

ACL 2010 | Computational Linguistics | Conditional Probability | Sense Disambiguation | Word Sense Disambiguation |

claim paper

Post Info
More Details (n/a)

Added	10 Feb 2011
Updated	10 Feb 2011
Type	Journal
Year	2010
Where	ACL
Authors	Linlin Li, Benjamin Roth, Caroline Sporleder

Comments (0)

Sciweavers

Topic Models for Word Sense Disambiguation and Token-Based Idiom Detection

ACL 2010 | Computational Linguistics | Conditional Probability | Sense Disambiguation | Word Sense Disambiguation |

Explore & Download

Productivity Tools

Sciweavers