Sciweavers

Share
TAL
2010
Springer

Robust Semi-supervised and Ensemble-Based Methods in Word Sense Disambiguation

9 years 5 months ago
Robust Semi-supervised and Ensemble-Based Methods in Word Sense Disambiguation
Mihalcea [1] discusses self-training and co-training in the context of word sense disambiguation and shows that parameter optimization on individual words was important to obtain good results. Using smoothed co-training of a naive Bayes classifier she obtains a 9.8% error reduction on Senseval-2 data with a fixed parameter setting. In this paper we test a semi-supervised learning algorithm with no parameters, namely tri-training [2]. We also test the random subspace method [3] for building committees out of stable learners. Both techniques lead to significant error reductions with different learning algorithms, but improvements do not accumulate. Our best error reduction is 7.4%, and our best absolute average over Senseval-2 data, though not directly comparable, is 12% higher than the results reported in Mihalcea [1].
Anders Søgaard, Anders Johannsen
Added 30 Jan 2011
Updated 30 Jan 2011
Type Journal
Year 2010
Where TAL
Authors Anders Søgaard, Anders Johannsen
Comments (0)
books