Sciweavers

EMNLP
2008

Part-of-Speech Tagging for English-Spanish Code-Switched Text

13 years 6 months ago
Part-of-Speech Tagging for English-Spanish Code-Switched Text
Code-switching is an interesting linguistic phenomenon commonly observed in highly bilingual communities. It consists of mixing languages in the same conversational event. This paper presents results on Part-of-Speech tagging Spanish-English code-switched discourse. We explore different approaches to exploit existing resources for both languages that range from simple heuristics, to language identification, to machine learning. The best results are achieved by training a machine learning algorithm with features that combine the output of an English and a Spanish Partof-Speech tagger.
Thamar Solorio, Yang Liu
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where EMNLP
Authors Thamar Solorio, Yang Liu
Comments (0)