Sciweavers

ACL
2010

Simultaneous Tokenization and Part-Of-Speech Tagging for Arabic without a Morphological Analyzer

13 years 2 months ago
Simultaneous Tokenization and Part-Of-Speech Tagging for Arabic without a Morphological Analyzer
We describe an approach to simultaneous tokenization and part-of-speech tagging that is based on separating the closed and open-class items, and focusing on the likelihood of the possible stems of the openclass words. By encoding some basic linguistic information, the machine learning task is simplified, while achieving stateof-the-art tokenization results and competitive POS results, although with a reduced tag set and some evaluation difficulties.
Seth Kulick
Added 10 Feb 2011
Updated 10 Feb 2011
Type Journal
Year 2010
Where ACL
Authors Seth Kulick
Comments (0)