Trigram morphosyntactic tagger for Polish

8 years 6 months ago
Trigram morphosyntactic tagger for Polish
Abstract. We introduce an implementation of a plain trigram part-of-speech tagger which appears to work well on Polish texts. At this moment the tagger achieves 9.4% error rate, which makes it signficantly better than our previous stochastic disambiguator. Since the trigram model for Polish behaves similarly to Czech, we hope to reach Czech state-of-art error rate when the quality of the training data improves.
Lukasz Debowski
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2004
Where IIS
Authors Lukasz Debowski
Comments (0)