Sciweavers

ANLP
1994

Tagging and Morphological Disambiguation of Turkish Text

13 years 5 months ago
Tagging and Morphological Disambiguation of Turkish Text
Automatic text tagging is an important component in higher level analysis of text corpora, and its output can be used in many natural language processing applications. In languages like Turkish or Finnish, with agglutinative morphology, morphological disambiguation is a very crucial process in tagging, as the structures of many lexical forms are morphologically ambiguous. This paper describes a POS tagger for Turkish text based on a full-scale two-level specification of Turkish morphology that is based on a lexicon of about 24,000 root words. This is augmented with a multiword and idiomatic construct recognizer, and most importantly morphological disambiguator based on local neighborhood constraints, heuristics and limited amount of statistical information. The tagger also has functionality for statistics compilation and fine tuning of the morphological analyzer, such as logging erroneous morphological parses, commonly used roots, etc. Preliminary results indicate that the tagger can ...
Kemal Oflazer, Ilker Kuruöz
Added 02 Nov 2010
Updated 02 Nov 2010
Type Conference
Year 1994
Where ANLP
Authors Kemal Oflazer, Ilker Kuruöz
Comments (0)