Sciweavers

FSMNLP
2005
Springer

German Compound Analysis with wfsc

13 years 10 months ago
German Compound Analysis with wfsc
Compounding is a very productive process in German to form complex nouns and adjectives which represent about 7% of the words of a newspaper text. Unlike English, German compounds do not contain spaces or other word boundaries, and the automatic analysis is often ambiguous. A (non-weighted) finite-state morphological analyzer provides all potential segmentations for a compound without any filtering or prioritization of the results. The paper presents an experiment in analyzing German compounds with the Xerox Weighted Finite-State Compiler (wfsc). The model is based on weights for compound segments and gives priority (a) to compounds with the minimal number of segments and (b) to compound segments with the highest frequency in a training list. The results with this rather simple model will show the advantage of using weighted finite-state transducers over simple FSTs. 1 Compound Construction A very productive word formation process in German is compounding, which combines words to bu...
Anne Schiller
Added 27 Jun 2010
Updated 27 Jun 2010
Type Conference
Year 2005
Where FSMNLP
Authors Anne Schiller
Comments (0)