Sciweavers

11 search results - page 1 / 3
» Automatic Acquisition of a Slovak Lexicon from a Raw Corpus
Sort
View
TSD
2005
Springer
15 years 4 months ago
Automatic Acquisition of a Slovak Lexicon from a Raw Corpus
This paper presents an automatic methodology we used in an experiment to acquire a morphological lexicon for the Slovak language, and the lexicon we obtained. This methodology exte...
Benoît Sagot
IJCNLP
2005
Springer
15 years 4 months ago
Automatic Acquisition of Basic Katakana Lexicon from a Given Corpus
Abstract. Katakana, Japanese phonogram mainly used for loan words, is a troublemaker in Japanese word segmentation. Since Katakana words are heavily domaindependent and there are m...
Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohas...
80
Voted
FINTAL
2006
15 years 2 months ago
Morphological Lexicon Extraction from Raw Text Data
The tool extract enables the automatic extraction of lemma-paradigm pairs from raw text data. The tool uses search patterns that consist of regular expressions and propositional lo...
Markus Forsberg, Harald Hammarström, Aarne Ra...
ACL
1996
15 years 5 days ago
Unsupervised Learning of Word-Category Guessing Rules
Words unknown to the lexicon present a substantial problem to part-of-speech tagging. In this paper we present a technique for fully unsupervised statistical acquisition of rules ...
Andrei Mikheev
COLING
1994
15 years 5 days ago
A Corpus-Based Learning Technique for Building A Self-Extensible Parser
IIuman intervention and/or training corpora tagged with various kinds of information were often assumed in many natural language acquisition models. This assumption is a major sou...
Rey-Long Liu, Von-Wun Soo