Sciweavers

11 search results - page 1 / 3
» Automatic Acquisition of a Slovak Lexicon from a Raw Corpus
Sort
View
TSD
2005
Springer
13 years 10 months ago
Automatic Acquisition of a Slovak Lexicon from a Raw Corpus
This paper presents an automatic methodology we used in an experiment to acquire a morphological lexicon for the Slovak language, and the lexicon we obtained. This methodology exte...
Benoît Sagot
IJCNLP
2005
Springer
13 years 10 months ago
Automatic Acquisition of Basic Katakana Lexicon from a Given Corpus
Abstract. Katakana, Japanese phonogram mainly used for loan words, is a troublemaker in Japanese word segmentation. Since Katakana words are heavily domaindependent and there are m...
Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohas...
FINTAL
2006
13 years 8 months ago
Morphological Lexicon Extraction from Raw Text Data
The tool extract enables the automatic extraction of lemma-paradigm pairs from raw text data. The tool uses search patterns that consist of regular expressions and propositional lo...
Markus Forsberg, Harald Hammarström, Aarne Ra...
ACL
1996
13 years 6 months ago
Unsupervised Learning of Word-Category Guessing Rules
Words unknown to the lexicon present a substantial problem to part-of-speech tagging. In this paper we present a technique for fully unsupervised statistical acquisition of rules ...
Andrei Mikheev
COLING
1994
13 years 6 months ago
A Corpus-Based Learning Technique for Building A Self-Extensible Parser
IIuman intervention and/or training corpora tagged with various kinds of information were often assumed in many natural language acquisition models. This assumption is a major sou...
Rey-Long Liu, Von-Wun Soo