Sciweavers

11 search results - page 1 / 3
» Automatic Acquisition of a Slovak Lexicon from a Raw Corpus
Sort
View
84
Voted
TSD
2005
Springer
15 years 5 months ago
Automatic Acquisition of a Slovak Lexicon from a Raw Corpus
This paper presents an automatic methodology we used in an experiment to acquire a morphological lexicon for the Slovak language, and the lexicon we obtained. This methodology exte...
Benoît Sagot
IJCNLP
2005
Springer
15 years 5 months ago
Automatic Acquisition of Basic Katakana Lexicon from a Given Corpus
Abstract. Katakana, Japanese phonogram mainly used for loan words, is a troublemaker in Japanese word segmentation. Since Katakana words are heavily domaindependent and there are m...
Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohas...
89
Voted
FINTAL
2006
15 years 3 months ago
Morphological Lexicon Extraction from Raw Text Data
The tool extract enables the automatic extraction of lemma-paradigm pairs from raw text data. The tool uses search patterns that consist of regular expressions and propositional lo...
Markus Forsberg, Harald Hammarström, Aarne Ra...
ACL
1996
15 years 1 months ago
Unsupervised Learning of Word-Category Guessing Rules
Words unknown to the lexicon present a substantial problem to part-of-speech tagging. In this paper we present a technique for fully unsupervised statistical acquisition of rules ...
Andrei Mikheev
81
Voted
COLING
1994
15 years 1 months ago
A Corpus-Based Learning Technique for Building A Self-Extensible Parser
IIuman intervention and/or training corpora tagged with various kinds of information were often assumed in many natural language acquisition models. This assumption is a major sou...
Rey-Long Liu, Von-Wun Soo