Sciweavers

3 search results - page 1 / 1
» Mostly-Unsupervised Statistical Segmentation of Japanese Kan...
Sort
View
CORR
2002
Springer
90views Education» more  CORR 2002»
14 years 9 months ago
Mostly-Unsupervised Statistical Segmentation of Japanese Kanji Sequences
Given the lack of word delimiters in written Japanese, word segmentation is generally considered a crucial first step in processing Japanese texts. Typical Japanese segmentation a...
Rie Kubota Ando, Lillian Lee
COLING
1996
14 years 11 months ago
The Automatic Extraction of Open Compounds from Text Corpora
This paper describes a new method for extracting open compounds (uninterrupted sequences of words) from text corpora of languages, such as Thai, Japanese and Korea that exhibit un...
Virach Sornlertlamvanich, Hozumi Tanaka