Sciweavers

25 search results - page 1 / 5
» A compression-based algorithm for Chinese word segmentation
Sort
View
ACL
1994
13 years 10 months ago
A Stochastic Finite-State Word-Segmentation Algorithm for Chinese
We present a stochastic finite-state model for segmenting Chinese text into dictionary entries and productively derived words, and providing pronunciations for these words; the me...
Richard Sproat, Chilin Shih, William Gale, Nancy C...
ACL
2009
13 years 7 months ago
An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and POS Tagging
In this paper, we present a discriminative word-character hybrid model for joint Chinese word segmentation and POS tagging. Our word-character hybrid model offers high performance...
Canasai Kruengkrai, Kiyotaka Uchimoto, Jun'ichi Ka...
ACL
2006
13 years 10 months ago
Discriminative Pruning of Language Models for Chinese Word Segmentation
This paper presents a discriminative pruning method of n-gram language model for Chinese word segmentation. To reduce the size of the language model that is used in a Chinese word...
Jianfeng Li, Haifeng Wang, Dengjun Ren, Guohua Li
IJCNLP
2005
Springer
14 years 2 months ago
A Chunking Strategy Towards Unknown Word Detection in Chinese Word Segmentation
This paper proposes a chunking strategy to detect unknown words in Chinese word segmentation. First, a raw sentence is pre-segmented into a sequence of word atoms 1 using a maximum...
Guodong Zhou
ACL
1997
13 years 10 months ago
A Trainable Rule-based Algorithm for Word Segmentation
This paper presents a trainable rule-based algorithm for performing word segmentation. The algorithm provides a simple, language-independent alternative to large-scale lexicai-bas...
David D. Palmer