Sciweavers

25 search results - page 1 / 5
» A compression-based algorithm for Chinese word segmentation
Sort
View
ACL
1994
13 years 5 months ago
A Stochastic Finite-State Word-Segmentation Algorithm for Chinese
We present a stochastic finite-state model for segmenting Chinese text into dictionary entries and productively derived words, and providing pronunciations for these words; the me...
Richard Sproat, Chilin Shih, William Gale, Nancy C...
ACL
2009
13 years 1 months ago
An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and POS Tagging
In this paper, we present a discriminative word-character hybrid model for joint Chinese word segmentation and POS tagging. Our word-character hybrid model offers high performance...
Canasai Kruengkrai, Kiyotaka Uchimoto, Jun'ichi Ka...
ACL
2006
13 years 5 months ago
Discriminative Pruning of Language Models for Chinese Word Segmentation
This paper presents a discriminative pruning method of n-gram language model for Chinese word segmentation. To reduce the size of the language model that is used in a Chinese word...
Jianfeng Li, Haifeng Wang, Dengjun Ren, Guohua Li
IJCNLP
2005
Springer
13 years 9 months ago
A Chunking Strategy Towards Unknown Word Detection in Chinese Word Segmentation
This paper proposes a chunking strategy to detect unknown words in Chinese word segmentation. First, a raw sentence is pre-segmented into a sequence of word atoms 1 using a maximum...
Guodong Zhou
ACL
1997
13 years 5 months ago
A Trainable Rule-based Algorithm for Word Segmentation
This paper presents a trainable rule-based algorithm for performing word segmentation. The algorithm provides a simple, language-independent alternative to large-scale lexicai-bas...
David D. Palmer