Sciweavers

265 search results - page 2 / 53
» Statistical-Based Approach to Word Segmentation
Sort
View
IJCNLP
2005
Springer
13 years 11 months ago
A Chunking Strategy Towards Unknown Word Detection in Chinese Word Segmentation
This paper proposes a chunking strategy to detect unknown words in Chinese word segmentation. First, a raw sentence is pre-segmented into a sequence of word atoms 1 using a maximum...
Guodong Zhou
LREC
2010
195views Education» more  LREC 2010»
13 years 6 months ago
Adapting Chinese Word Segmentation for Machine Translation Based on Short Units
In Chinese texts, words composed of single or multiple characters are not separated by spaces, unlike most western languages. Therefore Chinese word segmentation is considered an ...
Yiou Wang, Kiyotaka Uchimoto, Jun'ichi Kazama, Can...
COLING
2010
13 years 10 days ago
Word-based and Character-based Word Segmentation Models: Comparison and Combination
We present a theoretical and empirical comparative analysis of the two dominant categories of approaches in Chinese word segmentation: word-based models and character-based models...
Weiwei Sun
NAACL
2010
13 years 3 months ago
Is Arabic Part of Speech Tagging Feasible Without Word Segmentation?
In this paper, we compare two novel methods for part of speech tagging of Arabic without the use of gold standard word segmentation but with the full POS tagset of the Penn Arabic...
Emad Mohamed, Sandra Kübler
LREC
2008
147views Education» more  LREC 2008»
13 years 6 months ago
Word Segmentation of Vietnamese Texts: a Comparison of Approaches
We present in this paper a comparison between three segmentation systems for the Vietnamese language. Indeed, the majority of Vietnamese words is built by semantic composition fro...
Quang Thang Dinh, Hong Phuong Le, Thi Minh Huyen N...