Sciweavers

31 search results - page 1 / 7
» Enhancing Chinese Word Segmentation Using Unlabeled Data
Sort
View
ACL
2008
13 years 6 months ago
Semi-Supervised Sequential Labeling and Segmentation Using Giga-Word Scale Unlabeled Data
This paper provides evidence that the use of more unlabeled data in semi-supervised learning can improve the performance of Natural Language Processing (NLP) tasks, such as part-o...
Jun Suzuki, Hideki Isozaki
COLING
2010
12 years 12 months ago
Unsupervised phonemic Chinese word segmentation using Adaptor Grammars
Adaptor grammars are a framework for expressing and performing inference over a variety of non-parametric linguistic models. These models currently provide state-of-the-art perfor...
Mark Johnson, Katherine Demuth
ACL
2012
11 years 7 months ago
Enhancing Statistical Machine Translation with Character Alignment
The dominant practice of statistical machine translation (SMT) uses the same Chinese word segmentation specification in both alignment and translation rule induction steps in buil...
Ning Xi, Guangchao Tang, Xinyu Dai, Shujian Huang,...
EMNLP
2010
13 years 2 months ago
Enhancing Domain Portability of Chinese Segmentation Model Using Chi-Square Statistics and Bootstrapping
Almost all Chinese language processing tasks involve word segmentation of the language input as their first steps, thus robust and reliable segmentation techniques are always requ...
Baobao Chang, Dongxu Han