Search Sciweavers | Sciweavers

12

COLING
2008

163views Computational Linguistics» more COLING 2008»

Bayesian Semi-Supervised Chinese Word Segmentation for Statistical Machine Translation

13 years 6 months ago

Words in Chinese text are not naturally separated by delimiters, which poses a challenge to standard machine translation (MT) systems. In MT, the widely used approach is to apply ...

Jia Xu, Jianfeng Gao, Kristina Toutanova, Hermann ...

claim paper

Read More »

19

click to vote

ACL
1994

231views Computational Linguistics» more ACL 1994»

A Stochastic Finite-State Word-Segmentation Algorithm for Chinese

13 years 5 months ago

Download www.aclweb.org

We present a stochastic finite-state model for segmenting Chinese text into dictionary entries and productively derived words, and providing pronunciations for these words; the me...

Richard Sproat, Chilin Shih, William Gale, Nancy C...

claim paper

Read More »

10

click to vote

WWW
2007
ACM

117views Internet Technology» more WWW 2007»

A search-based Chinese word segmentation method

14 years 5 months ago

Download www2007.org

In this paper, we propose a novel Chinese word segmentation method which leverages the huge deposit of Web documents and search technology. It simultaneously solves ambiguous phra...

Xin-Jing Wang, Yong Qin, Wen Liu

claim paper

Read More »

14

click to vote

ACL
2004

89views Computational Linguistics» more ACL 2004»

Adaptive Chinese Word Segmentation

13 years 5 months ago

Download acl.ldc.upenn.edu

This paper presents a Chinese word segmentation system which can adapt to different domains and standards. We first present a statistical framework where domain-specific words are...

Jianfeng Gao, Andi Wu, Cheng-Ning Huang, Hongqiao ...

claim paper

Read More »

13

click to vote

ACL
2009

147views Computational Linguistics» more ACL 2009»

Automatic Adaptation of Annotation Standards: Chinese Word Segmentation and POS Tagging - A Case Study

13 years 2 months ago

Download mtgroup.ict.ac.cn

Manually annotated corpora are valuable but scarce resources, yet for many annotation tasks such as treebanking and sequence labeling there exist multiple corpora with different a...

Wenbin Jiang, Liang Huang, Qun Liu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers