Search Sciweavers | Sciweavers

133 search results - page 1 / 27

» Joint Tokenization and Translation

click to vote

COLING
2010

104views Computational Linguistics» more COLING 2010»

Joint Tokenization and Translation

13 years 4 months ago

Download nlp.ict.ac.cn

As tokenization is usually ambiguous for many natural languages such as Chinese and Korean, tokenization errors might potentially introduce translation mistakes for translation sy...

Xinyan Xiao, Yang Liu, Young-Sook Hwang, Qun Liu, ...

claim paper

Read More »

click to vote

ACL
2010

126views Computational Linguistics» more ACL 2010»

On Jointly Recognizing and Aligning Bilingual Named Entities

13 years 7 months ago

Download aclweb.org

We observe that (1) how a given named entity (NE) is translated (i.e., either semantically or phonetically) depends greatly on its associated entity type, and (2) entities within ...

Yufeng Chen, Chengqing Zong, Keh-Yih Su

claim paper

Read More »

click to vote

EMNLP
2009

133views Natural Language Processing» more EMNLP 2009»

Unsupervised Tokenization for Machine Translation

13 years 7 months ago

Download www.cs.rochester.edu

Training a statistical machine translation starts with tokenizing a parallel corpus. Some languages such as Chinese do not incorporate spacing in their writing system, which creat...

Tagyoung Chung, Daniel Gildea

claim paper

Read More »

click to vote

CLEF
2004
Springer

77views Information Technology» more CLEF 2004»

Effective Translation, Tokenization and Combination for Cross-Lingual Retrieval

14 years 2 months ago