Sciweavers

144 search results - page 9 / 29
» Improved Source-Channel Models for Chinese Word Segmentation
Sort
View
NAACL
2007
15 years 1 months ago
Hybrid Models for Semantic Classification of Chinese Unknown Words
This paper addresses the problem of classifying Chinese unknown words into fine-grained semantic categories defined in a Chinese thesaurus. We describe three novel knowledge-based...
Xiaofei Lu
110
Voted
ACL
2010
14 years 9 months ago
Jointly Optimizing a Two-Step Conditional Random Field Model for Machine Transliteration and Its Fast Decoding Algorithm
This paper presents a joint optimization method of a two-step conditional random field (CRF) model for machine transliteration and a fast decoding algorithm for the proposed metho...
Dong Yang, Paul R. Dixon, Sadaoki Furui
98
Voted
EMNLP
2004
15 years 1 months ago
A New Approach for English-Chinese Named Entity Alignment
Traditional word alignment approaches cannot come up with satisfactory results for Named Entities. In this paper, we propose a novel approach using a maximum entropy model for nam...
Donghui Feng, Yajuan Lü, Ming Zhou
89
Voted
IRAL
2003
ACM
15 years 4 months ago
Issues in pre- and post-translation document expansion: untranslatable cognates and missegmented words
Query expansion by pseudo-relevance feedback is a well-established technique in both mono- and cross- lingual information retrieval, enriching and disambiguating the typically ter...
Gina-Anne Levow
109
Voted
NAACL
2010
14 years 9 months ago
Joint Parsing and Alignment with Weakly Synchronized Grammars
Syntactic machine translation systems extract rules from bilingual, word-aligned, syntactically parsed text, but current systems for parsing and word alignment are at best cascade...
David Burkett, John Blitzer, Dan Klein