Sciweavers

144 search results - page 9 / 29
» Improved Source-Channel Models for Chinese Word Segmentation
Sort
View
NAACL
2007
14 years 11 months ago
Hybrid Models for Semantic Classification of Chinese Unknown Words
This paper addresses the problem of classifying Chinese unknown words into fine-grained semantic categories defined in a Chinese thesaurus. We describe three novel knowledge-based...
Xiaofei Lu
ACL
2010
14 years 7 months ago
Jointly Optimizing a Two-Step Conditional Random Field Model for Machine Transliteration and Its Fast Decoding Algorithm
This paper presents a joint optimization method of a two-step conditional random field (CRF) model for machine transliteration and a fast decoding algorithm for the proposed metho...
Dong Yang, Paul R. Dixon, Sadaoki Furui
EMNLP
2004
14 years 11 months ago
A New Approach for English-Chinese Named Entity Alignment
Traditional word alignment approaches cannot come up with satisfactory results for Named Entities. In this paper, we propose a novel approach using a maximum entropy model for nam...
Donghui Feng, Yajuan Lü, Ming Zhou
76
Voted
IRAL
2003
ACM
15 years 2 months ago
Issues in pre- and post-translation document expansion: untranslatable cognates and missegmented words
Query expansion by pseudo-relevance feedback is a well-established technique in both mono- and cross- lingual information retrieval, enriching and disambiguating the typically ter...
Gina-Anne Levow
NAACL
2010
14 years 7 months ago
Joint Parsing and Alignment with Weakly Synchronized Grammars
Syntactic machine translation systems extract rules from bilingual, word-aligned, syntactically parsed text, but current systems for parsing and word alignment are at best cascade...
David Burkett, John Blitzer, Dan Klein