Sciweavers

64 search results - page 5 / 13
» Estimation of English and non-English Language Use on the WW...
Sort
View
COLING
1996
14 years 11 months ago
Bilingual Knowledge Acquisition from Korean-English Parallel Corpus Using Alignment
This paper snggests a method to align Korean-English parallel corpus. '1?he structural dissimilarity between Korean and Indo-European languages requires more flexible measure...
Jung H. Shin, Young S. Han, Key-Sun Choi
COLING
2010
14 years 4 months ago
Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets
An unsupervised discriminative training procedure is proposed for estimating a language model (LM) for machine translation (MT). An English-to-English synchronous context-free gra...
Zhifei Li, Ziyuan Wang, Sanjeev Khudanpur, Jason E...
COLING
2010
14 years 4 months ago
Urdu and Hindi: Translation and sharing of linguistic resources
Hindi and Urdu share a common phonology, morphology and grammar but are written in different scripts. In addition, the vocabularies have also diverged significantly especially in ...
Karthik Visweswariah, Vijil Chenthamarakshan, Nand...
AAAI
2000
14 years 11 months ago
Estimating Word Translation Probabilities from Unrelated Monolingual Corpora Using the EM Algorithm
Selecting the right word translation among several options in the lexicon is a core problem for machine translation. We present a novel approach to this problem that can be traine...
Philipp Koehn, Kevin Knight
LREC
2010
172views Education» more  LREC 2010»
14 years 11 months ago
Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9
CzEng 0.9 is the third release of a large parallel corpus of Czech and English. For the current release, CzEng was extended by significant amount of texts from various types of so...
Ondrej Bojar, Adam Liska, Zdenek Zabokrtský