We present a grand challenge to build a corpus that will include all of the world's languages, in a consistent structure that permits large-scale cross-linguistic processing,...
: The performance of a statistical machine translation (SMT) system heavily depends on the quantity and quality of the bilingual language resource. However, the pervious work mainl...
In this paper we propose the method of multimodal corpus analysis to collect enough empirical data for modeling the behavior of embodied conversational agents. This is a prerequisi...
This article presents an original lexical unit extraction system for Chinese. The method is based on an incremental process driven by an association score featuring a minimal reso...
Current statistical machine translation (SMT) systems are trained on sentencealigned and word-aligned parallel text collected from various sources. Translation model parameters ar...
Spyros Matsoukas, Antti-Veikko I. Rosti, Bing Zhan...