Performance of n-gram language models depends to a large extent on the amount of training text material available for building the models and the degree to which this text matches...
In current phrase-based Statistical Machine Translation systems, more training data is generally better than less. However, a larger data set eventually introduces a larger model ...
Currently there are several approaches to machine translation (MT) based on different paradigms; e.g., phrasal, hierarchical and syntax-based. These three approaches yield similar...
Antti-Veikko I. Rosti, Necip Fazil Ayan, Bing Xian...
When aligning texts in very different languages such as Korean and English, structural features beyond word or phrase give useful intbrmation. In this paper, we present a method f...
We describe the lexical knowledge base system (LKB) which has been designed and implemented as part of the ACQUILEX project1 to allow the representation of multilinguM syntactic a...