Sciweavers

ACL
2012
11 years 7 months ago
Prediction of Learning Curves in Machine Translation
Parallel data in the domain of interest is the key resource when training a statistical machine translation (SMT) system for a specific purpose. Since ad-hoc manual translation c...
Prasanth Kolachina, Nicola Cancedda, Marc Dymetman...
ACL
2012
11 years 7 months ago
Akamon: An Open Source Toolkit for Tree/Forest-Based Statistical Machine Translation
We describe Akamon, an open source toolkit for tree and forest-based statistical machine translation (Liu et al., 2006; Mi et al., 2008; Mi and Huang, 2008). Akamon implements all...
Xianchao Wu, Takuya Matsuzaki, Jun-ichi Tsujii
ACL
2012
11 years 7 months ago
Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT
With a few exceptions, discriminative training in statistical machine translation (SMT) has been content with tuning weights for large feature sets on small development data. Evid...
Patrick Simianer, Stefan Riezler, Chris Dyer
ACL
2012
11 years 7 months ago
Fast and Scalable Decoding with Language Model Look-Ahead for Phrase-based Statistical Machine Translation
In this work we present two extensions to the well-known dynamic programming beam search in phrase-based statistical machine translation (SMT), aiming at increased efficiency of ...
Joern Wuebker, Hermann Ney, Richard Zens
ACL
2012
11 years 7 months ago
Private Access to Phrase Tables for Statistical Machine Translation
Some Statistical Machine Translation systems never see the light because the owner of the appropriate training data cannot release them, and the potential user of the system canno...
Nicola Cancedda
ACL
2012
11 years 7 months ago
Enhancing Statistical Machine Translation with Character Alignment
The dominant practice of statistical machine translation (SMT) uses the same Chinese word segmentation specification in both alignment and translation rule induction steps in buil...
Ning Xi, Guangchao Tang, Xinyu Dai, Shujian Huang,...
ACL
2012
11 years 7 months ago
Improving the IBM Alignment Models Using Variational Bayes
Bayesian approaches have been shown to reduce the amount of overfitting that occurs when running the EM algorithm, by placing prior probabilities on the model parameters. We appl...
Darcey Riley, Daniel Gildea
ACL
2012
11 years 7 months ago
Mixing Multiple Translation Models in Statistical Machine Translation
Statistical machine translation is often faced with the problem of combining training data from many diverse sources into a single translation model which then has to translate se...
Majid Razmara, George Foster, Baskaran Sankaran, A...
ACL
2012
11 years 7 months ago
Combining Word-Level and Character-Level Models for Machine Translation Between Closely-Related Languages
We propose several techniques for improving statistical machine translation between closely-related languages with scarce resources. We use character-level translation trained on ...
Preslav Nakov, Jörg Tiedemann
ACL
2012
11 years 7 months ago
A Ranking-based Approach to Word Reordering for Statistical Machine Translation
Long distance word reordering is a major challenge in statistical machine translation research. Previous work has shown using source syntactic trees is an effective way to tackle ...
Nan Yang, Mu Li, Dongdong Zhang, Nenghai Yu