Sciweavers

ACL
2012
13 years 8 months ago
Akamon: An Open Source Toolkit for Tree/Forest-Based Statistical Machine Translation
We describe Akamon, an open source toolkit for tree and forest-based statistical machine translation (Liu et al., 2006; Mi et al., 2008; Mi and Huang, 2008). Akamon implements all...
Xianchao Wu, Takuya Matsuzaki, Jun-ichi Tsujii
ACL
2012
13 years 8 months ago
Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT
With a few exceptions, discriminative training in statistical machine translation (SMT) has been content with tuning weights for large feature sets on small development data. Evid...
Patrick Simianer, Stefan Riezler, Chris Dyer
ACL
2012
13 years 8 months ago
Large-Scale Syntactic Language Modeling with Treelets
We propose a simple generative, syntactic language model that conditions on overlapping windows of tree context (or treelets) in the same way that n-gram language models condition...
Adam Pauls, Dan Klein
195
Voted
ACL
2012
13 years 8 months ago
DOMCAT: A Bilingual Concordancer for Domain-Specific Computer Assisted Translation
In this paper, we propose a web-based bilingual concordancer, DOMCAT 1 , for domain-specific computer assisted translation. Given a multi-word expression as a query, the system in...
Ming-Hong Bai, Yu-Ming Hsieh, Keh-Jiann Chen, Jaso...
ACL
2012
13 years 8 months ago
Text Segmentation by Language Using Minimum Description Length
The problem addressed in this paper is to segment a given multilingual document into segments for each language and then identify the language of each segment. The problem was mot...
Hiroshi Yamaguchi, Kumiko Tanaka-Ishii