Sciweavers

Share
ACL
2012

Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT

9 years 10 months ago
Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT
With a few exceptions, discriminative training in statistical machine translation (SMT) has been content with tuning weights for large feature sets on small development data. Evidence from machine learning indicates that increasing the training sample size results in better prediction. The goal of this paper is to show that this common wisdom can also be brought to bear upon SMT. We deploy local features for SCFG-based SMT that can be read off from rules at runtime, and present a learning algorithm that applies 1/ 2 regularization for joint feature selection over distributed stochastic learning processes. We present ex
Patrick Simianer, Stefan Riezler, Chris Dyer
Added 29 Sep 2012
Updated 29 Sep 2012
Type Journal
Year 2012
Where ACL
Authors Patrick Simianer, Stefan Riezler, Chris Dyer
Comments (0)
books