Sciweavers

223 search results - page 2 / 45
» An Efficient Computation of the Multiple-Bernoulli Language ...
Sort
View
ACL
2009
13 years 3 months ago
Source-Language Entailment Modeling for Translating Unknown Terms
This paper addresses the task of handling unknown terms in SMT. We propose using source-language monolingual models and resources to paraphrase the source text prior to translatio...
Shachar Mirkin, Lucia Specia, Nicola Cancedda, Ido...
ACL
2009
13 years 3 months ago
Bayesian Unsupervised Word Segmentation with Nested Pitman-Yor Language Modeling
In this paper, we propose a new Bayesian model for fully unsupervised word segmentation and an efficient blocked Gibbs sampler combined with dynamic programming for inference. Our...
Daichi Mochihashi, Takeshi Yamada, Naonori Ueda
ACL
2009
13 years 3 months ago
A Succinct N-gram Language Model
Efficient processing of tera-scale text data is an important research topic. This paper proposes lossless compression of Ngram language models based on LOUDS, a succinct data stru...
Taro Watanabe, Hajime Tsukada, Hideki Isozaki
FPL
2009
Springer
103views Hardware» more  FPL 2009»
13 years 3 months ago
Customizable domain-specific computing
In this article, we introduce the ongoing research in modeling and mapping for heterogeneous, customizable, parallel systems, as part of the effort in the newly established Center...
Jason Cong
EMNLP
2009
13 years 3 months ago
Stream-based Randomised Language Models for SMT
Randomised techniques allow very big language models to be represented succinctly. However, being batch-based they are unsuitable for modelling an unbounded stream of language whi...
Abby Levenberg, Miles Osborne