Sciweavers

237 search results - page 30 / 48
» acl 2008
Sort
View
ACL
2008
15 years 1 months ago
Lexicalized Phonotactic Word Segmentation
This paper presents a new unsupervised algorithm (WordEnds) for inferring word boundaries from transcribed adult conversations. Phone ngrams before and after observed pauses are u...
Margaret M. Fleck
ACL
2008
15 years 1 months ago
Hypertagging: Supertagging for Surface Realization with CCG
In lexicalized grammatical formalisms, it is possible to separate lexical category assignment from the combinatory processes that make use of such categories, such as parsing and ...
Dominic Espinosa, Michael White, Dennis Mehay
ACL
2008
15 years 1 months ago
Beyond Log-Linear Models: Boosted Minimum Error Rate Training for N-best Re-ranking
Current re-ranking algorithms for machine translation rely on log-linear models, which have the potential problem of underfitting the training data. We present BoostedMERT, a nove...
Kevin Duh, Katrin Kirchhoff
ACL
2008
15 years 1 months ago
Exploiting N-best Hypotheses for SMT Self-Enhancement
Word and n-gram posterior probabilities estimated on N-best hypotheses have been used to improve the performance of statistical machine translation (SMT) in a rescoring framework....
Boxing Chen, Min Zhang, AiTi Aw, Haizhou Li
49
Voted
ACL
2008
15 years 1 months ago
Smoothing a Tera-word Language Model
Frequency counts from very large corpora, such as the Web 1T dataset, have recently become available for language modeling. Omission of low frequency n-gram counts is a practical ...
Deniz Yuret