Sciweavers

164 search results - page 26 / 33
» Improved Word Alignment with Statistics and Linguistic Heuri...
Sort
View
62
Voted
LREC
2008
90views Education» more  LREC 2008»
14 years 11 months ago
A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation
In this paper, we discuss lemma identification in Japanese morphological analysis, which is crucial for a proper formulation of morphological analysis that benefits not only NLP r...
Yasuharu Den, Junpei Nakamura, Toshinobu Ogiso, Hi...
ICASSP
2011
IEEE
14 years 1 months ago
Unsupervised determination of efficient Korean LVCSR units using a Bayesian Dirichlet process model
Korean is an agglutinative language that does not have explicit word boundaries. It is also a highly inflective language that exhibits severe coarticulation effects. These charac...
Sakriani Sakti, Andrew M. Finch, Ryosuke Isotani, ...
71
Voted
COLING
2000
14 years 11 months ago
Identifying Terms by their Family and Friends
Multi-word terms are traditionally identified using statistical techniques or, more recently, using hybrid techniques combining statistics with shallow linguistic information. Al)...
Diana Maynard, Sophia Ananiadou
MT
2010
134views more  MT 2010»
14 years 8 months ago
Improve syntax-based translation using deep syntactic structures
This paper introduces deep syntactic structures to syntax-based Statistical Machine Translation (SMT). We use a Head-driven Phrase Structure Grammar (HPSG) parser to obtain the de...
Xianchao Wu, Takuya Matsuzaki, Jun-ichi Tsujii
70
Voted
ACL
2010
14 years 7 months ago
Training Phrase Translation Models with Leaving-One-Out
Several attempts have been made to learn phrase translation probabilities for phrasebased statistical machine translation that go beyond pure counting of phrases in word-aligned t...
Joern Wuebker, Arne Mauser, Hermann Ney