The paper presents an approach for constructing a weighted bilingual dictionary of inflectional forms using as input data a traditional bilingual dictionary, and not parallel corp...
Confusion networks are a simple representation of multiple speech recognition or translation hypotheses in a machine translation system. A typical operation on a confusion network...
We propose a novel HMM-based framework to accurately transliterate unseen named entities. The framework leverages features in letteralignment and letter n-gram pairs learned from ...
Bing Zhao, Nguyen Bach, Ian R. Lane, Stephan Vogel
In the spoken language machine translation project Verbmobil, the semantic formalism Language for Underspecified Discourse representation structures (LUD) is used. LUD describes a...
Natural languageprocessingNLP programsare confronted with various di culties in processing HTML and XML documents, and have the potential to produce better results if linguistic i...
Hideo Watanabe, Katashi Nagao, Michael C. McCord, ...