We propose a language-independent method for the automatic extraction of transliteration pairs from parallel corpora. In contrast to previous work, our method uses no form of supe...
This paper proposes an efficient client-server-based query translation approach to allowing more feasible implementation of cross-language information retrieval (CLIR) services in ...
Compared with the written word, few experts pay more attention to the spoken word because of the difficulty of obtaining spoken corpora. In order to develop and improve the spoken...
Yuqiang Zhang, Yu Zou, Wei He, Min Hou, Yonglin Te...
We describe a new information fusion approach to integrate facts extracted from cross-media objects (videos and texts) into a coherent common representation including multi-level ...
Adam Lee, Marissa Passantino, Heng Ji, Guojun Qi, ...
A serious bottleneck in the development of trainable text summarization systems is the shortage of training data. Constructing such data is a very tedious task, especially because...