Sciweavers

SIGIR
2004
ACM

Translating unknown queries with web corpora for cross-language information retrieval

13 years 10 months ago
Translating unknown queries with web corpora for cross-language information retrieval
It is crucial for cross-language information retrieval (CLIR) systems to deal with the translation of unknown queries1 due to that real queries might be short. The purpose of this paper is to investigate the feasibility of exploiting the Web as the corpus source to translate unknown queries for CLIR. We propose an online translation approach to determine effective translations for unknown query terms via mining of bilingual search-result pages obtained from Web search engines. This approach can alleviate the problem of the lack of large bilingual corpora, translate many unknown query terms, provide flexible query specifications, and extract semantically-close translations to benefit CLIR tasks – especially for cross-language Web search. Categories and Subject Descriptors H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval. General Terms Algorithms, Experimentation, Performance. Keywords Query Translation, Cross-Language Information Retrieval, CrossLanguage We...
Pu-Jen Cheng, Jei-Wen Teng, Ruey-Cheng Chen, Jenq-
Added 30 Jun 2010
Updated 30 Jun 2010
Type Conference
Year 2004
Where SIGIR
Authors Pu-Jen Cheng, Jei-Wen Teng, Ruey-Cheng Chen, Jenq-Haur Wang, Wen-Hsiang Lu, Lee-Feng Chien
Comments (0)