Recherche bilingue et multilingue d'information

13 years 1 months ago
Recherche bilingue et multilingue d'information
In order to search within corpora written in two or more languages, the simplest and most effective approach is to translate the submitted request into the required language(s). To achieve this goal, we based our IR model on translation tools freely available on the Web. When comparing the retrieval effectiveness of manually and automatically translated requests, we found that human-based translation outperformed machine-based approaches. However, when we analyzed the query-by-query performance, we found query performances based on machine-based translations to vary a great deal. The question that then arises is whether or not we can predict the retrieval performance of a translated query and as a result we may thus select only the best translation(s). To respond to this, we designed and evaluated a predictive system based on the logistic regression, and used it to select the top most appropriate machine-based translations. An evaluation of this approach shows retrieval performance is ...
Jacques Savoy, Pierre-Yves Berger
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2004
Authors Jacques Savoy, Pierre-Yves Berger
Comments (0)