Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

30

ECIR
2008
Springer

favoriteEmaildiscussreport

124views Information Technology» more ECIR 2008»

A Wikipedia-Based Multilingual Retrieval Model

13 years 10 months ago

A Wikipedia-Based Multilingual Retrieval Model

Download www.uni-weimar.de

This paper introduces CL-ESA, a new multilingual retrieval model for the analysis of cross-language similarity. The retrieval model exploits the multilingual alignment of Wikipedia: given a document d written in language L we construct a concept vector d for d, where each dimension i in d quantifies the similarity of d with respect to a document d i chosen from the "L-subset" of Wikipedia. Likewise, for a second document d written in language L , L = L , we construct a concept vector d , using from the L -subset of the Wikipedia the topic-aligned counterparts d i of our previously chosen documents. Since the two concept vectors d and d are collection-relative representations of d and d they are language-independent. I. e., their similarity can directly be computed with the cosine similarity measure, for instance. We present results of an extensive analysis that demonstrates the power of this new retrieval model: for a query document d the topically most similar documents from...

Martin Potthast, Benno Stein, Maik Anderka

Real-time Traffic

Concept Vectors | Document | ECIR 2008 | Information Technology | Retrieval Model |

claim paper

Related Content

» Combining WikipediaBased Concept Models for CrossLanguage Retrieval

» CLEF 2005 Multilingual Retrieval by Combining Multiple Multilingual Ranked Lists

» Dealing with MultiLingual Information Access Grid Experiments at TrebleCLEF

» ITCirst at CLEF 2003 Monolingual Bilingual and Multilingual Information Retrieval

» Mining multilingual topics from wikipedia

» Translation Resources Merging Strategies and Relevance Feedback for CrossLanguage Informat...

» A study of learning a merge model for multilingual information retrieval

» UB at CLEF2004 Cross Language Information Retrieval Using Statistical Language Models

» GRISP A Massive Multilingual Terminological Database for Scientific and Technical Domains

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	ECIR
Authors	Martin Potthast, Benno Stein, Maik Anderka

Comments (0)