Sciweavers

25 search results - page 2 / 5
» Mining multilingual topics from wikipedia
Sort
View
WSDM
2009
ACM
148views Data Mining» more  WSDM 2009»
13 years 11 months ago
Information arbitrage across multi-lingual Wikipedia
The rapid globalization of Wikipedia is generating a parallel, multi-lingual corpus of unprecedented scale. Pages for the same topic in many different languages emerge both as a r...
Eytan Adar, Michael Skinner, Daniel S. Weld
ECIR
2010
Springer
13 years 6 months ago
Extracting Multilingual Topics from Unaligned Comparable Corpora
Topic models have been studied extensively in the context of monolingual corpora. Though there are some attempts to mine topical structure from cross-lingual corpora, they require ...
Jagadeesh Jagarlamudi, Hal Daumé III
ECIR
2008
Springer
13 years 6 months ago
A Wikipedia-Based Multilingual Retrieval Model
This paper introduces CL-ESA, a new multilingual retrieval model for the analysis of cross-language similarity. The retrieval model exploits the multilingual alignment of Wikipedia...
Martin Potthast, Benno Stein, Maik Anderka
WSDM
2009
ACM
188views Data Mining» more  WSDM 2009»
13 years 11 months ago
Is Wikipedia link structure different?
In this paper, we investigate the difference between Wikipedia and Web link structure with respect to their value as indicators of the relevance of a page for a given topic of re...
Jaap Kamps, Marijn Koolen
WEBI
2009
Springer
13 years 11 months ago
Mining a Multilingual Geographical Gazetteer from the Web
Geographical gazetteers are necessary in a wide variety of applications. In the past, the construction of such gazetteers has been a tedious, manual process and only recently have...
Adrian Popescu, Gregory Grefenstette, Houda Bouamo...