Sciweavers

64 search results - page 2 / 13
» Estimation of English and non-English Language Use on the WW...
Sort
View
NAACL
2010
13 years 3 months ago
Improving the Multilingual User Experience of Wikipedia Using Cross-Language Name Search
Although Wikipedia has emerged as a powerful collaborative Encyclopedia on the Web, it is only partially multilingual as most of the content is in English and a small number of ot...
Raghavendra Udupa, Mitesh M. Khapra
TREC
2007
13 years 6 months ago
DUTIR at TREC 2007 Blog Track
This paper describes DUTIR at TREC 2007 Blog Track. In data preprocessing, a non English language list created from the corpus was used to remove the non English blogs, blog templ...
Rui Song, Qin Tang, Daming Shi 0002, Hongfei Lin, ...
CIKM
2008
Springer
13 years 7 months ago
Cross-lingual query classification: a preliminary study
The non-English Web is growing at breakneck speed, but available language processing tools are mostly English based. Taxonomies are a case in point: while there are plenty of comm...
Xuerui Wang, Andrei Z. Broder, Evgeniy Gabrilovich...
CIKM
2008
Springer
13 years 7 months ago
Experiments with English-Persian text retrieval
As the number of non-English documents is increasing dramatically on the web nowadays, the study and design of information retrieval systems for these languages is very important....
Abolfazl AleAhmad, Hadi Amiri, Masoud Rahgozar, Fa...
CLEF
2011
Springer
12 years 5 months ago
A Language-Independent Approach to Identify the Named Entities in Under-Resourced Languages and Clustering Multilingual Document
Abstract. This paper presents a language-independent Multilingual Document Clustering (MDC) approach on comparable corpora. Named entites (NEs) such as persons, locations, organiza...
N. Kiran Kumar, G. S. K. Santosh, Vasudeva Varma