Sciweavers

50 search results - page 3 / 10
» A Stemming Algorithm for the Farsi Language
Sort
View
ACSC
2005
IEEE
15 years 3 months ago
Stemming Indonesian
Stemming words to (usually) remove suffixes has applications in text search, machine translation, document summarisation, and text classification. For example, English stemming r...
Jelita Asian, Hugh E. Williams, Seyed M. M. Tahagh...
WWW
2001
ACM
15 years 10 months ago
Indexing the Indonesian Web: Language Identification and Miscellaneous Issues
Information retrieval tools and search engines have mainly been leveraging research results and technologies developed for the English language. In this paper we report the issues...
Stéphane Bressan, Vinsensius Berlian Vega S...
CIKM
2008
Springer
14 years 11 months ago
Experiments with English-Persian text retrieval
As the number of non-English documents is increasing dramatically on the web nowadays, the study and design of information retrieval systems for these languages is very important....
Abolfazl AleAhmad, Hadi Amiri, Masoud Rahgozar, Fa...
WWW
2004
ACM
15 years 10 months ago
Experiments with persian text compression for web
The increasing importance of Unicode for text encoding implies a possible doubling of data storage space and data transmission time, with a corresponding need for data compression...
Farhad Oroumchian, Ehsan Darrudi, Fattane Taghiyar...
SIGIR
2009
ACM
15 years 4 months ago
Addressing morphological variation in alphabetic languages
The selection of indexing terms for representing documents is a key decision that limits how effective subsequent retrieval can be. Often stemming algorithms are used to normaliz...
Paul McNamee, Charles K. Nicholas, James Mayfield