Sciweavers

IR
2007
13 years 4 months ago
Searching strategies for the Bulgarian language
This paper reports on the underlying IR problems encountered when indexing and searching with the Bulgarian language. For this language we propose a general light stemmer and demon...
Jacques Savoy
CLIN
2001
13 years 6 months ago
Accurate Stemming of Dutch for Text Classification
This paper investigates the use of stemming for classification of Dutch (email) texts. We introduce a stemmer, which combines dictionary lookup (implemented efficiently as a finit...
Tanja Gaustad, Gosse Bouma
ACL
2003
13 years 6 months ago
Unsupervised Learning of Arabic Stemming Using a Parallel Corpus
This paper presents an unsupervised learning approach to building a non-English (Arabic) stemmer. The stemming model is based on statistical machine translation and it uses an Eng...
Monica Rogati, J. Scott McCarley, Yiming Yang
ITCC
2005
IEEE
13 years 10 months ago
A Stemming Algorithm for the Farsi Language
In this paper, we report on the design and implementation of a stemmer for the Farsi language. The results of our evaluation on a small Farsi document collection shows a signific...
Kazem Taghva, Russell Beckley, Mohammad Sadeh