Sciweavers

34 search results - page 2 / 7
» A Word Stemming Algorithm for the Spanish Language
Sort
View
ACSC
2005
IEEE
13 years 11 months ago
Stemming Indonesian
Stemming words to (usually) remove suffixes has applications in text search, machine translation, document summarisation, and text classification. For example, English stemming r...
Jelita Asian, Hugh E. Williams, Seyed M. M. Tahagh...
SIGIR
2003
ACM
13 years 10 months ago
Single n-gram stemming
Stemming can improve retrieval accuracy, but stemmers are language-specific. Character n-gram tokenization achieves many of the benefits of stemming in a language independent way,...
James Mayfield, Paul McNamee
SPIRE
1998
Springer
13 years 9 months ago
An Experiment Stemming Non-Traditional Text
Stemming is a technique which aims to extract common suffixes of words. Thus, words which are literally differhave a common stem, may be abstracted by their common stem. The under...
Mario A. Nascimento, Adriano C. R. da Cunha
WWW
2001
ACM
14 years 6 months ago
Indexing the Indonesian Web: Language Identification and Miscellaneous Issues
Information retrieval tools and search engines have mainly been leveraging research results and technologies developed for the English language. In this paper we report the issues...
Stéphane Bressan, Vinsensius Berlian Vega S...
LREC
2010
182views Education» more  LREC 2010»
13 years 6 months ago
Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus
This article presents a new freely available trilingual corpus (Catalan, Spanish, English) that contains large portions of the Wikipedia and has been automatically enriched with l...
Samuel Reese, Gemma Boleda, Montse Cuadros, Llu&ia...