Sciweavers

77 search results - page 3 / 16
» Improved Modeling of Out-Of-Vocabulary Words Using Morpholog...
Sort
View
SIGIR
2002
ACM
13 years 4 months ago
Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis
Arabic, a highly inflected language, requires good stemming for effective information retrieval, yet no standard approach to stemming has emerged. We developed several light stemm...
Leah S. Larkey, Lisa Ballesteros, Margaret E. Conn...
COLING
2002
13 years 5 months ago
Morphological Analysis of the Spontaneous Speech Corpus
This paper describes a project tagging a spontaneous speech corpus with morphological information such as word segmentation and parts-ofspeech. We use a morphological analysis sys...
Kiyotaka Uchimoto, Chikashi Nobata, Atsushi Yamada...
CICLING
2007
Springer
13 years 11 months ago
Morphological Disambiguation of Turkish Text with Perceptron Algorithm
Abstract. This paper describes the application of the perceptron algorithm to the morphological disambiguation of Turkish text. Turkish has a productive derivational morphology. Du...
Hasim Sak, Tunga Güngör, Murat Saraclar
ACL
2008
13 years 6 months ago
Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation
In statistical language modeling, one technique to reduce the problematic effects of data sparsity is to partition the vocabulary into equivalence classes. In this paper we invest...
Jakob Uszkoreit, Thorsten Brants
ACL
2007
13 years 6 months ago
Automatic Part-of-Speech Tagging for Bengali: An Approach for Morphologically Rich Languages in a Poor Resource Scenario
This paper describes our work on building Part-of-Speech (POS) tagger for Bengali. We have use Hidden Markov Model (HMM) and Maximum Entropy (ME) based stochastic taggers. Bengali...
Sandipan Dandapat, Sudeshna Sarkar, Anupam Basu