Sciweavers

660 search results - page 7 / 132
» words 2003
Sort
View
IJCAI
2003
14 years 11 months ago
Improving Word Sense Disambiguation in Lexical Chaining
Previous algorithms to compute lexical chains suffer either from a lack of accuracy in word sense disambiguation (WSD) or from computational inefficiency. In this paper, we presen...
Michel Galley, Kathleen McKeown
CLIN
2003
14 years 11 months ago
Methods for the Extraction of Hungarian Multi-Word Lexemes
This paper describes an experiment on extracting Hungarian multi-word lexemes from a corpus, using statistical methods. Corpus preparation—the addition of POS tags and stems—w...
Balázs Kis, Begoña Villada, Gosse Bo...
ISICT
2003
14 years 11 months ago
Spam filters: bayes vs. chi-squared; letters vs. words
We compare two statistical methods for identifying spam or junk electronic mail. Spam filters are classifiers which determine whether an email is junk or not. The proliferation ...
Cormac O'Brien, Carl Vogel
87
Voted
ACL
2003
14 years 11 months ago
Unsupervised Segmentation of Words Using Prior Distributions of Morph Length and Frequency
We present a language-independent and unsupervised algorithm for the segmentation of words into morphs. The algorithm is based on a new generative probabilistic model, which makes...
Mathias Creutz
NAACL
2003
14 years 11 months ago
Word Alignment with Cohesion Constraint
We present a syntax-based constraint for word alignment, known as the cohesion constraint. It requires disjoint English phrases to be mapped to non-overlapping intervals in the Fr...
Dekang Lin, Colin Cherry