Sciweavers

2929 search results - page 471 / 586
» Models of English Text
Sort
View
ACL
2003
15 years 6 months ago
Minimum Error Rate Training in Statistical Machine Translation
Often, the training procedure for statistical machine translation models is based on maximum likelihood or related criteria. A general problem of this approach is that there is on...
Franz Josef Och
NIPS
2004
15 years 6 months ago
An Application of Boosting to Graph Classification
This paper presents an application of Boosting for classifying labeled graphs, general structures for modeling a number of real-world data, such as chemical compounds, natural lan...
Taku Kudo, Eisaku Maeda, Yuji Matsumoto
119
Voted
TREC
2004
15 years 6 months ago
Columbia University in the Novelty Track at TREC 2004
Our system for the Novelty Track at TREC 2004 looks beyond sentence boundaries as well as within sentences to identify novel, nonduplicative passages. It tries to identify text sp...
Barry Schiffman, Kathleen McKeown
129
Voted
SDM
2003
SIAM
125views Data Mining» more  SDM 2003»
15 years 6 months ago
Scalable, Balanced Model-based Clustering
This paper presents a general framework for adapting any generative (model-based) clustering algorithm to provide balanced solutions, i.e., clusters of comparable sizes. Partition...
Shi Zhong, Joydeep Ghosh
149
Voted
AAAI
1998
15 years 6 months ago
Knowledge Lean Word-Sense Disambiguation
We present a corpus{based approach to word{sense disambiguation that only requires information that can be automatically extracted from untagged text. We use unsupervised techniqu...
Ted Pedersen, Rebecca F. Bruce