Sciweavers

268 search results - page 43 / 54
» Improving IBM Word Alignment Model 1
Sort
View
ANLP
1994
103views more  ANLP 1994»
15 years 1 months ago
Exploiting Sophisticated Representations for Document Retrieval
The use of NLP techniques for document classification has not produced significant improvements in performance within the standard term weighting statistical assignment paradigm (...
Steven Finch
WWW
2007
ACM
16 years 11 days ago
A new suffix tree similarity measure for document clustering
In this paper, we propose a new similarity measure to compute the pairwise similarity of text-based documents based on suffix tree document model. By applying the new suffix tree ...
Hung Chim, Xiaotie Deng
EMNLP
2008
15 years 1 months ago
Bayesian Unsupervised Topic Segmentation
This paper describes a novel Bayesian approach to unsupervised topic segmentation. Unsupervised systems for this task are driven by lexical cohesion: the tendency of wellformed se...
Jacob Eisenstein, Regina Barzilay
QOSA
2010
Springer
15 years 3 months ago
Validating Model-Driven Performance Predictions on Random Software Systems
Abstract. Software performance prediction methods are typically validated by taking an appropriate software system, performing both performance predictions and performance measurem...
Vlastimil Babka, Petr Tuma, Lubomír Bulej
MA
2010
Springer
172views Communications» more  MA 2010»
14 years 10 months ago
On Monte Carlo methods for Bayesian multivariate regression models with heavy-tailed errors
We consider Bayesian analysis of data from multivariate linear regression models whose errors have a distribution that is a scale mixture of normals. Such models are used to analy...
Vivekananda Roy, James P. Hobert