Sciweavers

319 search results - page 56 / 64
» Distributional Features for Text Categorization
Sort
View
MMM
2007
Springer
105views Multimedia» more  MMM 2007»
15 years 6 months ago
Discovering User Information Goals with Semantic Website Media Modeling
In this work we present an approach to capture the total semantics in multimedia-multimodal web pages. Our research improves upon the state-ofthe-art with two key features: (1) cap...
Bibek Dev Bhattarai, Mike Wong, Rahul Singh
SIGIR
2004
ACM
15 years 5 months ago
GaP: a factor model for discrete data
We present a probabilistic model for a document corpus that combines many of the desirable features of previous models. The model is called “GaP” for Gamma-Poisson, the distri...
John F. Canny
EMNLP
2008
15 years 1 months ago
Bayesian Unsupervised Topic Segmentation
This paper describes a novel Bayesian approach to unsupervised topic segmentation. Unsupervised systems for this task are driven by lexical cohesion: the tendency of wellformed se...
Jacob Eisenstein, Regina Barzilay
LREC
2008
90views Education» more  LREC 2008»
15 years 1 months ago
Yet another Platform for Extracting Knowledge from Corpora
The research field of "extracting knowledge bases from text collections" seems to be mature: its target and its working hypotheses are clear. In this paper we propose a ...
Francesca Fallucchi, Fabio Massimo Zanzotto
ACL
2006
15 years 1 months ago
Modelling Lexical Redundancy for Machine Translation
Certain distinctions made in the lexicon of one language may be redundant when translating into another language. We quantify redundancy among source types by the similarity of th...
David Talbot, Miles Osborne