Sciweavers

579 search results - page 11 / 116
» Modeling word burstiness using the Dirichlet distribution
Sort
View
ICDM
2007
IEEE
184views Data Mining» more  ICDM 2007»
15 years 6 months ago
Bayesian Folding-In with Dirichlet Kernels for PLSI
Probabilistic latent semantic indexing (PLSI) represents documents of a collection as mixture proportions of latent topics, which are learned from the collection by an expectation...
Alexander Hinneburg, Hans-Henning Gabriel, Andr&eg...
SAC
2009
ACM
15 years 6 months ago
Applying latent dirichlet allocation to group discovery in large graphs
This paper introduces LDA-G, a scalable Bayesian approach to finding latent group structures in large real-world graph data. Existing Bayesian approaches for group discovery (suc...
Keith Henderson, Tina Eliassi-Rad
AAAI
2010
15 years 1 months ago
Bayesian Matrix Factorization with Side Information and Dirichlet Process Mixtures
Matrix factorization is a fundamental technique in machine learning that is applicable to collaborative filtering, information retrieval and many other areas. In collaborative fil...
Ian Porteous, Arthur Asuncion, Max Welling
COLING
2010
14 years 6 months ago
Finding the Storyteller: Automatic Spoiler Tagging using Linguistic Cues
Given a movie comment, does it contain a spoiler? A spoiler is a comment that, when disclosed, would ruin a surprise or reveal an important plot detail. We study automatic methods...
Sheng Guo, Naren Ramakrishnan
NIPS
2004
15 years 1 months ago
A Probabilistic Model for Online Document Clustering with Application to Novelty Detection
In this paper we propose a probabilistic model for online document clustering. We use non-parametric Dirichlet process prior to model the growing number of clusters, and use a pri...
Jian Zhang 0003, Zoubin Ghahramani, Yiming Yang