Sciweavers

70 search results - page 5 / 14
» Latent Dirichlet Allocation for Automatic Document Categoriz...
Sort
View
103
Voted
EMNLP
2008
15 years 3 months ago
HTM: A Topic Model for Hypertexts
Previously topic models such as PLSI (Probabilistic Latent Semantic Indexing) and LDA (Latent Dirichlet Allocation) were developed for modeling the contents of plain texts. Recent...
Congkai Sun, Bin Gao, Zhenfu Cao, Hang Li
115
Voted
EMNLP
2010
14 years 12 months ago
Evaluating Models of Latent Document Semantics in the Presence of OCR Errors
Models of latent document semantics such as the mixture of multinomials model and Latent Dirichlet Allocation have received substantial attention for their ability to discover top...
Daniel David Walker, William B. Lund, Eric K. Ring...
127
Voted
LWA
2004
15 years 3 months ago
Dirichlet Enhanced Latent Semantic Analysis
This paper describes nonparametric Bayesian treatments for analyzing records containing occurrences of items. The introduced model retains the strength of previous approaches that...
Kai Yu, Shipeng Yu, Volker Tresp
126
Voted
COLING
2010
14 years 8 months ago
Finding the Storyteller: Automatic Spoiler Tagging using Linguistic Cues
Given a movie comment, does it contain a spoiler? A spoiler is a comment that, when disclosed, would ruin a surprise or reveal an important plot detail. We study automatic methods...
Sheng Guo, Naren Ramakrishnan
ICML
2010
IEEE
15 years 2 months ago
Spherical Topic Models
We introduce the Spherical Admixture Model (SAM), a Bayesian topic model for arbitrary 2 normalized data. SAM maintains the same hierarchical structure as Latent Dirichlet Allocat...
Joseph Reisinger, Austin Waters, Bryan Silverthorn...