Sciweavers

70 search results - page 9 / 14
» Latent Dirichlet Allocation for Automatic Document Categoriz...
Sort
View
86
Voted
CIKM
2011
Springer
13 years 9 months ago
Towards noise-resilient document modeling
We introduce a generative probabilistic document model based on latent Dirichlet allocation (LDA), to deal with textual errors in the document collection. Our model is inspired by...
Tao Yang, Dongwon Lee
ACL
2012
13 years 1 days ago
Authorship Attribution with Author-aware Topic Models
Authorship attribution deals with identifying the authors of anonymous texts. Building on our earlier finding that the Latent Dirichlet Allocation (LDA) topic model can be used t...
Yanir Seroussi, Fabian Bohnert, Ingrid Zukerman
SIGIR
2003
ACM
15 years 2 months ago
Modeling annotated data
We consider the problem of modeling annotated data—data with multiple types where the instance of one type (such as a caption) serves as a description of the other type (such as...
David M. Blei, Michael I. Jordan
EMNLP
2007
14 years 11 months ago
A Topic Model for Word Sense Disambiguation
We develop latent Dirichlet allocation with WORDNET (LDAWN), an unsupervised probabilistic topic model that includes word sense as a hidden variable. We develop a probabilistic po...
Jordan L. Boyd-Graber, David M. Blei, Xiaojin Zhu
CIKM
2008
Springer
14 years 11 months ago
Modeling hidden topics on document manifold
Topic modeling has been a key problem for document analysis. One of the canonical approaches for topic modeling is Probabilistic Latent Semantic Indexing, which maximizes the join...
Deng Cai, Qiaozhu Mei, Jiawei Han, Chengxiang Zhai