Sciweavers

ICML
2006
IEEE
14 years 5 months ago
Pachinko allocation: DAG-structured mixture models of topic correlations
Latent Dirichlet allocation (LDA) and other related topic models are increasingly popular tools for summarization and manifold discovery in discrete data. However, LDA does not ca...
Wei Li, Andrew McCallum
ICML
2009
IEEE
14 years 5 months ago
Incorporating domain knowledge into topic modeling via Dirichlet Forest priors
Users of topic modeling methods often have knowledge about the composition of words that should have high or low probability in various topics. We incorporate such domain knowledg...
David Andrzejewski, Xiaojin Zhu, Mark Craven
ICDE
2009
IEEE
130views Database» more  ICDE 2009»
14 years 6 months ago
A Latent Topic Model for Complete Entity Resolution
In bibliographies like DBLP and Citeseer, there are three kinds of entity-name problems that need to be solved. First, multiple entities share one name, which is called the name sh...
Liangcai Shu, Bo Long, Weiyi Meng