Sciweavers

ICDM
2008
IEEE

Collective Latent Dirichlet Allocation

13 years 10 months ago
Collective Latent Dirichlet Allocation
In this paper, we propose a new variant of Latent Dirichlet Allocation(LDA): Collective LDA (C-LDA), for multiple corpora modeling. C-LDA combines multiple corpora during learning such that it can transfer knowledge from one corpus to another; meanwhile it keeps a discriminative node which represents the corpus ID to constrain the learned topics in each corpus. Compared with LDA locally applied to the target corpus, C-LDA results in refined topicword distribution, while compared with applying LDA globally and straightforwardly to the combined corpus, C-LDA keeps each topic only for one corpus. We demonstrate that C-LDA has improved performance with these advantages by experiments on several benchmark document data sets .
Zhiyong Shen, Jun Sun, Yi-Dong Shen
Added 30 May 2010
Updated 30 May 2010
Type Conference
Year 2008
Where ICDM
Authors Zhiyong Shen, Jun Sun, Yi-Dong Shen
Comments (0)