Sciweavers

374 search results - page 9 / 75
» Modeling Chinese Documents with Topical Word-Character Model...
Sort
View
MSR
2011
ACM
14 years 2 months ago
Modeling the evolution of topics in source code histories
Studying the evolution of topics (collections of co-occurring words) in a software project is an emerging technique to automatically shed light on how the project is changing over...
Stephen W. Thomas, Bram Adams, Ahmed E. Hassan, Do...
IRAL
2000
ACM
15 years 4 months ago
Content-based language models for spoken document retrieval
Spoken document retrieval (SDR) has been extensively studied in recent years because of its potential use in navigating large multimedia collections in the near future. This paper...
Hsin-Min Wang, Berlin Chen
PKDD
2010
Springer
154views Data Mining» more  PKDD 2010»
14 years 10 months ago
Topic Models Conditioned on Relations
Latent Dirichlet allocation is a fully generative statistical language model that has been proven to be successful in capturing both the content and the topics of a corpus of docum...
Mirwaes Wahabzada, Zhao Xu, Kristian Kersting
ICML
2006
IEEE
16 years 13 days ago
Dynamic topic models
A family of probabilistic time series models is developed to analyze the time evolution of topics in large document collections. The approach is to use state space models on the n...
David M. Blei, John D. Lafferty
ICDM
2007
IEEE
173views Data Mining» more  ICDM 2007»
15 years 6 months ago
Sparse Word Graphs: A Scalable Algorithm for Capturing Word Correlations in Topic Models
Statistical topic models such as the Latent Dirichlet Allocation (LDA) have emerged as an attractive framework to model, visualize and summarize large document collections in a co...
Ramesh Nallapati, Amr Ahmed, William W. Cohen, Eri...