Sciweavers

48 search results - page 2 / 10
» Gaussian LDA for Topic Models with Word Embeddings
Sort
View
ICDM
2007
IEEE
173views Data Mining» more  ICDM 2007»
15 years 4 months ago
Sparse Word Graphs: A Scalable Algorithm for Capturing Word Correlations in Topic Models
Statistical topic models such as the Latent Dirichlet Allocation (LDA) have emerged as an attractive framework to model, visualize and summarize large document collections in a co...
Ramesh Nallapati, Amr Ahmed, William W. Cohen, Eri...
WIAMIS
2009
IEEE
15 years 5 months ago
Automatic topic detection strategy for information retrieval in spoken document
This paper suggests an alternative solution for the task of spoken document retrieval (SDR). The proposed system runs retrieval on multi-level transcriptions (word and phone) prod...
Shan Jin, Hemant Misra, Thomas Sikora, Joemon M. J...
ECIR
2010
Springer
14 years 12 months ago
Extracting Multilingual Topics from Unaligned Comparable Corpora
Topic models have been studied extensively in the context of monolingual corpora. Though there are some attempts to mine topical structure from cross-lingual corpora, they require ...
Jagadeesh Jagarlamudi, Hal Daumé III
KDD
2010
ACM
435views Data Mining» more  KDD 2010»
15 years 2 months ago
Topic models with power-law using Pitman-Yor process
One of the important approaches for Knowledge discovery and Data mining is to estimate unobserved variables because latent variables can indicate hidden and specific properties o...
Issei Sato, Hiroshi Nakagawa
AIRS
2009
Springer
15 years 5 months ago
A Latent Dirichlet Framework for Relevance Modeling
Relevance-based language models operate by estimating the probabilities of observing words in documents relevant (or pseudo relevant) to a topic. However, these models assume that ...
Viet Ha-Thuc, Padmini Srinivasan