Sciweavers

TKDE
2010
224views more  TKDE 2010»
13 years 2 months ago
Probabilistic Topic Models for Learning Terminological Ontologies
—Probabilistic topic models were originally developed and utilised for document modeling and topic extraction in Information Retrieval. In this paper we describe a new approach f...
Wang Wei, Payam M. Barnaghi, Andrzej Bargiela
PKDD
2010
Springer
154views Data Mining» more  PKDD 2010»
13 years 2 months ago
Topic Models Conditioned on Relations
Latent Dirichlet allocation is a fully generative statistical language model that has been proven to be successful in capturing both the content and the topics of a corpus of docum...
Mirwaes Wahabzada, Zhao Xu, Kristian Kersting
SIGIR
2002
ACM
13 years 4 months ago
Cross-lingual relevance models
We propose a formal model of Cross-Language Information Retrieval that does not rely on either query translation or document translation. Our approach leverages recent advances in...
Victor Lavrenko, Martin Choquette, W. Bruce Croft
ICML
2010
IEEE
13 years 5 months ago
Conditional Topic Random Fields
Generative topic models such as LDA are limited by their inability to utilize nontrivial input features to enhance their performance, and many topic models assume that topic assig...
Jun Zhu, Eric P. Xing
DEXAW
2010
IEEE
202views Database» more  DEXAW 2010»
13 years 5 months ago
Identifying Sentence-Level Semantic Content Units with Topic Models
Abstract--Statistical approaches to document content modeling typically focus either on broad topics or on discourselevel subtopics of a text. We present an analysis of the perform...
Leonhard Hennig, Thomas Strecker, Sascha Narr, Ern...
UAI
2008
13 years 6 months ago
Latent Topic Models for Hypertext
Latent topic models have been successfully applied as an unsupervised topic discovery technique in large document collections. With the proliferation of hypertext document collect...
Amit Gruber, Michal Rosen-Zvi, Yair Weiss
UAI
2008
13 years 6 months ago
Topic Models Conditioned on Arbitrary Features with Dirichlet-multinomial Regression
Although fully generative models have been successfully used to model the contents of text documents, they are often awkward to apply to combinations of text data and document met...
David M. Mimno, Andrew McCallum
RIAO
2007
13 years 6 months ago
Investigating Retrieval Performance with Manually-Built Topic Models
Modeling text with topics is currently a popular research area in both Machine Learning and Information Retrieval (IR). Most of this research has focused on automatic methods thou...
Xing Wei, W. Bruce Croft
NIPS
2008
13 years 6 months ago
Syntactic Topic Models
We develop the syntactic topic model (STM), a nonparametric Bayesian model of parsed documents. The STM generates words that are both thematically and syntactically constrained, w...
Jordan L. Boyd-Graber, David M. Blei
EMNLP
2008
13 years 6 months ago
HTM: A Topic Model for Hypertexts
Previously topic models such as PLSI (Probabilistic Latent Semantic Indexing) and LDA (Latent Dirichlet Allocation) were developed for modeling the contents of plain texts. Recent...
Congkai Sun, Bin Gao, Zhenfu Cao, Hang Li