Sciweavers

374 search results - page 8 / 75
» Modeling Chinese Documents with Topical Word-Character Model...
Sort
View
96
Voted
EWMF
2005
Springer
15 years 5 months ago
Semi-automatic Construction of Topic Ontologies
In this paper, we review two techniques for topic discovery in collections of text documents (Latent Semantic Indexing and K-Means clustering) and present how we integrated them in...
Blaz Fortuna, Dunja Mladenic, Marko Grobelnik
116
Voted
ACML
2009
Springer
15 years 6 months ago
Estimating Likelihoods for Topic Models
Abstract. Topic models are a discrete analogue to principle component analysis and independent component analysis that model topic at the word level within a document. They have ma...
Wray L. Buntine
83
Voted
ICML
2009
IEEE
15 years 6 months ago
Topic-link LDA: joint models of topic and author community
Given a large-scale linked document collection, such as a collection of blog posts or a research literature archive, there are two fundamental problems that have generated a lot o...
Yan Liu, Alexandru Niculescu-Mizil, Wojciech Gryc
97
Voted
ICDM
2007
IEEE
184views Data Mining» more  ICDM 2007»
15 years 6 months ago
Bayesian Folding-In with Dirichlet Kernels for PLSI
Probabilistic latent semantic indexing (PLSI) represents documents of a collection as mixture proportions of latent topics, which are learned from the collection by an expectation...
Alexander Hinneburg, Hans-Henning Gabriel, Andr&eg...
CSL
2004
Springer
14 years 11 months ago
Contemporaneous text as side-information in statistical language modeling
We propose new methods to exploit contemporaneous text, such as on-line news articles, to improve language models for automatic speech recognition and other natural language proce...
Sanjeev Khudanpur, Woosung Kim