Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

16

ICML
2008
IEEE

favoriteEmaildiscussreport

127views Machine Learning» more ICML 2008»

Memory bounded inference in topic models

14 years 5 months ago

Memory bounded inference in topic models

Download vision.caltech.edu

What type of algorithms and statistical techniques support learning from very large datasets over long stretches of time? We address this question through a memory bounded version of a variational EM algorithm that approximates inference in a topic model. The algorithm alternates two phases: "model building" and "model compression" in order to always satisfy a given memory constraint. The model building phase expands its internal representation (the number of topics) as more data arrives through Bayesian model selection. Compression is achieved by merging data-items in clumps and only caching their sufficient statistics. Empirically, the resulting algorithm is able to handle datasets that are orders of magnitude larger than the standard batch version.

Ryan Gomes, Max Welling, Pietro Perona

Real-time Traffic

Bayesian Model Selection | ICML 2008 | Machine Learning | Model Building Phase | Variational Em Algorithm |

claim paper

Related Content

» Efficient methods for topic model inference on streaming document collections

» Supervised topic model for automatic image annotation

» Parametric inference of memory requirements for garbage collected languages

» Randomized computations on large data sets tight lower bounds

» Relative Performance Guarantees for Approximate Inference in Latent Dirichlet Allocation

» Collapsed Variational Inference for HDP

» Precise Analysis of Memory Consumption using Program Logics

» Belief ascription under bounded resources

» Incremental learning with temporary memory

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2008
Where	ICML
Authors	Ryan Gomes, Max Welling, Pietro Perona

Comments (0)