Sciweavers

109
Voted
ICML
2010
IEEE
15 years 1 months ago
Inverse Optimal Control with Linearly-Solvable MDPs
We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...
Dvijotham Krishnamurthy, Emanuel Todorov
117
Voted
ICML
2010
IEEE
15 years 1 months ago
Bayesian Nonparametric Matrix Factorization for Recorded Music
Recent research in machine learning has focused on breaking audio spectrograms into separate sources of sound using latent variable decompositions. These methods require that the ...
Matthew D. Hoffman, David M. Blei, Perry R. Cook
76
Voted
ICML
2010
IEEE
15 years 1 months ago
Exploiting Data-Independence for Fast Belief-Propagation
Maximum a posteriori (MAP) inference in graphical models requires that we maximize the sum of two terms: a data-dependent term, encoding the conditional likelihood of a certain la...
Julian John McAuley, Tibério S. Caetano
103
Voted
ICML
2010
IEEE
15 years 1 months ago
Simple and Efficient Multiple Kernel Learning by Group Lasso
We consider the problem of how to improve the efficiency of Multiple Kernel Learning (MKL). In literature, MKL is often solved by an alternating approach: (1) the minimization of ...
Zenglin Xu, Rong Jin, Haiqin Yang, Irwin King, Mic...
72
Voted
ICML
2010
IEEE
15 years 1 months ago
A fast natural Newton method
Nicolas Le Roux, Andrew Fitzgibbon
132
Voted
ICML
2010
IEEE
15 years 1 months ago
Continuous-Time Belief Propagation
Many temporal processes can be naturally modeled as a stochastic system that evolves continuously over time. The representation language of continuous-time Bayesian networks allow...
Tal El-Hay, Ido Cohn, Nir Friedman, Raz Kupferman
70
Voted
ICML
2010
IEEE
15 years 1 months ago
Learning Hierarchical Riffle Independent Groupings from Rankings
Jonathan Huang, Carlos Guestrin
ICML
2010
IEEE
15 years 1 months ago
Forgetting Counts: Constant Memory Inference for a Dependent Hierarchical Pitman-Yor Process
We propose a novel dependent hierarchical Pitman-Yor process model for discrete data. An incremental Monte Carlo inference procedure for this model is developed. We show that infe...
Nicholas Bartlett, David Pfau, Frank Wood
99
Voted
ICML
2010
IEEE
15 years 1 months ago
Convergence of Least Squares Temporal Difference Methods Under General Conditions
We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...
Huizhen Yu
70
Voted
ICML
2010
IEEE
15 years 1 months ago
Budgeted Distribution Learning of Belief Net Parameters
Liuyang Li, Barnabás Póczos, Csaba S...