An analytic solution to discrete Bayesian reinforcement learning

14 years 5 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms come short of achieving this goal because the amount of exploration required is often too costly and/or too time consuming for online learning. As a result, RL is mostly used for offline learning in simulated environments. We propose a new algorithm, called BEETLE, for effective online learning that is computationally efficient while minimizing the amount of exploration. We take a Bayesian model-based approach, framing RL as a partially observable Markov decision process. Our two main contributions are the analytical derivation that the optimal value function is the upper envelope of a set of multivariate polynomials, and an efficient pointbased value iteration algorithm that exploits this simple parameterization.

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi

Real-time Traffic

ICML 2006 | Machine Learning | Online Learning | RL Algorithms | Value Iteration Algorithm |

claim paper

» ModelBased Bayesian Reinforcement Learning in Large Structured Domains

» A Bayesian Framework for Reinforcement Learning

» Bayesian reinforcement learning in continuous POMDPs with gaussian processes

» Bayesian Inverse Reinforcement Learning

» Smarter Sampling in ModelBased Bayesian Reinforcement Learning

» Sequential decision making with untrustworthy service providers

» Sequential decision making in repeated coalition formation under uncertainty

» BayesAdaptive POMDPs

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2006
Where	ICML
Authors	Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevin Regan

Comments (0)

Sciweavers

An analytic solution to discrete Bayesian reinforcement learning

ICML 2006 | Machine Learning | Online Learning | RL Algorithms | Value Iteration Algorithm |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers