Sciweavers

75
Voted
JMLR
2010
125views more  JMLR 2010»
14 years 3 months ago
Variational methods for Reinforcement Learning
We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...
Thomas Furmston, David Barber