Sciweavers

664 search results - page 51 / 133
» Combining Reinforcement Learning with a Local Control Algori...
Sort
View
100
Voted
CORR
2010
Springer
204views Education» more  CORR 2010»
14 years 8 months ago
Predictive State Temporal Difference Learning
We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identification. In practical applications...
Byron Boots, Geoffrey J. Gordon
75
Voted
IMR
2003
Springer
15 years 3 months ago
A Local Cell Quality Metric and Variational Grid Smoothing Algorithm
A local cell quality metric is introduced and used to construct a variational functional for a grid smoothing algorithm. A maximum principle is proved and the properties of the loc...
Larisa Branets, Graham F. Carey
IROS
2006
IEEE
113views Robotics» more  IROS 2006»
15 years 4 months ago
Policy Gradient Methods for Robotics
— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...
Jan Peters, Stefan Schaal
104
Voted
ECML
2005
Springer
15 years 3 months ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal
84
Voted
ICML
2010
IEEE
14 years 11 months ago
Inverse Optimal Control with Linearly-Solvable MDPs
We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...
Dvijotham Krishnamurthy, Emanuel Todorov