Uncertainty handling CMA-ES for reinforcement learning

13 years 2 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an adaptive uncertainty handling mechanism. Because uncertainty is a typical property of RL problems this new algorithm, termed UH-CMA-ES, is promising for RL. The UH-CMA-ES dynamically adjusts the number of episodes considered in each evaluation of a policy. It controls the signal to noise ratio such that it is just high enough for a sufficiently good ranking of candidate policies, which in turn allows the evolutionary learning to find better solutions. This significantly increases the learning speed as well as the robustness without impairing the quality of the final solutions. We evaluate the UH-CMA-ES on fully and partially observable Markov decision processes with random start states and noisy observations. A canonical natural policy gradient method and random search serve as a baseline for comparison. Categor...

Verena Heidrich-Meisner, Christian Igel

Real-time Traffic

Adaptation Evolution Strategy | Covariance Matrix Adaptation | GECCO 2009 | Matrix Adaptation Evolution | Optimization |

claim paper

Added	17 Feb 2011
Updated	17 Feb 2011
Type	Journal
Year	2009
Where	GECCO
Authors	Verena Heidrich-Meisner, Christian Igel

Sciweavers

Uncertainty handling CMA-ES for reinforcement learning

Adaptation Evolution Strategy | Covariance Matrix Adaptation | GECCO 2009 | Matrix Adaptation Evolution | Optimization |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers