Sciweavers

HEURISTICS
2008

Accelerating autonomous learning by using heuristic selection of actions

13 years 4 months ago
Accelerating autonomous learning by using heuristic selection of actions
This paper investigates how to make improved action selection for online policy learning in robotic scenarios using reinforcement learning (RL) algorithms. Since finding control policies using any RL algorithm can be very time consuming, we propose to combine RL algorithms with heuristic functions for selecting promising actions during the learning process. With this aim, we investigate the use of heuristics for increasing the rate of convergence of RL algorithms and contribute with a new learning algorithm, Heuristically Accelerated Q-learning (HAQL), which incorporates heuristics for action selection to the Q-Learning algorithm. Experimental results on robot navigation show that the use of even very simple heuristic functions results in significant performance enhancement of the learning rate.
Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna
Added 10 Dec 2010
Updated 10 Dec 2010
Type Journal
Year 2008
Where HEURISTICS
Authors Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna Helena Reali Costa
Comments (0)