An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm

13 years 10 months ago

Download www-clmc.usc.edu

Recently, actor-critic methods have drawn much interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy. This paper studies an actor-critic type algorithm utilizing the RLS(recursive least-squares) method, which is one of the most eﬃcient techniques for adaptive signal processing, together with natural policy gradient. In the actor part of the studied algorithm, we follow the strategy of performing parameter update via the natural gradient method, while in its update for the critic part, the recursive least-squares method is employed in order to make the parameter estimation for the value functions more eﬃcient. The studied algorithm was applied to locomotion of a two-linked robot arm, and showed better performance compared to the conventional stochastic gradient ascent algorithm.

Jooyoung Park, Jongho Kim, Daesung Kang

Real-time Traffic

Actor-critic Type Algorithm | CIS 2005 | Gradient Ascent Algorithm | Natural Gradient Method |

claim paper

Post Info
More Details (n/a)

Added	26 Jun 2010
Updated	26 Jun 2010
Type	Conference
Year	2005
Where	CIS
Authors	Jooyoung Park, Jongho Kim, Daesung Kang

Comments (0)

Sciweavers

An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm

Actor-critic Type Algorithm | CIS 2005 | Gradient Ascent Algorithm | Natural Gradient Method |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers