Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic

15 years 4 months ago

Download hawaii.aist-nara.ac.jp

— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control parameters in Quasi-PDW are automatically adjusted because robots often suffer from changes in their physical parameters and the surrounding environment. Reinforcement learning (RL) can be a key technology to this adaptability, and it has been shown that RL realizes Quasi-PDW in a simulation study. To apply the existing method to controlling real robots, however, requires further improvement to accelerate its learning, otherwise the robots will break down before acquiring appropriate controls. To accelerate the learning, this study employs off-policy natural actor-critic (off-NAC), and applies it to an acquisition problem of Quasi-PDW. The most important feature of the off-NAC is that it reuses the samples that has already been obtained by previous controllers. This study also shows an adaptive method of the lear...

Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To

Real-time Traffic

Humanoid Robotics | IROS 2006 | Reinforcement Learning | Robot | Robotics |

claim paper

Post Info
More Details (n/a)

Added	12 Jun 2010
Updated	12 Jun 2010
Type	Conference
Year	2006
Where	IROS
Authors	Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, Tomohiro Shibata, Koh Hosoda, Shin Ishii

Comments (0)

Sciweavers

Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic

Humanoid Robotics | IROS 2006 | Reinforcement Learning | Robot | Robotics |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers