Sciweavers

163 search results - page 18 / 33
» Policy Gradient Methods for Robotics
Sort
View
125
Voted
ICRA
2010
IEEE
133views Robotics» more  ICRA 2010»
15 years 9 days ago
Generalized model learning for Reinforcement Learning on a humanoid robot
— Reinforcement learning (RL) algorithms have long been promising methods for enabling an autonomous robot to improve its behavior on sequential decision-making tasks. The obviou...
Todd Hester, Michael Quinlan, Peter Stone
HRI
2007
ACM
15 years 5 months ago
Learning by demonstration with critique from a human teacher
Learning by demonstration can be a powerful and natural tool for developing robot control policies. That is, instead of tedious hand-coding, a robot may learn a control policy by ...
Brenna Argall, Brett Browning, Manuela M. Veloso
AAAI
2008
15 years 4 months ago
A Variance Analysis for POMDP Policy Evaluation
Partially Observable Markov Decision Processes have been studied widely as a model for decision making under uncertainty, and a number of methods have been developed to find the s...
Mahdi Milani Fard, Joelle Pineau, Peng Sun
GECCO
2009
Springer
162views Optimization» more  GECCO 2009»
14 years 11 months ago
Uncertainty handling CMA-ES for reinforcement learning
The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...
Verena Heidrich-Meisner, Christian Igel
ICRA
2008
IEEE
143views Robotics» more  ICRA 2008»
15 years 8 months ago
Adaptive workspace biasing for sampling-based planners
Abstract— The widespread success of sampling-based planning algorithms stems from their ability to rapidly discover the connectivity of a configuration space. Past research has ...
Matthew Zucker, James Kuffner, James A. Bagnell