Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

12

ICML
2002
IEEE

favoriteEmaildiscussreport

138views Machine Learning» more ICML 2002»

Reinforcement Learning and Shaping: Encouraging Intended Behaviors

14 years 5 months ago

Reinforcement Learning and Shaping: Encouraging Intended Behaviors

Download www.grappa.univ-lille3.fr

We explore dynamic shaping to integrate our prior beliefs of the final policy into a conventional reinforcement learning system. Shaping provides a positive or negative artificial increment to the native task rewards in order to encourage or discourage behaviors. Previously, shaping functions have been static: the additional rewards do not vary with experience. But some prior knowledge cannot be expressed as static shaping. We take an explanation-based approach in which the specific shaping function emerges from initial experiences with the world. We compare no shaping, static shaping, and dynamic shaping in the task of learning bipedal-walking on a simulator. We empirically evaluate the convergence rate and final performance among these conditions while varying the accuracy of the prior knowledge. We conclude that in the appropriate context, dynamic shaping can greatly improve the learning of action policies.

Adam Laud, Gerald DeJong

Real-time Traffic

Dynamic Shaping | ICML 2002 | Machine Learning | Specific Shaping Function | Static Shaping |

claim paper

Related Content

» Social reward shaping in the prisoners dilemma

» VisionBased Reinforcement Learning for Purposive Behavior Acquisition

» Emotion and Reinforcement Affective Facial Expressions Facilitate Robot Learning

» Potentialbased Shaping in Modelbased Reinforcement Learning

» Machine Learning for Intelligent Systems

» Combining manual feedback with subsequent MDP reward signals for reinforcement learning

» PeertoPeer Valuation as a Mechanism for Reinforcing Active Learning in Virtual Communities...

» Learning from human teachers with Socially Guided Exploration

» Agent Behavior Alignment A Mechanism to Overcome Problems in Agent Interactions During Run...

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2002
Where	ICML
Authors	Adam Laud, Gerald DeJong

Comments (0)