Space-indexed dynamic programming: learning to follow trajectories

14 years 5 months ago

Download www.cs.stanford.edu

We consider the task of learning to accurately follow a trajectory in a vehicle such as a car or helicopter. A number of dynamic programming algorithms such as Differential Dynamic Programming (DDP) and Policy Search by Dynamic Programming (PSDP), can efficiently compute non-stationary policies for these tasks -- such policies in general are well-suited to trajectory following since they can easily generate different control actions at different times in order to follow the trajectory. However, a weakness of these algorithms is that their policies are timeindexed, in that they apply different policies depending on the current time. This is problematic since 1) the current time may not correspond well to where we are along the trajectory and 2) the uncertainty over states can prevent these algorithms from finding any good policies at all. In this paper we propose a method for space-indexed dynamic programming that overcomes both these difficulties. We begin by showing how a dynamical s...

J. Zico Kolter, Adam Coates, Andrew Y. Ng, Yi Gu,

Real-time Traffic

Dynamic Programming Algorithms | ICML 2008 | Machine Learning | Non-stationary Policies | Space-indexed Dynamic Programming |

claim paper

» Haptic Feedback Enhances Force Skill Learning

» Policies based on Trajectory Libraries

» Wordom A userfriendly program for the analysis of molecular structures trajectories and fr...

» Biomimetic motor behavior for simultaneous adaptation of force impedance and trajectory in...

» Robust People Tracking with Global Trajectory Optimization

» Receding Horizon Differential Dynamic Programming

» Learning Impedance Control for Robotic Manipulators

» Learning ForceBased Robot Skills from Haptic Demonstration

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2008
Where	ICML
Authors	J. Zico Kolter, Adam Coates, Andrew Y. Ng, Yi Gu, Charles DuHadway

Comments (0)

Sciweavers

Space-indexed dynamic programming: learning to follow trajectories

Dynamic Programming Algorithms | ICML 2008 | Machine Learning | Non-stationary Policies | Space-indexed Dynamic Programming |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers