Search Sciweavers | Sciweavers

89 search results - page 2 / 18

» Stable Function Approximation in Dynamic Programming

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

13 years 11 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

click to vote

TSMC
1998

135views more TSMC 1998»

Universal stabilization using control Lyapunov functions, adaptive derivative feedback, and neural network approximators

13 years 5 months ago

Download www.dssl.tuc.gr

— In this paper, the problem of stabilization of unknown nonlinear dynamical systems is considered. An adaptive feedback law is constructed that is based on the switching adaptiv...

Elias B. Kosmatopoulos

claim paper

Read More »

click to vote

ICRA
2002
IEEE

147views Robotics» more ICRA 2002»

Design of Asymptotically Stable Walking for a 5-Link Planar Biped Walker via Optimization

13 years 10 months ago

Download www.eecs.umich.edu

— Closed-loop, asymptotically stable walking motions are designed for a 5-link, planar bipedal robot model with one degree of underactuation. Parameter optimization is applied to...

E. R. Westervelt, J. W. Grizzle

claim paper

Read More »

click to vote

CORR
2010
Springer

119views Education» more CORR 2010»

Dynamic Policy Programming

13 years 5 months ago

Download www.snn.ru.nl

In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...

Mohammad Gheshlaghi Azar, Hilbert J. Kappen

claim paper

Read More »

click to vote

NIPS
1994

178views Information Technology» more NIPS 1994»

Generalization in Reinforcement Learning: Safely Approximating the Value Function

13 years 6 months ago

Download www.ri.cmu.edu

To appear in: G. Tesauro, D. S. Touretzky and T. K. Leen, eds., Advances in Neural Information Processing Systems 7, MIT Press, Cambridge MA, 1995. A straightforward approach to t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

« Prev « First page 2 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers