Search Sciweavers | Sciweavers

11 search results - page 2 / 3

» Suppressing intersample behavior in Iterative Learning Contr...

click to vote

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

13 years 9 days ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

click to vote

TON
2008

139views more TON 2008»

Stochastic learning solution for distributed discrete power control game in wireless data networks

13 years 5 months ago

Download koala.ece.stevens-tech.edu

Distributed power control is an important issue in wireless networks. Recently, noncooperative game theory has been applied to investigate interesting solutions to this problem. Th...

Yiping Xing, Rajarathnam Chandramouli

claim paper

Read More »

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

14 years 6 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

click to vote

IROS
2007
IEEE

144views Robotics» more IROS 2007»

Bipedal walking on rough terrain using manifold control

13 years 11 months ago

Download www.cse.wustl.edu

— This paper presents an algorithm for adapting periodic behavior to gradual shifts in task parameters. Since learning optimal control in high dimensional domains is subject to t...

Tom Erez, William D. Smart

claim paper

Read More »

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

13 years 6 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

« Prev « First page 2 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers