Search Sciweavers | Sciweavers

17 search results - page 2 / 4

» Hierarchical Reinforcement Learning with the MAXQ Value Func...

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation

14 years 6 months ago

Download www.machinelearning.org

Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...

Chee Wee Phua, Robert Fitch

claim paper

Read More »

click to vote

ICRA
2006
IEEE

161views Robotics» more ICRA 2006»

Quadruped Robot Obstacle Negotiation via Reinforcement Learning

13 years 11 months ago

Download www.stanford.edu

— Legged robots can, in principle, traverse a large variety of obstacles and terrains. In this paper, we describe a successful application of reinforcement learning to the proble...

Honglak Lee, Yirong Shen, Chih-Han Yu, Gurjeet Sin...

claim paper

Read More »

click to vote

NCI
2004

185views Neural Networks» more NCI 2004»

Hierarchical reinforcement learning with subpolicies specializing for learned subgoals

13 years 6 months ago

Download staff.science.uva.nl

This paper describes a method for hierarchical reinforcement learning in which high-level policies automatically discover subgoals, and low-level policies learn to specialize for ...

Bram Bakker, Jürgen Schmidhuber

claim paper

Read More »

click to vote

ICMLA
2009

171views Machine Learning» more ICMLA 2009»

Multiagent Transfer Learning via Assignment-Based Decomposition

13 years 2 months ago

Download web.engr.oregonstate.edu

We describe a system that successfully transfers value function knowledge across multiple subdomains of realtime strategy games in the context of multiagent reinforcement learning....

Scott Proper, Prasad Tadepalli

claim paper

Read More »

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

14 years 6 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

« Prev « First page 2 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers