Sciweavers

1310 search results - page 83 / 262
» Progressive Optimization in Action
Sort
View
NIPS
2007
15 years 5 months ago
Random Sampling of States in Dynamic Programming
We combine three threads of research on approximate dynamic programming: sparse random sampling of states, value function and policy approximation using local models, and using lo...
Christopher G. Atkeson, Benjamin Stephens
CORR
1998
Springer
164views Education» more  CORR 1998»
15 years 4 months ago
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...
Aristidis Likas, Isaac E. Lagaris
AAAI
1998
15 years 5 months ago
The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems
Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate their action choices in multiagent systems. We examine some of the factors that...
Caroline Claus, Craig Boutilier
SAB
2010
Springer
153views Optimization» more  SAB 2010»
15 years 2 months ago
Attentional Modulation of Mutually Dependent Behaviors
In this paper, we investigate simple attentional mechanisms suitable for sensing rate regulation and action coordination in the presence of mutually dependent behaviors. We present...
Ernesto Burattini, Silvia Rossi, Alberto Finzi, Ma...
ICML
2003
IEEE
16 years 5 months ago
Q-Decomposition for Reinforcement Learning Agents
The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...
Stuart J. Russell, Andrew Zimdars