Search Sciweavers | Sciweavers

1310 search results - page 83 / 262

» Progressive Optimization in Action

118

click to vote

NIPS
2007

124views Information Technology» more NIPS 2007»

Random Sampling of States in Dynamic Programming

15 years 5 months ago

Download books.nips.cc

We combine three threads of research on approximate dynamic programming: sparse random sampling of states, value function and policy approximation using local models, and using lo...

Christopher G. Atkeson, Benjamin Stephens

claim paper

Read More »

129

click to vote

CORR
1998
Springer

164views Education» more CORR 1998»

Training Reinforcement Neurocontrollers Using the Polytope Algorithm

15 years 4 months ago

Download zeus.cs.uoi.gr

A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...

Aristidis Likas, Isaac E. Lagaris

claim paper

Read More »

142

click to vote

AAAI
1998

170views Intelligent Agents» more AAAI 1998»

The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems

15 years 5 months ago

Download opim.wharton.upenn.edu

Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate their action choices in multiagent systems. We examine some of the factors that...

Caroline Claus, Craig Boutilier

claim paper

Read More »

111

click to vote

SAB
2010
Springer

153views Optimization» more SAB 2010»

Attentional Modulation of Mutually Dependent Behaviors

15 years 2 months ago

Download people.na.infn.it

In this paper, we investigate simple attentional mechanisms suitable for sensing rate regulation and action coordination in the presence of mutually dependent behaviors. We present...

Ernesto Burattini, Silvia Rossi, Alberto Finzi, Ma...

claim paper

Read More »

129

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

16 years 5 months ago

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...

Stuart J. Russell, Andrew Zimdars

claim paper

Read More »

« Prev « First page 83 / 262 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers