Search Sciweavers | Sciweavers

813 search results - page 72 / 163

» Ensemble Algorithms in Reinforcement Learning

222

click to vote

GECCO
2011
Springer

276views Optimization» more GECCO 2011»

Evolution of reward functions for reinforcement learning

14 years 7 months ago

Download hampshire.edu

The reward functions that drive reinforcement learning systems are generally derived directly from the descriptions of the problems that the systems are being used to solve. In so...

Scott Niekum, Lee Spector, Andrew G. Barto

claim paper

Read More »

129

click to vote

JMLR
2002

125views more JMLR 2002»

Lyapunov Design for Safe Reinforcement Learning

15 years 4 months ago

Download www-anw.cs.umass.edu

Lyapunov design methods are used widely in control engineering to design controllers that achieve qualitative objectives, such as stabilizing a system or maintaining a system'...

Theodore J. Perkins, Andrew G. Barto

claim paper

Read More »

152

click to vote

ICML
2000
IEEE

192views Machine Learning» more ICML 2000»

Convergence Problems of General-Sum Multiagent Reinforcement Learning

16 years 5 months ago

Download www.cs.ualberta.ca

Stochastic games are a generalization of MDPs to multiple agents, and can be used as a framework for investigating multiagent learning. Hu and Wellman (1998) recently proposed a m...

Michael H. Bowling

claim paper

Read More »

129

click to vote

FLAIRS
1998

132views Artificial Intelligence» more FLAIRS 1998»

Analytical Design of Reinforcement Learning Tasks

15 years 5 months ago

Download www.aaai.org

Reinforcement learning (RL) problems constitute an important class of learning and control problems faced by artificial intelligence systems. In these problems, one is faced with ...

Robert E. Smith

claim paper

Read More »

169

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

15 years 10 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

« Prev « First page 72 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers