Search Sciweavers | Sciweavers

12 search results - page 3 / 3

» Critical factors in the empirical performance of temporal di...

click to vote

ATAL
2008
Springer

176views Intelligent Agents» more ATAL 2008»

Analysis of an evolutionary reinforcement learning method in a multiagent domain

13 years 7 months ago

Download www.aamas-conference.org

Many multiagent problems comprise subtasks which can be considered as reinforcement learning (RL) problems. In addition to classical temporal difference methods, evolutionary algo...

Jan Hendrik Metzen, Mark Edgington, Yohannes Kassa...

claim paper

Read More »

click to vote

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

13 years 7 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

« Prev « First page 3 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers