Search Sciweavers | Sciweavers

49 search results - page 2 / 10

» Temporal Difference and Policy Search Methods for Reinforcem...

AAMAS
2010
Springer

190views Intelligent Agents» more AAMAS 2010»

Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning

13 years 6 months ago

Download www.cs.utexas.edu

Shimon Whiteson, Matthew E. Taylor, Peter Stone

claim paper

Read More »

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

13 years 10 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

ATAL
2007
Springer

146views Intelligent Agents» more ATAL 2007»

Transfer via inter-task mappings in policy search reinforcement learning

14 years 16 days ago

Download userweb.cs.utexas.edu

The ambitious goal of transfer learning is to accelerate learning on a target task after training on a different, but related, source task. While many past transfer methods have f...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

13 years 7 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

13 years 7 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

« Prev « First page 2 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers