Sciweavers

2108 search results - page 36 / 422

» Tracking in Reinforcement Learning

82

ATAL
2009
Springer

77views Intelligent Agents» more ATAL 2009»

Learning with whom to communicate using relational reinforcement learning

16 years 21 days ago

Learning with whom to communicate using relational reinforcement learning

Download www.aamas-conference.org

Marc J. V. Ponsen, Tom Croonenborghs, Karl Tuyls, ...

claim paper

Read More »

128

FLAIRS
2003

117views Artificial Intelligence» more FLAIRS 2003»

Subgoal Discovery for Hierarchical Reinforcement Learning Using Learned Policies

15 years 7 months ago

Subgoal Discovery for Hierarchical Reinforcement Learning Using Learned Policies

Download www.cse.uta.edu

Sandeep Goel, Manfred Huber

claim paper

Read More »

119

NIPS
1996

89views Information Technology» more NIPS 1996»

Learning Decision Theoretic Utilities through Reinforcement Learning

15 years 7 months ago

Learning Decision Theoretic Utilities through Reinforcement Learning

Download papers.cnl.salk.edu

Magnus Stensmo, Terrence J. Sejnowski

claim paper

Read More »

171

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

15 years 7 months ago

Gradient Descent for General Reinforcement Learning

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

154

AUSAI
2005
Springer

123views Artificial Intelligence» more AUSAI 2005»

Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning

15 years 11 months ago

Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning

Download eprints.utas.edu.au

: In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to be combined with function approximation techniques. The majority of...

Peter Vamplew, Robert Ollington

claim paper

Read More »

« Prev « First page 36 / 422 Last » Next »