Sciweavers

251 search results - page 25 / 51
» Skill Combination for Reinforcement Learning
Sort
View
ATAL
2005
Springer
15 years 5 months ago
Improving reinforcement learning function approximators via neuroevolution
Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...
Shimon Whiteson
IJAMCIGI
2010
90views more  IJAMCIGI 2010»
14 years 9 months ago
A Reinforcement Learning - Great-Deluge Hyper-Heuristic for Examination Timetabling
Hyper-heuristics are identified as the methodologies that search the space generated by a finite set of low level heuristics for solving difficult problems. One of the iterative h...
Ender Özcan, Mustafa Misir, Gabriela Ochoa, E...
IROS
2006
IEEE
113views Robotics» more  IROS 2006»
15 years 5 months ago
Policy Gradient Methods for Robotics
— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...
Jan Peters, Stefan Schaal
ATAL
2008
Springer
15 years 1 months ago
Switching dynamics of multi-agent learning
This paper presents the dynamics of multi-agent reinforcement learning in multiple state problems. We extend previous work that formally modelled the relation between reinforcemen...
Peter Vrancx, Karl Tuyls, Ronald L. Westra
ATAL
2009
Springer
15 years 6 months ago
An empirical analysis of value function-based and policy search reinforcement learning
In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...
Shivaram Kalyanakrishnan, Peter Stone