Sciweavers

1235 search results - page 145 / 247
» ABC Reinforcement Learning
Sort
View
111
Voted
ICMLA
2010
14 years 10 months ago
Multimodal Parameter-exploring Policy Gradients
Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...
Frank Sehnke, Alex Graves, Christian Osendorfer, J...
ITNG
2007
IEEE
15 years 7 months ago
Input Fuzzy Modeling for the Recognition of Handwritten Hindi Numerals
This paper presents the recognition of Handwritten Hindi Numerals based on the modified exponential membership function fitted to the fuzzy sets derived from normalized distance f...
Madasu Hanmandlu, J. Grover, Vamsi Krishna Madasu,...
ATAL
2003
Springer
15 years 6 months ago
A selection-mutation model for q-learning in multi-agent systems
Although well understood in the single-agent framework, the use of traditional reinforcement learning (RL) algorithms in multi-agent systems (MAS) is not always justified. The fe...
Karl Tuyls, Katja Verbeeck, Tom Lenaerts
100
Voted
ATAL
2008
Springer
15 years 2 months ago
A new perspective to the keepaway soccer: the takers
Keepaway is a sub-problem of RoboCup Soccer Simulator in which 'the keepers' try to maintain the possession of the ball, while 'the takers' try to steal the ba...
Atil Iscen, Umut Erogul
115
Voted
AAAI
1992
15 years 1 months ago
Automatic Programming of Robots Using Genetic Programming
The goal in automatic programming is to get a computer to perform a task by telling it what needs to be done, rather than by explicitly programming it. This paper considers the ta...
John R. Koza, James Rice