Sciweavers

664 search results - page 72 / 133
» Combining Reinforcement Learning with a Local Control Algori...
Sort
View
ICML
2008
IEEE
16 years 3 months ago
Sample-based learning and search with permanent and transient memories
We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...
David Silver, Martin Müller 0003, Richard S. ...
125
Voted
APIN
2002
90views more  APIN 2002»
15 years 2 months ago
Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning
Abstract: Locally weighted learning (LWL) is a class of techniques from nonparametric statistics that provides useful representations and training algorithms for learning about com...
Stefan Schaal, Christopher G. Atkeson, Sethu Vijay...
101
Voted
GECCO
2008
Springer
274views Optimization» more  GECCO 2008»
15 years 3 months ago
Bacterial foraging oriented by particle swarm optimization strategy for PID tuning
Proportional integral derivative (PID) controller tuning is an area of interest for researchers in many disciplines of science and engineering. This paper presents a new algorithm...
Wael Mansour Korani
ATAL
2008
Springer
15 years 4 months ago
Adaptive Kanerva-based function approximation for multi-agent systems
In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...
Cheng Wu, Waleed Meleis
GECCO
2004
Springer
103views Optimization» more  GECCO 2004»
15 years 7 months ago
Training Neural Networks with GA Hybrid Algorithms
Abstract. Training neural networks is a complex task of great importance in the supervised learning field of research. In this work we tackle this problem with five algorithms, a...
Enrique Alba, J. Francisco Chicano