Sciweavers

425 search results - page 30 / 85
» Metacognitive Control and Optimal Learning
Sort
View
78
Voted
NIPS
2001
14 years 11 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
83
Voted
ATAL
2004
Springer
15 years 3 months ago
Product Distribution Theory for Control of Multi-Agent Systems
Product Distribution (PD) theory is a new framework for controlling Multi-Agent Systems (MAS’s). First we review one motivation of PD theory, as the information-theoretic extens...
Chiu Fan Lee, David H. Wolpert
SMC
2007
IEEE
102views Control Systems» more  SMC 2007»
15 years 3 months ago
An improved immune Q-learning algorithm
—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...
Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...
GECCO
2005
Springer
175views Optimization» more  GECCO 2005»
15 years 3 months ago
Evolution of multi-loop controllers for fixed morphology with a cyclic genetic algorithm
Cyclic genetic algorithms can be used to generate single loop control programs for robots. While successful in generating controllers for individual leg movement, gait generation,...
Gary B. Parker, Ramona Georgescu
EUROGP
2007
Springer
116views Optimization» more  EUROGP 2007»
15 years 3 months ago
Genetic Programming with Fitness Based on Model Checking
Abstract. Model checking is a way of analysing programs and programlike structures to decide whether they satisfy a list of temporal logic statements describing desired behaviour. ...
Colin G. Johnson