Search Sciweavers | Sciweavers

425 search results - page 30 / 85

» Metacognitive Control and Optimal Learning

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

click to vote

ATAL
2004
Springer

168views Intelligent Agents» more ATAL 2004»

Product Distribution Theory for Control of Multi-Agent Systems

15 years 5 months ago

Download collectives.stanford.edu

Product Distribution (PD) theory is a new framework for controlling Multi-Agent Systems (MAS’s). First we review one motivation of PD theory, as the information-theoretic extens...

Chiu Fan Lee, David H. Wolpert

claim paper

Read More »

click to vote

SMC
2007
IEEE

102views Control Systems» more SMC 2007»

An improved immune Q-learning algorithm

15 years 6 months ago

Download web2.uwindsor.ca

—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...

Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...

claim paper

Read More »

click to vote

GECCO
2005
Springer

175views Optimization» more GECCO 2005»

Evolution of multi-loop controllers for fixed morphology with a cyclic genetic algorithm

15 years 5 months ago

Download cs.conncoll.edu

Cyclic genetic algorithms can be used to generate single loop control programs for robots. While successful in generating controllers for individual leg movement, gait generation,...

Gary B. Parker, Ramona Georgescu

claim paper

Read More »

click to vote

EUROGP
2007
Springer

116views Optimization» more EUROGP 2007»

Genetic Programming with Fitness Based on Model Checking

15 years 6 months ago

Download www.cs.kent.ac.uk

Abstract. Model checking is a way of analysing programs and programlike structures to decide whether they satisfy a list of temporal logic statements describing desired behaviour. ...

Colin G. Johnson

claim paper

Read More »

« Prev « First page 30 / 85 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers