Search Sciweavers | Sciweavers

53 search results - page 5 / 11

» Shaping multi-agent systems with gradient reinforcement lear...

148

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 7 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

149

click to vote

CORR
2000
Springer

92views Education» more CORR 2000»

Predicting the expected behavior of agents that learn about agents: the CLRI framework

15 years 5 months ago

Download jmvidal.cse.sc.edu

We describe a framework and equations used to model and predict the behavior of multi-agent systems (MASs) with learning agents. A difference equation is used for calculating the ...

José M. Vidal, Edmund H. Durfee

claim paper

Read More »

136

click to vote

ICML
2002
IEEE

138views Machine Learning» more ICML 2002»

Reinforcement Learning and Shaping: Encouraging Intended Behaviors

16 years 6 months ago

Download www.grappa.univ-lille3.fr

We explore dynamic shaping to integrate our prior beliefs of the final policy into a conventional reinforcement learning system. Shaping provides a positive or negative artificial...

Adam Laud, Gerald DeJong

claim paper

Read More »

153

Voted

PPSN
2004
Springer

156views Distributed And Parallel Com...» more PPSN 2004»

Evolutionary Multi-agent Systems

15 years 11 months ago

Download lis.epfl.ch

In Multi-Agent learning, agents must learn to select actions that maximize their utility given the action choices of the other agents. Cooperative Coevolution oﬀers a way to evol...

Pieter Jan't Hoen, Edwin D. de Jong

claim paper

Read More »

160

click to vote

ALIFE
2002

176views Modeling And Simulation» more ALIFE 2002»

Ant Colony Optimization and Stochastic Gradient Descent

15 years 5 months ago

Download ti.arc.nasa.gov

In this paper, we study the relationship between the two techniques known as ant colony optimization (aco) and stochastic gradient descent. More precisely, we show that some empir...

Nicolas Meuleau, Marco Dorigo

claim paper

Read More »

« Prev « First page 5 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers