Search Sciweavers | Sciweavers

236 search results - page 34 / 48

» A Multiagent Reinforcement Learning Algorithm with Non-linea...

160

click to vote

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

15 years 8 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

165

click to vote

ICML
2001
IEEE

127views Machine Learning» more ICML 2001»

Convergence of Gradient Dynamics with a Variable Learning Rate

16 years 7 months ago

Download www.cs.cmu.edu

As multiagent environments become more prevalent we need to understand how this changes the agent-based paradigm. One aspect that is heavily affected by the presence of multiple a...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

188

click to vote

AAAI
2008

204views Intelligent Agents» more AAAI 2008»

Adaptive Management of Air Traffic Flow: A Multiagent Coordination Approach

15 years 9 months ago

Download www.aaai.org

This paper summarizes recent advances in the application of multiagent coordination algorithms to air traffic flow management. Indeed, air traffic flow management is one of the fu...

Kagan Tumer, Adrian K. Agogino

claim paper

Read More »

189

click to vote

IBERAMIA
2010
Springer

245views Artificial Intelligence» more IBERAMIA 2010»

Dynamic Reward Shaping: Training a Robot by Voice

15 years 5 months ago

Download ccc.inaoep.mx

Reinforcement Learning is commonly used for learning tasks in robotics, however, traditional algorithms can take very long training times. Reward shaping has been recently used to ...

Ana C. Tenorio-Gonzalez, Eduardo F. Morales, Luis ...

claim paper

Read More »

152

click to vote

ATAL
2005
Springer

117views Intelligent Agents» more ATAL 2005»

Multi-agent reward analysis for learning in noisy domains

16 years 11 days ago

Download ti.arc.nasa.gov

In many multi agent learning problems, it is difﬁcult to determine, a priori, the agent reward structure that will lead to good performance. This problem is particularly pronoun...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

« Prev « First page 34 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers