Search Sciweavers | Sciweavers

267 search results - page 35 / 54

» The Dynamics of Multi-Agent Reinforcement Learning

Voted

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

15 years 1 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

100

click to vote

ATAL
2004
Springer

172views Intelligent Agents» more ATAL 2004»

Adaptive Information Infrastructures for the e-Society

15 years 5 months ago

Download www.cs.unb.ca

Abstract. Positioned at the confluence between human/machine and hardware/software integration and backed by a solid proof of concept realized through several scenarios encompassin...

Mihaela Ulieru

claim paper

Read More »

click to vote

ROBOCUP
2007
Springer

167views Robotics» more ROBOCUP 2007»

Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others

15 years 5 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...

Kentarou Noma, Yasutake Takahashi, Minoru Asada

claim paper

Read More »

click to vote

AROBOTS
1999

87views more AROBOTS 1999»

Dynamics of a Classical Conditioning Model

14 years 11 months ago

Download www.lucs.lu.se

Abstract. Classical conditioning is a basic learning mechanism in animals and can be found in almost all organisms. If we want to construct robots with abilities matching those of ...

Christian Balkenius

claim paper

Read More »

101

click to vote

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

15 years 6 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

« Prev « First page 35 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers