Sciweavers

267 search results - page 35 / 54
» The Dynamics of Multi-Agent Reinforcement Learning
Sort
View
NIPS
2003
14 years 11 months ago
Policy Search by Dynamic Programming
We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...
J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...
ATAL
2004
Springer
15 years 3 months ago
Adaptive Information Infrastructures for the e-Society
Abstract. Positioned at the confluence between human/machine and hardware/software integration and backed by a solid proof of concept realized through several scenarios encompassin...
Mihaela Ulieru
ROBOCUP
2007
Springer
167views Robotics» more  ROBOCUP 2007»
15 years 3 months ago
Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others
The existing reinforcement learning approaches have been suffering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...
Kentarou Noma, Yasutake Takahashi, Minoru Asada
AROBOTS
1999
87views more  AROBOTS 1999»
14 years 9 months ago
Dynamics of a Classical Conditioning Model
Abstract. Classical conditioning is a basic learning mechanism in animals and can be found in almost all organisms. If we want to construct robots with abilities matching those of ...
Christian Balkenius
92
Voted
IROS
2008
IEEE
125views Robotics» more  IROS 2008»
15 years 4 months ago
Dynamic correlation matrix based multi-Q learning for a multi-robot system
—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...
Hongliang Guo, Yan Meng