Sciweavers

267 search results - page 36 / 54
» The Dynamics of Multi-Agent Reinforcement Learning
Sort
View
AR
2002
157views more  AR 2002»
14 years 9 months ago
Acquiring state from control dynamics to learn grasping policies for robot hands
Abstract--A prominent emerging theory of sensorimotor development in biological systems proposes that control knowledge is encoded in the dynamics of physical interaction with the ...
Roderic A. Grupen, Jefferson A. Coelho Jr.
89
Voted
ANOR
2005
80views more  ANOR 2005»
14 years 9 months ago
Entropic Penalties in Finite Games
The main objects here are finite-strategy games in which entropic terms are subtracted from the payoffs. After such subtraction each Nash equilibrium solves an explicit, unconstra...
Sjur Didrik Flåm, E. Cavazzuti
ATAL
2004
Springer
15 years 3 months ago
A Pheromone-Based Utility Model for Collaborative Foraging
Multi-agent research often borrows from biology, where remarkable examples of collective intelligence may be found. One interesting example is ant colonies’ use of pheromones as...
Liviu Panait, Sean Luke
106
Voted
JCP
2007
143views more  JCP 2007»
14 years 9 months ago
Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization
Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...
Nicolas Chapados, Yoshua Bengio
SIGDIAL
2010
14 years 7 months ago
Sparse Approximate Dynamic Programming for Dialog Management
Spoken dialogue management strategy optimization by means of Reinforcement Learning (RL) is now part of the state of the art. Yet, there is still a clear mismatch between the comp...
Senthilkumar Chandramohan, Matthieu Geist, Olivier...