Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

11

ICML
2003
IEEE

favoriteEmaildiscussreport

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

14 years 5 months ago

Q-Decomposition for Reinforcement Learning Agents

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and runs its own reinforcement learning process. It supplies to a central arbitrator the Q-values (according to its own reward function) for each possible action. The arbitrator selects an action maximizing the sum of Q-values from all the subagents. This approach has advantages over designs in which subagents recommend actions. It also has the property that if each subagent runs the Sarsa reinforcement learning algorithm to learn its local Q-function, then a globally optimal policy is achieved. (On the other hand, local Q-learning leads to globally suboptimal behavior.) In some cases, this form of agent decomposition allows the local Q-functions to be expressed by muchreduced state and action spaces. These results are illustrated in two domains that require effective coordination of behaviors.

Stuart J. Russell, Andrew Zimdars

Real-time Traffic

ICML 2003 | Machine Learning | Reinforcement Learning Algorithm | Reinforcement Learning Process | Reward Function |

claim paper

Related Content

» Reinforcement learning agents with primary knowledge designed by analytic hierarchy proces...

» QConceptLearning Generalization with Concept Lattice Representation in Reinforcement Learn...

» Asymmetric Multiagent Reinforcement Learning

» Combining ModelBased MetaReasoning and Reinforcement Learning for Adapting GamePlaying Age...

» Multiagent Reinforcement Learning Using Strategies and Voting

» Learning Methods to Generate Good Plans Integrating HTN Learning and Reinforcement Learnin...

» Relational Reinforcement Learning for Agents in Worlds with Objects

» Learning to Drive a Bicycle Using Reinforcement Learning and Shaping

» Rule value reinforcement learning for cognitive agents

» Reinforcement Learning for Agents with Many Sensors and Actuators Acting in Categorizable ...

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2003
Where	ICML
Authors	Stuart J. Russell, Andrew Zimdars

Comments (0)