Sciweavers

17 search results - page 4 / 4
» Hierarchical Reinforcement Learning with the MAXQ Value Func...
Sort
View
ICML
2007
IEEE
14 years 6 months ago
Learning state-action basis functions for hierarchical MDPs
This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper exten...
Sarah Osentoski, Sridhar Mahadevan
JMLR
2006
153views more  JMLR 2006»
13 years 5 months ago
Collaborative Multiagent Reinforcement Learning by Payoff Propagation
In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...
Jelle R. Kok, Nikos A. Vlassis