Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

19

ICML
2007
IEEE

favoriteEmaildiscussreport

139views Machine Learning» more ICML 2007»

Learning state-action basis functions for hierarchical MDPs

14 years 7 months ago

Learning state-action basis functions for hierarchical MDPs

Download www.machinelearning.org

This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper extends previous work on using Laplacian bases for value function approximation by using the actions of the agent as part of the representation when creating basis functions. The approach results in a nonlinear learned representation particularly suited to approximating action-value functions, without incurring the wasteful duplication of state bases in previous work. We discuss two techniques to create state-action graphs: offpolicy and on-policy. We show that these graphs have a greater expressive power and have better performance over state-based Laplacian basis functions in domains modeled as Semi-Markov Decision Processes (SMDPs). We present a simple graph partitioning method to scale the approach to large discrete MDPs.

Sarah Osentoski, Sridhar Mahadevan

Real-time Traffic

ICML 2007 | Laplacian Bases | Laplacian Basis Functions | Machine Learning | Value Function Approximation |

claim paper

Related Content

» Constructing basis functions from directed graphs for value function approximation

» Learning Basis Functions in Hybrid Domains

» Basis function construction for hierarchical reinforcement learning

» Hierarchical reinforcement learning with subpolicies specializing for learned subgoals

» Generating Hierarchical Structure in Reinforcement Learning from State Variables

» Learning to Plan Using Harmonic Analysis of Diffusion Models

» A hierarchical RBF online learning algorithm for realtime 3D scanner

» Hierarchical Classification of Gene Ontology Terms Using the Gostruct Method

» Wavelet interpolation networks

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2007
Where	ICML
Authors	Sarah Osentoski, Sridhar Mahadevan

Comments (0)