Learning to Plan Using Harmonic Analysis of Diffusion Models

15 years 2 months ago

Download www.cs.umass.edu

This paper summarizes research on a new emerging framework for learning to plan using the Markov decision process model (MDP). In this paradigm, two approaches to learning to plan have traditionally been studied: the indirect model-based approach infers the state transition matrix and reward function from samples, and then solves the Bellman equation to ﬁnd the optimal (action) value function; the direct model-free approach, most notably Q-learning, estimates the action value function directly. This paper describes a new harmonic analysis framework for planning based on estimating a diffusion model that captures information ﬂow on a graph (discrete state space) or a manifold (continuous state space) using the Laplace heat equation. Diffusion models are signiﬁcantly easier to learn than transition models, and yet provide similar speedups in performance over model-free methods. Two methods for constructing novel plan representations from diffusion models are described: Fourier met...

Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns,

Real-time Traffic

AIPS 2007 | Artificial Intelligence | Basis Functions | Diffusion Models | Paper Summarizes Research |

claim paper

» Hyperspherical von MisesFisher Mixture HvMF Modelling of High Angular Resolution Diffusion...

» Learning Harmonic Relationships in Digital Audio with DirichletBased Hidden Markov Models

» Learning with Queries Corrupted by Classification Noise

» A Methodology for Integrating Network Theory and Topic Modeling and its Application to Inn...

» Mining social networks using heat diffusion processes for marketing candidates selection

» Operationalising Guidelines for InterOrganisational Systems Planning Exploring a Learning ...

» Protovalue functions developmental reinforcement learning

» eLearning in Higher Education Searching for a Model of Curriculum Analysis

Post Info
More Details (n/a)

Added	02 Oct 2010
Updated	02 Oct 2010
Type	Conference
Year	2007
Where	AIPS
Authors	Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns, Kimberly Ferguson, Chang Wang

Comments (0)

Sciweavers

Learning to Plan Using Harmonic Analysis of Diffusion Models

AIPS 2007 | Artificial Intelligence | Basis Functions | Diffusion Models | Paper Summarizes Research |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers