The Dependence of Effective Planning Horizon on Model Accuracy

9 years 12 months ago

Download web.eecs.umich.edu

For Markov decision processes with long horizons (i.e., discount factors close to one), it is common in practice to use reduced horizons during planning to speed computation. However, perhaps surprisingly, when the model available to the agent is estimated from data, as will be the case in most real-world problems, the policy found using a shorter planning horizon can actually be better than a policy learned with the true horizon. In this paper we provide a precise explanation for this phenomenon based on principles of learning theory. We show formally that the planning horizon is a complexity control parameter for the class of policies to be learned. In particular, it has an intuitive, monotonic relationship with a simple counting measure of complexity, and that a similar relationship can be observed empirically with a more general and data-dependent Rademacher complexity measure. Each complexity measure gives rise to a bound on the planning loss predicting that a planning horizon sh...

Nan Jiang, Alex Kulesza, Satinder Singh, Richard L

Real-time Traffic

ATAL 2015 | Intelligent Agents |

claim paper

» Motion planning under uncertainty for robotic tasks with long time horizons

» Effective teamdriven multimodel motion tracking

» Planning under uncertainty using model predictive control for information gathering

» Online NextBestView Planning for Accuracy Optimization Using an Extended ECriterion

» Multimodel Tracking using Team Actuation Models

» Quantifying the accuracy of Hammerstein model estimation

» Constraintbased dynamic programming for decentralized POMDPs with structured interactions

» Using Learned Policies in HeuristicSearch Planning

» Instructional interventions in computerbased tutoring differential impact on learning time...

Post Info
More Details (n/a)

Added	16 Apr 2016
Updated	16 Apr 2016
Type	Journal
Year	2015
Where	ATAL
Authors	Nan Jiang, Alex Kulesza, Satinder Singh, Richard L. Lewis

Comments (0)

Sciweavers

The Dependence of Effective Planning Horizon on Model Accuracy

ATAL 2015 | Intelligent Agents |

Explore & Download

Productivity Tools

Sciweavers