MDP planning | Sciweavers

ATAL
2008
Springer

104views Intelligent Agents» more ATAL 2008»

Expediting RL by using graphical structures

13 years 6 months ago

The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...

Peng Dai, Alexander L. Strehl, Judy Goldsmith

claim paper

Read More »

click to vote

SARA
2007
Springer

152views Artificial Intelligence» more SARA 2007»

Computing and Using Lower and Upper Bounds for Action Elimination in MDP Planning

13 years 10 months ago

Download www.cs.umd.edu

Abstract. We describe a way to improve the performance of MDP planners by modifying them to use lower and upper bounds to eliminate non-optimal actions during their search. First, ...

Ugur Kuter, Jiaqiao Hu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers