Sciweavers

28 search results - page 3 / 6
» The MAXQ Method for Hierarchical Reinforcement Learning
Sort
View
ICML
2001
IEEE
14 years 7 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
ICML
1997
IEEE
14 years 7 months ago
Hierarchical Explanation-Based Reinforcement Learning
Explanation-Based Reinforcement Learning (EBRL) was introduced by Dietterich and Flann as a way of combining the ability of Reinforcement Learning (RL) to learn optimal plans with...
Prasad Tadepalli, Thomas G. Dietterich
CVPR
2011
IEEE
13 years 3 months ago
Shape Grammar Parsing via Reinforcement Learning
This paper tackles shape grammar parsing for facade segmentation using a novel optimization approach based on reinforcement learning (RL). To this end, we use a binary recursive g...
Olivier Teboul, Iasonas Kokkinos, Panagiotis Kouts...
HIS
2004
13 years 7 months ago
Reinforcement Learning Hierarchical Neuro-Fuzzy Politree Model for Control of Autonomous Agents
: This work presents a new hybrid neuro-fuzzy model for automatic learning of actions taken by agents. The main objective of this new model is to provide an agent with intelligence...
Karla Figueiredo, Marley B. R. Vellasco, Marco Aur...
FLAIRS
2004
13 years 7 months ago
State Space Reduction For Hierarchical Reinforcement Learning
er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...
Mehran Asadi, Manfred Huber