Sciweavers

1233 search results - page 91 / 247
» Reinforcement Learning in MirrorBot
Sort
View
127
Voted
PKDD
2009
Springer
181views Data Mining» more  PKDD 2009»
15 years 10 months ago
Active Learning for Reward Estimation in Inverse Reinforcement Learning
Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...
Manuel Lopes, Francisco S. Melo, Luis Montesano
130
Voted
ICML
1998
IEEE
16 years 4 months ago
The MAXQ Method for Hierarchical Reinforcement Learning
This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...
Thomas G. Dietterich
131
Voted
ECML
2006
Springer
15 years 7 months ago
Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks
Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...
Sébastien Jodogne, Cyril Briquet, Justus H....
122
Voted
FLAIRS
2000
15 years 5 months ago
Resolving Conflicts Among Actions in Concurrent Behaviors
A robotic agent must coordinate its coupled concurrent behaviors to produce a coherent response to stimuli. Reinforcement learning has been used extensively in coordinating sensin...
Henry Hexmoor
117
Voted
IJCAI
2003
15 years 5 months ago
Simultaneous Adversarial Multi-Robot Learning
Multi-robot learning faces all of the challenges of robot learning with all of the challenges of multiagent learning. There has been a great deal of recent research on multiagent ...
Michael H. Bowling, Manuela M. Veloso