Search Sciweavers | Sciweavers

37 search results - page 2 / 8

» Analysis of Inverse Reinforcement Learning with Perturbed De...

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

14 years 6 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

click to vote

IROS
2007
IEEE

144views Robotics» more IROS 2007»

Using reinforcement learning to adapt an imitation task

13 years 11 months ago

Download lasa.epfl.ch

Abstract— The goal of developing algorithms for programming robots by demonstration is to create an easy way of programming robots that can be accomplished by everyone. When a de...

Florent Guenter, Aude Billard

claim paper

Read More »

click to vote

Publication

240views

Bayesian multitask inverse reinforcement learning

12 years 3 months ago

Download arxiv.org

We generalise the problem of inverse reinforcement learning to multiple tasks, from multiple demonstrations. Each one may represent one expert trying to solve a different task, or ...

Christos Dimitrakakis, Constantin A. Rothkopf

posted by olethros

Read More »

click to vote

AAAI
2008

199views Intelligent Agents» more AAAI 2008»

Maximum Entropy Inverse Reinforcement Learning

13 years 7 months ago

Download www.andrew.cmu.edu

Recent research has shown the benefit of framing problems of imitation learning as solutions to Markov Decision Problems. This approach reduces learning to the problem of recoveri...

Brian Ziebart, Andrew L. Maas, J. Andrew Bagnell, ...

claim paper

Read More »

click to vote

AAMAS
2007
Springer

210views Intelligent Agents» more AAMAS 2007»

Bifurcation Analysis of Reinforcement Learning Agents in the Selten's Horse Game

13 years 11 months ago

Download sequel.futurs.inria.fr

Abstract. The application of reinforcement learning algorithms to multiagent domains may cause complex non-convergent dynamics. The replicator dynamics, commonly used in evolutiona...

Alessandro Lazaric, Jose Enrique Munoz de Cote, Fa...

claim paper

Read More »

« Prev « First page 2 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers