Search Sciweavers | Sciweavers

6 search results - page 1 / 2

» Maximum Entropy Inverse Reinforcement Learning

click to vote

AAAI
2008

199views Intelligent Agents» more AAAI 2008»

Maximum Entropy Inverse Reinforcement Learning

13 years 7 months ago

Download www.andrew.cmu.edu

Recent research has shown the benefit of framing problems of imitation learning as solutions to Markov Decision Problems. This approach reduces learning to the problem of recoveri...

Brian Ziebart, Andrew L. Maas, J. Andrew Bagnell, ...

claim paper

Read More »

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

13 years 5 months ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

click to vote

ICML
2010
IEEE

215views Machine Learning» more ICML 2010»

Modeling Interaction via the Principle of Maximum Causal Entropy

13 years 5 months ago

Download www.cs.cmu.edu

The principle of maximum entropy provides a powerful framework for statistical models of joint, conditional, and marginal distributions. However, there are many important distribu...

Brian Ziebart, J. Andrew Bagnell, Anind K. Dey

claim paper

Read More »

click to vote

IROS
2009
IEEE

123views Robotics» more IROS 2009»

Planning-based prediction for pedestrians

13 years 11 months ago

Download www.cs.cmu.edu

— We present a novel approach for determining robot movements that efﬁciently accomplish the robot’s tasks while not hindering the movements of people within the environment....

Brian Ziebart, Nathan D. Ratliff, Garratt Gallaghe...

claim paper

Read More »

click to vote

CORR
2011
Springer

230views Education» more CORR 2011»

Computational Rationalization: The Inverse Equilibrium Problem

12 years 11 months ago

Download www.cs.cmu.edu

Modeling the behavior of imperfect agents from a small number of observations is a diﬃcult, but important task. In the singleagent decision-theoretic setting, inverse optimal co...

Kevin Waugh, Brian Ziebart, J. Andrew Bagnell

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers