Preference elicitation and inverse reinforcement learning

14 years 7 months ago

Download arxiv.org

We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting in a principled (Bayesian) statistical formulation. This generalises previous work on Bayesian inverse reinforcement learning and allows us to obtain a posterior distribution on the agent's preferences, policy and optionally, the obtained reward sequence, from observations. We examine the relation of the resulting approach to other statistical methods for inverse reinforcement learning via analysis and experimental results. We show that preferences can be determined accurately, even if the observed agent's policy is sub-optimal with respect to its own preferences. In that case, significantly improved policies with respect to the agent's preferences are obtained, compared to both other methods and to the performance of the demonstrated policy.

Constantin Rothkopf, Christos Dimitrakakis

Real-time Traffic

Bayesian Statistics | Inverse Reinfoncement Learning | Preference Elicitation |

posted by olethros

» Bayesian Inverse Reinforcement Learning

» Maximum Entropy Inverse Reinforcement Learning

» Learning User Preferences for Wireless Services Provisioning

» Rational Bidding Using Reinforcement Learning

Post Info
More Details (n/a)

Added	02 Oct 2011
Updated	02 Oct 2011
Type	Conference
Year	2011
Where	ECML
Authors	Constantin Rothkopf, Christos Dimitrakakis

Comments (0)

	Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning 509 views
	Reid et al.'s Distance Bounding Protocol and Mafia Fraud Attacks over Noisy Channels 545 views
	Rollout Sampling Approximate Policy Iteration 334 views
	Bayesian variable order Markov models. 404 views
	Statistical Decision Making for Authentication and Intrusion Detection 634 views

Sciweavers

Preference elicitation and inverse reinforcement learning

Bayesian Statistics | Inverse Reinfoncement Learning | Preference Elicitation |

Explore & Download

Productivity Tools

Sciweavers