Search Sciweavers | Sciweavers

101 search results - page 2 / 21

» Multi-task reinforcement learning: a hierarchical Bayesian a...

click to vote

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

14 years 6 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

click to vote

IJCAI
2003

188views Artificial Intelligence» more IJCAI 2003»

A Bayesian Approach to Imitation in Reinforcement Learning

13 years 6 months ago

Download ijcai.org

In multiagent environments, forms of social learning such as teaching and imitation have been shown to aid the transfer of knowledge from experts to learners in reinforcement lear...

Bob Price, Craig Boutilier

claim paper

Read More »

click to vote

EWRL
2008

191views Machine Learning» more EWRL 2008»

Bayesian Reward Filtering

13 years 7 months ago

Download www.metz.supelec.fr

A wide variety of function approximation schemes have been applied to reinforcement learning. However, Bayesian filtering approaches, which have been shown efficient in other field...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

click to vote

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

12 years 4 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

Publication

154views

Preference elicitation and inverse reinforcement learning

12 years 7 months ago

Download arxiv.org

We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting in a principled (Bayesian) statistical formulation. This generalises previous w...

Constantin Rothkopf, Christos Dimitrakakis

posted by olethros

Read More »

« Prev « First page 2 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers