Search Sciweavers | Sciweavers

101 search results - page 6 / 21

» Multi-task reinforcement learning: a hierarchical Bayesian a...

click to vote

NIPS
1997

94views Information Technology» more NIPS 1997»

Reinforcement Learning with Hierarchies of Machines

15 years 1 months ago

Download www.cs.berkeley.edu

We present a new approach to reinforcement learning in which the policies considered by the learning process are constrained by hierarchies of partially speciﬁed machines. This ...

Ronald Parr, Stuart J. Russell

claim paper

Read More »

115

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

15 years 6 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

click to vote

ICANN
2009
Springer

123views Neural Networks» more ICANN 2009»

Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data

15 years 3 months ago

Download www.tu-ilmenau.de

In a typical reinforcement learning (RL) setting details of the environment are not given explicitly but have to be estimated from observations. Most RL approaches only optimize th...

Alexander Hans, Steffen Udluft

claim paper

Read More »

click to vote

ATAL
2004
Springer

127views Intelligent Agents» more ATAL 2004»

Bayesian Reinforcement Learning for Coalition Formation under Uncertainty

15 years 5 months ago

Download www.cs.toronto.edu

Research on coalition formation usually assumes the values of potential coalitions to be known with certainty. Furthermore, settings in which agents lack sufﬁcient knowledge of ...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

click to vote

ICML
2010
IEEE

241views Machine Learning» more ICML 2010»

Learning Programs: A Hierarchical Bayesian Approach

15 years 27 days ago

Download www.cs.berkeley.edu

We are interested in learning programs for multiple related tasks given only a few training examples per task. Since the program for a single task is underdetermined by its data, ...

Percy Liang, Michael I. Jordan, Dan Klein

claim paper

Read More »

« Prev « First page 6 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers