Search Sciweavers | Sciweavers

397 search results - page 23 / 80

» Reinforcement Learning with Hierarchies of Machines

click to vote

ICML
2007
IEEE

185views Machine Learning» more ICML 2007»

Hierarchical Gaussian process latent variable models

16 years 19 days ago

Download www.machinelearning.org

The Gaussian process latent variable model (GP-LVM) is a powerful approach for probabilistic modelling of high dimensional data through dimensional reduction. In this paper we ext...

Neil D. Lawrence, Andrew J. Moore

claim paper

Read More »

102

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation

16 years 19 days ago

Download www.machinelearning.org

Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...

Chee Wee Phua, Robert Fitch

claim paper

Read More »

102

click to vote

ICML
2001
IEEE

172views Machine Learning» more ICML 2001»

Continuous-Time Hierarchical Reinforcement Learning

16 years 19 days ago

Download www.cs.ualberta.ca

Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Pri...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

109

click to vote

ICML
2000
IEEE

165views Machine Learning» more ICML 2000»

A Bayesian Framework for Reinforcement Learning

15 years 4 months ago

Download www.ece.uvic.ca

The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...

Malcolm J. A. Strens

claim paper

Read More »

105

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 19 days ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

« Prev « First page 23 / 80 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers