Search Sciweavers | Sciweavers

226 search results - page 10 / 46

» Linear Bayesian Reinforcement Learning

171

click to vote

LCN
2006
IEEE

115views Computer Networks» more LCN 2006»

Sensor Networks Routing via Bayesian Exploration

16 years 10 days ago

Download www.cc.gatech.edu

There is increasing research interest in solving routing problems in sensor networks subject to constraints such as data correlation, link reliability and energy conservation. Sin...

Shuang Hao, Ting Wang

claim paper

Read More »

172

Voted

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 7 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

155

click to vote

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 7 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

160

click to vote

ICRA
2005
IEEE

140views Robotics» more ICRA 2005»

Fast Reinforcement Learning for Vision-guided Mobile Robots

15 years 12 months ago

Download aass.oru.se

— This paper presents a new reinforcement learning algorithm for accelerating acquisition of new skills by real mobile robots, without requiring simulation. It speeds up Q-learni...

Tomás Martínez-Marín, Tom Duc...

claim paper

Read More »

174

click to vote

ECML
2006
Springer

116views Machine Learning» more ECML 2006»

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

15 years 10 months ago

Download web.engr.oregonstate.edu

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

« Prev « First page 10 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers