Search Sciweavers | Sciweavers

575 search results - page 84 / 115

» Reinforcement Learning State Estimator

127

click to vote

ICML
2003
IEEE

168views Machine Learning» more ICML 2003»

Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning

16 years 5 months ago

Download webee.technion.ac.il

We present a novel Bayesian approach to the problem of value function estimation in continuous state spaces. We define a probabilistic generative model for the value function by i...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

163

click to vote

ICASSP
2011
IEEE

165views Signal Processing» more ICASSP 2011»

A sliding-window online fast variational sparse Bayesian learning algorithm

14 years 8 months ago

Download mirlab.org

In this work a new online learning algorithm that uses automatic relevance determination (ARD) is proposed for fast adaptive nonlinear ﬁltering. A sequential decision rule for i...

Thomas Buchgraber, Dmitriy Shutin, H. Vincent Poor

claim paper

Read More »

136

click to vote

NIPS
2001

104views Information Technology» more NIPS 2001»

Linking Motor Learning to Function Approximation: Learning in an Unlearnable Force Field

15 years 5 months ago

Download books.nips.cc

Reaching movements require the brain to generate motor commands that rely on an internal model of the task's dynamics. Here we consider the errors that subjects make early in...

O. Donchin, Reza Shadmehr

claim paper

Read More »

132

click to vote

FLAIRS
2004

188views Artificial Intelligence» more FLAIRS 2004»

Intelligent Control of Closed-Loop Sedation in Simulated ICU Patients

15 years 5 months ago

Download www.aaai.org

The intensive care unit is a challenging environment to both patient and caregiver. Continued shortages in staffing, principally in nursing, increase risk to patient and healthcar...

Brett L. Moore, Eric D. Sinzinger, Todd M. Quasny,...

claim paper

Read More »

141

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 2 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

« Prev « First page 84 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers