Search Sciweavers | Sciweavers

32 search results - page 3 / 7

» Reinforcement Learning and the Bayesian Control Rule

click to vote

NIPS
1996

117views Information Technology» more NIPS 1996»

Reinforcement Learning for Mixed Open-loop and Closed-loop Control

15 years 3 months ago

Download anytime.cs.umass.edu

Closed-loop control relies on sensory feedback that is usually assumed to be free. But if sensing incurs a cost, it may be coste ective to take sequences of actions in open-loop m...

Eric A. Hansen, Andrew G. Barto, Shlomo Zilberstei...

claim paper

Read More »

119

click to vote

ICML
2005
IEEE

196views Machine Learning» more ICML 2005»

Bayesian sparse sampling for on-line reward optimization

16 years 2 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

120

click to vote

NIPS
2008

188views Information Technology» more NIPS 2008»

Bayesian Kernel Shaping for Learning Control

15 years 3 months ago

Download eprints.pascal-network.org

In kernel-based regression learning, optimizing each kernel individually is useful when the data density, curvature of regression surfaces (or decision boundaries) or magnitude of...

Jo-Anne Ting, Mrinal Kalakrishnan, Sethu Vijayakum...

claim paper

Read More »

click to vote

FLAIRS
1998

130views Artificial Intelligence» more FLAIRS 1998»

Learning to Race: Experiments with a Simulated Race Car

15 years 3 months ago

Download www.aaai.org

Our focus is on designing adaptable agents for highly dynamic environments. Wehave implementeda reinforcement learning architecture as the reactive componentof a twolayer control ...

Larry D. Pyeatt, Adele E. Howe

claim paper

Read More »

195

Voted

NIPS
2008

149views Information Technology» more NIPS 2008»

Optimization on a Budget: A Reinforcement Learning Approach

15 years 3 months ago

Download www.cs.arizona.edu

Many popular optimization algorithms, like the Levenberg-Marquardt algorithm (LMA), use heuristic-based "controllers" that modulate the behavior of the optimizer during ...

Paul Ruvolo, Ian R. Fasel, Javier R. Movellan

claim paper

Read More »

« Prev « First page 3 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers