Search Sciweavers | Sciweavers

226 search results - page 14 / 46

» Linear Bayesian Reinforcement Learning

155

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

15 years 9 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

134

click to vote

ICML
2009
IEEE

188views Machine Learning» more ICML 2009»

Convex variational Bayesian inference for large scale generalized linear models

16 years 3 months ago

Download www.kyb.tuebingen.mpg.de

We show how variational Bayesian inference can be implemented for very large generalized linear models. Our relaxation is proven to be a convex problem for any log-concave model. ...

Hannes Nickisch, Matthias W. Seeger

claim paper

Read More »

122

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

15 years 4 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

124

Voted

ICRA
1995
IEEE

123views Robotics» more ICRA 1995»

Vision-Based Reinforcement Learning for Purposive Behavior Acquisition

15 years 6 months ago

Download www.er.ams.eng.osaka-u.ac.jp

This paper presents a method of vision-based reinforcement learning by which a robot learns to shoot a ball into a goal, and discusses several issues in applying the reinforcement...

Minoru Asada, Shoichi Noda, Sukoya Tawaratsumida, ...

claim paper

Read More »

128

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

16 years 3 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

« Prev « First page 14 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers