Search Sciweavers | Sciweavers

132 search results - page 18 / 27

» Generalization in Reinforcement Learning: Safely Approximati...

123

click to vote

CORR
2012
Springer

196views Education» more CORR 2012»

PAC-Bayesian Policy Evaluation for Reinforcement Learning

13 years 7 months ago

Download www.cs.mcgill.ca

Bayesian priors oﬀer a compact yet general means of incorporating domain knowledge into many learning tasks. The correctness of the Bayesian analysis and inference, however, lar...

Mahdi Milani Fard, Joelle Pineau, Csaba Szepesv&aa...

claim paper

Read More »

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

14 years 9 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

click to vote

GECCO
2006
Springer

159views Optimization» more GECCO 2006»

Standard and averaging reinforcement learning in XCS

15 years 3 months ago

Download www.cs.bham.ac.uk

This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usu...

Pier Luca Lanzi, Daniele Loiacono

claim paper

Read More »

Voted

ICML
1995
IEEE

155views Machine Learning» more ICML 1995»

Stable Function Approximation in Dynamic Programming

16 years 13 days ago

Download www.ri.cmu.edu

The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experime...

Geoffrey J. Gordon

claim paper

Read More »

136

Voted

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

14 years 9 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

« Prev « First page 18 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers