Search Sciweavers | Sciweavers

154 search results - page 28 / 31

» Sample-Efficient Evolutionary Function Approximation for Rei...

208

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

15 years 1 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

179

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 7 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

188

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 10 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

203

click to vote

FLAIRS
2004

125views Artificial Intelligence» more FLAIRS 2004»

Multimodal Function Optimization Using Local Ruggedness Information

15 years 8 months ago

Download www.aaai.org

In multimodal function optimization, niching techniques create diversification within the population, thus encouraging heterogeneous convergence. The key to the effective diversif...

Jian Zhang 0007, Xiaohui Yuan, Bill P. Buckles

claim paper

Read More »

169

click to vote

GECCO
2008
Springer

131views Optimization» more GECCO 2008»

Self-adaptive mutation in XCSF

15 years 7 months ago

Download www.uni-wuerzburg.de

Recent advances in XCS technology have shown that selfadaptive mutation can be highly useful to speed-up the evolutionary progress in XCS. Moreover, recent publications have shown...

Martin V. Butz, Patrick O. Stalph, Pier Luca Lanzi

claim paper

Read More »

« Prev « First page 28 / 31 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers