Search Sciweavers | Sciweavers

827 search results - page 67 / 166

» Variational methods for Reinforcement Learning

167

click to vote

ICML
1998
IEEE

165views Machine Learning» more ICML 1998»

Intra-Option Learning about Temporally Abstract Actions

16 years 7 months ago

Download www.cs.ualberta.ca

tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

194

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

15 years 6 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

189

click to vote

ICPR
2010
IEEE

171views Computer Vision» more ICPR 2010»

Variational Mixture of Experts for Classification with Applications to Landmine Detection

15 years 4 months ago

Download www.cise.ufl.edu

Abstract--In this paper, we (1) provide a complete framework for classification using Variational Mixture of Experts (VME); (2) derive the variational lower bound; and (3) apply th...

Seniha Esen Yuksel, Paul D. Gader

claim paper

Read More »

197

click to vote

GECCO
2006
Springer

198views Optimization» more GECCO 2006»

Reward allotment in an event-driven hybrid learning classifier system for online soccer games

15 years 10 months ago

Download www.cs.bham.ac.uk

This paper describes our study into the concept of using rewards in a classifier system applied to the acquisition of decision-making algorithms for agents in a soccer game. Our a...

Yuji Sato, Yosuke Akatsuka, Takenori Nishizono

claim paper

Read More »

204

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

16 years 6 days ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

« Prev « First page 67 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers