Sciweavers

827 search results - page 67 / 166
» Variational methods for Reinforcement Learning
Sort
View
102
Voted
ICML
1998
IEEE
16 years 1 months ago
Intra-Option Learning about Temporally Abstract Actions
tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...
Richard S. Sutton, Doina Precup, Satinder P. Singh
108
Voted
CORR
2010
Springer
152views Education» more  CORR 2010»
15 years 21 days ago
Neuroevolutionary optimization
Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...
Eva Volná
109
Voted
ICPR
2010
IEEE
14 years 10 months ago
Variational Mixture of Experts for Classification with Applications to Landmine Detection
Abstract--In this paper, we (1) provide a complete framework for classification using Variational Mixture of Experts (VME); (2) derive the variational lower bound; and (3) apply th...
Seniha Esen Yuksel, Paul D. Gader
127
Voted
GECCO
2006
Springer
198views Optimization» more  GECCO 2006»
15 years 4 months ago
Reward allotment in an event-driven hybrid learning classifier system for online soccer games
This paper describes our study into the concept of using rewards in a classifier system applied to the acquisition of decision-making algorithms for agents in a soccer game. Our a...
Yuji Sato, Yosuke Akatsuka, Takenori Nishizono
130
Voted
ECML
2005
Springer
15 years 6 months ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal