Sciweavers

162 search results - page 13 / 33
» Off-Policy Temporal Difference Learning with Function Approx...
Sort
View
CORR
2006
Springer
111views Education» more  CORR 2006»
14 years 9 months ago
An associative memory for the on-line recognition and prediction of temporal sequences
This paper presents the design of an associative memory with feedback that is capable of on-line temporal sequence learning. A framework for on-line sequence learning has been prop...
Joy Bose, Stephen B. Furber, Jonathan L. Shapiro
CG
2000
Springer
15 years 1 months ago
Chess Neighborhoods, Function Combination, and Reinforcement Learning
Abstract. Over the years, various research projects have attempted to develop a chess program that learns to play well given little prior knowledge beyond the rules of the game. Ea...
Robert Levinson, Ryan Weber
GECCO
2009
Springer
124views Optimization» more  GECCO 2009»
15 years 2 months ago
Reinforcement learning for games: failures and successes
We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...
Wolfgang Konen, Thomas Bartz-Beielstein
105
Voted
ICASSP
2011
IEEE
14 years 1 months ago
Nonstationary and temporally correlated source separation using Gaussian process
Blind source separation (BSS) is a process to reconstruct source signals from the mixed signals. The standard BSS methods assume a fixed set of stationary source signals with the ...
Hsin-Lung Hsieh, Jen-Tzung Chien
CEC
2005
IEEE
15 years 3 months ago
XCS with computed prediction for the learning of Boolean functions
Computed prediction represents a major shift in learning classifier system research. XCS with computed prediction, based on linear approximators, has been applied so far to functi...
Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...