temporal difference learning

145

ML
1998
ACM

136views Machine Learning» more ML 1998»

Co-Evolution in the Successful Learning of Backgammon Strategy

15 years 5 months ago

Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

143

click to vote

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 6 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

164

click to vote

FLAIRS
2003

195views Artificial Intelligence» more FLAIRS 2003»

Learning Opening Strategy in the Game of Go

15 years 6 months ago

Download vision.middlebury.edu

In this paper, we present an experimental methodology and results for a machine learning approach to learning opening strategy in the game of Go, a game for which the best compute...

Timothy Huang, Graeme Connell, Bryan McQuade

claim paper

Read More »

155

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 6 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

156

click to vote

NIPS
2008

173views Information Technology» more NIPS 2008»

On the asymptotic equivalence between differential Hebbian and temporal difference learning using a local third factor

15 years 6 months ago

Download books.nips.cc

In this theoretical contribution we provide mathematical proof that two of the most important classes of network learning - correlation-based differential Hebbian learning and rew...

Christoph Kolodziejski, Bernd Porr, Minija Tamosiu...

claim paper

Read More »

158

Voted

CG
2000
Springer

150views Computer Graphics» more CG 2000»

Chess Neighborhoods, Function Combination, and Reinforcement Learning

15 years 9 months ago

Download users.soe.ucsc.edu

Abstract. Over the years, various research projects have attempted to develop a chess program that learns to play well given little prior knowledge beyond the rules of the game. Ea...

Robert Levinson, Ryan Weber

claim paper

Read More »

166

click to vote

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

15 years 10 months ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

177

click to vote

CIG
2006
IEEE

202views Applied Computing» more CIG 2006»

Temporal Difference Learning Versus Co-Evolution for Acquiring Othello Position Evaluation

15 years 11 months ago

Download algoval.essex.ac.uk

Abstract— This paper compares the use of temporal difference learning (TDL) versus co-evolutionary learning (CEL) for acquiring position evaluation functions for the game of Othe...

Simon M. Lucas, Thomas Philip Runarsson

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers