Online Adaptable Learning Rates for the Game Connect-4

10 years 13 days ago

Download www.gm.fh-koeln.de

—Learning board games by self-play has a long tradition in computational intelligence for games. Based on Tesauro’s seminal success with TD-Gammon in 1994, many successful agents use temporal difference learning today. But in order to be successful with temporal difference learning on game tasks, often a careful selection of features and a large number of training games is necessary. Even for board games of moderate complexity like Connect-4, we found in previous work that a very rich initial feature set and several millions of game plays are required. In this work we investigate different approaches of online-adaptable learning rates like Incremental Delta Bar Delta (IDBD) or Temporal Coherence Learning (TCL) whether they have the potential to speed up learning for such a complex task. We propose a new variant of TCL with geometric step size changes. We compare those algorithms with several other state-of-the-art learning rate adaptation algorithms and perform a case study on the ...

Samineh Bagheri, Markus Thill, Patrick Koch, Wolfg

Real-time Traffic

Software Engineering | TCIAIG 2016 |

claim paper

» Fast Online Training with FrequencyAdaptive Learning Rates for Chinese Word Segmentation a...

» IMPLANT An Integrated MDP and POMDP Learning AgeNT for Adaptive Games

» RETALIATE Learning Winning Policies in FirstPerson Shooter Games

» SpatiallyAdaptive Learning Rates for Online Incremental SLAM

» Agent Learning using ActionDependent Learning Rates in Computer RolePlaying Games

» Using adaptive consultation of experts to improve convergence rates in multiagent learning

» GameLike Simulations for Online Adaptive Learning A Case Study

» Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

» Improved Adaptive Mixture Learning for Robust Video Background Modeling

Post Info
More Details (n/a)

Added	10 Apr 2016
Updated	10 Apr 2016
Type	Journal
Year	2016
Where	TCIAIG
Authors	Samineh Bagheri, Markus Thill, Patrick Koch, Wolfgang Konen

Comments (0)

Sciweavers

Online Adaptable Learning Rates for the Game Connect-4

Software Engineering | TCIAIG 2016 |

Explore & Download

Productivity Tools

Sciweavers