The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy

15 years 4 months ago

Download www.hpl.hp.com

Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of states becomes too large to enumerate. Temporal-difference learning with self-play is one method successfully used to derive the value approximation function. Coevolution of the value function is also claimed to yield good results. This paper reports on a direct comparison between an agent trained to play gin rummy using temporal difference learning, and the same agent trained with co-evolution. Coevolution produced superior results.

Clifford Kotnik, Jugal K. Kalita

Real-time Traffic

Game Playing Agents | ICML 2003 | Machine Learning | Value Approximation Function | Value Function |

claim paper

Post Info
More Details (n/a)

Added	05 Jul 2010
Updated	05 Jul 2010
Type	Conference
Year	2003
Where	ICML
Authors	Clifford Kotnik, Jugal K. Kalita

Comments (0)

Sciweavers

The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy

Game Playing Agents | ICML 2003 | Machine Learning | Value Approximation Function | Value Function |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers