Search Sciweavers | Sciweavers

1233 search results - page 201 / 247

» Reinforcement Learning in MirrorBot

142

click to vote

ICML
2003
IEEE

150views Machine Learning» more ICML 2003»

The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy

15 years 6 months ago

Download www.hpl.hp.com

Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...

Clifford Kotnik, Jugal K. Kalita

claim paper

Read More »

109

click to vote

GECCO
2005
Springer

98views Optimization» more GECCO 2005»

Intelligent exploration method for XCS

15 years 7 months ago

Download www.cs.bham.ac.uk

Exploration/Exploitation equilibrium is one of the most challenging issues in reinforcement learning area as well as learning classifier systems such as XCS. In this paper1 , an i...

Ali Hamzeh, Adel Rahmani

claim paper

Read More »

118

click to vote

AE
2003
Springer

123views Artificial Intelligence» more AE 2003»

An Agent Model for First Price and Second Price Private Value Auctions

15 years 6 months ago

Download www.uea.ac.uk

The aim of this research is to develop an adaptive agent based model of auction scenarios commonly used in auction theory to help understand how competitors in auctions reach equil...

Anthony J. Bagnall, Iain Toft

claim paper

Read More »

136

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 2 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

110

click to vote

NIPS
1993

134views Information Technology» more NIPS 1993»

Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming

15 years 2 months ago

Download www.cs.cmu.edu

Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...

Christopher G. Atkeson

claim paper

Read More »

« Prev « First page 201 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers