Search Sciweavers | Sciweavers

340 search results - page 9 / 68

» Kernelized value function approximation for reinforcement le...

click to vote

ICML
2003
IEEE

150views Machine Learning» more ICML 2003»

The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy

15 years 2 months ago

Download www.hpl.hp.com

Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...

Clifford Kotnik, Jugal K. Kalita

claim paper

Read More »

102

click to vote

ICML
2007
IEEE

179views Machine Learning» more ICML 2007»

A kernel path algorithm for support vector machines

15 years 10 months ago

Download www.machinelearning.org

The choice of the kernel function which determines the mapping between the input space and the feature space is of crucial importance to kernel methods. The past few years have se...

Gang Wang, Dit-Yan Yeung, Frederick H. Lochovsky

claim paper

Read More »

click to vote

ICML
2003
IEEE

157views Machine Learning» more ICML 2003»

Action Elimination and Stopping Conditions for Reinforcement Learning

15 years 10 months ago

Download www.hpl.hp.com

We consider incorporating action elimination procedures in reinforcement learning algorithms. We suggest a framework that is based on learning an upper and a lower estimates of th...

Eyal Even-Dar, Shie Mannor, Yishay Mansour

claim paper

Read More »

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

14 years 11 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

click to vote

ESANN
2004

90views Neural Networks» more ESANN 2004»

High-accuracy value-function approximation with neural networks applied to the acrobot

14 years 11 months ago

Download remi.coulom.free.fr

Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this pape...

Rémi Coulom

claim paper

Read More »

« Prev « First page 9 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers