Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

144

CORR
1998
Springer

164views Education» more CORR 1998»

Training Reinforcement Neurocontrollers Using the Polytope Algorithm

15 years 5 months ago

Training Reinforcement Neurocontrollers Using the Polytope Algorithm

Download zeus.cs.uoi.gr

A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorithm to adjust the weights of the action network so that a simple direct measure of the training performance is maximized. Experimental results from the application of the method to the pole balancing problem indicate improved training performance compared with critic-based and genetic reinforcement approaches. Key words: reinforcement learning, neurocontrol, optimization, polytope algorithm, pole balancing, genetic reinforcement

Aristidis Likas, Isaac E. Lagaris

Real-time Traffic

CORR 1998 | Education | Genetic Reinforcement | Polytope Optimization Algorithm | Reinforcement |

claim paper

Related Content

» Dual heuristic programming based nonlinear optimal control for a synchronous generator

» Blind Data Classification Using HyperDimensional Convex Polytopes

» Selecting actions for resourcebounded information extraction using reinforcement learning

» Generating a novel sort algorithm using Reinforcement Programming

» Genetic algorithmbased training for semisupervised SVM

» Dynamic Reward Shaping Training a Robot by Voice

» High speed obstacle avoidance using monocular vision and reinforcement learning

» Ensembles of Neural Networks for Robust Reinforcement Learning

» Could Active Perception Aid Navigation of Partially Observable Grid Worlds

Post Info
More Details (n/a)

Added	22 Dec 2010
Updated	22 Dec 2010
Type	Journal
Year	1998
Where	CORR
Authors	Aristidis Likas, Isaac E. Lagaris

Comments (0)