Sciweavers

664 search results - page 41 / 133
» Combining Reinforcement Learning with a Local Control Algori...
Sort
View
NIPS
1993
14 years 11 months ago
Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming
Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...
Christopher G. Atkeson
ATAL
2008
Springer
15 years 3 days ago
Sigma point policy iteration
In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...
Michael H. Bowling, Alborz Geramifard, David Winga...
COMAD
2008
14 years 11 months ago
Personalized Web-page Rendering System
Personalized rendering of web pages gives the users greater control to view only what they prefer. The goal of this work is to provide a tool that will let users customize the con...
Swapna Raj Prabakara Raj, Balaraman Ravindran
JMLR
2010
119views more  JMLR 2010»
14 years 4 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
SMC
2007
IEEE
118views Control Systems» more  SMC 2007»
15 years 4 months ago
One-class learning with multi-objective genetic programming
One-class classification naturally only provides one class of exemplars on which to construct the classification model. In this work, multiobjective genetic programming (GP) all...
Robert Curry, Malcolm I. Heywood