Sciweavers

664 search results - page 41 / 133
» Combining Reinforcement Learning with a Local Control Algori...
Sort
View
NIPS
1993
15 years 5 months ago
Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming
Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...
Christopher G. Atkeson
ATAL
2008
Springer
15 years 6 months ago
Sigma point policy iteration
In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...
Michael H. Bowling, Alborz Geramifard, David Winga...
COMAD
2008
15 years 6 months ago
Personalized Web-page Rendering System
Personalized rendering of web pages gives the users greater control to view only what they prefer. The goal of this work is to provide a tool that will let users customize the con...
Swapna Raj Prabakara Raj, Balaraman Ravindran
JMLR
2010
119views more  JMLR 2010»
14 years 11 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
139
Voted
SMC
2007
IEEE
118views Control Systems» more  SMC 2007»
15 years 10 months ago
One-class learning with multi-objective genetic programming
One-class classification naturally only provides one class of exemplars on which to construct the classification model. In this work, multiobjective genetic programming (GP) all...
Robert Curry, Malcolm I. Heywood