Search Sciweavers | Sciweavers

664 search results - page 41 / 133

» Combining Reinforcement Learning with a Local Control Algori...

click to vote

NIPS
1993

134views Information Technology» more NIPS 1993»

Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming

14 years 11 months ago

Download www.cs.cmu.edu

Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...

Christopher G. Atkeson

claim paper

Read More »

click to vote

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 3 days ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

click to vote

COMAD
2008

157views Knowledge Management» more COMAD 2008»

Personalized Web-page Rendering System

14 years 11 months ago

Download www.cse.iitb.ac.in

Personalized rendering of web pages gives the users greater control to view only what they prefer. The goal of this work is to provide a tool that will let users customize the con...

Swapna Raj Prabakara Raj, Balaraman Ravindran

claim paper

Read More »

106

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

14 years 4 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

click to vote

SMC
2007
IEEE

118views Control Systems» more SMC 2007»

One-class learning with multi-objective genetic programming

15 years 4 months ago

Download users.cs.dal.ca

One-class classiﬁcation naturally only provides one class of exemplars on which to construct the classiﬁcation model. In this work, multiobjective genetic programming (GP) all...

Robert Curry, Malcolm I. Heywood

claim paper

Read More »

« Prev « First page 41 / 133 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers