Search Sciweavers | Sciweavers

160 search results - page 1 / 32

» Optimization on a Budget: A Reinforcement Learning Approach

172

click to vote

NIPS
2008

149views Information Technology» more NIPS 2008»

Optimization on a Budget: A Reinforcement Learning Approach

15 years 1 months ago

Download www.cs.arizona.edu

Many popular optimization algorithms, like the Levenberg-Marquardt algorithm (LMA), use heuristic-based "controllers" that modulate the behavior of the optimizer during ...

Paul Ruvolo, Ian R. Fasel, Javier R. Movellan

claim paper

Read More »

111

click to vote

SAB
2010
Springer

189views Optimization» more SAB 2010»

TeXDYNA: Hierarchical Reinforcement Learning in Factored MDPs

14 years 9 months ago

Download www.isir.upmc.fr

Reinforcement learning is one of the main adaptive mechanisms that is both well documented in animal behaviour and giving rise to computational studies in animats and robots. In th...

Olga Kozlova, Olivier Sigaud, Christophe Meyer

claim paper

Read More »

click to vote

CORR
1998
Springer

164views Education» more CORR 1998»

Training Reinforcement Neurocontrollers Using the Polytope Algorithm

14 years 11 months ago

Download zeus.cs.uoi.gr

A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...

Aristidis Likas, Isaac E. Lagaris

claim paper

Read More »

click to vote

ICML
2010
IEEE

210views Machine Learning» more ICML 2010»

Multi-Class Pegasos on a Budget

15 years 20 days ago

Download astro.temple.edu

When equipped with kernel functions, online learning algorithms are susceptible to the "curse of kernelization" that causes unbounded growth in the model size. To addres...

Zhuang Wang, Koby Crammer, Slobodan Vucetic

claim paper

Read More »

120

click to vote

WSC
2007

166views Modeling And Simulation» more WSC 2007»

Optimizing time warp simulation with reinforcement learning techniques

15 years 1 months ago

Download www.informs-sim.org

Adaptive Time Warp protocols in the literature are usually based on a pre-deﬁned analytic model of the system, expressed as a closed form function that maps system state to cont...

Jun Wang, Carl Tropper

claim paper

Read More »

« Prev « First page 1 / 32 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers