Sciweavers

1233 search results - page 144 / 247
» Reinforcement Learning in MirrorBot
Sort
View
ECML
2004
Springer
15 years 3 months ago
Dynamic Asset Allocation Exploiting Predictors in Reinforcement Learning Framework
Given the pattern-based multi-predictors of the stock price, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset ...
Jangmin O, Jae Won Lee, Jongwoo Lee, Byoung-Tak Zh...
ICML
2002
IEEE
15 years 10 months ago
Reinforcement Learning and Shaping: Encouraging Intended Behaviors
We explore dynamic shaping to integrate our prior beliefs of the final policy into a conventional reinforcement learning system. Shaping provides a positive or negative artificial...
Adam Laud, Gerald DeJong
IJCNN
2006
IEEE
15 years 4 months ago
Reinforcement Learning Control for Biped Robot Walking on Uneven Surfaces
— Biped robots based on the concept of (passive) dynamic walking are far simpler than the traditional fullycontrolled walking robots, while achieving a more natural gait and cons...
Shouyi Wang, Jelmer Braaksma, Robert Babuska, Daan...
CEEMAS
2003
Springer
15 years 3 months ago
On a Dynamical Analysis of Reinforcement Learning in Games: Emergence of Occam's Razor
Modeling learning agents in the context of Multi-agent Systems requires an adequate understanding of their dynamic behaviour. Usually, these agents are modeled similar to the diï¬...
Karl Tuyls, Katja Verbeeck, Sam Maes
ICML
2002
IEEE
15 years 10 months ago
Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs
One of the central challenges in reinforcement learning is to balance the exploration/exploitation tradeoff while scaling up to large problems. Although model-based reinforcement ...
Carlos Guestrin, Relu Patrascu, Dale Schuurmans