Sciweavers

24 search results - page 2 / 5
» Reinforcement learning for optimized trade execution
Sort
View
ICRA
2006
IEEE
131views Robotics» more  ICRA 2006»
13 years 11 months ago
Using Reinforcement Learning to Improve Exploration Trajectories for Error Minimization
Abstract— The mapping and localization problems have received considerable attention in robotics recently. The exploration problem that drives mapping has started to generate sim...
Thomas Kollar, Nicholas Roy
ICML
1998
IEEE
14 years 6 months ago
The MAXQ Method for Hierarchical Reinforcement Learning
This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...
Thomas G. Dietterich
CORR
2006
Springer
140views Education» more  CORR 2006»
13 years 5 months ago
Nearly optimal exploration-exploitation decision thresholds
While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...
Christos Dimitrakakis
IJCAI
2001
13 years 6 months ago
R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning
R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...
Ronen I. Brafman, Moshe Tennenholtz
IEEEPACT
2008
IEEE
13 years 11 months ago
Feature selection and policy optimization for distributed instruction placement using reinforcement learning
Communication overheads are one of the fundamental challenges in a multiprocessor system. As the number of processors on a chip increases, communication overheads and the distribu...
Katherine E. Coons, Behnam Robatmili, Matthew E. T...