Search Sciweavers | Sciweavers

24 search results - page 2 / 5

» Reinforcement learning for optimized trade execution

click to vote

ICRA
2006
IEEE

131views Robotics» more ICRA 2006»

Using Reinforcement Learning to Improve Exploration Trajectories for Error Minimization

13 years 11 months ago

Download mapleleaf.csail.mit.edu

Abstract— The mapping and localization problems have received considerable attention in robotics recently. The exploration problem that drives mapping has started to generate sim...

Thomas Kollar, Nicholas Roy

claim paper

Read More »

click to vote

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

14 years 6 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

click to vote

CORR
2006
Springer

140views Education» more CORR 2006»

Nearly optimal exploration-exploitation decision thresholds

13 years 5 months ago

Download www.idiap.ch

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 6 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

click to vote

IEEEPACT
2008
IEEE

136views Distributed And Parallel Com...» more IEEEPACT 2008»

Feature selection and policy optimization for distributed instruction placement using reinforcement learning

13 years 11 months ago

Download userweb.cs.utexas.edu

Communication overheads are one of the fundamental challenges in a multiprocessor system. As the number of processors on a chip increases, communication overheads and the distribu...

Katherine E. Coons, Behnam Robatmili, Matthew E. T...

claim paper

Read More »

« Prev « First page 2 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers