Search Sciweavers | Sciweavers

14 search results - page 3 / 3

» Near-Optimal Reinforcement Learning in Polynomial Time

132

click to vote

ALT
1998
Springer

101views Machine Learning» more ALT 1998»

Predictive Learning Models for Concept Drift

15 years 10 months ago

Download www.comp.nus.edu.sg

Concept drift means that the concept about which data is obtained may shift from time to time, each time after some minimum permanence. Except for this minimum permanence, the con...

John Case, Sanjay Jain, Susanne Kaufmann, Arun Sha...

claim paper

Read More »

162

Voted

ICML
2003
IEEE

104views Machine Learning» more ICML 2003»

The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping

15 years 11 months ago

Download www.hpl.hp.com

Shaping can be an effective method for improving the learning rate in reinforcement systems. Previously, shaping has been heuristically motivated and implemented. We provide a for...

Adam Laud, Gerald DeJong

claim paper

Read More »

211

click to vote

JAIR
2008

148views more JAIR 2008»

Learning Partially Observable Deterministic Action Models

15 years 6 months ago

Download www.jair.org

We present exact algorithms for identifying deterministic-actions' effects and preconditions in dynamic partially observable domains. They apply when one does not know the ac...

Eyal Amir, Allen Chang

claim paper

Read More »

132

click to vote

CACM
2010

105views more CACM 2010»

Censored exploration and the dark pool problem

15 years 6 months ago

Download www.cis.upenn.edu

We introduce and analyze a natural algorithm for multi-venue exploration from censored data, which is motivated by the Dark Pool Problem of modern quantitative finance. We prove t...

Kuzman Ganchev, Yuriy Nevmyvaka, Michael Kearns, J...

claim paper

Read More »

« Prev « First page 3 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers