value function | Sciweavers

18

ICML
1999
IEEE

152views Machine Learning» more ICML 1999»

14 years 5 months ago

Many interesting problems, such as power grids, network switches, and tra c ow, that are candidates for solving with reinforcement learningRL, alsohave properties that make distri...

Jeff G. Schneider, Weng-Keen Wong, Andrew W. Moore...

claim paper

Read More »

20

click to vote

ICML
2000
IEEE

169views Machine Learning» more ICML 2000»

Rates of Convergence for Variable Resolution Schemes in Optimal Control

14 years 5 months ago

Download sequel.futurs.inria.fr

This paper presents a general method to derive tight rates of convergence for numerical approximations in optimal control when we consider variable resolution grids. We study the ...

Andrew W. Moore, Rémi Munos

claim paper

Read More »

16

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

14 years 5 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers