Sciweavers

192

JMLR
2002

100views more JMLR 2002»

On the Convergence of Optimistic Policy Iteration

15 years 7 months ago

We consider a finite-state Markov decision problem and establish the convergence of a special case of optimistic policy iteration that involves Monte Carlo estimation of Q-values,...

John N. Tsitsiklis

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers