Sciweavers

112
Voted
JMLR
2002
100views more  JMLR 2002»
15 years 23 hour ago
On the Convergence of Optimistic Policy Iteration
We consider a finite-state Markov decision problem and establish the convergence of a special case of optimistic policy iteration that involves Monte Carlo estimation of Q-values,...
John N. Tsitsiklis