Sciweavers

114
Voted
JMLR
2002
100views more  JMLR 2002»
15 years 2 days ago
On the Convergence of Optimistic Policy Iteration
We consider a finite-state Markov decision problem and establish the convergence of a special case of optimistic policy iteration that involves Monte Carlo estimation of Q-values,...
John N. Tsitsiklis