Sciweavers

154

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

16 years 6 months ago

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers