near-optimal policies

191

ICML
2005
IEEE

133views Machine Learning» more ICML 2005»

A theoretical analysis of Model-Based Interval Estimation

16 years 8 months ago

Several algorithms for learning near-optimal policies in Markov Decision Processes have been analyzed and proven efficient. Empirical results have suggested that Model-based Inter...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

211

click to vote

ICML
2005
IEEE

127views Machine Learning» more ICML 2005»

Exploration and apprenticeship learning in reinforcement learning

16 years 8 months ago

Download ai.stanford.edu

We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers