reward horizon | Sciweavers

217

CORR
2006
Springer

140views Education» more CORR 2006»

Nearly optimal exploration-exploitation decision thresholds

15 years 6 months ago

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...

Christos Dimitrakakis

posted by olethros

Read More »

178

click to vote

ICML
2003
IEEE

104views Machine Learning» more ICML 2003»

The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping

15 years 12 months ago

Download www.hpl.hp.com

Shaping can be an effective method for improving the learning rate in reinforcement systems. Previously, shaping has been heuristically motivated and implemented. We provide a for...

Adam Laud, Gerald DeJong

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers