Sciweavers

185

NIPS
2004

103views Information Technology» more NIPS 2004»

15 years 8 months ago

We consider an MDP setting in which the reward function is allowed to change during each time step of play (possibly in an adversarial manner), yet the dynamics remain fixed. Simi...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers