Sciweavers

150

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

15 years 1 months ago

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers