Sciweavers

163

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

16 years 6 days ago

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers