Sciweavers

157

Voted

NIPS
2003

108views Information Technology» more NIPS 2003»

15 years 8 months ago

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers