Search Sciweavers | Sciweavers

113 search results - page 18 / 23

» Learning Representation and Control in Continuous Markov Dec...

click to vote

ICML
2007
IEEE

139views Machine Learning» more ICML 2007»

Learning state-action basis functions for hierarchical MDPs

14 years 9 months ago

Download www.machinelearning.org

This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper exten...

Sarah Osentoski, Sridhar Mahadevan

claim paper

Read More »

click to vote

JMLR
2010

202views more JMLR 2010»

Learning the Structure of Deep Sparse Graphical Models

13 years 3 months ago

Download www.cs.umass.edu

Deep belief networks are a powerful way to model complex probability distributions. However, it is difficult to learn the structure of a belief network, particularly one with hidd...

Ryan Prescott Adams, Hanna M. Wallach, Zoubin Ghah...

claim paper

Read More »

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

13 years 10 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

click to vote

TSMC
2011

258views more TSMC 2011»

Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions

13 years 3 months ago

Download www.montefiore.ulg.ac.be

—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-l...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

14 years 3 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

« Prev « First page 18 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers