Search Sciweavers | Sciweavers

473 search results - page 9 / 95

» Optimal policy switching algorithms for reinforcement learni...

179

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 7 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

215

click to vote

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

15 years 25 days ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

185

click to vote

UAI
2008

236views Artificial Intelligence» more UAI 2008»

CORL: A Continuous-state Offset-dynamics Reinforcement Learner

15 years 7 months ago

Download uai2008.cs.helsinki.fi

Continuous state spaces and stochastic, switching dynamics characterize a number of rich, realworld domains, such as robot navigation across varying terrain. We describe a reinfor...

Emma Brunskill, Bethany R. Leffler, Lihong Li, Mic...

claim paper

Read More »

119

click to vote

ML
2000
ACM

133views Machine Learning» more ML 2000»

Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms

15 years 5 months ago

Download www.cs.rutgers.edu

Satinder P. Singh, Tommi Jaakkola, Michael L. Litt...

claim paper

Read More »

170

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

Markov Games as a Framework for Multi-Agent Reinforcement Learning

15 years 9 months ago

Download www.cs.rutgers.edu

In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....

Michael L. Littman

claim paper

Read More »

« Prev « First page 9 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers