Search Sciweavers | Sciweavers

37 search results - page 3 / 8

» Analysis of Inverse Reinforcement Learning with Perturbed De...

180

Voted

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 7 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

173

click to vote

ICRA
2010
IEEE

145views Robotics» more ICRA 2010»

Reinforcement learning of motor skills in high dimensions: A path integral approach

15 years 4 months ago

Download www-personal.acfr.usyd.edu.au

— Reinforcement learning (RL) is one of the most general approaches to learning control. Its applicability to complex motor systems, however, has been largely impossible so far d...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

216

Voted

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

15 years 27 days ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

171

click to vote

ANOR
2005

80views more ANOR 2005»

Entropic Penalties in Finite Games

15 years 6 months ago

Download www.science.unitn.it

The main objects here are finite-strategy games in which entropic terms are subtracted from the payoffs. After such subtraction each Nash equilibrium solves an explicit, unconstra...

Sjur Didrik Flåm, E. Cavazzuti

claim paper

Read More »

162

Voted

ICML
2003
IEEE

104views Machine Learning» more ICML 2003»

The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping

15 years 11 months ago

Download www.hpl.hp.com

Shaping can be an effective method for improving the learning rate in reinforcement systems. Previously, shaping has been heuristically motivated and implemented. We provide a for...

Adam Laud, Gerald DeJong

claim paper

Read More »

« Prev « First page 3 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers