Search Sciweavers | Sciweavers

163 search results - page 7 / 33

» Policy Gradient Methods for Robotics

156

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

206

Voted

ICRA
2009
IEEE

255views Robotics» more ICRA 2009»

Combining color-based invariant gradient detector with HoG descriptors for robust image detection in scenes under cast shadows

16 years 1 months ago

Download urus.upc.es

— In this work we present a robust detection method in outdoor scenes under cast shadows using color based invariant gradients in combination with HoG local features. The method ...

Michael Villamizar, Jorge Scandaliaris, Alberto Sa...

claim paper

Read More »

167

click to vote

ICRA
2009
IEEE

125views Robotics» more ICRA 2009»

A novel method for learning policies from constrained motion

16 years 1 months ago

Download www.ipab.informatics.ed.ac.uk

— Many everyday human skills can be framed in terms of performing some task subject to constraints imposed by the environment. Constraints are usually unobservable and frequently...

Matthew Howard, Stefan Klanke, Michael Gienger, Ch...

claim paper

Read More »

147

Voted

MATES
2004
Springer

87views Intelligent Agents» more MATES 2004»

Policies for Cloned Teleo-reactive Robots

16 years 19 hour ago

Download www.doc.ic.ac.uk

This paper presents a new method for predicting the values of policies for cloned multiple teleo-reactive robots operating in the context of exogenous events. A teleo-reactive robo...

Krysia Broda, Christopher J. Hogger

claim paper

Read More »

171

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

15 years 6 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

« Prev « First page 7 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers