Search Sciweavers | Sciweavers

378 search results - page 23 / 76

» Reinforcement Learning for Online Control of Evolutionary Al...

click to vote

COLT
2006
Springer

85views Machine Learning» more COLT 2006»

A Randomized Online Learning Algorithm for Better Variance Control

13 years 11 months ago

Download certis.enpc.fr

We propose a sequential randomized algorithm, which at each step concentrates on functions having both low risk and low variance with respect to the previous step prediction functi...

Jean-Yves Audibert

claim paper

Read More »

click to vote

SIGGRAPH
2010
ACM

295views Computer Graphics» more SIGGRAPH 2010»

Learning behavior styles with inverse reinforcement learning

14 years 2 days ago

Download grail.cs.washington.edu

We present a method for inferring the behavior styles of character controllers from a small set of examples. We show that a rich set of behavior variations can be captured by dete...

Seong Jae Lee, Zoran Popovic

claim paper

Read More »

click to vote

ECAI
2010
Springer

238views Artificial Intelligence» more ECAI 2010»

The Dynamics of Multi-Agent Reinforcement Learning

13 years 8 months ago

Download www.doc.ic.ac.uk

Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...

Luke Dickens, Krysia Broda, Alessandra Russo

claim paper

Read More »

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

13 years 9 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

13 years 9 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 23 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers