Search Sciweavers | Sciweavers

128 search results - page 3 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

170

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

16 years 27 days ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

174

click to vote

ICONIP
2007

147views Information Technology» more ICONIP 2007»

Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents

15 years 8 months ago

Download www.nc.irp.oist.jp

The aim of the Cyber Rodent project [1] is to elucidate the origin of our reward and aﬀective systems by building artiﬁcial agents that share the natural biological constraints...

Eiji Uchibe, Kenji Doya

claim paper

Read More »

215

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 1 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

159

Voted

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

212

Voted

SAB
2010
Springer

189views Optimization» more SAB 2010»

TeXDYNA: Hierarchical Reinforcement Learning in Factored MDPs

15 years 4 months ago

Download www.isir.upmc.fr

Reinforcement learning is one of the main adaptive mechanisms that is both well documented in animal behaviour and giving rise to computational studies in animats and robots. In th...

Olga Kozlova, Olivier Sigaud, Christophe Meyer

claim paper

Read More »

« Prev « First page 3 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers