Search Sciweavers | Sciweavers

1236 search results - page 72 / 248

» Opposition-Based Reinforcement Learning

207

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 1 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

162

click to vote

ATAL
2003
Springer

154views Intelligent Agents» more ATAL 2003»

Coordination in multiagent reinforcement learning: a Bayesian approach

15 years 11 months ago

Download www.cs.toronto.edu

Much emphasis in multiagent reinforcement learning (MARL) research is placed on ensuring that MARL algorithms (eventually) converge to desirable equilibria. As in standard reinfor...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

174

click to vote

IJCAI
2007

173views Artificial Intelligence» more IJCAI 2007»

Reinforcement Learning of Local Shape in the Game of Go

15 years 7 months ago

Download webdocs.cs.ualberta.ca

We explore an application to the game of Go of a reinforcement learning approach based on a linear evaluation function and large numbers of binary features. This strategy has prov...

David Silver, Richard S. Sutton, Martin Mülle...

claim paper

Read More »

178

click to vote

DAGM
2006
Springer

121views Image Processing» more DAGM 2006»

Handling Camera Movement Constraints in Reinforcement Learning Based Active Object Recognition

15 years 10 months ago

Download www5.informatik.uni-erlangen.de

In real world scenes, objects to be classified are usually not visible from every direction, since they are almost always positioned on some kind of opaque plane. When moving a cam...

Christian Derichs, Heinrich Niemann

claim paper

Read More »

208

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

15 years 12 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

« Prev « First page 72 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers