Search Sciweavers | Sciweavers

513 search results - page 48 / 103

» Metric learning for reinforcement learning agents

156

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 7 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

205

Voted

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 28 days ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

169

click to vote

ICCBR
2005
Springer

210views Automated Reasoning» more ICCBR 2005»

CBR for State Value Function Approximation in Reinforcement Learning

15 years 11 months ago

Download ml.informatik.uni-freiburg.de

CBR is one of the techniques that can be applied to the task of approximating a function over high-dimensional, continuous spaces. In Reinforcement Learning systems a learning agen...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

167

click to vote

ATAL
2009
Springer

184views Intelligent Agents» more ATAL 2009»

Multiagent reinforcement learning: algorithm converging to Nash equilibrium in general-sum discounted stochastic games

16 years 21 days ago

Download www.aamas-conference.org

This paper introduces a multiagent reinforcement learning algorithm that converges with a given accuracy to stationary Nash equilibria in general-sum discounted stochastic games. ...

Natalia Akchurina

claim paper

Read More »

166

click to vote

NN
2006
Springer

72views Neural Networks» more NN 2006»

Neural systems implicated in delayed and probabilistic reinforcement

15 years 6 months ago

Download egret.psychol.cam.ac.uk

This review considers the theoretical problems facing agents that must learn and choose on the basis of reward or reinforcement that is uncertain or delayed, in implicit or proced...

Rudolf N. Cardinal

claim paper

Read More »

« Prev « First page 48 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers