Sciweavers

513 search results - page 48 / 103
» Metric learning for reinforcement learning agents
Sort
View
ICML
2003
IEEE
16 years 7 months ago
Exploration in Metric State Spaces
We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...
Sham Kakade, Michael J. Kearns, John Langford
205
Voted
JMLR
2010
189views more  JMLR 2010»
15 years 28 days ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
ICCBR
2005
Springer
15 years 11 months ago
CBR for State Value Function Approximation in Reinforcement Learning
CBR is one of the techniques that can be applied to the task of approximating a function over high-dimensional, continuous spaces. In Reinforcement Learning systems a learning agen...
Thomas Gabel, Martin A. Riedmiller
ATAL
2009
Springer
16 years 21 days ago
Multiagent reinforcement learning: algorithm converging to Nash equilibrium in general-sum discounted stochastic games
This paper introduces a multiagent reinforcement learning algorithm that converges with a given accuracy to stationary Nash equilibria in general-sum discounted stochastic games. ...
Natalia Akchurina
NN
2006
Springer
15 years 6 months ago
Neural systems implicated in delayed and probabilistic reinforcement
This review considers the theoretical problems facing agents that must learn and choose on the basis of reward or reinforcement that is uncertain or delayed, in implicit or proced...
Rudolf N. Cardinal