Sciweavers

1233 search results - page 27 / 247
» Reinforcement learning
Sort
View
NIPS
2001
13 years 10 months ago
Reinforcement Learning with Long Short-Term Memory
This paper presents reinforcement learning with a Long ShortTerm Memory recurrent neural network: RL-LSTM. Model-free RL-LSTM using Advantage learning and directed exploration can...
Bram Bakker
NIPS
2001
13 years 10 months ago
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...
Gregory Z. Grudic, Lyle H. Ungar
NN
2006
Springer
13 years 8 months ago
The misbehavior of value and the discipline of the will
Most reinforcement learning models of animal conditioning operate under the convenient, though fictive, assumption that Pavlovian conditioning concerns prediction learning whereas...
Peter Dayan, Yael Niv, Ben Seymour, Nathaniel D. D...
NIPS
2000
13 years 10 months ago
Programmable Reinforcement Learning Agents
We present an expressive agent design language for reinforcement learning that allows the user to constrain the policies considered by the learning process.The language includes s...
David Andre, Stuart J. Russell
ICRA
2010
IEEE
133views Robotics» more  ICRA 2010»
13 years 7 months ago
Generalized model learning for Reinforcement Learning on a humanoid robot
— Reinforcement learning (RL) algorithms have long been promising methods for enabling an autonomous robot to improve its behavior on sequential decision-making tasks. The obviou...
Todd Hester, Michael Quinlan, Peter Stone