Search Sciweavers | Sciweavers

152

NIPS
2001

101views Information Technology» more NIPS 2001»

Reinforcement Learning with Long Short-Term Memory

15 years 7 months ago

This paper presents reinforcement learning with a Long ShortTerm Memory recurrent neural network: RL-LSTM. Model-free RL-LSTM using Advantage learning and directed exploration can...

Bram Bakker

claim paper

Read More »

148

click to vote

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 7 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

150

click to vote

NN
2006
Springer

79views Neural Networks» more NN 2006»

The misbehavior of value and the discipline of the will

15 years 5 months ago

Download www.cns.nyu.edu

Most reinforcement learning models of animal conditioning operate under the convenient, though fictive, assumption that Pavlovian conditioning concerns prediction learning whereas...

Peter Dayan, Yael Niv, Ben Seymour, Nathaniel D. D...

claim paper

Read More »

146

click to vote

NIPS
2000

150views Information Technology» more NIPS 2000»

Programmable Reinforcement Learning Agents

15 years 7 months ago

Download reference.kfupm.edu.sa

We present an expressive agent design language for reinforcement learning that allows the user to constrain the policies considered by the learning process.The language includes s...

David Andre, Stuart J. Russell

claim paper

Read More »

171

click to vote

ICRA
2010
IEEE

133views Robotics» more ICRA 2010»

Generalized model learning for Reinforcement Learning on a humanoid robot

15 years 4 months ago

Download www.cs.utexas.edu

— Reinforcement learning (RL) algorithms have long been promising methods for enabling an autonomous robot to improve its behavior on sequential decision-making tasks. The obviou...

Todd Hester, Michael Quinlan, Peter Stone

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers