Search Sciweavers | Sciweavers

1233 search results - page 106 / 247

» Feudal Reinforcement Learning

182

AAAI
2006

127views Intelligent Agents» more AAAI 2006»

Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance

15 years 7 months ago

Download robotic.media.mit.edu

As robots become a mass consumer product, they will need to learn new skills by interacting with typical human users. Past approaches have adapted reinforcement learning (RL) to a...

Andrea Lockerd Thomaz, Cynthia Breazeal

claim paper

Read More »

224

click to vote

ESANN
2008

278views Neural Networks» more ESANN 2008»

Learning to play Tetris applying reinforcement learning methods

15 years 7 months ago

Download www.dice.ucl.ac.be

In this paper the application of reinforcement learning to Tetris is investigated, particulary the idea of temporal difference learning is applied to estimate the state value funct...

Alexander Groß, Jan Friedland, Friedhelm Sch...

claim paper

Read More »

154

click to vote

NIPS
2001

101views Information Technology» more NIPS 2001»

Reinforcement Learning with Long Short-Term Memory

15 years 7 months ago

Download staff.science.uva.nl

This paper presents reinforcement learning with a Long ShortTerm Memory recurrent neural network: RL-LSTM. Model-free RL-LSTM using Advantage learning and directed exploration can...

Bram Bakker

claim paper

Read More »

149

click to vote

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 7 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

190

click to vote

BROADNETS
2007
IEEE

119views Computer Networks» more BROADNETS 2007»

Reinforcement learning based routing in all-optical networks with physical impairments

15 years 10 months ago

Download www.tsp.ece.mcgill.ca

Abstract-- We present and evaluate a reinforcement learningbased RWA algorithm for all-optical networks subject to physical impairments. The technique is suitable for decentralized...

Yvan Pointurier, Fariba Heidari

claim paper

Read More »

« Prev « First page 106 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers