Sciweavers

1233 search results - page 106 / 247
» Feudal Reinforcement Learning
Sort
View
154
Voted
AAAI
2006
15 years 4 months ago
Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance
As robots become a mass consumer product, they will need to learn new skills by interacting with typical human users. Past approaches have adapted reinforcement learning (RL) to a...
Andrea Lockerd Thomaz, Cynthia Breazeal
175
Voted
ESANN
2008
15 years 4 months ago
Learning to play Tetris applying reinforcement learning methods
In this paper the application of reinforcement learning to Tetris is investigated, particulary the idea of temporal difference learning is applied to estimate the state value funct...
Alexander Groß, Jan Friedland, Friedhelm Sch...
129
Voted
NIPS
2001
15 years 4 months ago
Reinforcement Learning with Long Short-Term Memory
This paper presents reinforcement learning with a Long ShortTerm Memory recurrent neural network: RL-LSTM. Model-free RL-LSTM using Advantage learning and directed exploration can...
Bram Bakker
123
Voted
NIPS
2001
15 years 4 months ago
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...
Gregory Z. Grudic, Lyle H. Ungar
BROADNETS
2007
IEEE
15 years 7 months ago
Reinforcement learning based routing in all-optical networks with physical impairments
Abstract-- We present and evaluate a reinforcement learningbased RWA algorithm for all-optical networks subject to physical impairments. The technique is suitable for decentralized...
Yvan Pointurier, Fariba Heidari