Sciweavers

1630 search results - page 121 / 326
» Coordinated Reinforcement Learning
Sort
View
ISDA
2009
IEEE
15 years 10 months ago
Postponed Updates for Temporal-Difference Reinforcement Learning
This paper presents postponed updates, a new strategy for TD methods that can improve sample efficiency without incurring the computational and space requirements of model-based ...
Harm van Seijen, Shimon Whiteson
60
Voted
ATAL
2009
Springer
15 years 10 months ago
Transferring experience in reinforcement learning through task decomposition
Ioannis Partalas, Grigorios Tsoumakas, Konstantino...