Sciweavers

1262 search results - page 118 / 253
» Reinforcement Learning: An Introduction
Sort
View
ICAART
2010
INSTICC
14 years 5 months ago
A Reinforcement Learning Approach for Multiagent Navigation
Francisco Martinez-Gil, Fernando Barber, Miguel Lo...
ICAART
2010
INSTICC
14 years 5 months ago
A Cautious Approach to Generalization in Reinforcement Learning
Raphael Fonteneau, Susan A. Murphy, Louis Wehenkel...
ISDA
2009
IEEE
14 years 2 months ago
Postponed Updates for Temporal-Difference Reinforcement Learning
This paper presents postponed updates, a new strategy for TD methods that can improve sample efficiency without incurring the computational and space requirements of model-based ...
Harm van Seijen, Shimon Whiteson