Sciweavers

ATAL
2009
Springer
14 years 6 months ago
Online exploration in least-squares policy iteration
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
ATAL
2009
Springer
14 years 6 months ago
An agent oriented hotel information system
Armando Robles P, Pablo Noriega, Francisco J. Cant...