Sciweavers

1235 search results - page 33 / 247
» Reinforcement learning in a nutshell
Sort
View
ATAL
2009
Springer
14 years 3 months ago
Learning with whom to communicate using relational reinforcement learning
Marc J. V. Ponsen, Tom Croonenborghs, Karl Tuyls, ...
NIPS
1998
13 years 10 months ago
Gradient Descent for General Reinforcement Learning
A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...
Leemon C. Baird III, Andrew W. Moore
AUSAI
2005
Springer
14 years 2 months ago
Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning
: In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to be combined with function approximation techniques. The majority of...
Peter Vamplew, Robert Ollington