Sciweavers

4544 search results - page 59 / 909
» Reinforcement Learning with Time
Sort
View
NIPS
1998
13 years 10 months ago
Gradient Descent for General Reinforcement Learning
A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...
Leemon C. Baird III, Andrew W. Moore
AUSAI
2005
Springer
14 years 2 months ago
Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning
: In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to be combined with function approximation techniques. The majority of...
Peter Vamplew, Robert Ollington
IAT
2003
IEEE
14 years 2 months ago
Integrating Reinforcement Learning, Bidding and Genetic Algorithms
This paper presents a multi-agent reinforcement learning bidding approach (MARLBS) for performing multi-agent reinforcement learning. MARLBS integrates reinforcement learning, bid...
Dehu Qi, Ron Sun