Sciweavers

70 search results - page 9 / 14
» Reinforcement Learning: Past, Present and Future
Sort
View
BMEI
2008
IEEE
14 years 1 months ago
A Retrospective Comparative Study of Three Data Modelling Techniques in Anticoagulation Therapy
Three types of data modelling technique are applied retrospectively to individual patients’ anticoagulation therapy data to predict their future levels of anticoagulation. The r...
Simon McDonald, Costas S. Xydeas, Plamen P. Angelo...
GECCO
2008
Springer
128views Optimization» more  GECCO 2008»
13 years 8 months ago
Multi-agent task allocation: learning when to say no
This paper presents a communication-less multi-agent task allocation procedure that allows agents to use past experience to make non-greedy decisions about task assignments. Exper...
Adam Campbell, Annie S. Wu, Randall Shumaker
ECAI
2008
Springer
13 years 9 months ago
Learning to Select Object Recognition Methods for Autonomous Mobile Robots
Selecting which algorithms should be used by a mobile robot computer vision system is a decision that is usually made a priori by the system developer, based on past experience and...
Reinaldo A. C. Bianchi, Arnau Ramisa, Ramon L&oacu...
ICANN
2007
Springer
14 years 1 months ago
Solving Deep Memory POMDPs with Recurrent Policy Gradients
Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...
Daan Wierstra, Alexander Förster, Jan Peters,...
ECML
2007
Springer
14 years 1 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber