Sciweavers

473 search results - page 41 / 95
» Optimal policy switching algorithms for reinforcement learni...
Sort
View
IROS
2006
IEEE
113views Robotics» more  IROS 2006»
14 years 1 months ago
Policy Gradient Methods for Robotics
— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...
Jan Peters, Stefan Schaal
ECML
2004
Springer
14 years 1 months ago
Dynamic Asset Allocation Exploiting Predictors in Reinforcement Learning Framework
Given the pattern-based multi-predictors of the stock price, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset ...
Jangmin O, Jae Won Lee, Jongwoo Lee, Byoung-Tak Zh...
ATAL
2005
Springer
14 years 1 months ago
Coordinated exploration in multi-agent reinforcement learning: an application to load-balancing
This paper is concerned with how multi-agent reinforcement learning algorithms can practically be applied to real-life problems. Recently, a new coordinated multi-agent exploratio...
Katja Verbeeck, Ann Nowé, Karl Tuyls
NIPS
2003
13 years 9 months ago
Gaussian Processes in Reinforcement Learning
We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learning in continuous state spaces and discrete time. We demonstrate how the GP mod...
Carl Edward Rasmussen, Malte Kuss
ICML
1996
IEEE
14 years 8 months ago
Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning
Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...
Sridhar Mahadevan