Sciweavers

1233 search results - page 52 / 247
» Reinforcement learning
Sort
View
ICML
2005
IEEE
14 years 10 months ago
Reinforcement learning with Gaussian processes
Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...
Yaakov Engel, Shie Mannor, Ron Meir
ATAL
2005
Springer
14 years 3 months ago
Behavior transfer for value-function-based reinforcement learning
Temporal difference (TD) learning methods [22] have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been...
Matthew E. Taylor, Peter Stone
TFS
2011
239views Education» more  TFS 2011»
13 years 4 months ago
Systems Control With Generalized Probabilistic Fuzzy-Reinforcement Learning
—Reinforcement learning (RL) is a valuable learning method when the systems require a selection of control actions whose consequences emerge over long periods for which input– ...
William M. Hinojosa, Samia Nefti, Uzay Kaymak
NETWORKING
2007
13 years 11 months ago
Reinforcement Learning-Based Load Shared Sequential Routing
We consider event dependent routing algorithms for on-line explicit source routing in MPLS networks. The proposed methods are based on load shared sequential routing in which load ...
Fariba Heidari, Shie Mannor, Lorne Mason
AR
2007
105views more  AR 2007»
13 years 10 months ago
Reinforcement learning of a continuous motor sequence with hidden states
—Reinforcement learning is the scheme for unsupervised learning in which robots are expected to acquire behavior skills through self-explorations based on reward signals. There a...
Hiroaki Arie, Tetsuya Ogata, Jun Tani, Shigeki Sug...