Search Sciweavers | Sciweavers

166

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 6 months ago

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

155

click to vote

ATAL
2005
Springer

130views Intelligent Agents» more ATAL 2005»

Behavior transfer for value-function-based reinforcement learning

15 years 11 months ago

Download www.cs.huji.ac.il

Temporal difference (TD) learning methods [22] have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been...

Matthew E. Taylor, Peter Stone

claim paper

Read More »

186

click to vote

TFS
2011

239views Education» more TFS 2011»

Systems Control With Generalized Probabilistic Fuzzy-Reinforcement Learning

15 years 26 days ago

Download www.triteq.com

—Reinforcement learning (RL) is a valuable learning method when the systems require a selection of control actions whose consequences emerge over long periods for which input– ...

William M. Hinojosa, Samia Nefti, Uzay Kaymak

claim paper

Read More »

166

click to vote

NETWORKING
2007

110views Computer Networks» more NETWORKING 2007»

Reinforcement Learning-Based Load Shared Sequential Routing

15 years 7 months ago

Download www.ece.mcgill.ca

We consider event dependent routing algorithms for on-line explicit source routing in MPLS networks. The proposed methods are based on load shared sequential routing in which load ...

Fariba Heidari, Shie Mannor, Lorne Mason

claim paper

Read More »

161

click to vote

AR
2007

105views more AR 2007»

Reinforcement learning of a continuous motor sequence with hidden states

15 years 6 months ago

Download www.bdc.brain.riken.go.jp

—Reinforcement learning is the scheme for unsupervised learning in which robots are expected to acquire behavior skills through self-explorations based on reward signals. There a...

Hiroaki Arie, Tetsuya Ogata, Jun Tani, Shigeki Sug...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers