Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

175

Voted

NIPS
2003

105views Information Technology» more NIPS 2003»

Gaussian Processes in Reinforcement Learning

15 years 8 months ago

Gaussian Processes in Reinforcement Learning

Download books.nips.cc

We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learning in continuous state spaces and discrete time. We demonstrate how the GP model allows evaluation of the value function in closed form. The resulting policy iteration algorithm is demonstrated on a simple problem with a two dimensional state space. Further, we speculate that the intrinsic ability of GP models to characterise distributions of functions would allow the method to capture entire distributions over future values instead of merely their expectation, which has traditionally been the focus of much of reinforcement learning.

Carl Edward Rasmussen, Malte Kuss

Real-time Traffic

Gp Models | NIPS 2003 | NIPS 2007 | Reinforcement Learning | State Space |

claim paper

Related Content

» Reinforcement learning with Gaussian processes

» Graph Kernels and Gaussian Processes for Relational Reinforcement Learning

» Gaussian Processes for Sample Efficient Reinforcement Learning with RMAXLike Exploration

» Bayesian reinforcement learning in continuous POMDPs with gaussian processes

» Gaussian Processes and Reinforcement Learning for Identification and Control of an Autonom...

» Adaptive autonomous control using online value iteration with gaussian processes

» Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian p...

» Reinforcement learning agents with primary knowledge designed by analytic hierarchy proces...

» Autonomous blimp control using modelfree reinforcement learning in a continuous state and ...

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	NIPS
Authors	Carl Edward Rasmussen, Malte Kuss

Comments (0)