Search Sciweavers | Sciweavers

582 search results - page 6 / 117

» Reinforcement learning with Gaussian processes

180

click to vote

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

15 years 8 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

186

Voted

DAGM
2010
Springer

277views Image Processing» more DAGM 2010»

Gaussian Mixture Modeling with Gaussian Process Latent Variable Models

15 years 7 months ago

Download www.kyb.tuebingen.mpg.de

Density modeling is notoriously difficult for high dimensional data. One approach to the problem is to search for a lower dimensional manifold which captures the main characteristi...

Hannes Nickisch, Carl Edward Rasmussen

claim paper

Read More »

160

click to vote

ICML
2003
IEEE

168views Machine Learning» more ICML 2003»

Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning

16 years 7 months ago

Download webee.technion.ac.il

We present a novel Bayesian approach to the problem of value function estimation in continuous state spaces. We define a probabilistic generative model for the value function by i...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

199

click to vote

ICML
2005
IEEE

147views Machine Learning» more ICML 2005»

Learning Gaussian processes from multiple tasks

16 years 7 months ago

Download wwwbrauer.in.tum.de

We consider the problem of multi-task learning, that is, learning multiple related functions. Our approach is based on a hierarchical Bayesian framework, that exploits the equival...

Kai Yu, Volker Tresp, Anton Schwaighofer

claim paper

Read More »

191

Voted

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 10 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

« Prev « First page 6 / 117 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers