Search Sciweavers | Sciweavers

50 search results - page 4 / 10

» Nonparametric Return Distribution Approximation for Reinforc...

112

Voted

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

14 years 9 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

148

click to vote

CORR
2012
Springer

196views Education» more CORR 2012»

PAC-Bayesian Policy Evaluation for Reinforcement Learning

13 years 10 months ago

Download www.cs.mcgill.ca

Bayesian priors oﬀer a compact yet general means of incorporating domain knowledge into many learning tasks. The correctness of the Bayesian analysis and inference, however, lar...

Mahdi Milani Fard, Joelle Pineau, Csaba Szepesv&aa...

claim paper

Read More »

138

click to vote

ICAC
2008
IEEE

99views Applied Computing» more ICAC 2008»

Utility-Based Reinforcement Learning for Reactive Grids

15 years 9 months ago

Download hal.inria.fr

—Large scale production grids are an important case for autonomic computing. They follow a mutualization paradigm: decision-making (human or automatic) is distributed and largely...

Julien Perez, Cécile Germain-Renaud, Bal&aa...

claim paper

Read More »

147

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

15 years 8 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

121

click to vote

ICML
2009
IEEE

122views Machine Learning» more ICML 2009»

Tractable nonparametric Bayesian inference in Poisson processes with Gaussian process intensities

16 years 3 months ago

Download www.cs.toronto.edu

The inhomogeneous Poisson process is a point process that has varying intensity across its domain (usually time or space). For nonparametric Bayesian modeling, the Gaussian proces...

Ryan Prescott Adams, Iain Murray, David J. C. MacK...

claim paper

Read More »

« Prev « First page 4 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers