Search Sciweavers | Sciweavers

995 search results - page 84 / 199

» Learning Useful Horn Approximations

164

Voted

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 3 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

158

click to vote

CORR
2012
Springer

196views Education» more CORR 2012»

PAC-Bayesian Policy Evaluation for Reinforcement Learning

13 years 11 months ago

Download www.cs.mcgill.ca

Bayesian priors oﬀer a compact yet general means of incorporating domain knowledge into many learning tasks. The correctness of the Bayesian analysis and inference, however, lar...

Mahdi Milani Fard, Joelle Pineau, Csaba Szepesv&aa...

claim paper

Read More »

150

click to vote

ML
2002
ACM

168views Machine Learning» more ML 2002»

On Average Versus Discounted Reward Temporal-Difference Learning

15 years 3 months ago

Download web.mit.edu

We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...

John N. Tsitsiklis, Benjamin Van Roy

claim paper

Read More »

163

Voted

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

15 years 9 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

125

Voted

NN
2008
Springer

89views Neural Networks» more NN 2008»

Multilayer in-place learning networks for modeling functional layers in the laminar cortex

15 years 3 months ago

Download www.cse.msu.edu

Currently, there is a lack of general-purpose in-place learning networks that model feature layers in the cortex. By "general-purpose" we mean a general yet adaptive hig...

Juyang Weng, Tianyu Luwang, Hong Lu, Xiangyang Xue

claim paper

Read More »

« Prev « First page 84 / 199 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers