Search Sciweavers | Sciweavers

86 search results - page 2 / 18

» Estimation and Approximation Bounds for Gradient-Based Reinf...

212

click to vote

CORR
2012
Springer

196views Education» more CORR 2012»

PAC-Bayesian Policy Evaluation for Reinforcement Learning

14 years 2 months ago

Download www.cs.mcgill.ca

Bayesian priors oﬀer a compact yet general means of incorporating domain knowledge into many learning tasks. The correctness of the Bayesian analysis and inference, however, lar...

Mahdi Milani Fard, Joelle Pineau, Csaba Szepesv&aa...

claim paper

Read More »

184

click to vote

COLT
1991
Springer

159views Machine Learning» more COLT 1991»

Approximation and Estimation Bounds for Artificial Neural Networks

15 years 10 months ago

Download www.stat.yale.edu

Andrew R. Barron

claim paper

Read More »

201

click to vote

ICCBR
2005
Springer

210views Automated Reasoning» more ICCBR 2005»

CBR for State Value Function Approximation in Reinforcement Learning

16 years 15 days ago

Download ml.informatik.uni-freiburg.de

CBR is one of the techniques that can be applied to the task of approximating a function over high-dimensional, continuous spaces. In Reinforcement Learning systems a learning agen...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

218

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

15 years 8 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

180

click to vote

ICML
2010
IEEE

189views Machine Learning» more ICML 2010»

Nonparametric Return Distribution Approximation for Reinforcement Learning

15 years 8 months ago

Download www.icml2010.org

Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such ...

Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashim...

claim paper

Read More »

« Prev « First page 2 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers