Search Sciweavers | Sciweavers

779 search results - page 26 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

214

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 5 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

216

click to vote

ILP
2007
Springer

250views Automated Reasoning» more ILP 2007»

Learning Relational Options for Inductive Transfer in Relational Reinforcement Learning

16 years 1 months ago

Download people.cs.kuleuven.be

In reinforcement learning problems, an agent has the task of learning a good or optimal strategy from interaction with his environment. At the start of the learning task, the agent...

Tom Croonenborghs, Kurt Driessens, Maurice Bruynoo...

claim paper

Read More »

194

click to vote

ICML
1997
IEEE

194views Machine Learning» more ICML 1997»

Hierarchical Explanation-Based Reinforcement Learning

16 years 8 months ago

Download reference.kfupm.edu.sa

Explanation-Based Reinforcement Learning (EBRL) was introduced by Dietterich and Flann as a way of combining the ability of Reinforcement Learning (RL) to learn optimal plans with...

Prasad Tadepalli, Thomas G. Dietterich

claim paper

Read More »

240

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

15 years 5 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

161

click to vote

IEICET
2007

68views more IEICET 2007»

Generalization Error Estimation for Non-linear Learning Methods

15 years 7 months ago

Download sugiyama-www.cs.titech.ac.jp

Estimating the generalization error is one of the key ingredients of supervised learning since a good generalization error estimator can be used for model selection. An unbiased g...

Masashi Sugiyama

claim paper

Read More »

« Prev « First page 26 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers