Search Sciweavers | Sciweavers

827 search results - page 22 / 166

» Variational methods for Reinforcement Learning

187

click to vote

ECCV
2008
Springer

137views Computer Vision» more ECCV 2008»

Local Regularization for Multiclass Classification Facing Significant Intraclass Variations

16 years 8 months ago

Download www.cs.tau.ac.il

We propose a new local learning scheme that is based on the principle of decisiveness: the learned classifier is expected to exhibit large variability in the direction of the test ...

Lior Wolf, Yoni Donner

claim paper

Read More »

178

click to vote

TSMC
2008

132views more TSMC 2008»

Ensemble Algorithms in Reinforcement Learning

15 years 6 months ago

Download people.cs.uu.nl

This paper describes several ensemble methods that combine multiple different reinforcement learning (RL) algorithms in a single agent. The aim is to enhance learning speed and fin...

Marco A. Wiering, Hado van Hasselt

claim paper

Read More »

209

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

15 years 8 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

216

Voted

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Kernel-Based Reinforcement Learning on Representative States

13 years 9 months ago

Download www.bkveton.com

Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...

Branislav Kveton, Georgios Theocharous

claim paper

Read More »

220

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

16 years 6 days ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

« Prev « First page 22 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers