Search Sciweavers | Sciweavers

164 search results - page 19 / 33

» Self-Optimizing Memory Controllers: A Reinforcement Learning...

153

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

16 years 28 days ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

236

click to vote

ECML
2006
Springer

146views Machine Learning» more ECML 2006»

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

15 years 10 months ago

Download www.montefiore.ulg.ac.be

We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJ...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

207

click to vote

ICML
2005
IEEE

196views Machine Learning» more ICML 2005»

Bayesian sparse sampling for on-line reward optimization

16 years 7 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

226

click to vote

NIPS
1992

160views Information Technology» more NIPS 1992»

Explanation-Based Neural Network Learning for Robot Control

15 years 8 months ago

Download www.ri.cmu.edu

How can artificial neural nets generalize better from fewer examples? In order to generalize successfully, neural network learning methods typically require large training data se...

Tom M. Mitchell, Sebastian Thrun

claim paper

Read More »

175

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

16 years 1 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

« Prev « First page 19 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers