Search Sciweavers | Sciweavers

85 search results - page 10 / 17

» Approximate Policy Iteration with a Policy Language Bias

click to vote

JMLR
2010

135views more JMLR 2010»

Finite-sample Analysis of Bellman Residual Minimization

13 years 2 months ago

Download jmlr.csail.mit.edu

We consider the Bellman residual minimization approach for solving discounted Markov decision problems, where we assume that a generative model of the dynamics and rewards is avai...

Odalric-Ambrym Maillard, Rémi Munos, Alessa...

claim paper

Read More »

click to vote

ICMLA
2008

195views Machine Learning» more ICMLA 2008»

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture

13 years 8 months ago

Download www.grappa.univ-lille3.fr

In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...

Sertan Girgin, Philippe Preux

claim paper

Read More »

click to vote

AIPS
2004

82views Artificial Intelligence» more AIPS 2004»

Learning Domain-Specific Control Knowledge from Random Walks

13 years 8 months ago

Download www2.parc.com

We describe and evaluate a system for learning domainspecific control knowledge. In particular, given a planning domain, the goal is to output a control policy that performs well ...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

13 years 7 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

click to vote

ICMLA
2010

211views Machine Learning» more ICMLA 2010»

Ensembles of Neural Networks for Robust Reinforcement Learning

13 years 5 months ago

Download ahans.de

Reinforcement learning algorithms that employ neural networks as function approximators have proven to be powerful tools for solving optimal control problems. However, their traini...

Alexander Hans, Steffen Udluft

claim paper

Read More »

« Prev « First page 10 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers