Search Sciweavers | Sciweavers

154 search results - page 7 / 31

» Sample-Efficient Evolutionary Function Approximation for Rei...

188

click to vote

ESANN
2008

164views Neural Networks» more ESANN 2008»

Multilayer Perceptrons with Radial Basis Functions as Value Functions in Reinforcement Learning

15 years 8 months ago

Download www.dice.ucl.ac.be

Using multilayer perceptrons (MLPs) to approximate the state-action value function in reinforcement learning (RL) algorithms could become a nightmare due to the constant possibilit...

Victor Uc Cetina

claim paper

Read More »

177

click to vote

ICML
2002
IEEE

156views Machine Learning» more ICML 2002»

Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs

16 years 7 months ago

Download select.cs.cmu.edu

One of the central challenges in reinforcement learning is to balance the exploration/exploitation tradeoff while scaling up to large problems. Although model-based reinforcement ...

Carlos Guestrin, Relu Patrascu, Dale Schuurmans

claim paper

Read More »

179

Voted

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 7 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

188

click to vote

ICMLA
2008

195views Machine Learning» more ICMLA 2008»

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture

15 years 8 months ago

Download www.grappa.univ-lille3.fr

In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...

Sertan Girgin, Philippe Preux

claim paper

Read More »

175

click to vote

ECAL
2005
Springer

119views Artificial Intelligence» more ECAL 2005»

The Quantitative Law of Effect is a Robust Emergent Property of an Evolutionary Algorithm for Reinforcement Learning

16 years 6 days ago

Download www.psychology.emory.edu

An evolutionary reinforcement-learning algorithm, the operation of which was not associated with an optimality condition, was instantiated in an artificial organism. The algorithm ...

J. J. McDowell, Zahra Ansari

claim paper

Read More »

« Prev « First page 7 / 31 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers