Search Sciweavers | Sciweavers

272 search results - page 10 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

188

click to vote

ESANN
2008

164views Neural Networks» more ESANN 2008»

Multilayer Perceptrons with Radial Basis Functions as Value Functions in Reinforcement Learning

15 years 8 months ago

Download www.dice.ucl.ac.be

Using multilayer perceptrons (MLPs) to approximate the state-action value function in reinforcement learning (RL) algorithms could become a nightmare due to the constant possibilit...

Victor Uc Cetina

claim paper

Read More »

188

click to vote

ICMLA
2008

195views Machine Learning» more ICMLA 2008»

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture

15 years 8 months ago

Download www.grappa.univ-lille3.fr

In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...

Sertan Girgin, Philippe Preux

claim paper

Read More »

244

click to vote

EWRL
2008

191views Machine Learning» more EWRL 2008»

Bayesian Reward Filtering

15 years 8 months ago

Download www.metz.supelec.fr

A wide variety of function approximation schemes have been applied to reinforcement learning. However, Bayesian filtering approaches, which have been shown efficient in other field...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

146

click to vote

ICML
2009
IEEE

120views Machine Learning» more ICML 2009»

Fast gradient-descent methods for temporal-difference learning with linear function approximation

16 years 7 months ago

Download carbon.videolectures.net

Csaba Szepesvári, David Silver, Doina Precu...

claim paper

Read More »

267

click to vote

Publication

334views

Rollout Sampling Approximate Policy Iteration

16 years 3 months ago

Download www.springerlink.com

Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

« Prev « First page 10 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers