Search Sciweavers | Sciweavers

51 search results - page 7 / 11

» Characterizing reinforcement learning methods through parame...

168

click to vote

GECCO
2008
Springer

128views Optimization» more GECCO 2008»

Adapted Pittsburgh classifier system: building accurate strategies in non markovian environments

15 years 7 months ago

Download www.cs.bham.ac.uk

This paper focuses on the study of the behavior of a genetic algorithm based classiﬁer system, the Adapted Pittsburgh Classiﬁer System (A.P.C.S), on maze type environments con...

Gilles Énée, Mathias Péroumal...

claim paper

Read More »

199

click to vote

CI
2005

106views more CI 2005»

Incremental Learning of Procedural Planning Knowledge in Challenging Environments

15 years 6 months ago

Download www.sunnyhome.org

Autonomous agents that learn about their environment can be divided into two broad classes. One class of existing learners, reinforcement learners, typically employ weak learning ...

Douglas J. Pearson, John E. Laird

claim paper

Read More »

150

click to vote

ATAL
2003
Springer

172views Intelligent Agents» more ATAL 2003»

Resource allocation games with changing resource capacities

15 years 12 months ago

Download www.isi.edu

In this paper we study a class of resource allocation games which are inspired by the El Farol Bar problem. We consider a system of competitive agents that have to choose between ...

Aram Galstyan, Shashikiran Kolar, Kristina Lerman

claim paper

Read More »

243

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

15 years 4 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

182

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

16 years 25 days ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 7 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers