Search Sciweavers | Sciweavers

164 search results - page 21 / 33

» Self-Optimizing Memory Controllers: A Reinforcement Learning...

185

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

16 years 27 days ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

153

click to vote

AAAI
2010

194views Intelligent Agents» more AAAI 2010»

Learning Simulation Control in General Game-Playing Agents

15 years 8 months ago

Download www.ru.is

The aim of General Game Playing (GGP) is to create intelligent agents that can automatically learn how to play many different games at an expert level without any human interventi...

Hilmar Finnsson, Yngvi Björnsson

claim paper

Read More »

167

Voted

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

15 years 10 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

168

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

16 years 8 days ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

179

click to vote

SMC
2007
IEEE

118views Control Systems» more SMC 2007»

One-class learning with multi-objective genetic programming

16 years 1 months ago

Download users.cs.dal.ca

One-class classiﬁcation naturally only provides one class of exemplars on which to construct the classiﬁcation model. In this work, multiobjective genetic programming (GP) all...

Robert Curry, Malcolm I. Heywood

claim paper

Read More »

« Prev « First page 21 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers