Sciweavers

164 search results - page 21 / 33
» Self-Optimizing Memory Controllers: A Reinforcement Learning...
Sort
View
ECML
2007
Springer
14 years 1 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
AAAI
2010
13 years 9 months ago
Learning Simulation Control in General Game-Playing Agents
The aim of General Game Playing (GGP) is to create intelligent agents that can automatically learn how to play many different games at an expert level without any human interventi...
Hilmar Finnsson, Yngvi Björnsson
ECML
2006
Springer
13 years 11 months ago
Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks
Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...
Sébastien Jodogne, Cyril Briquet, Justus H....
ECML
2005
Springer
14 years 1 months ago
Model-Based Online Learning of POMDPs
Abstract. Learning to act in an unknown partially observable domain is a difficult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...
Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony
SMC
2007
IEEE
118views Control Systems» more  SMC 2007»
14 years 1 months ago
One-class learning with multi-objective genetic programming
One-class classification naturally only provides one class of exemplars on which to construct the classification model. In this work, multiobjective genetic programming (GP) all...
Robert Curry, Malcolm I. Heywood