Sciweavers

827 search results - page 87 / 166
» Variational methods for Reinforcement Learning
Sort
View
ICANN
2007
Springer
14 years 2 months ago
Solving Deep Memory POMDPs with Recurrent Policy Gradients
Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...
Daan Wierstra, Alexander Förster, Jan Peters,...
CLOR
2006
13 years 11 months ago
Sequential Learning of Layered Models from Video
Abstract. A popular framework for the interpretation of image sequences is the layers or sprite model, see e.g. [1], [2]. Jojic and Frey [3] provide a generative probabilistic mode...
Michalis K. Titsias, Christopher K. I. Williams
IJCAI
1993
13 years 9 months ago
Learning of Resource Allocation Strategies for Game Playing
Human chess players exhibit a large variation in the amount of time they allocate for each move. Yet, the problem of devising resource allocation strategies for game playing did n...
Shaul Markovitch, Yaron Sella
ECML
2006
Springer
13 years 11 months ago
Efficient Non-linear Control Through Neuroevolution
Abstract. Many complex control problems are not amenable to traditional controller design. Not only is it difficult to model real systems, but often it is unclear what kind of beha...
Faustino J. Gomez, Jürgen Schmidhuber, Risto ...
WCE
2007
13 years 9 months ago
Neural Networks for Optimal Control of Aircraft Landing Systems
Abstract—In this work we present a variational formulation for a multilayer perceptron neural network. With this formulation any learning task for the neural network is defined ...
Kevin Lau, Roberto Lopez, Eugenio Oñate