Search Sciweavers | Sciweavers

827 search results - page 87 / 166

» Variational methods for Reinforcement Learning

172

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

16 years 1 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

240

click to vote

CLOR
2006

158views Computer Vision» more CLOR 2006»

Sequential Learning of Layered Models from Video

15 years 11 months ago

Download homepages.inf.ed.ac.uk

Abstract. A popular framework for the interpretation of image sequences is the layers or sprite model, see e.g. [1], [2]. Jojic and Frey [3] provide a generative probabilistic mode...

Michalis K. Titsias, Christopher K. I. Williams

claim paper

Read More »

177

click to vote

IJCAI
1993

81views Artificial Intelligence» more IJCAI 1993»

Learning of Resource Allocation Strategies for Game Playing

15 years 8 months ago

Download dli.iiit.ac.in

Human chess players exhibit a large variation in the amount of time they allocate for each move. Yet, the problem of devising resource allocation strategies for game playing did n...

Shaul Markovitch, Yaron Sella

claim paper

Read More »

179

click to vote

ECML
2006
Springer

84views Machine Learning» more ECML 2006»

Efficient Non-linear Control Through Neuroevolution

15 years 11 months ago

Download www.idsia.ch

Abstract. Many complex control problems are not amenable to traditional controller design. Not only is it difficult to model real systems, but often it is unclear what kind of beha...

Faustino J. Gomez, Jürgen Schmidhuber, Risto ...

claim paper

Read More »

175

click to vote

WCE
2007

118views Electrical And Computer Engi...» more WCE 2007»

Neural Networks for Optimal Control of Aircraft Landing Systems

15 years 8 months ago

Download www.iaeng.org

Abstract—In this work we present a variational formulation for a multilayer perceptron neural network. With this formulation any learning task for the neural network is deﬁned ...

Kevin Lau, Roberto Lopez, Eugenio Oñate

claim paper

Read More »

« Prev « First page 87 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers