Search Sciweavers | Sciweavers

1630 search results - page 227 / 326

» Coordinated Reinforcement Learning

132

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 10 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

163

click to vote

ACSE
2000
ACM

271views Theoretical Computer Science» more ACSE 2000»

The information environments program - a new design based IT degree

15 years 8 months ago

Download www.itee.uq.edu.au

The University of Queensland has recently established a new design-focused, studio-based IT degree at a new “flexible-learning” campus. The Bachelor of Information Environment...

Michael Docherty, Peter Sutton, Margot Brereton, S...

claim paper

Read More »

137

click to vote

ICCS
1993
Springer

99views Applied Computing» more ICCS 1993»

Towards Domain-Independent Machine Intelligence

15 years 8 months ago

Download www.soe.ucsc.edu

Adaptive predictive search (APS), is a learning system framework, which given little initial domain knowledge, increases its decision-making abilities in complex problems domains....

Robert Levinson

claim paper

Read More »

150

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 5 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

145

click to vote

ICIP
2006
IEEE

121views Image Processing» more ICIP 2006»

Image Manifold Interpolation using Free-Form Deformations

16 years 6 months ago

Download www.cs.wustl.edu

An important class of image data sets depict an object undergoing deformation. When there are only a few underlying causes of the deformation, these images have a natural lowdimen...

Richard Souvenir, Qilong Zhang, Robert Pless

claim paper

Read More »

« Prev « First page 227 / 326 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers