Search Sciweavers | Sciweavers

102 search results - page 7 / 21

» Efficient Asymptotic Approximation in Temporal Difference Le...

198

click to vote

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 9 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

190

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 8 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

183

click to vote

ICPR
2002
IEEE

135views computer vision» more ICPR 2002»

Unsupervised Learning Using Locally Linear Embedding: Experiments with Face Pose Analysis

16 years 8 months ago

Download www.ee.oulu.fi

This paper considers a recently proposed method for unsupervised learning and dimensionality reduction, locally linear embedding (LLE). LLE computes a compact representation of hi...

Abdenour Hadid, Matti Pietikäinen, Olga Kouro...

claim paper

Read More »

243

click to vote

UAI
2008

230views Artificial Intelligence» more UAI 2008»

Efficient Inference in Persistent Dynamic Bayesian Networks

15 years 8 months ago

Download uai2008.cs.helsinki.fi

Numerous temporal inference tasks such as fault monitoring and anomaly detection exhibit a persistence property: for example, if something breaks, it stays broken until an interve...

Tomás Singliar, Denver Dash

claim paper

Read More »

191

click to vote

ICML
2004
IEEE

161views Machine Learning» more ICML 2004»

Using relative novelty to identify useful temporal abstractions in reinforcement learning

16 years 7 months ago

Download www.cs.umass.edu

lative Novelty to Identify Useful Temporal Abstractions in Reinforcement Learning ?Ozg?ur S?im?sek ozgur@cs.umass.edu Andrew G. Barto barto@cs.umass.edu Department of Computer Scie...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

« Prev « First page 7 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers