Search Sciweavers | Sciweavers

1799 search results - page 37 / 360

» Filtered Reinforcement Learning

203

click to vote

ICML
2009
IEEE

166views Machine Learning» more ICML 2009»

Analytic moment-based Gaussian process filtering

16 years 7 months ago

Download isas.uka.de

We propose an analytic moment-based filter for nonlinear stochastic dynamic systems modeled by Gaussian processes. Exact expressions for the expected value and the covariance matr...

Marc Peter Deisenroth, Marco F. Huber, Uwe D. Hane...

claim paper

Read More »

156

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

16 years 1 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

190

click to vote

ICML
2004
IEEE

161views Machine Learning» more ICML 2004»

Using relative novelty to identify useful temporal abstractions in reinforcement learning

16 years 7 months ago

Download www.cs.umass.edu

lative Novelty to Identify Useful Temporal Abstractions in Reinforcement Learning ?Ozg?ur S?im?sek ozgur@cs.umass.edu Andrew G. Barto barto@cs.umass.edu Department of Computer Scie...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

199

click to vote

ECML
2006
Springer

116views Machine Learning» more ECML 2006»

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

15 years 10 months ago

Download web.engr.oregonstate.edu

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

155

click to vote

PRIMA
2009
Springer

102views Intelligent Agents» more PRIMA 2009»

Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

16 years 1 months ago

Download teamcore.usc.edu

In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...

Itsuki Noda

claim paper

Read More »

« Prev « First page 37 / 360 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers