Search Sciweavers | Sciweavers

7 search results - page 2 / 2

» A Counterexample Guided Abstraction-Refinement Framework for...

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

14 years 8 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

click to vote

IJRR
2011

218views more IJRR 2011»

Motion planning under uncertainty for robotic tasks with long time horizons

13 years 2 months ago

Download deslab.mit.edu

Abstract Partially observable Markov decision processes (POMDPs) are a principled mathematical framework for planning under uncertainty, a crucial capability for reliable operation...

Hanna Kurniawati, Yanzhu Du, David Hsu, Wee Sun Le...

claim paper

Read More »

« Prev « First page 2 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers