Search Sciweavers | Sciweavers

358 search results - page 43 / 72

» Online Testing with Reinforcement Learning

click to vote

ICRA
2010
IEEE

148views Robotics» more ICRA 2010»

Body schema acquisition through active learning

13 years 8 months ago

Download users.isr.ist.utl.pt

— We present an active learning algorithm for the problem of body schema learning, i.e. estimating a kinematic model of a serial robot. The learning process is done online using ...

Ruben Martinez-Cantin, Manuel Lopes, Luis Montesan...

claim paper

Read More »

click to vote

ACE
2004

315views Education» more ACE 2004»

Applying SPICE to e-Learning: An e-Learning Maturity Model?

13 years 11 months ago

Download crpit.com

The Capability Maturity Model and SPICE approach to software process improvement has resulted in a robust system for improving development process capability in the field of softw...

Stephen Marshall, Geoff Mitchell

claim paper

Read More »

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

13 years 11 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

click to vote

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

14 years 10 months ago

Download www.cs.ualberta.ca

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

click to vote

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

14 years 10 months ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

« Prev « First page 43 / 72 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers