Search Sciweavers | Sciweavers

121 search results - page 3 / 25

» Toward Off-Policy Learning Control with Function Approximati...

188

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 1 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

186

click to vote

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

16 years 29 days ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

191

Voted

ICCBR
2007
Springer

196views Automated Reasoning» more ICCBR 2007»

An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs

16 years 24 days ago

Download www.ni.uos.de

We identify two fundamental points of utilizing CBR for an adaptive agent that tries to learn on the basis of trial and error without a model of its environment. The ﬁrst link co...

Thomas Gabel, Martin Riedmiller

claim paper

Read More »

175

click to vote

CDC
2009
IEEE

172views Control Systems» more CDC 2009»

Approximate dynamic programming using fluid and diffusion approximations with applications to power management

15 years 11 months ago

Download www.cs.caltech.edu

—TD learning and its reﬁnements are powerful tools for approximating the solution to dynamic programming problems. However, the techniques provide the approximate solution only...

Wei Chen, Dayu Huang, Ankur A. Kulkarni, Jayakrish...

claim paper

Read More »

206

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

16 years 18 days ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

« Prev « First page 3 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers