Search Sciweavers | Sciweavers

67 search results - page 5 / 14

» Learning predictive state representations using non-blind po...

187

Voted

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

233

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 6 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

221

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 8 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

203

Voted

ICRA
2005
IEEE

91views Robotics» more ICRA 2005»

Learning to Steer on Winding Tracks Using Semi-Parametric Control Policies

16 years 1 months ago

Download www.cs.ubc.ca

— We present a semi-parametric control policy representation and use it to solve a series of nonholonomic control problems with input state spaces of up to 7 dimensions. A neares...

Kenneth Robert Alton, Michiel van de Panne

claim paper

Read More »

201

click to vote

NN
2007
Springer

162views Neural Networks» more NN 2007»

Learning grammatical structure with Echo State Networks

15 years 7 months ago

Download cseweb.ucsd.edu

Echo State Networks (ESNs) have been shown to be effective for a number of tasks, including motor control, dynamic time series prediction, and memorizing musical sequences. Howeve...

Matthew H. Tong, Adam D. Bickett, Eric M. Christia...

claim paper

Read More »

« Prev « First page 5 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers