Search Sciweavers | Sciweavers

86 search results - page 9 / 18

» Estimation and Approximation Bounds for Gradient-Based Reinf...

188

click to vote

ESANN
2004

90views Neural Networks» more ESANN 2004»

High-accuracy value-function approximation with neural networks applied to the acrobot

15 years 8 months ago

Download remi.coulom.free.fr

Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this pape...

Rémi Coulom

claim paper

Read More »

193

click to vote

ICAC
2006
IEEE

112views Applied Computing» more ICAC 2006»

A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation

16 years 1 months ago

Download userweb.cs.utexas.edu

— Reinforcement Learning (RL) provides a promising new approach to systems performance management that differs radically from standard queuing-theoretic approaches making use of ...

Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mo...

claim paper

Read More »

172

click to vote

ATAL
2008
Springer

131views Intelligent Agents» more ATAL 2008»

A new perspective to the keepaway soccer: the takers

15 years 9 months ago

Download www.aamas-conference.org

Keepaway is a sub-problem of RoboCup Soccer Simulator in which 'the keepers' try to maintain the possession of the ball, while 'the takers' try to steal the ba...

Atil Iscen, Umut Erogul

claim paper

Read More »

179

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 8 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

173

click to vote

JMLR
2010

132views more JMLR 2010»

On the Impact of Kernel Approximation on Learning Accuracy

15 years 1 months ago

Download www.cs.nyu.edu

Kernel approximation is commonly used to scale kernel-based algorithms to applications containing as many as several million instances. This paper analyzes the effect of such appr...

Corinna Cortes, Mehryar Mohri, Ameet Talwalkar

claim paper

Read More »

« Prev « First page 9 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers