Search Sciweavers | Sciweavers

226 search results - page 38 / 46

» A Convergent Reinforcement Learning Algorithm in the Continu...

197

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

16 years 2 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

168

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 6 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

156

click to vote

ESANN
2004

90views Neural Networks» more ESANN 2004»

High-accuracy value-function approximation with neural networks applied to the acrobot

15 years 7 months ago

Download remi.coulom.free.fr

Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this pape...

Rémi Coulom

claim paper

Read More »

240

click to vote

ECCV
2010
Springer

251views Computer Vision» more ECCV 2010»

Discriminative Tracking by Metric Learning

15 years 10 months ago

Download www.eecs.northwestern.edu

We present a discriminative model that casts appearance modeling and visual matching into a single objective for visual tracking. Most previous discriminative models for visual tra...

claim paper

Read More »

163

click to vote

IBERAMIA
2004
Springer

168views Artificial Intelligence» more IBERAMIA 2004»

Mobile Robotic Supported Collaborative Learning (MRSCL)

15 years 11 months ago

Download www2.ing.puc.cl

In this paper we describe MRSCL Geometry a collaborative educational activity that explores the use of robotic technology and wirelessly connected Pocket PCs as tools for teaching ...

Rubén Mitnik, Miguel Nussbaum, Alvaro Soto

claim paper

Read More »

« Prev « First page 38 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers