Search Sciweavers | Sciweavers

340 search results - page 14 / 68

» Kernelized value function approximation for reinforcement le...

121

click to vote

NIPS
2007

80views Information Technology» more NIPS 2007»

Stable Dual Dynamic Programming

15 years 3 months ago

Download webdocs.cs.ualberta.ca

Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions i...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

106

click to vote

ECML
2004
Springer

139views Machine Learning» more ECML 2004»

Batch Reinforcement Learning with State Importance

15 years 7 months ago

Download www.research.rutgers.edu

Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classiﬁer mapping states to actions....

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

116

click to vote

ICML
2005
IEEE

121views Machine Learning» more ICML 2005»

Combining model-based and instance-based learning for first order regression

16 years 3 months ago

Download www.cs.kuleuven.ac.be

T ORDER REGRESSION (EXTENDED ABSTRACT) Kurt Driessensa Saso Dzeroskib a Department of Computer Science, University of Waikato, Hamilton, New Zealand (kurtd@waikato.ac.nz) b Departm...

Kurt Driessens, Saso Dzeroski

claim paper

Read More »

161

click to vote

ECML
2006
Springer

146views Machine Learning» more ECML 2006»

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

15 years 6 months ago

Download www.montefiore.ulg.ac.be

We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJ...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

124

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

16 years 3 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

« Prev « First page 14 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers