Search Sciweavers | Sciweavers

8 search results - page 2 / 2

» Global Versus Local Constructive Function Approximation for ...

156

click to vote

ECML
2007
Springer

167views Machine Learning» more ECML 2007»

Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs

15 years 6 months ago

Download www.igi.tugraz.at

Abstract. We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of...

Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass

claim paper

Read More »

163

click to vote

IJCNN
2007
IEEE

222views Neural Networks» more IJCNN 2007»

Agnostic Learning versus Prior Knowledge in the Design of Kernel Machines

15 years 9 months ago

Download theoval.cmp.uea.ac.uk

Abstract— The optimal model parameters of a kernel machine are typically given by the solution of a convex optimisation problem with a single global optimum. Obtaining the best p...

Gavin C. Cawley, Nicola L. C. Talbot

claim paper

Read More »

145

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

14 years 9 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

« Prev « First page 2 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers