Search Sciweavers | Sciweavers

340 search results - page 19 / 68

» Kernelized value function approximation for reinforcement le...

119

click to vote

ICCV
2009
IEEE

307views Computer Vision» more ICCV 2009»

Kernel map compression using generalized radial basis functions

15 years 6 days ago

Download ivalab.ece.gatech.edu

The use of Mercer kernel methods in statistical learning theory provides for strong learning capabilities, as seen in kernel principal component analysis and support vector machin...

Omar Arif, Patricio A. Vela

claim paper

Read More »

142

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 1 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

175

click to vote

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

14 years 9 months ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

138

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 3 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

152

Voted

IAT
2005
IEEE

180views Intelligent Agents» more IAT 2005»

Self-Organizing Cognitive Agents and Reinforcement Learning in Multi-Agent Environment

15 years 8 months ago

Download www3.ntu.edu.sg

This paper presents a self-organizing cognitive architecture, known as TD-FALCON, that learns to function through its interaction with the environment. TD-FALCON learns the value ...

Ah-Hwee Tan, Dan Xiao

claim paper

Read More »

« Prev « First page 19 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers