Search Sciweavers | Sciweavers

536 search results - page 46 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

232

click to vote

NECO
2007

258views more NECO 2007»

Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

15 years 6 months ago

Download www.coneural.org

The persistent modiﬁcation of synaptic efﬁcacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spiketiming-dependent plasticity (...

Razvan V. Florian

claim paper

Read More »

178

click to vote

ICML
1995
IEEE

155views Machine Learning» more ICML 1995»

Stable Function Approximation in Dynamic Programming

16 years 7 months ago

Download www.ri.cmu.edu

The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experime...

Geoffrey J. Gordon

claim paper

Read More »

157

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

16 years 7 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

218

click to vote

ICML
1999
IEEE

152views Machine Learning» more ICML 1999»

Distributed Value Functions

16 years 7 months ago

Download www.ri.cmu.edu

Many interesting problems, such as power grids, network switches, and tra c ow, that are candidates for solving with reinforcement learningRL, alsohave properties that make distri...

Jeff G. Schneider, Weng-Keen Wong, Andrew W. Moore...

claim paper

Read More »

189

click to vote

HEURISTICS
2008

170views more HEURISTICS 2008»

Accelerating autonomous learning by using heuristic selection of actions

15 years 7 months ago

Download www.fei.edu.br

This paper investigates how to make improved action selection for online policy learning in robotic scenarios using reinforcement learning (RL) algorithms. Since finding control po...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

« Prev « First page 46 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers