Search Sciweavers | Sciweavers

1630 search results - page 87 / 326

» Coordinated Reinforcement Learning

136

Voted

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 2 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

160

Voted

NECO
2007

258views more NECO 2007»

Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

15 years 2 months ago

Download www.coneural.org

The persistent modiﬁcation of synaptic efﬁcacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spiketiming-dependent plasticity (...

Razvan V. Florian

claim paper

Read More »

202

Voted

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

13 years 11 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

119

click to vote

PKDD
2009
Springer

144views Data Mining» more PKDD 2009»

Compositional Models for Reinforcement Learning

15 years 10 months ago

Download userweb.cs.utexas.edu

Abstract. Innovations such as optimistic exploration, function approximation, and hierarchical decomposition have helped scale reinforcement learning to more complex environments, ...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

127

Voted

KES
2007
Springer

146views Information Technology» more KES 2007»

Making Financial Trading by Recurrent Reinforcement Learning

15 years 9 months ago

Download www.sms.dsems.unile.it

In this paper we propose a ﬁnancial trading system whose strategy is developed by means of an artiﬁcial neural network approach based on a recurrent reinforcement learning algo...

Francesco Bertoluzzo, Marco Corazza

claim paper

Read More »

« Prev « First page 87 / 326 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers