Search Sciweavers | Sciweavers

286 search results - page 19 / 58

» Using inaccurate models in reinforcement learning

201

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 7 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

250

click to vote

NECO
2007

258views more NECO 2007»

Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

15 years 7 months ago

Download www.coneural.org

The persistent modiﬁcation of synaptic efﬁcacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spiketiming-dependent plasticity (...

Razvan V. Florian

claim paper

Read More »

241

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

15 years 5 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

222

Voted

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

201

click to vote

FSTTCS
2006
Springer

149views Software Engineering» more FSTTCS 2006»

Testing Probabilistic Equivalence Through Reinforcement Learning

15 years 11 months ago

Download www2.ift.ulaval.ca

We propose a new approach to verification of probabilistic processes for which the model may not be available. We use a technique from Reinforcement Learning to approximate how far...

Josee Desharnais, François Laviolette, Sami...

claim paper

Read More »

« Prev « First page 19 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers