Search Sciweavers | Sciweavers

1262 search results - page 83 / 253

» Reinforcement Learning: An Introduction

112

Voted

AIPS
2006

141views Artificial Intelligence» more AIPS 2006»

Combining Stochastic Task Models with Reinforcement Learning for Dynamic Scheduling

15 years 4 months ago

Download www.aaai.org

We view dynamic scheduling as a sequential decision problem. Firstly, we introduce a generalized planning operator, the stochastic task model (STM), which predicts the effects of ...

Malcolm J. A. Strens

claim paper

Read More »

111

click to vote

ATAL
2010
Springer

181views Intelligent Agents» more ATAL 2010»

Basis function construction for hierarchical reinforcement learning

15 years 4 months ago

Download www.cs.brown.edu

This paper introduces an approach to automatic basis function construction for Hierarchical Reinforcement Learning (HRL) tasks. We describe some considerations that arise when con...

Sarah Osentoski, Sridhar Mahadevan

claim paper

Read More »

152

click to vote

JAIR
2002

163views more JAIR 2002»

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

15 years 2 months ago

Download www.jair.org

The recursive least-squares (RLS) algorithm is one of the most well-known algorithms used in adaptive filtering, system identification and adaptive control. Its popularity is main...

Xin Xu, Hangen He, Dewen Hu

claim paper

Read More »

133

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 2 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

158

click to vote

NECO
2007

258views more NECO 2007»

Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

15 years 2 months ago

Download www.coneural.org

The persistent modiﬁcation of synaptic efﬁcacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spiketiming-dependent plasticity (...

Razvan V. Florian

claim paper

Read More »

« Prev « First page 83 / 253 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers