Sciweavers

223 search results - page 26 / 45
» Least-Squares Temporal Difference Learning
Sort
View
ICCV
2001
IEEE
14 years 10 months ago
Learning Image Statistics for Bayesian Tracking
This paper describes a framework for learning probabilistic models of objects and scenes and for exploiting these models for tracking complex, deformable, or articulated objects i...
Hedvig Sidenbladh, Michael J. Black
ATAL
2005
Springer
14 years 2 months ago
Behavior transfer for value-function-based reinforcement learning
Temporal difference (TD) learning methods [22] have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been...
Matthew E. Taylor, Peter Stone
ESANN
2006
13 years 10 months ago
Learning and discrimination through STDP in a top-down modulated associative memory
Abstract. This article underlines the learning and discrimination capabilities of a model of associative memory based on artificial networks of spiking neurons. Inspired from neuro...
Anthony Mouraud, Hélène Paugam-Moisy
BC
2002
108views more  BC 2002»
13 years 8 months ago
Spike-timing-dependent plasticity: common themes and divergent vistas
Abstract. Recent experimental observations of spiketiming-dependent synaptic plasticity (STDP) have revitalized the study of synaptic learning rules. The most surprising aspect of ...
Ádám Kepecs, Mark C. W. van Rossum, ...
GECCO
2006
Springer
133views Optimization» more  GECCO 2006»
14 years 8 days ago
On-line evolutionary computation for reinforcement learning in stochastic domains
In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...
Shimon Whiteson, Peter Stone