Sciweavers

223 search results - page 34 / 45
» Least-Squares Temporal Difference Learning
Sort
View
ICCV
2001
IEEE
14 years 9 months ago
Human Tracking with Mixtures of Trees
Tree-structured probabilistic models admit simple, fast inference. However, they are not well suited to phenomena such as occlusion, where multiple components of an object may dis...
Sergey Ioffe, David A. Forsyth
ICVGIP
2004
13 years 8 months ago
Multi-Cue Exemplar-Based Nonparametric Model for Gesture Recognition
This paper presents an approach for a multi-cue, viewbased recognition of gestures. We describe an exemplarbased technique that combines two different forms of exemplars - shape e...
Vinay D. Shet, V. Shiv Naga Prasad, Ahmed M. Elgam...
NIPS
2007
13 years 9 months ago
Hippocampal Contributions to Control: The Third Way
Recent experimental studies have focused on the specialization of different neural structures for different types of instrumental behavior. Recent theoretical work has provided no...
Máté Lengyel, Peter Dayan
NN
2002
Springer
13 years 7 months ago
Opponent interactions between serotonin and dopamine
Anatomical and pharmacological evidence suggests that the dorsal raphe serotonin system and the ventral tegmental and substantia nigra dopamine system may act as mutual opponents....
Nathaniel D. Daw, Sham Kakade, Peter Dayan
AAAI
2011
12 years 7 months ago
Differential Eligibility Vectors for Advantage Updating and Gradient Methods
In this paper we propose differential eligibility vectors (DEV) for temporal-difference (TD) learning, a new class of eligibility vectors designed to bring out the contribution of...
Francisco S. Melo