Sciweavers

4544 search results - page 149 / 909
» Reinforcement Learning with Time
Sort
View
102
Voted
CEC
2007
IEEE
15 years 9 months ago
Evolving neuromodulatory topologies for reinforcement learning-like problems
— Environments with varying reward contingencies constitute a challenge to many living creatures. In such conditions, animals capable of adaptation and learning derive an advanta...
Andrea Soltoggio, Peter Dürr, Claudio Mattius...
125
Voted
GECCO
2006
Springer
175views Optimization» more  GECCO 2006»
15 years 6 months ago
A computational theory of adaptive behavior based on an evolutionary reinforcement mechanism
Two mathematical and two computational theories from the field of human and animal learning are combined to produce a more general theory of adaptive behavior. The cornerstone of ...
J. J. McDowell, Paul L. Soto, Jesse Dallery, Saule...
124
Voted
NN
2006
Springer
15 years 2 months ago
Neural systems implicated in delayed and probabilistic reinforcement
This review considers the theoretical problems facing agents that must learn and choose on the basis of reward or reinforcement that is uncertain or delayed, in implicit or proced...
Rudolf N. Cardinal
139
Voted

Publication
222views
15 years 11 months ago
Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration
Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...
Christos Dimitrakakis, Michail G. Lagoudakis
140
Voted
SIGGRAPH
2010
ACM
15 years 7 months ago
Gesture controllers
We introduce gesture controllers, a method for animating the body language of avatars engaged in live spoken conversation. A gesture controller is an optimal-policy controller tha...
Sergey Levine, Philipp Krähenbühl, Sebastian Thr...