Sciweavers

2108 search results - page 131 / 422
» Tracking in Reinforcement Learning
Sort
View
BC
2008
56views more  BC 2008»
13 years 10 months ago
An implementation of reinforcement learning based on spike timing dependent plasticity
Patrick D. Roberts, Roberto A. Santiago, Gerardo L...
IJAIT
2008
60views more  IJAIT 2008»
13 years 10 months ago
A Hybrid Multiagent Reinforcement Learning Approach Using Strategies and Fusion
Ioannis Partalas, Ioannis Feneris, Ioannis P. Vlah...
CORR
2007
Springer
73views Education» more  CORR 2007»
13 years 10 months ago
Universal Reinforcement Learning
—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can influence futu...
Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...