Sciweavers

1512 search results - page 133 / 303
» Qualitative reinforcement learning
Sort
View
BC
2008
56views more  BC 2008»
13 years 9 months ago
An implementation of reinforcement learning based on spike timing dependent plasticity
Patrick D. Roberts, Roberto A. Santiago, Gerardo L...
IJAIT
2008
60views more  IJAIT 2008»
13 years 9 months ago
A Hybrid Multiagent Reinforcement Learning Approach Using Strategies and Fusion
Ioannis Partalas, Ioannis Feneris, Ioannis P. Vlah...
CORR
2007
Springer
73views Education» more  CORR 2007»
13 years 9 months ago
Universal Reinforcement Learning
—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can influence futu...
Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...