Sciweavers

1630 search results - page 246 / 326
» Coordinated Reinforcement Learning
Sort
View
ATAL
2008
Springer
15 years 6 months ago
Social reward shaping in the prisoner's dilemma
Reward shaping is a well-known technique applied to help reinforcement-learning agents converge more quickly to nearoptimal behavior. In this paper, we introduce social reward sha...
Monica Babes, Enrique Munoz de Cote, Michael L. Li...
EWCBR
2008
Springer
15 years 5 months ago
Discovering Feature Weights for Feature-based Indexing of Q-tables
In this paper we propose an approach to address the old problem of identifying the feature conditions under which a gaming strategy can be effective. For doing this, we will build ...
Chad Hogg, Stephen Lee-Urban, Bryan Auslander, H&e...
AIPS
2006
15 years 5 months ago
Reusing and Building a Policy Library
Policy Reuse is a method to improve reinforcement learning with the ability to solve multiple tasks by building upon past problem solving experience, as accumulated in a Policy Li...
Fernando Fernández, Manuela M. Veloso
NN
2006
Springer
140views Neural Networks» more  NN 2006»
15 years 4 months ago
Neural mechanism for stochastic behaviour during a competitive game
Previous studies have shown that non-human primates can generate highly stochastic choice behaviour, especially when this is required during a competitive interaction with another...
Alireza Soltani, Daeyeol Lee, Xiao-Jing Wang
TSMC
2008
135views more  TSMC 2008»
15 years 4 months ago
Wholesale Power Price Dynamics Under Transmission Line Limits: A Use of an Agent-Based Intelligent Simulator
Abstract--This research proposes a use of an agent-based intelligent simulator to numerically examine the influence of a transmission line limit on the dynamics of a wholesale powe...
Toshiyuki Sueyoshi, Gopalakrishna Reddy Tadiparthi