Sciweavers

417 search results - page 47 / 84
» The Dynamics of Reinforcement Learning in Cooperative Multia...
Sort
View
CDC
2010
IEEE
182views Control Systems» more  CDC 2010»
13 years 2 months ago
An approximate dual subgradient algorithm for multi-agent non-convex optimization
We consider a multi-agent optimization problem where agents aim to cooperatively minimize a sum of local objective functions subject to a global inequality constraint and a global ...
Minghui Zhu, Sonia Martínez
NIPS
2003
13 years 9 months ago
Extending Q-Learning to General Adaptive Multi-Agent Systems
Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...
Gerald Tesauro
IBERAMIA
2010
Springer
13 years 6 months ago
Dynamic Reward Shaping: Training a Robot by Voice
Reinforcement Learning is commonly used for learning tasks in robotics, however, traditional algorithms can take very long training times. Reward shaping has been recently used to ...
Ana C. Tenorio-Gonzalez, Eduardo F. Morales, Luis ...
NIPS
2004
13 years 9 months ago
Convergence and No-Regret in Multiagent Learning
Learning in a multiagent system is a challenging problem due to two key factors. First, if other agents are simultaneously learning then the environment is no longer stationary, t...
Michael H. Bowling
ATAL
2007
Springer
14 years 1 months ago
Cooperative evolution of service ontologies
Most of the proposed approaches in automatic service selection assume the existence of a common ontology among communicating agents. However, this assumption becomes difficult to...
Murat Sensoy, Pinar Yolum