Sciweavers

343 search results - page 38 / 69
» Action discovery for reinforcement learning
Sort
View
IWLCS
2005
Springer
14 years 1 months ago
Counter Example for Q-Bucket-Brigade Under Prediction Problem
Aiming to clarify the convergence or divergence conditions for Learning Classifier System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...
Atsushi Wada, Keiki Takadama, Katsunori Shimohara
COST
2009
Springer
185views Multimedia» more  COST 2009»
13 years 5 months ago
How an Agent Can Detect and Use Synchrony Parameter of Its Own Interaction with a Human?
Synchrony is claimed by psychology as a crucial parameter of any social interaction: to give to human a feeling of natural interaction, a feeling of agency [17], an agent must be a...
Ken Prepin, Philippe Gaussier
ATAL
2003
Springer
14 years 29 days ago
Coordination in multiagent reinforcement learning: a Bayesian approach
Much emphasis in multiagent reinforcement learning (MARL) research is placed on ensuring that MARL algorithms (eventually) converge to desirable equilibria. As in standard reinfor...
Georgios Chalkiadakis, Craig Boutilier
ABIALS
2008
Springer
13 years 9 months ago
Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning
Abstract. In order to establish autonomous behavior for technical systems, the well known trade-off between reactive control and deliberative planning has to be considered. Within ...
Matthias Rungger, Hao Ding, Olaf Stursberg
ATAL
2007
Springer
14 years 1 months ago
Multiagent reinforcement learning and self-organization in a network of agents
To cope with large scale, agents are usually organized in a network such that an agent interacts only with its immediate neighbors in the network. Reinforcement learning technique...
Sherief Abdallah, Victor R. Lesser