Sciweavers

178 search results - page 22 / 36
» Probabilistic policy reuse in a reinforcement learning agent
Sort
View
CIA
2007
Springer
14 years 2 months ago
Agent Behavior Alignment: A Mechanism to Overcome Problems in Agent Interactions During Runtime
When two or more agents interacting, their behaviors are not necessarily matching. Automated ways to overcome conicts in the behavior of agents can make the execution of interacti...
Gerben G. Meyer, Nicolae B. Szirbik
SASO
2009
IEEE
14 years 3 months ago
Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems
—Large-scale agent-based systems are required to self-optimize towards multiple, potentially conflicting, policies of varying spatial and temporal scope. As a result, not all ag...
Ivana Dusparic, Vinny Cahill
AAAI
2006
13 years 10 months ago
Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning
Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...
Shimon Whiteson, Peter Stone
FLAIRS
2003
13 years 10 months ago
Learning from Reinforcement and Advice Using Composite Reward Functions
1 Reinforcement learning has become a widely used methodology for creating intelligent agents in a wide range of applications. However, its performance deteriorates in tasks with s...
Vinay N. Papudesi, Manfred Huber
ATAL
2005
Springer
14 years 2 months ago
Behavior transfer for value-function-based reinforcement learning
Temporal difference (TD) learning methods [22] have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been...
Matthew E. Taylor, Peter Stone