Search Sciweavers | Sciweavers

178 search results - page 22 / 36

» Probabilistic policy reuse in a reinforcement learning agent

click to vote

CIA
2007
Springer

89views Intelligent Agents» more CIA 2007»

Agent Behavior Alignment: A Mechanism to Overcome Problems in Agent Interactions During Runtime

14 years 2 months ago

Download www.bdk.rug.nl

When two or more agents interacting, their behaviors are not necessarily matching. Automated ways to overcome conicts in the behavior of agents can make the execution of interacti...

Gerben G. Meyer, Nicolae B. Szirbik

claim paper

Read More »

click to vote

SASO
2009
IEEE

172views Control Systems» more SASO 2009»

Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems

14 years 3 months ago

Download www.scss.tcd.ie

—Large-scale agent-based systems are required to self-optimize towards multiple, potentially conﬂicting, policies of varying spatial and temporal scope. As a result, not all ag...

Ivana Dusparic, Vinny Cahill

claim paper

Read More »

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

13 years 10 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

FLAIRS
2003

141views Artificial Intelligence» more FLAIRS 2003»

Learning from Reinforcement and Advice Using Composite Reward Functions

13 years 10 months ago

Download ranger.uta.edu

1 Reinforcement learning has become a widely used methodology for creating intelligent agents in a wide range of applications. However, its performance deteriorates in tasks with s...

Vinay N. Papudesi, Manfred Huber

claim paper

Read More »

click to vote

ATAL
2005
Springer

130views Intelligent Agents» more ATAL 2005»

Behavior transfer for value-function-based reinforcement learning

14 years 2 months ago

Download www.cs.huji.ac.il

Temporal difference (TD) learning methods [22] have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been...

Matthew E. Taylor, Peter Stone

claim paper

Read More »

« Prev « First page 22 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers