Search Sciweavers | Sciweavers

458 search results - page 31 / 92

» Q-Decomposition for Reinforcement Learning Agents

click to vote

ROBOCUP
2005
Springer

134views Robotics» more ROBOCUP 2005»

Simultaneous Learning to Acquire Competitive Behaviors in Multi-agent System Based on Modular Learning System

14 years 1 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the policy alternation of others in multiagent dynamic environments. A typical example is a case of RoboCup...

Yasutake Takahashi, Kazuhiro Edazawa, Kentarou Nom...

claim paper

Read More »

click to vote

CIG
2005
IEEE

120views Applied Computing» more CIG 2005»

Adapting Reinforcement Learning for Computer Games: Using Group Utility Functions

14 years 1 months ago

Download cswww.essex.ac.uk

AbstractGroup utility functions are an extension of the common team utility function for providing multiple agents with a common reinforcement learning signal for learning cooperat...

Jay Bradley, Gillian Hayes

claim paper

Read More »

click to vote

ATAL
2008
Springer

151views Intelligent Agents» more ATAL 2008»

Graph Laplacian based transfer learning in reinforcement learning

13 years 9 months ago

Download www.ifaamas.org

The aim of transfer learning is to accelerate learning in related domains. In reinforcement learning, many different features such as a value function and a policy can be transfer...

Yi-Ting Tsao, Ke-Ting Xiao, Von-Wun Soo

claim paper

Read More »

click to vote

AAAI
2006

127views Intelligent Agents» more AAAI 2006»

Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance

13 years 9 months ago

Download robotic.media.mit.edu

As robots become a mass consumer product, they will need to learn new skills by interacting with typical human users. Past approaches have adapted reinforcement learning (RL) to a...

Andrea Lockerd Thomaz, Cynthia Breazeal

claim paper

Read More »

click to vote

AAAI
1996

191views Intelligent Agents» more AAAI 1996»

Evolution-Based Discovery of Hierarchical Behaviors

13 years 9 months ago

Download www.aaai.org

Procedural representations of control policies have two advantages when facing the scale-up problem in learning tasks. First they are implicit, with potential for inductive genera...

Justinian P. Rosca, Dana H. Ballard

claim paper

Read More »

« Prev « First page 31 / 92 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers