Sciweavers

417 search results - page 26 / 84
» The Dynamics of Reinforcement Learning in Cooperative Multia...
Sort
View
RAS
2006
105views more  RAS 2006»
13 years 7 months ago
Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot
A class of biped locomotion called Passive Dynamic Walking (PDW) has been recognized to be efficient in energy consumption and a key to understand human walking. Although PDW is s...
Kentarou Hitomi, Tomohiro Shibata, Yutaka Nakamura...
ATAL
2008
Springer
13 years 9 months ago
Adaptive Kanerva-based function approximation for multi-agent systems
In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...
Cheng Wu, Waleed Meleis
AAAI
2010
13 years 9 months ago
Multi-Agent Learning with Policy Prediction
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...
Chongjie Zhang, Victor R. Lesser
ATAL
2005
Springer
14 years 1 months ago
An integrated framework for adaptive reasoning about conversation patterns
We present an integrated approach for reasoning about and learning conversation patterns in multiagent communication. The approach is based on the assumption that information abou...
Michael Rovatsos, Felix A. Fischer, Gerhard Wei&sz...
UAI
2008
13 years 9 months ago
Model-Based Bayesian Reinforcement Learning in Large Structured Domains
Model-based Bayesian reinforcement learning has generated significant interest in the AI community as it provides an elegant solution to the optimal exploration-exploitation trade...
Stéphane Ross, Joelle Pineau