Sciweavers

1233 search results - page 29 / 247
» Reinforcement Learning in MirrorBot
Sort
View
ATAL
2008
Springer
14 years 28 days ago
Sequential decision making in repeated coalition formation under uncertainty
The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learnin...
Georgios Chalkiadakis, Craig Boutilier
JDCTA
2010
160views more  JDCTA 2010»
13 years 5 months ago
Learning and Decision Making in Human During a Game of Matching Pennies
To gain insights into the neural basis of such adaptive decision-making processes, we investigated the nature of learning process in humans playing a competitive game with binary ...
Jianfeng Hu, Xiaofeng Li, Jinghai Yin
PDPTA
2003
14 years 9 days ago
Java Resources for Teaching Reinforcement Learning
— In this paper we present a library of classes for programming reinforcement learning simulations in Java. This library is based upon the standard by Sutton and Santamaria [1], ...
Amy J. Kerr, Todd W. Neller, Christopher J. La Pil...
ICCCI
2011
Springer
12 years 10 months ago
Evolving Equilibrium Policies for a Multiagent Reinforcement Learning Problem with State Attractors
Multiagent reinforcement learning problems are especially difficult because of their dynamism and the size of joint state space. In this paper a new benchmark problem is proposed, ...
Florin Leon
NECO
2010
97views more  NECO 2010»
13 years 9 months ago
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...
Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...