Search Sciweavers | Sciweavers

1233 search results - page 29 / 247

» Reinforcement Learning in MirrorBot

143

click to vote

ATAL
2008
Springer

160views Intelligent Agents» more ATAL 2008»

Sequential decision making in repeated coalition formation under uncertainty

15 years 4 months ago

Download www.aamas-conference.org

The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learnin...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

128

click to vote

JDCTA
2010

160views more JDCTA 2010»

Learning and Decision Making in Human During a Game of Matching Pennies

14 years 9 months ago

Download www.aicit.org

To gain insights into the neural basis of such adaptive decision-making processes, we investigated the nature of learning process in humans playing a competitive game with binary ...

Jianfeng Hu, Xiaofeng Li, Jinghai Yin

claim paper

Read More »

click to vote

PDPTA
2003

110views Distributed And Parallel Com...» more PDPTA 2003»

Java Resources for Teaching Reinforcement Learning

15 years 4 months ago

Download cs.gettysburg.edu

— In this paper we present a library of classes for programming reinforcement learning simulations in Java. This library is based upon the standard by Sutton and Santamaria [1], ...

Amy J. Kerr, Todd W. Neller, Christopher J. La Pil...

claim paper

Read More »

214

click to vote

ICCCI
2011
Springer

223views Intelligent Agents» more ICCCI 2011»

Evolving Equilibrium Policies for a Multiagent Reinforcement Learning Problem with State Attractors

14 years 2 months ago

Download florinleon.byethost24.com

Multiagent reinforcement learning problems are especially difficult because of their dynamism and the size of joint state space. In this paper a new benchmark problem is proposed, ...

Florin Leon

claim paper

Read More »

127

click to vote

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

15 years 1 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

« Prev « First page 29 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers