Sciweavers

4544 search results - page 80 / 909
» Reinforcement Learning with Time
Sort
View
ATAL
2008
Springer
13 years 11 months ago
Sequential decision making in repeated coalition formation under uncertainty
The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learnin...
Georgios Chalkiadakis, Craig Boutilier
ICCBR
2005
Springer
14 years 2 months ago
CBR for State Value Function Approximation in Reinforcement Learning
CBR is one of the techniques that can be applied to the task of approximating a function over high-dimensional, continuous spaces. In Reinforcement Learning systems a learning agen...
Thomas Gabel, Martin A. Riedmiller
ML
2002
ACM
133views Machine Learning» more  ML 2002»
13 years 8 months ago
Finite-time Analysis of the Multiarmed Bandit Problem
Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...
Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...
JDCTA
2010
160views more  JDCTA 2010»
13 years 3 months ago
Learning and Decision Making in Human During a Game of Matching Pennies
To gain insights into the neural basis of such adaptive decision-making processes, we investigated the nature of learning process in humans playing a competitive game with binary ...
Jianfeng Hu, Xiaofeng Li, Jinghai Yin
PDPTA
2003
13 years 10 months ago
Java Resources for Teaching Reinforcement Learning
— In this paper we present a library of classes for programming reinforcement learning simulations in Java. This library is based upon the standard by Sutton and Santamaria [1], ...
Amy J. Kerr, Todd W. Neller, Christopher J. La Pil...