payoff functions | Sciweavers

186

CORR
2008
Springer

136views Education» more CORR 2008»

15 years 6 months ago

In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of n trials so as to maximize the total payoff of the chosen strategies. While ...

Robert Kleinberg, Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

165

click to vote

CEC
2005
IEEE

99views Artificial Intelligence» more CEC 2005»

XCS with computed prediction for the learning of Boolean functions

16 years 5 days ago

Download www.eskimo.com

Computed prediction represents a major shift in learning classiﬁer system research. XCS with computed prediction, based on linear approximators, has been applied so far to functi...

Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...

claim paper

Read More »

171

click to vote

LICS
2007
IEEE

121views Automated Reasoning» more LICS 2007»

Limits of Multi-Discounted Markov Decision Processes

16 years 25 days ago

Download www.labri.fr

Markov decision processes (MDPs) are controllable discrete event systems with stochastic transitions. The payoff received by the controller can be evaluated in different ways, dep...

Hugo Gimbert, Wieslaw Zielonka

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers