Search Sciweavers | Sciweavers

211

COLT
2008
Springer

179views Machine Learning» more COLT 2008»

Adapting to a Changing Environment: the Brownian Restless Bandits

15 years 9 months ago

In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...

Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

219

click to vote

JAIR
2008

119views more JAIR 2008»

A Multiagent Reinforcement Learning Algorithm with Non-linear Dynamics

15 years 7 months ago

Download www.ece.utk.edu

Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Due to the complexity of the problem, the majority of the previo...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

195

click to vote

WIOPT
2010
IEEE

151views Computer Networks» more WIOPT 2010»

Evolutionary forwarding games in Delay Tolerant Networks

15 years 5 months ago

Download hal.archives-ouvertes.fr

—In this paper, we apply evolutionary games to non-cooperative forwarding control of Delay Tolerant Networks (DTN). We focus our study on the probability to deliver a message fro...

Rachid El Azouzi, Francesco De Pellegrini, Vijay K...

claim paper

Read More »

180

Voted

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

15 years 8 months ago

Download books.nips.cc

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

214

click to vote

COLT
2006
Springer

132views Machine Learning» more COLT 2006»

Online Learning with Variable Stage Duration

15 years 11 months ago

Download www.ece.mcgill.ca

We consider online learning in repeated decision problems, within the framework of a repeated game against an arbitrary opponent. For repeated matrix games, well known results esta...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers