Search Sciweavers | Sciweavers

205 search results - page 38 / 41

» One-Counter Stochastic Games

191

click to vote

SIAMCOMP
2002

124views more SIAMCOMP 2002»

The Nonstochastic Multiarmed Bandit Problem

15 years 7 months ago

Download homes.dsi.unimi.it

Abstract. In the multiarmed bandit problem, a gambler must decide which arm of K nonidentical slot machines to play in a sequence of trials so as to maximize his reward. This class...

Peter Auer, Nicolò Cesa-Bianchi, Yoav Freun...

claim paper

Read More »

199

Voted

GLOBECOM
2008
IEEE

133views Communications» more GLOBECOM 2008»

Foresighted Resource Reciprocation Strategies in P2P Networks

16 years 1 months ago

Download medianetlab.ee.ucla.edu

—We consider peer-to-peer (P2P) networks, where multiple peers are interested in sharing content. While sharing resources, autonomous and self-interested peers need to make decis...

Hyunggon Park, Mihaela van der Schaar

claim paper

Read More »

178

Voted

ICRA
2007
IEEE

128views Robotics» more ICRA 2007»

Adaptive Play Q-Learning with Initial Heuristic Approximation

16 years 1 months ago

Download www.damas.ift.ulaval.ca

Abstract— The problem of an effective coordination of multiple autonomous robots is one of the most important tasks of the modern robotics. In turn, it is well known that the lea...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

203

Voted

ACMICEC
2004
ACM

161views ECommerce» more ACMICEC 2004»

The 2003 Supply Chain Management Trading Agent Competition

16 years 26 days ago

Download www.eecs.harvard.edu

Supply Chain Management deals with the planning and coordination of bidding, production and procurement activities across the multiple organizations involved in the delivery of on...

Raghu Arunachalam, Norman M. Sadeh

claim paper

Read More »

200

Voted

ECML
2004
Springer

137views Machine Learning» more ECML 2004»

Analyzing Multi-agent Reinforcement Learning Using Evolutionary Dynamics

16 years 24 days ago

Download www.personeel.unimaas.nl

In this paper, we show how the dynamics of Q-learning can be visualized and analyzed from a perspective of Evolutionary Dynamics (ED). More speciﬁcally, we show how ED can be use...

Pieter Jan't Hoen, Karl Tuyls

claim paper

Read More »

« Prev « First page 38 / 41 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers