Search Sciweavers | Sciweavers

227 search results - page 40 / 46

» Generalized multiagent learning with performance bound

197

click to vote

ATAL
2007
Springer

106views Intelligent Agents» more ATAL 2007»

Determining confidence when integrating contributions from multiple agents

15 years 10 months ago

Download dancorkill.home.comcast.net

Integrating contributions received from other agents is an essential activity in multi-agent systems (MASs). Not only must related contributions be integrated together, but the co...

Raphen Becker, Daniel D. Corkill

claim paper

Read More »

164

Voted

CDC
2008
IEEE

104views Control Systems» more CDC 2008»

A structured multiarmed bandit problem and the greedy policy

16 years 1 months ago

Download web.mit.edu

—We consider a multiarmed bandit problem where the expected reward of each arm is a linear function of an unknown scalar with a prior distribution. The objective is to choose a s...

Adam J. Mersereau, Paat Rusmevichientong, John N. ...

claim paper

Read More »

196

Voted

ACG
2009
Springer

292views Computer Graphics» more ACG 2009»

Monte-Carlo Tree Search in Settlers of Catan

16 years 1 months ago

Download ticc.uvt.nl

Abstract. Games are considered important benchmark tasks of artiﬁcial intelligence research. Modern strategic board games can typically be played by three or more people, which m...

Istvan Szita, Guillaume Chaslot, Pieter Spronck

claim paper

Read More »

160

click to vote

COLT
2004
Springer

78views Machine Learning» more COLT 2004»

Online Geometric Optimization in the Bandit Setting Against an Adaptive Adversary

16 years 2 days ago

Download www.cs.cmu.edu

We give an algorithm for the bandit version of a very general online optimization problem considered by Kalai and Vempala [1], for the case of an adaptive adversary. In this proble...

H. Brendan McMahan, Avrim Blum

claim paper

Read More »

179

click to vote

ATAL
2006
Springer

153views Intelligent Agents» more ATAL 2006»

Evolutionary Optimization of ZIP60: A Controlled Explosion in Hyperspace

15 years 10 months ago

Download www.cs.bham.ac.uk

The "ZIP" adaptive trading algorithm has been demonstrated to outperform human traders in experimental studies of continuous double auction (CDA) markets. The original Z...

Dave Cliff

claim paper

Read More »

« Prev « First page 40 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers