Search Sciweavers | Sciweavers

227 search results - page 31 / 46

» Generalized multiagent learning with performance bound

174

click to vote

ATAL
2010
Springer

134views Intelligent Agents» more ATAL 2010»

Cultivating desired behaviour: policy teaching via environment-dynamics tweaks

15 years 8 months ago

Download eprints.ecs.soton.ac.uk

In this paper we study, for the first time explicitly, the implications of endowing an interested party (i.e. a teacher) with the ability to modify the underlying dynamics of the ...

Zinovi Rabinovich, Lachlan Dufton, Kate Larson, Ni...

claim paper

Read More »

167

click to vote

ALT
2009
Springer

128views Machine Learning» more ALT 2009»

Pure Exploration in Multi-armed Bandits Problems

16 years 3 months ago

Download sequel.futurs.inria.fr

Abstract. We consider the framework of stochastic multi-armed bandit problems and study the possibilities and limitations of strategies that explore sequentially the arms. The stra...

Sébastien Bubeck, Rémi Munos, Gilles...

claim paper

Read More »

227

click to vote

ECAI
2008
Springer

208views Artificial Intelligence» more ECAI 2008»

Optimal Coalition Structure Generation In Partition Function Games

15 years 8 months ago

Download www.csc.liv.ac.uk

1 In multi-agent systems (MAS), coalition formation is typically studied using characteristic function game (CFG) representations, where the performance of any coalition is indepen...

Tomasz P. Michalak, Andrew Dowell, Peter McBurney,...

claim paper

Read More »

156

click to vote

ALT
2008
Springer

110views Machine Learning» more ALT 2008»

Entropy Regularized LPBoost

16 years 3 months ago

Download www.columbia.edu

In this paper we discuss boosting algorithms that maximize the soft margin of the produced linear combination of base hypotheses. LPBoost is the most straightforward boosting algor...

Manfred K. Warmuth, Karen A. Glocer, S. V. N. Vish...

claim paper

Read More »

184

Voted

NCA
2007
IEEE

90views Computer Networks» more NCA 2007»

Implementing Atomic Data through Indirect Learning in Dynamic Networks

16 years 1 months ago

Download www.engr.uconn.edu

Developing middleware services for dynamic distributed systems, e.g., ad-hoc networks, is a challenging task given that such services deal with dynamically changing membership and...

Kishori M. Konwar, Peter M. Musial, Nicolas C. Nic...

claim paper

Read More »

« Prev « First page 31 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers