Search Sciweavers | Sciweavers

161 search results - page 32 / 33

» Convergence Problems of General-Sum Multiagent Reinforcement...

221

click to vote

WELCOM
2001
Springer

132views ECommerce» more WELCOM 2001»

Incentives for Sharing in Peer-to-Peer Networks

15 years 11 months ago

Download crypto.stanford.edu

The recent and unprecedented surge of public interest in peer-to-peer ﬁle sharing has led to a variety of interesting research questions. In this paper, we will address the ince...

Philippe Golle, Kevin Leyton-Brown, Ilya Mironov, ...

claim paper

Read More »

220

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 8 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

195

click to vote

ATAL
2006
Springer

168views Intelligent Agents» more ATAL 2006»

Efficient agent-based cluster ensembles

15 years 10 months ago

Download web.engr.oregonstate.edu

Numerous domains ranging from distributed data acquisition to knowledge reuse need to solve the cluster ensemble problem of combining multiple clusterings into a single unified cl...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

212

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

15 years 6 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

162

click to vote

ATAL
2003
Springer

123views Intelligent Agents» more ATAL 2003»

Team formation and communication restrictions in collectives

16 years 3 days ago

Download ti.arc.nasa.gov

A collective of agents often needs to maximize a “world utility” function which rates the performance of an entire system, while subject to communication restrictions among th...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

« Prev « First page 32 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers