Sciweavers

2027 search results - page 8 / 406
» When is it best to best-respond
Sort
View
ICML
2003
IEEE
14 years 10 months ago
AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Oppon
A satisfactory multiagent learning algorithm should, at a minimum, learn to play optimally against stationary opponents and converge to a Nash equilibrium in self-play. The algori...
Vincent Conitzer, Tuomas Sandholm
ALDT
2009
Springer
142views Algorithms» more  ALDT 2009»
14 years 4 months ago
Finding Best k Policies
Abstract. An optimal probabilistic-planning algorithm solves a problem, usually modeled by a Markov decision process, by finding its optimal policy. In this paper, we study the k ...
Peng Dai, Judy Goldsmith
CVPR
2007
IEEE
14 years 12 months ago
Active Visual Object Reconstruction using D-, E-, and T-Optimal Next Best Views
In visual 3-D reconstruction tasks with mobile cameras, one wishes to move the cameras so that they provide the views that lead to the best reconstruction result. When the camera ...
Stefan Wenhardt, Benjamin Deutsch, Elli Angelopoul...
COLT
2010
Springer
13 years 8 months ago
Best Arm Identification in Multi-Armed Bandits
We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...
Jean-Yves Audibert, Sébastien Bubeck, R&eac...
DCOSS
2011
Springer
12 years 9 months ago
Link quality ranking: Getting the best out of unreliable links
—Link quality estimation has been an active area of research within the wireless sensor network community. It is now well known that the estimation of reliable links requires few...
Marco Zuniga, Izabela Irzynska, Jan-Hinrich Hauer,...