— Multi-agent coordination problems can be cast as distributed optimization tasks. Probability Collectives (PCs) are techniques that deal with such problems in discrete and continuous spaces [1]. In this paper we are going to propose a new variation of PCs, Sequentially updated Probability Collectives. Our objective is to show how standard techniques from the statistics literature, Sequential Monte Carlo methods and kernel regression, can be used as building blocks within PCs instead of the ad hoc approaches taken previously to produce samples and estimate values in continuous action spaces. We test our algorithm in three different simulation scenarios with continuous action spaces. Two classical distributed optimization functions, the three and six dimensional Hartman functions [14] and a vehicle target assignment type game [6]. The results for the Hartman functions were close to the global optimum, and the agents managed to coordinate to the optimal solution of the target assignmen...
Michalis Smyrnakis, David S. Leslie