Sciweavers

227 search results - page 20 / 46
» Generalized multiagent learning with performance bound
Sort
View
ICML
2005
IEEE
14 years 9 months ago
Using additive expert ensembles to cope with concept drift
We consider online learning where the target concept can change over time. Previous work on expert prediction algorithms has bounded the worst-case performance on any subsequence ...
Jeremy Z. Kolter, Marcus A. Maloof
ML
2012
ACM
385views Machine Learning» more  ML 2012»
12 years 4 months ago
An alternative view of variational Bayes and asymptotic approximations of free energy
Bayesian learning, widely used in many applied data-modeling problems, is often accomplished with approximation schemes because it requires intractable computation of the posterio...
Kazuho Watanabe
ATAL
2006
Springer
14 years 23 days ago
Multi-robot learning with particle swarm optimization
We apply an adapted version of Particle Swarm Optimization to distributed unsupervised robotic learning in groups of robots with only local information. The performance of the lea...
Jim Pugh, Alcherio Martinoli
ICAC
2009
IEEE
13 years 6 months ago
Using distributed w-learning for multi-policy optimization in decentralized autonomic systems
Distributed W-Learning (DWL) is a reinforcement learningbased algorithm for multi-policy optimization in agent-based systems. In this poster we propose the use of DWL for decentra...
Ivana Dusparic, Vinny Cahill
ATAL
2008
Springer
13 years 11 months ago
Value-based observation compression for DEC-POMDPs
Representing agent policies compactly is essential for improving the scalability of multi-agent planning algorithms. In this paper, we focus on developing a pruning technique that...
Alan Carlin, Shlomo Zilberstein