Sciweavers

1863 search results - page 15 / 373
» Multiagent learning using a variable learning rate
Sort
View
ICML
2006
IEEE
14 years 8 months ago
Full Bayesian network classifiers
The structure of a Bayesian network (BN) encodes variable independence. Learning the structure of a BN, however, is typically of high computational complexity. In this paper, we e...
Jiang Su, Harry Zhang
ICML
2005
IEEE
14 years 8 months ago
New d-separation identification results for learning continuous latent variable models
Learning the structure of graphical models is an important task, but one of considerable difficulty when latent variables are involved. Because conditional independences using hid...
Ricardo Silva, Richard Scheines
ICMLA
2004
13 years 9 months ago
Variable resolution discretization in the joint space
We present JoSTLe, an algorithm that performs value iteration on control problems with continuous actions, allowing this useful reinforcement learning technique to be applied to p...
Christopher K. Monson, David Wingate, Kevin D. Sep...
SASO
2008
IEEE
14 years 1 months ago
Leveraging Organizational Guidance Policies with Learning to Self-Tune Multiagent Systems
As organization-based multiagent systems are applied to more complex problems, configuring and tuning the systems can become nearly as complex as the original problem a system wa...
Scott J. Harmon, Scott A. DeLoach, Robby, Doina Ca...
IJCAI
2007
13 years 9 months ago
Heuristic Selection of Actions in Multiagent Reinforcement Learning
This work presents a new algorithm, called Heuristically Accelerated Minimax-Q (HAMMQ), that allows the use of heuristics to speed up the wellknown Multiagent Reinforcement Learni...
Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...