Search Sciweavers | Sciweavers

1863 search results - page 15 / 373

» Multiagent learning using a variable learning rate

203

click to vote

ICML
2006
IEEE

145views Machine Learning» more ICML 2006»

Full Bayesian network classifiers

16 years 7 months ago

Download delivery.acm.org

The structure of a Bayesian network (BN) encodes variable independence. Learning the structure of a BN, however, is typically of high computational complexity. In this paper, we e...

Jiang Su, Harry Zhang

claim paper

Read More »

179

click to vote

ICML
2005
IEEE

104views Machine Learning» more ICML 2005»

New d-separation identification results for learning continuous latent variable models

16 years 7 months ago

Download www.machinelearning.org

Learning the structure of graphical models is an important task, but one of considerable difficulty when latent variables are involved. Because conditional independences using hid...

Ricardo Silva, Richard Scheines

claim paper

Read More »

172

click to vote

ICMLA
2004

109views Machine Learning» more ICMLA 2004»

Variable resolution discretization in the joint space

15 years 8 months ago

Download highentropy.com

We present JoSTLe, an algorithm that performs value iteration on control problems with continuous actions, allowing this useful reinforcement learning technique to be applied to p...

Christopher K. Monson, David Wingate, Kevin D. Sep...

claim paper

Read More »

159

click to vote

SASO
2008
IEEE

116views Control Systems» more SASO 2008»

Leveraging Organizational Guidance Policies with Learning to Self-Tune Multiagent Systems

16 years 1 months ago

Download people.cis.ksu.edu

As organization-based multiagent systems are applied to more complex problems, conﬁguring and tuning the systems can become nearly as complex as the original problem a system wa...

Scott J. Harmon, Scott A. DeLoach, Robby, Doina Ca...

claim paper

Read More »

218

click to vote

IJCAI
2007

179views Artificial Intelligence» more IJCAI 2007»

Heuristic Selection of Actions in Multiagent Reinforcement Learning

15 years 8 months ago

Download www.ijcai.org

This work presents a new algorithm, called Heuristically Accelerated Minimax-Q (HAMMQ), that allows the use of heuristics to speed up the wellknown Multiagent Reinforcement Learni...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

« Prev « First page 15 / 373 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers