Search Sciweavers | Sciweavers

102 search results - page 6 / 21

» Efficient Asymptotic Approximation in Temporal Difference Le...

176

Voted

DSMML
2004
Springer

107views Machine Learning» more DSMML 2004»

Variational Bayes Estimation of Mixing Coefficients

15 years 10 months ago

Download www.gla.ac.uk

We investigate theoretically some properties of variational Bayes approximations based on estimating the mixing coefficients of known densities. We show that, with probability 1 a...

Bo Wang 0002, D. M. Titterington

claim paper

Read More »

206

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

15 years 7 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

186

click to vote

JMLR
2006

153views more JMLR 2006»

Collaborative Multiagent Reinforcement Learning by Payoff Propagation

15 years 7 months ago

Download jmlr.csail.mit.edu

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...

Jelle R. Kok, Nikos A. Vlassis

claim paper

Read More »

220

click to vote

TMM
2010

270views Management» more TMM 2010»

Sequence Multi-Labeling: A Unified Video Annotation Scheme With Spatial and Temporal Context

15 years 1 months ago

Download www.jdl.ac.cn

Abstract--Automatic video annotation is a challenging yet important problem for content-based video indexing and retrieval. In most existing works, annotation is formulated as a mu...

Yuanning Li, YongHong Tian, Ling-Yu Duan, Jingjing...

claim paper

Read More »

200

click to vote

IAT
2008
IEEE

161views Intelligent Agents» more IAT 2008»

Scaling Up Multi-agent Reinforcement Learning in Complex Domains

15 years 7 months ago

Download www3.ntu.edu.sg

TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...

Dan Xiao, Ah-Hwee Tan

claim paper

Read More »

« Prev « First page 6 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers