Search Sciweavers | Sciweavers

536 search results - page 56 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

187

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

15 years 22 days ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

112

click to vote

JMLR
2010

136views more JMLR 2010»

Approximate Riemannian Conjugate Gradient Learning for Fixed-Form Variational Bayes

14 years 9 months ago

Download jmlr.csail.mit.edu

Variational Bayesian (VB) methods are typically only applied to models in the conjugate-exponential family using the variational Bayesian expectation maximisation (VB EM) algorith...

Antti Honkela, Tapani Raiko, Mikael Kuusela, Matti...

claim paper

Read More »

141

click to vote

AGENTS
1999
Springer

126views Security Privacy» more AGENTS 1999»

General Principles of Learning-Based Multi-Agent Systems

15 years 7 months ago

Download web.engr.oregonstate.edu

We consider the problem of how to design large decentralized multiagent systems (MAS’s) in an automated fashion, with little or no hand-tuning. Our approach has each agent run a...

David Wolpert, Kevin R. Wheeler, Kagan Tumer

claim paper

Read More »

148

click to vote

CORR
2006
Springer

140views Education» more CORR 2006»

Nearly optimal exploration-exploitation decision thresholds

15 years 2 months ago

Download www.idiap.ch

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...

Christos Dimitrakakis

posted by olethros

Read More »

137

click to vote

CC
2010
Springer

120views System Software» more CC 2010»

Lower Bounds for Agnostic Learning via Approximate Rank

15 years 8 days ago

Download www.cs.utexas.edu

We prove that the concept class of disjunctions cannot be pointwise approximated by linear combinations of any small set of arbitrary real-valued functions. That is, suppose that t...

Adam R. Klivans, Alexander A. Sherstov

claim paper

Read More »

« Prev « First page 56 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers