Search Sciweavers | Sciweavers

50 search results - page 5 / 10

» Convergence and Divergence in Standard and Averaging Reinfor...

click to vote

ATAL
2003
Springer

154views Intelligent Agents» more ATAL 2003»

Coordination in multiagent reinforcement learning: a Bayesian approach

14 years 28 days ago

Download www.cs.toronto.edu

Much emphasis in multiagent reinforcement learning (MARL) research is placed on ensuring that MARL algorithms (eventually) converge to desirable equilibria. As in standard reinfor...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

click to vote

COLT
2007
Springer

174views Machine Learning» more COLT 2007»

Regret to the Best vs. Regret to the Average

14 years 1 months ago

Download www.math.tau.ac.il

Abstract. We study online regret minimization algorithms in a bicriteria setting, examining not only the standard notion of regret to the best expert, but also the regret to the av...

Eyal Even-Dar, Michael J. Kearns, Yishay Mansour, ...

claim paper

Read More »

click to vote

COLT
2000
Springer

129views Machine Learning» more COLT 2000»

Average-Case Complexity of Learning Polynomials

14 years 1 days ago

Download www-alg.ist.hokudai.ac.jp

The present paper deals with the averagecase complexity of various algorithms for learning univariate polynomials. For this purpose an appropriate framework is introduced. Based o...

Frank Stephan, Thomas Zeugmann

claim paper

Read More »

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

13 years 9 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

click to vote

ECAI
2010
Springer

238views Artificial Intelligence» more ECAI 2010»

The Dynamics of Multi-Agent Reinforcement Learning

13 years 8 months ago

Download www.doc.ic.ac.uk

Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...

Luke Dickens, Krysia Broda, Alessandra Russo

claim paper

Read More »

« Prev « First page 5 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers