Search Sciweavers | Sciweavers

3694 search results - page 16 / 739

» Stochastic complexity in learning

158

click to vote

SIAMCO
2000

117views more SIAMCO 2000»

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

15 years 6 months ago

Download eprints.iisc.ernet.in

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...

Vivek S. Borkar, Sean P. Meyn

claim paper

Read More »

257

click to vote

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

194

click to vote

ATAL
2007
Springer

128views Intelligent Agents» more ATAL 2007»

Advice taking in multiagent reinforcement learning

16 years 1 months ago

Download homepages.inf.ed.ac.uk

This paper proposes the β-WoLF algorithm for multiagent reinforcement learning (MARL) in the stochastic games framework that uses an additional “advice” signal to inform agen...

Michael Rovatsos, Alexandros Belesiotis

claim paper

Read More »

171

click to vote

DEDS
2006

78views more DEDS 2006»

The Equivalence between Ordinal Optimization in Deterministic Complex Problems and in Stochastic Simulation Problems

15 years 7 months ago

Download www.cfins.au.tsinghua.edu.cn

In the last decade ordinal optimization (OO) has been successfully applied in many stochastic simulation-based optimization problems (SP) and deterministic complex problems (DCP). ...

Yu-Chi Ho, Qing-Shan Jia, Qianchuan Zhao

claim paper

Read More »

193

click to vote

GECCO
2007
Springer

162views Optimization» more GECCO 2007»

Learning noise

16 years 1 months ago

Download www.cs.bham.ac.uk

In this paper we propose a genetic programming approach to learning stochastic models with unsymmetrical noise distributions. Most learning algorithms try to learn from noisy data...

Michael D. Schmidt, Hod Lipson

claim paper

Read More »

« Prev « First page 16 / 739 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers