Search Sciweavers | Sciweavers

945 search results - page 15 / 189

» Dialog Convergence and Learning

156

click to vote

SIAMCO
2000

117views more SIAMCO 2000»

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

15 years 6 months ago

Download eprints.iisc.ernet.in

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...

Vivek S. Borkar, Sean P. Meyn

claim paper

Read More »

181

click to vote

ESANN
2003

151views Neural Networks» more ESANN 2003»

Accelerating the convergence speed of neural networks learning methods using least squares

15 years 8 months ago

Download www.dice.ucl.ac.be

In this work a hybrid training scheme for the supervised learning of feedforward neural networks is presented. In the proposed method, the weights of the last layer are obtained em...

Oscar Fontenla-Romero, Deniz Erdogmus, José...

claim paper

Read More »

193

click to vote

ICML
2000
IEEE

192views Machine Learning» more ICML 2000»

Convergence Problems of General-Sum Multiagent Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

Stochastic games are a generalization of MDPs to multiple agents, and can be used as a framework for investigating multiagent learning. Hu and Wellman (1998) recently proposed a m...

Michael H. Bowling

claim paper

Read More »

213

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

15 years 8 months ago

Download eprints.pascal-network.org

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...

Dotan Di Castro, Dmitry Volkinshtein, Ron Meir

claim paper

Read More »

160

click to vote

FUZZIEEE
2007
IEEE

132views Fuzzy Logic» more FUZZIEEE 2007»

Fuzzy Approximation for Convergent Model-Based Reinforcement Learning

16 years 1 months ago

Download www.montefiore.ulg.ac.be

— Reinforcement learning (RL) is a learning control paradigm that provides well-understood algorithms with good convergence and consistency properties. Unfortunately, these algor...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

« Prev « First page 15 / 189 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers