Search Sciweavers | Sciweavers

43 search results - page 3 / 9

» The O.D.E. Method for Convergence of Stochastic Approximatio...

click to vote

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

13 years 10 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

click to vote

ICML
1998
IEEE

155views Machine Learning» more ICML 1998»

Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm

14 years 9 months ago

Download www.lirmm.fr

In this paper, we adopt general-sum stochastic games as a framework for multiagent reinforcement learning. Our work extends previous work by Littman on zero-sum stochastic games t...

Junling Hu, Michael P. Wellman

claim paper

Read More »

click to vote

AAMAS
2007
Springer

142views Intelligent Agents» more AAMAS 2007»

Parallel Reinforcement Learning with Linear Function Approximation

13 years 8 months ago

Download www.aamas-conference.org

In this paper, we investigate the use of parallelization in reinforcement learning (RL), with the goal of learning optimal policies for single-agent RL problems more quickly by us...

Matthew Grounds, Daniel Kudenko

claim paper

Read More »

click to vote

JAIR
2002

163views more JAIR 2002»

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

13 years 8 months ago

Download www.jair.org

The recursive least-squares (RLS) algorithm is one of the most well-known algorithms used in adaptive filtering, system identification and adaptive control. Its popularity is main...

Xin Xu, Hangen He, Dewen Hu

claim paper

Read More »

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

14 years 9 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

« Prev « First page 3 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers