Search Sciweavers | Sciweavers

54 search results - page 6 / 11

» Convergence Results for Single-Step On-Policy Reinforcement-...

177

Voted

NIPS
1994

90views Information Technology» more NIPS 1994»

Reinforcement Learning with Soft State Aggregation

15 years 9 months ago

Download www.eecs.umich.edu

It is widely accepted that the use of more compact representations than lookup tables is crucial to scaling reinforcement learning (RL) algorithms to real-world problems. Unfortun...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

174

click to vote

ICML
2003
IEEE

146views Machine Learning» more ICML 2003»

TD(0) Converges Provably Faster than the Residual Gradient Algorithm

16 years 8 months ago

Download www.hpl.hp.com

In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...

Ralf Schoknecht, Artur Merke

claim paper

Read More »

289

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

15 years 7 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

221

click to vote

IJCAI
2001

119views Artificial Intelligence» more IJCAI 2001»

Rational and Convergent Learning in Stochastic Games

15 years 9 months ago

Download reference.kfupm.edu.sa

This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as de...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

242

click to vote

JAIR
2002

163views more JAIR 2002»

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

15 years 7 months ago

Download www.jair.org

The recursive least-squares (RLS) algorithm is one of the most well-known algorithms used in adaptive filtering, system identification and adaptive control. Its popularity is main...

Xin Xu, Hangen He, Dewen Hu

claim paper

Read More »

« Prev « First page 6 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers