Search Sciweavers | Sciweavers

16 search results - page 1 / 4

» On Average Versus Discounted Reward Temporal-Difference Lear...

204

click to vote

ML
2002
ACM

168views Machine Learning» more ML 2002»

On Average Versus Discounted Reward Temporal-Difference Learning

15 years 6 months ago

Download web.mit.edu

We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...

John N. Tsitsiklis, Benjamin Van Roy

claim paper

Read More »

158

click to vote

ALT
2006
Springer

109views Machine Learning» more ALT 2006»

General Discounting Versus Average Reward

16 years 3 months ago

Download www.idsia.ch

Consider an agent interacting with an environment in cycles. In every interaction cycle the agent is rewarded for its performance. We compare the average reward U from cycle 1 to ...

Marcus Hutter

claim paper

Read More »

186

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

16 years 7 months ago

Download reference.kfupm.edu.sa

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

201

click to vote

CIG
2006
IEEE

202views Applied Computing» more CIG 2006»

Temporal Difference Learning Versus Co-Evolution for Acquiring Othello Position Evaluation

16 years 1 months ago

Download algoval.essex.ac.uk

Abstract— This paper compares the use of temporal difference learning (TDL) versus co-evolutionary learning (CEL) for acquiring position evaluation functions for the game of Othe...

Simon M. Lucas, Thomas Philip Runarsson

claim paper

Read More »

180

click to vote

ICML
2001
IEEE

172views Machine Learning» more ICML 2001»

Continuous-Time Hierarchical Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Pri...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 1 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers