Search Sciweavers | Sciweavers

30 search results - page 1 / 6

» Model-Based Average Reward Reinforcement Learning

154

click to vote

ECML
2006
Springer

116views Machine Learning» more ECML 2006»

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

15 years 9 months ago

Download web.engr.oregonstate.edu

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

209

click to vote

AI
1998
Springer

177views Artificial Intelligence» more AI 1998»

Model-Based Average Reward Reinforcement Learning

15 years 5 months ago

Download web.engr.oregonstate.edu

Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...

Prasad Tadepalli, DoKyeong Ok

claim paper

Read More »

139

click to vote

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Reinforcement Learning for Average Reward Zero-Sum Games

15 years 10 months ago

Download www.ece.mcgill.ca

Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The ﬁrst is based on relative Q-learning and the ...

Shie Mannor

claim paper

Read More »

154

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 6 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

153

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

16 years 6 months ago

Download reference.kfupm.edu.sa

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 1 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers