Search Sciweavers | Sciweavers

1233 search results - page 52 / 247

» Reinforcement Learning in MirrorBot

122

click to vote

EWCBR
2008
Springer

251views Automated Reasoning» more EWCBR 2008»

Recognizing the Enemy: Combining Reinforcement Learning with Strategy Selection Using Case-Based Reasoning

15 years 4 months ago

Download www.cse.lehigh.edu

This paper presents CBRetaliate, an agent that combines Case-Based Reasoning (CBR) and Reinforcement Learning (RL) algorithms. Unlike most previous work where RL is used to improve...

Bryan Auslander, Stephen Lee-Urban, Chad Hogg, H&e...

claim paper

Read More »

100

Voted

ESANN
2008

123views Neural Networks» more ESANN 2008»

Safe exploration for reinforcement learning

15 years 4 months ago

Download ahans.de

In this paper we define and address the problem of safe exploration in the context of reinforcement learning. Our notion of safety is concerned with states or transitions that can ...

Alexander Hans, Daniel Schneegaß, Anton Maxi...

claim paper

Read More »

133

click to vote

ESANN
2008

164views Neural Networks» more ESANN 2008»

Multilayer Perceptrons with Radial Basis Functions as Value Functions in Reinforcement Learning

15 years 4 months ago

Download www.dice.ucl.ac.be

Using multilayer perceptrons (MLPs) to approximate the state-action value function in reinforcement learning (RL) algorithms could become a nightmare due to the constant possibilit...

Victor Uc Cetina

claim paper

Read More »

111

click to vote

NIPS
1994

178views Information Technology» more NIPS 1994»

Generalization in Reinforcement Learning: Safely Approximating the Value Function

15 years 4 months ago

Download www.ri.cmu.edu

To appear in: G. Tesauro, D. S. Touretzky and T. K. Leen, eds., Advances in Neural Information Processing Systems 7, MIT Press, Cambridge MA, 1995. A straightforward approach to t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

113

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

14 years 9 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

« Prev « First page 52 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers