Search Sciweavers | Sciweavers

199 search results - page 2 / 40

» Efficient Reinforcement Learning with Relocatable Action Mod...

click to vote

SCAI
2008

246views Artificial Intelligence» more SCAI 2008»

Fast Learning in an Actor-Critic Architecture with Reward and Punishment

13 years 9 months ago

Download www.lucs.lu.se

Abstract. A reinforcement architecture is introduced that consists of three complementary learning systems with different generalization abilities. The ACTOR learns state-action as...

Christian Balkenius, Stefan Winberg

claim paper

Read More »

click to vote

ICANN
2005
Springer

151views Neural Networks» more ICANN 2005»

Reinforcement Learning in MirrorBot

14 years 1 months ago

Download fias.uni-frankfurt.de

For this special session of EU projects in the area of NeuroIT, we will review the progress of the MirrorBot project with special emphasis on its relation to reinforcement learning...

Cornelius Weber, David Muse, Mark Elshaw, Stefan W...

claim paper

Read More »

click to vote

CORR
1998
Springer

164views Education» more CORR 1998»

Training Reinforcement Neurocontrollers Using the Polytope Algorithm

13 years 7 months ago

Download zeus.cs.uoi.gr

A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...

Aristidis Likas, Isaac E. Lagaris

claim paper

Read More »

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

14 years 8 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 9 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

« Prev « First page 2 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers