Search Sciweavers | Sciweavers

70 search results - page 5 / 14

» Reinforcement Learning: Past, Present and Future

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

13 years 8 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

click to vote

CCIA
2005
Springer

117views Artificial Intelligence» more CCIA 2005»

Direct Policy Search Reinforcement Learning for Robot Control

14 years 28 days ago

Download vicorob.udg.es

— This paper proposes a high-level Reinforcement Learning (RL) control system for solving the action selection problem of an autonomous robot. Although the dominant approach, whe...

Andres El-Fakdi, Marc Carreras, Narcís Palo...

claim paper

Read More »

click to vote

ICANN
2010
Springer

157views Neural Networks» more ICANN 2010»

Using Reinforcement Learning to Guide the Development of Self-organised Feature Maps for Visual Orienting

13 years 8 months ago

Download personalpages.manchester.ac.uk

We present a biologically inspired neural network model of visual orienting (using saccadic eye movements) in which targets are preferentially selected according to their reward va...

Kevin Brohan, Kevin N. Gurney, Piotr Dudek

claim paper

Read More »

click to vote

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Optimizing Anthrax Outbreak Detection Using Reinforcement Learning

13 years 9 months ago

Download www.aaai.org

The potentially catastrophic impact of a bioterrorist attack makes developing effective detection methods essential for public health. In the case of anthrax attack, a delay of ho...

Masoumeh T. Izadi, David L. Buckeridge

claim paper

Read More »

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

14 years 8 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

« Prev « First page 5 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers