Search Sciweavers | Sciweavers

70 search results - page 9 / 14

» Reinforcement Learning: Past, Present and Future

click to vote

BMEI
2008
IEEE

153views Biomedical Imaging» more BMEI 2008»

A Retrospective Comparative Study of Three Data Modelling Techniques in Anticoagulation Therapy

14 years 1 months ago

Download eprints.lancs.ac.uk

Three types of data modelling technique are applied retrospectively to individual patients’ anticoagulation therapy data to predict their future levels of anticoagulation. The r...

Simon McDonald, Costas S. Xydeas, Plamen P. Angelo...

claim paper

Read More »

click to vote

GECCO
2008
Springer

128views Optimization» more GECCO 2008»

Multi-agent task allocation: learning when to say no

13 years 8 months ago

Download www.cs.ucf.edu

This paper presents a communication-less multi-agent task allocation procedure that allows agents to use past experience to make non-greedy decisions about task assignments. Exper...

Adam Campbell, Annie S. Wu, Randall Shumaker

claim paper

Read More »

click to vote

ECAI
2008
Springer

123views Artificial Intelligence» more ECAI 2008»

Learning to Select Object Recognition Methods for Autonomous Mobile Robots

13 years 9 months ago

Download www.iiia.csic.es

Selecting which algorithms should be used by a mobile robot computer vision system is a decision that is usually made a priori by the system developer, based on past experience and...

Reinaldo A. C. Bianchi, Arnau Ramisa, Ramon L&oacu...

claim paper

Read More »

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

14 years 1 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

14 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 9 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers