Search Sciweavers | Sciweavers

138

NIPS
1992

99views Information Technology» more NIPS 1992»

15 years 6 months ago

This paper describes the adaption and application of an algorithm called Feudal Reinforcement Learning to a complex gridworld navigation problem. The algorithm proved to be not ea...

Peter Dayan, Geoffrey E. Hinton

claim paper

Read More »

139

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 6 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

139

click to vote

ECML
2007
Springer

133views Machine Learning» more ECML 2007»

Transfer Learning in Reinforcement Learning Problems Through Partial Policy Recycling

15 years 12 months ago

Download dtai.cs.kuleuven.be

In this paper we investigate the relation between transfer learning in reinforcement learning with function approximation and supervised learning with concept drift. We present a n...

Jan Ramon, Kurt Driessens, Tom Croonenborghs

claim paper

Read More »

174

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

16 years 6 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

115

click to vote

IJCNN
2006
IEEE

91views Neural Networks» more IJCNN 2006»

Global Reinforcement Learning in Neural Networks with Stochastic Synapses

15 years 11 months ago

Download pavel.physics.sunysb.edu

— We have found a more general formulation of the REINFORCE learning principle which had been proposed by R. J. Williams for the case of artiﬁcial neural networks with stochast...

Xiaolong Ma, Konstantin Likharev

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers