Sciweavers

779 search results - page 60 / 156
» Reinforcement Using Supervised Learning for Policy Generaliz...
Sort
View
NIPS
2003
13 years 9 months ago
Approximate Planning in POMDPs with Macro-Actions
Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...
Georgios Theocharous, Leslie Pack Kaelbling
PRL
2011
13 years 2 months ago
Object recognition using proportion-based prior information: Application to fisheries acoustics
: This paper addresses the inference of probabilistic classification models using weakly supervised learning. The main contribution of this work is the development of learning meth...
Riwal Lefort, Ronan Fablet, Jean-Marc Boucher
ESANN
2007
13 years 9 months ago
The Recurrent Control Neural Network
This paper presents our Recurrent Control Neural Network (RCNN), which is a model-based approach for a data-efficient modelling and control of reinforcement learning problems in di...
Anton Maximilian Schäfer, Steffen Udluft, Han...
ICML
2007
IEEE
14 years 8 months ago
Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation
Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...
Chee Wee Phua, Robert Fitch
ATAL
2007
Springer
14 years 2 months ago
Model-based function approximation in reinforcement learning
Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains di...
Nicholas K. Jong, Peter Stone