Search Sciweavers | Sciweavers

779 search results - page 60 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

123

Voted

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

15 years 3 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

149

click to vote

PRL
2011

219views Computer Networks» more PRL 2011»

Object recognition using proportion-based prior information: Application to fisheries acoustics

14 years 9 months ago

Download archimer.ifremer.fr

: This paper addresses the inference of probabilistic classification models using weakly supervised learning. The main contribution of this work is the development of learning meth...

Riwal Lefort, Ronan Fablet, Jean-Marc Boucher

claim paper

Read More »

Voted

ESANN
2007

122views Neural Networks» more ESANN 2007»

The Recurrent Control Neural Network

15 years 3 months ago

Download www.dice.ucl.ac.be

This paper presents our Recurrent Control Neural Network (RCNN), which is a model-based approach for a data-eﬃcient modelling and control of reinforcement learning problems in di...

Anton Maximilian Schäfer, Steffen Udluft, Han...

claim paper

Read More »

125

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation

16 years 3 months ago

Download www.machinelearning.org

Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...

Chee Wee Phua, Robert Fitch

claim paper

Read More »

116

click to vote

ATAL
2007
Springer

162views Intelligent Agents» more ATAL 2007»

Model-based function approximation in reinforcement learning

15 years 8 months ago

Download userweb.cs.utexas.edu

Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains di...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

« Prev « First page 60 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers