Search Sciweavers | Sciweavers

1235 search results - page 148 / 247

» Reinforcement learning in a nutshell

108

click to vote

DAGSTUHL
2003

116views Software Engineering» more DAGSTUHL 2003»

Maximizing Learning Progress: An Internal Reward System for Development

15 years 4 months ago

Download www.csl.sony.fr

This chapter presents a generic internal reward system that drives an agent to increase the complexity of its behavior. This reward system does not reinforce a predeﬁned task. It...

Frédéric Kaplan, Pierre-Yves Oudeyer

claim paper

Read More »

141

Voted

ICRA
2009
IEEE

227views Robotics» more ICRA 2009»

Adaptive autonomous control using online value iteration with gaussian processes

15 years 10 months ago

Download www-personal.acfr.usyd.edu.au

— In this paper, we present a novel approach to controlling a robotic system online from scratch based on the reinforcement learning principle. In contrast to other approaches, o...

Axel Rottmann, Wolfram Burgard

claim paper

Read More »

211

Voted

Publication

233views

Sparse reward processes

14 years 1 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

140

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 4 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

137

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

15 years 4 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

« Prev « First page 148 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers