Search Sciweavers | Sciweavers

165 search results - page 5 / 33

» Exploration and apprenticeship learning in reinforcement lea...

click to vote

ICASSP
2011
IEEE

153views Signal Processing» more ICASSP 2011»

Reinforcement learning for energy-efficient wireless transmission

12 years 11 months ago

Download mirlab.org

We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g. multimedia data) over a fading channel. We propose a rigorous and unified fra...

Nicholas Mastronarde, Mihaela van der Schaar

claim paper

Read More »

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

13 years 2 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

14 years 8 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

click to vote

IJRR
2008

151views more IJRR 2008»

Trajectory Optimization using Reinforcement Learning for Map Exploration

13 years 7 months ago

Download mapleleaf.csail.mit.edu

Automatically building maps from sensor data is a necessary and fundamental skill for mobile robots; as a result, considerable research attention has focused on the technical chall...

Thomas Kollar, Nicholas Roy

claim paper

Read More »

click to vote

EVOW
2003
Springer

141views Artificial Intelligence» more EVOW 2003»

Exploring the T-Maze: Evolving Learning-Like Robot Behaviors Using CTRNNs

14 years 28 days ago

Download infoscience.epfl.ch

Abstract. This paper explores the capabilities of continuous time recurrent neural networks (CTRNNs) to display reinforcement learning-like abilities on a set of T-Maze and double ...

Jesper Blynel, Dario Floreano

claim paper

Read More »

« Prev « First page 5 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers