Search Sciweavers | Sciweavers

1262 search results - page 33 / 253

» Reinforcement Learning: An Introduction

128

Voted

NIPS
2001

101views Information Technology» more NIPS 2001»

Reinforcement Learning with Long Short-Term Memory

15 years 4 months ago

Download staff.science.uva.nl

This paper presents reinforcement learning with a Long ShortTerm Memory recurrent neural network: RL-LSTM. Model-free RL-LSTM using Advantage learning and directed exploration can...

Bram Bakker

claim paper

Read More »

120

click to vote

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 4 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

144

click to vote

BROADNETS
2007
IEEE

119views Computer Networks» more BROADNETS 2007»

Reinforcement learning based routing in all-optical networks with physical impairments

15 years 7 months ago

Download www.tsp.ece.mcgill.ca

Abstract-- We present and evaluate a reinforcement learningbased RWA algorithm for all-optical networks subject to physical impairments. The technique is suitable for decentralized...

Yvan Pointurier, Fariba Heidari

claim paper

Read More »

114

click to vote

ICML
2009
IEEE

160views Machine Learning» more ICML 2009»

The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning

16 years 4 months ago

Download www.research.rutgers.edu

The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an ...

Carlos Diuk, Lihong Li, Bethany R. Leffler

claim paper

Read More »

139

click to vote

IJAIT
2008

146views more IJAIT 2008»

Learning to Behave in Space: a Qualitative Spatial Representation for Robot Navigation with Reinforcement Learning

15 years 3 months ago

Download www.aussagekraft.de

ion mechanism to create a representation of space consisting of the circular order of detected landmarks and the relative position of walls towards the agent's moving directio...

Lutz Frommberger

claim paper

Read More »

« Prev « First page 33 / 253 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers