Search Sciweavers | Sciweavers

1630 search results - page 40 / 326

» Coordinated Reinforcement Learning

136

Voted

SIGGRAPH
2010
ACM

295views Computer Graphics» more SIGGRAPH 2010»

Learning behavior styles with inverse reinforcement learning

15 years 8 months ago

Download grail.cs.washington.edu

We present a method for inferring the behavior styles of character controllers from a small set of examples. We show that a rich set of behavior variations can be captured by dete...

Seong Jae Lee, Zoran Popovic

claim paper

Read More »

129

Voted

NIPS
2001

101views Information Technology» more NIPS 2001»

Reinforcement Learning with Long Short-Term Memory

15 years 4 months ago

Download staff.science.uva.nl

This paper presents reinforcement learning with a Long ShortTerm Memory recurrent neural network: RL-LSTM. Model-free RL-LSTM using Advantage learning and directed exploration can...

Bram Bakker

claim paper

Read More »

123

Voted

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 4 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

145

click to vote

BROADNETS
2007
IEEE

119views Computer Networks» more BROADNETS 2007»

Reinforcement learning based routing in all-optical networks with physical impairments

15 years 7 months ago

Download www.tsp.ece.mcgill.ca

Abstract-- We present and evaluate a reinforcement learningbased RWA algorithm for all-optical networks subject to physical impairments. The technique is suitable for decentralized...

Yvan Pointurier, Fariba Heidari

claim paper

Read More »

119

Voted

ICML
2009
IEEE

160views Machine Learning» more ICML 2009»

The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning

16 years 4 months ago

Download www.research.rutgers.edu

The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an ...

Carlos Diuk, Lihong Li, Bethany R. Leffler

claim paper

Read More »

« Prev « First page 40 / 326 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers