Search Sciweavers | Sciweavers

272 search results - page 26 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

172

click to vote

NIPS
2007

80views Information Technology» more NIPS 2007»

Stable Dual Dynamic Programming

15 years 8 months ago

Download webdocs.cs.ualberta.ca

Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions i...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

245

click to vote

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

149

click to vote

ICML
2008
IEEE

105views Machine Learning» more ICML 2008»

Learning all optimal policies with multiple criteria

16 years 7 months ago

Download leon.barrettnexus.com

We describe an algorithm for learning in the presence of multiple criteria. Our technique generalizes previous approaches in that it can learn optimal policies for all linear pref...

Leon Barrett, Srini Narayanan

claim paper

Read More »

191

click to vote

ICML
1998
IEEE

179views Machine Learning» more ICML 1998»

Value Function Based Production Scheduling

16 years 7 months ago

Download www.ri.cmu.edu

Production scheduling, the problem of sequentially con guring a factory to meet forecasted demands, is a critical problem throughout the manufacturing industry. The requirement of...

Jeff G. Schneider, Justin A. Boyan, Andrew W. Moor...

claim paper

Read More »

172

click to vote

CCGRID
2008
IEEE

127views Distributed And Parallel Com...» more CCGRID 2008»

Grid Differentiated Services: A Reinforcement Learning Approach

16 years 1 months ago

Download hal.inria.fr

—Large scale production grids are a major case for autonomic computing. Following the classical deﬁnition of Kephart, an autonomic computing system should optimize its own beha...

Julien Perez, Cécile Germain-Renaud, Bal&aa...

claim paper

Read More »

« Prev « First page 26 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers