Search Sciweavers | Sciweavers

4544 search results - page 18 / 909

» Reinforcement Learning with Time

208

click to vote

ICML
2003
IEEE

104views Machine Learning» more ICML 2003»

The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping

16 years 21 days ago

Download www.hpl.hp.com

Shaping can be an effective method for improving the learning rate in reinforcement systems. Previously, shaping has been heuristically motivated and implemented. We provide a for...

Adam Laud, Gerald DeJong

claim paper

Read More »

229

Voted

ICMLA
2010

211views Machine Learning» more ICMLA 2010»

Ensembles of Neural Networks for Robust Reinforcement Learning

15 years 5 months ago

Download ahans.de

Reinforcement learning algorithms that employ neural networks as function approximators have proven to be powerful tools for solving optimal control problems. However, their traini...

Alexander Hans, Steffen Udluft

claim paper

Read More »

184

Voted

HPDC
2009
IEEE

108views Distributed And Parallel Com...» more HPDC 2009»

Maestro: a self-organizing peer-to-peer dataflow framework using reinforcement learning

15 years 11 months ago

Download www.cs.vu.nl

In this paper we describe Maestro, a dataflow computation framework for Ibis, our Java-based grid middleware. The novelty of Maestro is that it is a self-organizing peer-to-peer s...

C. van Reeuwijk

claim paper

Read More »

210

click to vote

WEBI
2009
Springer

120views Internet Technology» more WEBI 2009»

Adapting Reinforcement Learning for Trust: Effective Modeling in Dynamic Environments

16 years 2 months ago

Download mas.cmpe.boun.edu.tr

—In open multiagent systems, agents need to model their environments in order to identify trustworthy agents. Models of the environment should be accurate so that decisions about...

Özgür Kafali, Pinar Yolum

claim paper

Read More »

191

Voted

CCGRID
2008
IEEE

127views Distributed And Parallel Com...» more CCGRID 2008»

Grid Differentiated Services: A Reinforcement Learning Approach

16 years 1 months ago

Download hal.inria.fr

—Large scale production grids are a major case for autonomic computing. Following the classical deﬁnition of Kephart, an autonomic computing system should optimize its own beha...

Julien Perez, Cécile Germain-Renaud, Bal&aa...

claim paper

Read More »

« Prev « First page 18 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers