Search Sciweavers | Sciweavers

239 search results - page 23 / 48

» Use of Simulation in Optimization of Maintenance Policies

174

click to vote

INFOCOM
2007
IEEE

185views Communications» more INFOCOM 2007»

Two-Tier Load Balancing in OSPF Wireless Back-Hauls

16 years 4 days ago

Download web.eng.fiu.edu

Abstract— High-speed wireless communication technology (e.g. WiMAX) makes it feasible and cost-effective to build wireless back-hauls for Internet access. Compared to wired count...

Xiaowen Zhang, Hao Zhu

claim paper

Read More »

184

click to vote

TMC
2012

215views Logical Reasoning» more TMC 2012»

Message Drop and Scheduling in DTNs: Theory and Practice

13 years 8 months ago

Download www-sop.inria.fr

Abstract—In order to achieve data delivery in Delay Tolerant Networks (DTN), researchers have proposed the use of store-carryand-forward protocols: a node there may store a messa...

Amir Krifa, Chadi Barakat, Thrasyvoulos Spyropoulo...

claim paper

Read More »

138

click to vote

AAMAS
2007
Springer

142views Intelligent Agents» more AAMAS 2007»

Parallel Reinforcement Learning with Linear Function Approximation

15 years 6 months ago

Download www.aamas-conference.org

In this paper, we investigate the use of parallelization in reinforcement learning (RL), with the goal of learning optimal policies for single-agent RL problems more quickly by us...

Matthew Grounds, Daniel Kudenko

claim paper

Read More »

151

click to vote

WSC
2007

234views Modeling And Simulation» more WSC 2007»

Feasibility study of variance reduction in the logistics composite model

15 years 8 months ago

Download www.informs-sim.org

The Logistics Composite Model (LCOM) is a stochastic, discrete-event simulation that relies on probabilities and random number generators to model scenarios in a maintenance unit ...

George P. Cole III, Alan W. Johnson, J. O. Miller

claim paper

Read More »

149

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 7 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 23 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers