On improving the performance of simulation-based algorithms for average reward processes with application to network pricing

14 years 4 months ago

Download home.gwu.edu

We address performance issues associated with simulationbased algorithms for optimizing Markov reward processes. Specifically, we are concerned with algorithms that exploit the regenerative structure of the process in estimating the gradient of the objective function with the respect to control parameters. In many applications, states which initially have short expected return-times may eventually become infrequently visited as the control parameters are updated. As a result, unbiased updates to the control parameters can become so infrequent as to render the algorithm impractical. The performance of these algorithms can be significantly improved by adapting the state which is used to mark regenerative cycles. In this paper, we introduce such an adaptation procedure, give initial arguments for its convergence properties, and illustrate its application in two numerical examples. The examples relate to the optimal pricing of communication network resources for congestion-controlled traf...

Enrique Campos-Náñez, Stephen D. Pat

Real-time Traffic

Algorithms | Control Parameters | Markov Reward Processes | Modeling And Simulation | WSC 2001 |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2001
Where	WSC
Authors	Enrique Campos-Náñez, Stephen D. Patek

Comments (0)

Sciweavers

On improving the performance of simulation-based algorithms for average reward processes with application to network pricing

Algorithms | Control Parameters | Markov Reward Processes | Modeling And Simulation | WSC 2001 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers