On the myopic policy for a class of restless bandit problems with applications in dynamic multichannel access

14 years 7 months ago

Download www.ece.ucdavis.edu

We consider a class of restless multi-armed bandit problems that arises in multi-channel opportunistic communications, where channels are modeled as independent and stochastically identical GilbertElliot channels and channel state observations are subject to errors. We show that the myopic channel selection policy has a semi-universal structure that obviates the need to know the Markovian transition probabilities of the channel states. Based on this semi-universal structure, we establish closed-form lower and upper bounds on the maximum throughput (i.e., average reward) achieved by the myopic policy. Furthermore, we characterize the approximation factor of the myopic policy by considering a genie-aided system.

Keqin Liu, Qing Zhao

Real-time Traffic

CDC 2009 | Channel States | Control Systems | Myopic Channel Selection | Myopic Policy |

claim paper

Post Info
More Details (n/a)

Added	21 Jul 2010
Updated	21 Jul 2010
Type	Conference
Year	2009
Where	CDC
Authors	Keqin Liu, Qing Zhao

Comments (0)

Sciweavers

On the myopic policy for a class of restless bandit problems with applications in dynamic multichannel access

CDC 2009 | Channel States | Control Systems | Myopic Channel Selection | Myopic Policy |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers