Distributed learning in cognitive radio networks: Multi-armed bandit with distributed multiple players

15 years 8 months ago

Download www.ece.ucdavis.edu

—We consider a cognitive radio network with distributed multiple secondary users, where each user independently searches for spectrum opportunities in multiple channels without exchanging information with others. The occupancy of each channel is modeled as an i.i.d. Bernoulli process with unknown mean. Users choosing the same channel collide, and none or only one receives reward depending on the collision model. This problem can be formulated as a decentralized multi-armed bandit problem. We measure the performance of a decentralized policy by the system regret, deﬁned as the total reward loss with respect to the optimal performance under the perfect scenario where all channel parameters are known to all users and collisions among secondary users are eliminated through perfect scheduling. We show that the minimum system regret grows with time at the same logarithmic order as in the centralized counterpart, where users exchange observations and make decisions jointly. We propose a b...

Keqin Liu, Qing Zhao

Real-time Traffic

Cognitive Radio Network | Decentralized Multi-armed Bandit | ICASSP 2010 | Signal Processing | Total Reward Loss |

claim paper

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Keqin Liu, Qing Zhao

Comments (0)

Sciweavers

Distributed learning in cognitive radio networks: Multi-armed bandit with distributed multiple players

Cognitive Radio Network | Decentralized Multi-armed Bandit | ICASSP 2010 | Signal Processing | Total Reward Loss |

Explore & Download

Productivity Tools

Sciweavers