On the myopic policy for a class of restless bandit problems with applications in dynamic multichannel access
- 1 December 2009
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 3592-3597
- https://doi.org/10.1109/cdc.2009.5400366
Abstract
We consider a class of restless multi-armed bandit problems that arises in multi-channel opportunistic communications, where channels are modeled as independent and stochastically identical Gilbert-Elliot channels and channel state observations are subject to errors. We show that the myopic channel selection policy has a semi-universal structure that obviates the need to know the Markovian transition probabilities of the channel states. Based on this structure, we establish closed-form lower and upper bounds on the steady-state throughput achieved by the myopic policy. Furthermore, we characterize the approximation factor of the myopic policy to bound its worst-case performance loss with respect to the optimal performance.Keywords
This publication has 12 references indexed in Scilit:
- Optimality of Myopic Sensing in Multichannel Opportunistic AccessIEEE Transactions on Information Theory, 2009
- Algorithms for Dynamic Spectrum Access With Learning for Cognitive RadioIEEE Transactions on Signal Processing, 2009
- On myopic sensing for multi-channel opportunistic access: structure, optimality, and performanceIEEE Transactions on Wireless Communications, 2008
- Multi-UAV dynamic routing with partial observations using restless bandit allocation indicesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- Joint Design and Separation Principle for Opportunistic Spectrum Access in the Presence of Sensing ErrorsIEEE Transactions on Information Theory, 2008
- Performance improvement with predictive channel selection for cognitive radiosPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- A Near Optimal Policy for Channel Allocation in Cognitive RadioLecture Notes in Computer Science, 2008
- Low-Complexity Approaches to Spectrum Opportunity TrackingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Decentralized cognitive MAC for opportunistic spectrum access in ad hoc networks: A POMDP frameworkIEEE Journal on Selected Areas in Communications, 2007
- The Complexity of Optimal Queuing Network ControlMathematics of Operations Research, 1999