Q-Learning Based Energy Management Policies for a Single Sensor Node with Finite Buffer
- 27 November 2012
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Wireless Communications Letters
- Vol. 2 (1), 82-85
- https://doi.org/10.1109/wcl.2012.112012.120754
Abstract
In this paper, we consider the problem of finding optimal energy management policies in the presence of energy harvesting sources to maximize network performance. We formulate this problem in the discounted cost Markov decision process framework and apply two reinforcement learning algorithms. Prior work obtains optimal policy in the case when the conversion function mapping energy to data transmitted is linear and provides heuristic policies in the case when the same is nonlinear. Our algorithms, however, provide optimal policies regardless of the form of the conversion function. Through simulations, our policies are seen to outperform those of in the nonlinear case.Keywords
This publication has 2 references indexed in Scilit:
- Transmission with Energy Harvesting Nodes in Fading Wireless Channels: Optimal PoliciesIEEE Journal on Selected Areas in Communications, 2011
- Optimal energy management policies for energy harvesting sensor nodesIEEE Transactions on Wireless Communications, 2010