Multiagent-Based Reinforcement Learning for Optimal Reactive Power Dispatch
- 21 December 2012
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews)
- Vol. 42 (6), 1742-1751
- https://doi.org/10.1109/tsmcc.2012.2218596
Abstract
This paper proposes a fully distributed multiagent-based reinforcement learning method for optimal reactive power dispatch. According to the method, two agents communicate with each other only if their corresponding buses are electrically coupled. The global rewards that are required for learning are obtained with a consensus-based global information discovery algorithm, which has been demonstrated to be efficient and reliable. Based on the discovered global rewards, a distributed Q-learning algorithm is implemented to minimize the active power loss while satisfying operational constraints. The proposed method does not require accurate system model and can learn from scratch. Simulation studies with power systems of different sizes show that the method is very computationally efficient and able to provide near-optimal solutions. It can be observed that prior knowledge can significantly speed up the learning process and decrease the occurrences of undesirable disturbances. The proposed method has good potential for online implementation.Keywords
This publication has 23 references indexed in Scilit:
- Stable Multi-Agent-Based Load Shedding Algorithm for Power SystemsIEEE Transactions on Power Systems, 2011
- Novel Multiagent Based Load Restoration Algorithm for MicrogridsIEEE Transactions on Smart Grid, 2011
- Hysteretic Q-learning : an algorithm for Decentralized Reinforcement Learning in Cooperative Multi-Agent TeamsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- A Q-decomposition and bounded RTDP approach to resource allocationPublished by Association for Computing Machinery (ACM) ,2007
- Consensus and Cooperation in Networked Multi-Agent SystemsProceedings of the IEEE, 2007
- Handbook of Learning and Approximate Dynamic ProgrammingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Implementation of adaptive critic-based neurocontrollers for turbogenerators in a multimachine power systemIEEE Transactions on Neural Networks, 2003
- Dual heuristic programming excitation neurocontrol for generators in a multimachine power systemIEEE Transactions on Industry Applications, 2003
- Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogeneratorIEEE Transactions on Neural Networks, 2002
- Adaptive critic designsIEEE Transactions on Neural Networks, 1997