TY - JOUR
T1 - Cooperative Dispatch of Renewable-Penetrated Microgrids Alliances Using Risk-Sensitive Reinforcement Learning
AU - Zhu, Ziqing
AU - Gao, Xiang
AU - Bu, Siqi
AU - Chan, Ka Wing
AU - Zhou, Bin
AU - Xia, Shiwei
N1 - This work was supported in part by Guang dong Basic and Applied Basic Research Foundation under Grant 2023A1515110105, in part by the Scientific Research Start up Fund for Shenzhen High-Caliber Personnel of SZPT under Grant 6022310042k, and in part by the National Natural Science Foundation of China under Grant 52077075.
Publisher Copyright:
IEEE
PY - 2024/10
Y1 - 2024/10
N2 - The integration of individual microgrids (MGs) into Microgrid Alliances (MGAs) significantly improves the reliability and flexibility of energy supply. The dispatch of MGAs is the key challenge to ensure the secure and economic operation of the distribution network. Currently, there is a lack of coordination mechanism that aligns the individual MGs' objectives with the overall welfare of the alliance. In addition, current optimization method cannot simultaneously achieve requirements of MGAs' dispatch, including fast computation speed, scalability, foresight-seeing capability, and risk mitigation against uncertainty due to high penetration of renewable distributed energy resources. In this paper, a cooperation mechanism for MGs in the MGA is proposed to harmonize MGs' own profit and the global profit of the MGA, with the guarantee of fairness. Aligned with this mechanism, a novel Risk-Sensitive Trust Region Policy Optimization (RS-TRPO), as a risk-averse multi-agent reinforcement learning algorithm, is proposed to help MGs to optimize their own dispatch strategy. This algorithm tackles the deficiencies of conventional methods, enabling the distributed, fast-speed, and foresight-seeing dispatch of MGs in a scalable manner, while considering the uncertain risks. In particular, the optimality of this algorithm is theoretically guaranteed. The outstanding computational performance is demonstrated in comparison with conventional algorithms in a modified IEEE 30-Bus Test System with 4 MGs.
AB - The integration of individual microgrids (MGs) into Microgrid Alliances (MGAs) significantly improves the reliability and flexibility of energy supply. The dispatch of MGAs is the key challenge to ensure the secure and economic operation of the distribution network. Currently, there is a lack of coordination mechanism that aligns the individual MGs' objectives with the overall welfare of the alliance. In addition, current optimization method cannot simultaneously achieve requirements of MGAs' dispatch, including fast computation speed, scalability, foresight-seeing capability, and risk mitigation against uncertainty due to high penetration of renewable distributed energy resources. In this paper, a cooperation mechanism for MGs in the MGA is proposed to harmonize MGs' own profit and the global profit of the MGA, with the guarantee of fairness. Aligned with this mechanism, a novel Risk-Sensitive Trust Region Policy Optimization (RS-TRPO), as a risk-averse multi-agent reinforcement learning algorithm, is proposed to help MGs to optimize their own dispatch strategy. This algorithm tackles the deficiencies of conventional methods, enabling the distributed, fast-speed, and foresight-seeing dispatch of MGs in a scalable manner, while considering the uncertain risks. In particular, the optimality of this algorithm is theoretically guaranteed. The outstanding computational performance is demonstrated in comparison with conventional algorithms in a modified IEEE 30-Bus Test System with 4 MGs.
KW - Microgrid alliances
KW - distributed dispatch
KW - multi-agent reinforcement learning
KW - risk mitigation
UR - http://www.scopus.com/inward/record.url?scp=85194877608&partnerID=8YFLogxK
U2 - 10.1109/TSTE.2024.3406590
DO - 10.1109/TSTE.2024.3406590
M3 - Journal article
AN - SCOPUS:85194877608
SN - 1949-3029
VL - 15
SP - 2194
EP - 2208
JO - IEEE Transactions on Sustainable Energy
JF - IEEE Transactions on Sustainable Energy
IS - 4
ER -