月刊
ISSN 1000-7229
CN 11-2583/TM
电力建设 ›› 2024, Vol. 45 ›› Issue (5): 80-93.doi: 10.12204/j.issn.1000-7229.2024.05.009
收稿日期:
2023-08-25
出版日期:
2024-05-01
发布日期:
2024-04-29
通讯作者:
李晓露(1971),女,博士,副教授,研究方向为电网调度自动化及电力系统分析与运行等,E-mail:lixiaolu_sh@163.com。作者简介:
周祥(1998),男,硕士研究生,研究方向为配电网优化运行与控制,E-mail:zxdxyx1@163.com;基金资助:
ZHOU Xiang1(), LI Xiaolu1(), LIU Jinsong2, LIN Shunfu1
Received:
2023-08-25
Published:
2024-05-01
Online:
2024-04-29
Supported by:
摘要:
智能软开关能够有效解决分布式光伏大规模接入配电网引起的电压波动问题,但会导致区域间协作程度加深,而现阶段使用多智能体深度强化学习算法进行电压优化时,各智能体仅使用各自区域内的奖励进行训练,导致智能体缺乏协同,输出策略难以保证最优性。为此提出考虑区域间辅助奖励的配电网电压优化方法,首先建立基于多智能体深度强化学习的多时间尺度电压优化框架,其次针对控制智能软开关的智能体,将各自区域内奖励定义为主奖励,邻近区域内奖励定义为辅助奖励,然后通过主、辅助奖励损失函数关于网络参数梯度的数量积分析辅助奖励对训练的有利程度,并采用演化博弈方法自适应修改辅助奖励参与因子;最后,在改进的IEEE 33节点系统验证了所提方法能够稳定智能体训练过程,提升智能体策略的优化效果。
中图分类号:
周祥, 李晓露, 柳劲松, 林顺富. 考虑区域间辅助奖励的配电网电压优化控制[J]. 电力建设, 2024, 45(5): 80-93.
ZHOU Xiang, LI Xiaolu, LIU Jinsong, LIN Shunfu. Voltage Optimization Control of Distribution Networks Considering Inter-Regional Auxiliary Rewards[J]. ELECTRIC POWER CONSTRUCTION, 2024, 45(5): 80-93.
[1] | 陶鹏, 张冰玉, 韩桂楠, 等. 计及源荷双侧风险管理的光储微网两阶段低碳运行优化研究[J]. 智慧电力, 2023, 51(11): 1-6. |
TAO Peng, ZHANG Bingyu, HAN Guinan, et al. Two-stage low carbon operation optimization of photovoltaic storage microgrid considering risk management of both source and load sides[J]. Smart Power, 2023, 51(11): 1-6. | |
[2] | 李长宇, 唐文秀. 基于数据驱动的多微电网互联系统分布鲁棒运行优化[J]. 智慧电力, 2022, 50(5): 1-8. |
LI Changyu, TANG Wenxiu. Distributed robust operation optimization of multi-microgrid interconnection system based on data driven[J]. Smart Power, 2022, 50(5): 1-8. | |
[3] | 虎智峰, 陈静, 张婧菲, 等. 考虑新能源不确定性边界的主动配电网优化调度[J]. 智慧电力, 2022, 50(11): 48-55. |
HU Zhifeng, CHEN Jing, ZHANG Jingfei, et al. Optimal dispatch of active distribution network considering uncertainty boundary of renewable power generation[J]. Smart Power, 2022, 50(11): 48-55. | |
[4] | 曹宏基, 刘道兵, 李世春, 等. 计及UPFC的主动配电网日前-实时优化策略[J]. 智慧电力, 2022, 50(7): 65-73. |
CAO Hongji, LIU Daobing, LI Shichun, et al. Day ahead real-time optimization strategy of active distribution network considering UPFC[J]. Smart Power, 2022, 50(7): 65-73. | |
[5] | 郭清元, 吴杰康, 莫超, 等. 基于混合整数二阶锥规划的新能源配电网电压无功协同优化模型[J]. 中国电机工程学报, 2018, 38(5): 1385-1396. |
GUO Qingyuan, WU Jiekang, MO Chao, et al. A model for multi-objective coordination optimization of voltage and reactive power in distribution networks based on mixed integer second-order cone programming[J]. Proceedings of the CSEE, 2018, 38(5): 1385-1396. | |
[6] | KIM Y J, KIRTLEY J L, NORFORD L K. Reactive power ancillary service of synchronous DGs in coordination with voltage control devices[J]. IEEE Transactions on Smart Grid, 2017, 8(2): 515-527. |
[7] | 高锋阳, 乔垚, 杜强, 等. 考虑光伏出力相关性的配电网动态无功优化[J]. 太阳能学报, 2018, 39(1): 101-109. |
GAO Fengyang, QIAO Yao, DU Qiang, et al. Dynamic reactive power optimization of distribution network considering correlated photovoltaic power output[J]. Acta Energiae Solaris Sinica, 2018, 39(1): 101-109. | |
[8] | 蒋平, 梁乐. 基于内点法和遗传算法相结合的交直流系统无功优化[J]. 高电压技术, 2015, 41(3): 724-729. |
JIANG Ping, LIANG Le. Reactive power optimization of hybrid AC/HVDC power system combining interior point algorithm and genetic algorithm[J]. High Voltage Engineering, 2015, 41(3): 724-729. | |
[9] |
PRASAD S, VINOD KUMAR D M. Trade-offs in PMU and IED deployment for active distribution state estimation using multi-objective evolutionary algorithm[J]. IEEE Transactions on Instrumentation and Measurement, 2018, 67(6): 1298-1307.
doi: 10.1109/TIM.2018.2792890 URL |
[10] | 陈瑞捷, 鲁宗相, 乔颖. 基于多场景模糊集和改进二阶锥方法的配电网优化调度[J]. 电网技术, 2021, 45(12): 4621-4629. |
CHEN Ruijie, LU Zongxiang, QIAO Ying. Optimal dispatch based on multi-scene ambiguity set and modified second-order cone algorithm for distribution network[J]. Power System Technology, 2021, 45(12): 4621-4629. | |
[11] |
GUO Y F, ZHANG Q Z, WANG Z Y. Cooperative peak shaving and voltage regulation in unbalanced distribution feeders[J]. IEEE Transactions on Power Systems, 2021, 36(6): 5235-5244.
doi: 10.1109/TPWRS.2021.3069781 URL |
[12] | 孙峰洲, 刘海涛, 陈庆, 等. 考虑新能源波动区间的交直流配电网下垂斜率鲁棒优化方法[J]. 电力系统自动化, 2020, 44(14): 62-70. |
SUN Fengzhou, LIU Haitao, CHEN Qing, et al. Robust optimization method for droop slopes in AC/DC distribution network considering fluctuation interval of renewable energy source[J]. Automation of Electric Power Systems, 2020, 44(14): 62-70. | |
[13] |
DUAN J J, SHI D, DIAO R S, et al. Deep-reinforcement-learning-based autonomous voltage control for power grid operations[J]. IEEE Transactions on Power Systems, 2020, 35(1): 814-817.
doi: 10.1109/TPWRS.59 URL |
[14] | 范士雄, 李立新, 王松岩, 等. 人工智能技术在电网调控中的应用研究[J]. 电网技术, 2020, 44(2): 401-411. |
FAN Shixiong, LI Lixin, WANG Songyan, et al. Application analysis and exploration of artificial intelligence technology in power grid dispatch and control[J]. Power System Technology, 2020, 44(2): 401-411. | |
[15] | 张沛, 朱驻军, 谢桦. 基于深度强化学习近端策略优化的电网无功优化方法[J]. 电网技术, 2023, 47(2): 562-572. |
ZAHNG Pei, ZHU Zhujun, XIE Hua. Reactive power optimization based on proximal policy optimization of deep reinforcement learning[J]. Power System Technology, 2023, 47(2): 562-572.
doi: 10.52783/pst.226 URL |
|
[16] | 李琦, 乔颖, 张宇精. 配电网持续无功优化的深度强化学习方法[J]. 电网技术, 2020, 44(4): 1473-1480. |
LI Qi, QIAO Ying, ZHANG Yujing. Continuous reactive power optimization of distribution network using deep reinforcement learning[J]. Power System Technology, 2020, 44(4): 1473-1480. | |
[17] | 李鹏, 姜磊, 王加浩, 等. 基于深度强化学习的新能源配电网双时间尺度无功电压优化[J]. 中国电机工程学报, 2023, 43(16): 6255-6265. |
LI Peng, JIANG Lei, WANG Jiahao, et al. Optimization of dual-time scale reactive voltage for distribution network with renewable energy based on deep reinforcement learning[J]. Proceedings of the CSEE, 2023, 43(16): 6255-6265. | |
[18] | 习伟, 李鹏, 李鹏,, 等. 基于深度强化学习的分布式电源就地自适应电压控制方法[J]. 电力系统自动化, 2022, 46(22): 25-31. |
XI Wei, LI Peng, LI Peng,, et al. Adaptive local voltage control method for distributed generator based on deep reinforcement learning[J]. Automation of Electric Power Systems, 2022, 46(22): 25-31. | |
[19] |
CAO D, HU W H, ZHAO J B, et al. A multi-agent deep reinforcement learning based voltage regulation using coordinated PV inverters[J]. IEEE Transactions on Power Systems, 2020, 35(5): 4120-4123.
doi: 10.1109/TPWRS.59 URL |
[20] | 胡丹尔, 彭勇刚, 韦巍, 等. 多时间尺度的配电网深度强化学习无功优化策略[J]. 中国电机工程学报, 2022, 42(14): 5034-5045. |
HU Daner, PENG Yonggang, WEI Wei, et al. Multi-timescale deep reinforcement learning for reactive power optimization of distribution network[J]. Proceedings of the CSEE, 2022, 42(14): 5034-5045. | |
[21] |
WU Z, LI Y Q, GU W, et al. Multi-timescale voltage control for distribution system based on multi-agent deep reinforcement learning[J]. International Journal of Electrical Power & Energy Systems, 2023, 147: 108830.
doi: 10.1016/j.ijepes.2022.108830 URL |
[22] | 王成山, 季节, 冀浩然, 等. 配电系统智能软开关技术及应用[J]. 电力系统自动化, 2022, 46(4): 1-14. |
WANG Chengshan, JI Jie, JI Haoran, et al. Technologies and application of soft open points in distribution networks[J]. Automation of Electric Power Systems, 2022, 46(4): 1-14. | |
[23] | LOWE R, WU Y, TAMAR A, et al. Multi-agent actor-critic for mixed cooperative-competitive environments[C]// Proceedings of the 31st International Conference on Neural Information Processing Systems. ACM, 2017: 6382-6393. |
[24] | RUDER S. An overview of gradient descent optimization algorithms[EB/OL]. [2023-08-05]. http://arxiv.org/abs/1609.04747. |
[25] | LILLICRAP T P, HUNT J J, PRITZEL A, et al. Continuous control with deep reinforcement learning[EB/OL]. [2023-08-05]. http://arxiv.org/abs/1509.02971. |
[26] | 全欢, 彭显刚, 刘涵予, 等. 基于深度强化学习的配电网实时电压优化控制方法[J]. 电网技术, 2023, 47(5): 2029-2039. |
QUAN Huan, PENG Xiangang, LIU Hanyu, et al. Voltage optimal control of distribution network based on deep reinforcement learning[J]. Power System Technology, 2023, 47(5): 2029-2039. | |
[27] | JANG E, GU S X, POOLE B. Categorical reparameterization with gumbel-softmax[EB/OL]. [2023-08-05]. http://arxiv.org/abs/1611.01144. |
[28] |
WANG S Y, DUAN J J, SHI D, et al. A data-driven multi-agent autonomous voltage control framework using deep reinforcement learning[J]. IEEE Transactions on Power Systems, 2020, 35(6): 4644-4654.
doi: 10.1109/TPWRS.59 URL |
[29] |
HUO Y D, LI P, JI H R, et al. Data-driven adaptive operation of soft open points in active distribution networks[J]. IEEE Transactions on Industrial Informatics, 2021, 17(12): 8230-8242.
doi: 10.1109/TII.2021.3064370 URL |
[30] | Elia Group. Transparency on grid data[DB/OL]. (2014-06-03) [2023-08-05]. https://www.elia.be/en/grid-data/power-generation/solar-pv-power-generation-data. |
[31] | 张玉莹, 曾博, 周吟雨, 等. 碳减排驱动下的数据中心与配电网交互式集成规划研究[J]. 电工技术学报, 2023, 38(23): 6433-6450. |
ZHANG Yuying, ZENG Bo, ZHOU Yinyu, et al. Research on interactive integration planning of data centers and distribution network driven by carbon emission reduction[J]. Transactions of China Electrotechnical Society, 2023, 38(23): 6433-6450. |
[1] | 李扬, 马文捷, 卜凡金, 杨震, 王彬, 韩猛. 多智能体深度强化学习驱动的跨园区能源交互优化调度[J]. 电力建设, 2024, 45(5): 59-70. |
[2] | 范辉, 罗蓬, 王弘利, 梁纪峰, 李乾, 杨军, 吴赋章. 基于社交网上演化博弈的光伏台区用户需求响应特性研究[J]. 电力建设, 2022, 43(8): 150-158. |
[3] | 边辉, 陈丽娜, 马凡琳, 张新, 姜金朋. 基于中枢解耦与演化博弈的多农业园区综合能源系统优化运行[J]. 电力建设, 2022, 43(2): 26-36. |
[4] | 赵洱岽, 王浩, 林弘杨. 现货市场中基于演化博弈的火电企业阶梯报价策略[J]. 电力建设, 2020, 41(8): 68-71. |
[5] | 耿世平, 牛东晓, 郭晓鹏, 陈梦, 谭彩霞. 计及多能源柔性负荷调度的微能源网多目标演化博弈[J]. 电力建设, 2020, 41(11): 101-115. |
[6] | 刘敦楠, 汪伟业, 李鹏飞, 李根柱, 范晨凯, 韩金山. 绿电消纳责任权重下火电厂商市场行为的演化博弈[J]. 电力建设, 2020, 41(10): 20-29. |
[7] | 伍文城 余波 周德才 王彦峰. ±500kV等级3200MW直流运行电压优化[J]. 电力建设, 2011, 32(6): 39-45. |
阅读次数 | ||||||
全文 |
|
|||||
摘要 |
|
|||||
版权所有 © 2020 《电力建设》编辑部
地址:北京市昌平区北七家未来科技城北区国家电网公司办公区 邮编:102209 电话:010-66602697
京ICP备18017181号-1 国网安备4511A3CPZ号
本系统由北京玛格泰克科技发展有限公司设计开发