Deep Reinforcement Learning-Based Real-Time Trading Decision Support for Virtual Power Plants with Intelligent Assistant Integration

Abstract

With the profound transformation of the energy structure and the advancement of the “dual carbon” goal, the virtual power plant (VPP), centered on distributed energy resources, has emerged as a key technology for enhancing the flexibility and integration capacity of modern power grids. However, the diversity and volatility of VPP internal resources, coupled with the complexity of the electricity market, impose significant challenges on the response speed and economic efficiency of real-time trading decisions.To address these challenges, this paper proposes and develops a real-time trading decision support system for VPPs driven by an Intelligent Assistant (IA). The system leverages hydropower plants—with their fast response and energy storage capabilities—as the core regulating resources and coordinates multiple distributed energy sources, including photovoltaics, wind power, and energy storage systems. At its core, the IA integrates deep learning–based forecasting models, reinforcement learning–based decision modules, and a natural language processing (NLP)–based interaction component. The IA assists operators in real time by analyzing multidimensional data such as market prices, grid loads, meteorological information, and hydropower inflows, accurately predicting generation and price trends, and dynamically optimizing bidding and regulation strategies through reinforcement learning algorithms to maximize overall benefits.This paper details the overall architecture and key technological components of the proposed system and conducts a simulation case study using a regional VPP containing multiple hydropower plants. Specifically, the core decision-making module employs the Soft Actor-Critic (SAC) deep reinforcement learning algorithm. The system was trained for 2 million steps over 72 hours on a V100 GPU, utilizing one year of real historical operational and market data from a VPP in Southwest China. The simulation results demonstrate that the proposed IA-DRL strategy outperforms a traditional rolling-horizon Mixed-Integer Linear Programming (MILP) method, achieving a 14.7% increase in net profit, a 47.8% reduction in deviation assessment costs, and a remarkable 99.7% acceleration in decision-making time. These results confirm the significant technical and economic advantages of the proposed framework, providing a new theoretical foundation and practical solution for intelligent VPP operation and business model innovation, while also offering enhanced interpretability through the intelligent assistant.

References

Zare, A., & Shafiyi, M. A. (2025). Virtual power plant models and market participation: A deep dive into optimization and real-world applications. Results in Engineering, 26, 105548. https://doi.org/10.1016/j.rineng.2025.105548

Abdelkader, S., Amissah, J., & Abdel-Rahim, O. (2024). Virtual power plants: an in-depth analysis of their advancements and importance as crucial players in modern power systems. Energy, Sustainability and Society, 14(1), 52. https://doi.org/10.1186/s13705-024-00483-y

Ruan, G., Qiu, D., Sivaranjani, S., Awad, A. S. A., & Strbac, G. (2024). Data-driven energy management of virtual power plants: A review. Advances in Applied Energy, 14, 100170. https://doi.org/10.1016/j.adapen.2024.100170

Al-Shetwi, A. Q., Al-Shaalan, A. M., El-Ela, A. A., & El-Sehiemy, R. A. (2023). Adaptive power management strategy for microgrids considering virtual power plants. Sustainable Energy Technologies and Assessments, 59, 103328. https://doi.org/10.1016/j.seta.2023.103328

Juma, S. A., Ayeng’o, S. P., & Kimambo, C. Z. M. (2024). A review of control strategies for optimized microgrid operations. IET Renewable Power Generation, 18(14), 2785–2818. https://doi.org/10.1049/rpg2.13056

Alam, M. M., Hossain, M. J., Habib, M. A., Arafat, M. Y., & Hannan, M. A. (2025). Artificial intelligence integrated grid systems: Technologies, potential frameworks, challenges, and research directions. Renewable and Sustainable Energy Reviews, 211, 115251. https://doi.org/10.1016/j.rser.2024.115251

Kumar, A., Maulik, A., & Chinmaya, K. A. (2025). Energy Management Strategies for Active Distribution Networks and Microgrids – A Comprehensive Survey. IETE Technical Review, 1-20. https://doi.org/10.1080/02564602.2025.2522083

Alsaigh, R., Mehmood, R., & Katib, I. (2023). AI explainability and governance in smart energy systems: A review. Frontiers in Energy Research, 11. https://doi.org/10.3389/fenrg.2023.1071291

Li, Y., Chang, W., & Yang, Q. (2025). Deep reinforcement learning based hierarchical energy management for virtual power plant with aggregated multiple heterogeneous microgrids. Applied Energy, 382, 125333.https://doi.org/10.1016/j.apenergy.2025.125333

Gong, X., Li, X., & Zhong, Z. (2025). Strategic bidding of virtual power plants in integrated electricity-carbon-green certificate market with renewable energy uncertainties. Sustainable Cities and Society, 121, 106176.https://doi.org/10.1016/j.scs.2025.106176

Qiu, D., Wang, Y., Hua, W., & Strbac, G. (2023). Reinforcement learning for electric vehicle applications in power systems: A critical review. Renewable and Sustainable Energy Reviews, 173, 113052. https://doi.org/10.1016/j.rser.2022.113052

Xu, Y., Liao, Y., Kuang, S., Ma, J., & Wen, T. (2025). Virtual Power Plant Optimization Process Under the Electricity–Carbon–Certificate Multi-Market: A Case Study in Southern China. Processes, 13(7), 2148.https://doi.org/10.3390/pr13072148

Wang, S., Sheng, W., Shang, Y., & Liu, K. (2024). Distribution network voltage control considering virtual power plants cooperative optimization with transactive energy. Applied Energy, 371, 123680. https://doi.org/10.1016/j.apenergy.2024.123680

Mahmood, M., Chowdhury, P., Yeassin, R., Hasan, M., Ahmad, T., & Chowdhury, N. U. R. (2024). Impacts of digitalization on smart grids, renewable energy, and demand response: An updated review of current applications. Energy Conversion and Management: X, 24, 100790.https://doi.org/10.1016/j.ecmx.2024.100790

Sun, Z., & Lu, T. (2024). Collaborative operation optimization of distribution system and virtual power plants using multi‐agent deep reinforcement learning with parameter‐sharing mechanism. IET Generation, Transmission & Distribution, 18(1), 39-49.https://doi.org/10.1049/gtd2.13037

Tang, X., & Wang, J. (2025). Deep Reinforcement Learning-Based Multi-Objective Optimization for Virtual Power Plants and Smart Grids: Maximizing Renewable Energy Integration and Grid Efficiency. Processes, 13(6), 1809.https://doi.org/10.3390/pr13061809

Feng, B., Liu, Z., Huang, G., & Guo, C. (2023). Robust federated deep reinforcement learning for optimal control in multiple virtual power plants with electric vehicles. Applied Energy, 349, 121615. https://doi.org/10.1016/j.apenergy.2023.121615

Li, G., Zhang, R., Bu, S., Zhang, J., & Gao, J. (2024). Probabilistic prediction-based multi-objective optimization approach for multi-energy virtual power plant. International Journal of Electrical Power & Energy Systems, 161, 110200.https://doi.org/10.1016/j.ijepes.2024.110200

Yan, C., & Qiu, Z. (2025). Review of Power Market Optimization Strategies Based on Industrial Load Flexibility. Energies, 18(7), 1569. https://doi.org/10.3390/en18071569

Boulkroune, A., Hamel, S., Zouari, F., Boukabou, A., & Ibeas, A. (2017). Output‐Feedback Controller Based Projective Lag‐Synchronization of Uncertain Chaotic Systems in the Presence of Input Nonlinearities. Mathematical Problems in Engineering, 2017(1), 8045803.https://doi.org/10.1155/2017/8045803

Boulkroune, A., Zouari, F., & Boubellouta, A. (2025). Adaptive fuzzy control for practical fixed-time synchronization of fractional-order chaotic systems. Journal of Vibration and Control, 10775463251320258. https://doi.org/10.1177/10775463251320258

Zouari, F., Saad, K. B., & Benrejeb, M. (2013, March). Adaptive backstepping control for a class of uncertain single input single output nonlinear systems. In 10th International Multi-Conferences on Systems, Signals & Devices 2013 (SSD13) (pp. 1-6).IEEE.https://doi.org/10.1109/SSD.2013.6564134

Rigatos, G., Abbaszadeh, M., Sari, B., Siano, P., Cuccurullo, G., & Zouari, F. (2023). Nonlinear optimal control for a gas compressor driven by an induction motor. Results in Control and Optimization, 11, 100226.https://doi.org/10.1016/j.rico.2023.100226

Merazka, L., Zouari, F., & Boulkroune, A. (2017, May). High-gain observer-based adaptive fuzzy control for a class of multivariable nonlinear systems. In 2017 6th International Conference on Systems and Control (ICSC) (pp. 96-102). IEEE.https://doi.org/ 10.1109/ICoSC.2017.7958728

Baur, L., Ditschuneit, K., Schambach, M., Kaymakci, C., Wollmann, T., & Sauer, A. (2024). Explainability and Interpretability in Electric Load Forecasting Using Machine Learning Techniques – A Review. Energy and AI, 16, 100358. https://doi.org/10.1016/j.egyai.2024.100358

Authors

  • Shaohua Zhao
  • Cong Zhang
  • Fuming Liu
  • Yuqian Tian
  • Jifan Ouyang
  • Zhenkai Hu CGS POWER GENERATION(GUANGDONG)ENERGY STORAGE TECHNOLOGY CO.,LTD, Guangzhou,Guangdong, 510630, China

DOI:

https://doi.org/10.31449/inf.v49i35.12599

Downloads

Published

12/16/2025

How to Cite

Zhao, S., Zhang, C., Liu, F., Tian, Y., Ouyang, J., & Hu, Z. (2025). Deep Reinforcement Learning-Based Real-Time Trading Decision Support for Virtual Power Plants with Intelligent Assistant Integration. Informatica, 49(35). https://doi.org/10.31449/inf.v49i35.12599