Mixed strategy Nash equilibrium analysis in real-time pricing and demand response for future smart retail market

Ze Hu, Ziqing Zhu*, Xiang Wei, Ka Wing Chan, Siqi Bu

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

1 Citation (Scopus)

Abstract

Real-time pricing and demand response (RTP-DR) is a key problem for profit-maximizing and policy-making in the deregulated retail electricity market (REM). However, previous studies overlooked the non-convexity and multi-equilibria caused by the network constraints and the temporally-related non-linear power consumption characteristics of end-users (EUs) in a privacy-protected environment. This paper employs mixed strategy Nash equilibrium (MSNE) to analyze the multiple equilibria in the non-convex game of the RTP-DR problem, providing a comprehensive view of the potential transaction results. A novel multi-agent Q-learning algorithm is developed to estimate subgame perfect equilibrium (SPE) in the proposed game. As a multi-agent reinforcement learning (MARL) algorithm, it enables players in the game to be rational “agents” that learn from “trial and error” to make optimal decisions across time periods. Moreover, the proposed algorithm has a bi-level structure and adopts probability distributions to denote Q-values, representing the belief in environmental response. Through validation on a Northern Illinois utility dataset, our proposed approach demonstrates notable advantages over benchmark algorithms. Specifically, it provides more profitable pricing decisions for monopoly retailers in REM, leading to strategic outcomes for EUs. The numerical results also find that multiple optimal pricing decisions over a day exist simultaneously by providing almost identical profits to the retailer, while leading to different energy consumption patterns and also significant differences in total energy usage on the demand side.

Original languageEnglish
Article number125815
Number of pages11
JournalApplied Energy
Volume391
Early online date14 Apr 2025
DOIs
Publication statusE-pub ahead of print - 14 Apr 2025

User-Defined Keywords

  • Demand response
  • Mixed strategy Nash equilibrium
  • Real-time pricing
  • Reinforcement learning
  • Stackelberg game

Fingerprint

Dive into the research topics of 'Mixed strategy Nash equilibrium analysis in real-time pricing and demand response for future smart retail market'. Together they form a unique fingerprint.

Cite this