Skip to main content
Log in

Demand Response Management in Smart Grid Networks: a Two-Stage Game-Theoretic Learning-Based Approach

  • Published:
Mobile Networks and Applications Aims and scope Submit manuscript

Abstract

In this paper, the combined problem of power company selection and demand response management (DRM) in a smart grid network consisting of multiple power companies and multiple customers is studied via adopting a reinforcement learning and game-theoretic technique. Each power company is characterized by its reputation and competitiveness. The customers, acting as learning automata select the most appropriate power company to be served, in terms of price and electricity needs’ fulfillment, via a reinforcement learning based mechanism. Given customers’ power company selection, the DRM problem is formulated as a two-stage game-theoretic optimization framework. At the first stage the optimal customers’ electricity consumption is determined and at the second stage the optimal power companies’ pricing is obtained. The output of the DRM problem feeds the learning system to build knowledge and to conclude to the optimal power company selection. To realize the aforementioned framework a two-stage Power Company learning selection and Demand Response Management (PC-DRM) iterative algorithm is introduced. The performance evaluation of the proposed approach is achieved via modeling and simulation and its superiority against other approaches is illustrated.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Swift J, Carey JD, O'Connor DL (2002) 2000 market monitor: electric industry restructuring. Division of Energy resources, Office of Consumer Affairs and Business Regulation, Cambridge, pp 1–6

    Google Scholar 

  2. Nelson DL, Anderson KW, Marquez BM (2015) Report to the 84th Texas legislature: scope of com-petition in Electric Markets in Texas. Public Utility Commission of Texas, Austin, pp 1–142

    Google Scholar 

  3. Mohsenian-Rad H, Wong VWS, Jatskevich J, Schober R, Leon-Garcia A (2010) Autonomous Demand-Side Management Based on Game-Theoretic Energy Consumption Scheduling for the Future Smart Grid. IEEE Trans on Smart Grid 1(3):320–331

    Article  Google Scholar 

  4. Li N, Chen L, Dahleh MA (2015) Demand response using linear supply function bidding. IEEE Transactions on Smart Grid 6(4):1827–1838

    Article  Google Scholar 

  5. Chai J, Chen Z, Yang Z (2014) Demand response management with multiple utility companies: a two-level game approach. IEEE Trans on Smart Grid 5(2):722–731

    Article  Google Scholar 

  6. Palensky P, Dietrich D (2011) Demand Side Management: Demand Response, Intelligent Energy Systems, and Smart Loads. IEEE Transactions on Industrial Informatics 7(3):381–388

    Article  Google Scholar 

  7. Ibars MN, Giupponi L (2010) “Distributed demand Management in Smart Grid with a congestion game,” 2010 first IEEE international conference on smart grid communications. MD, Gaithersburg, pp 495–500

    Book  Google Scholar 

  8. Chen SK, Snyder LV (2011) An innovative RTP-based residential power scheduling scheme for smart grids. 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, pp. 5956–5959

  9. Erkoc M, Al-Ahmadi E, Celik N, Saad W (2015) A game theoretic approach for load-shifting in the smart grid. 2015 IEEE International Conference on Smart Grid Communications, Miami, pp 187–192

    Google Scholar 

  10. N. Yaagoubi and H. T. Mouftah (2013) A Comfort Based Game Theoretic Approach for Load Management in the Smart Grid. 2013 IEEE Green Technologies Conference, Denver, pp. 35–41

  11. Yang P, Tang G, Nehorai A (2013) A game-theoretic approach for optimal time-of-use electricity pricing. IEEE Trans on Power Systems 28(2):884–892

    Article  Google Scholar 

  12. Shi W, Li N, Xie X, Chu CC, Gadh R (2014) Optimal residential demand response in distribution networks. IEEE J on Selected Areas in Comm 32(7):1441–1450

    Article  Google Scholar 

  13. Xu Y, Li N, Low S (2016) Demand response with capacity constrained supply function bidding. 2016 IEEE Power and Energy Society General Meeting (PESGM), Boston, pp 1–1

    Google Scholar 

  14. Yu M, Hong SH (2016) A Real-Time Demand-Response Algorithm for Smart Grids: A Stackelberg Game Approach. IEEE Transactions on Smart Grid 7(2):879–888

    Google Scholar 

  15. Maharjan S, Zhu Q, Zhang Y, Gjessing S, Basar T (2013) Dependable Demand Response Management in the Smart Grid: A Stackelberg Game Approach. IEEE Trans on Smart Grid 4(1):120–132

    Article  Google Scholar 

  16. Alshehri K, Liu J, Chen X, Başar T (2015) A Stackelberg game for multi-period demand response management in the smart grid. 54th IEEE Conf on Decision and Control (CDC), pp. 5889–5894

  17. Popov I, Krylatov A, Zakharov V, Ivanov D (2017) Competitive energy consumption under transmission constraints in a multi-supplier power grid system. Int J Syst Sci 48(5):994–1001

    Article  MathSciNet  MATH  Google Scholar 

  18. Vamvakas P, Tsiropoulou EE, Papavassiliou S (2018) Dynamic provider selection & power resource Management in Competitive Wireless Communication Markets. Mobile Networks and Applications, Springer 23(1):86–99

    Article  Google Scholar 

  19. Narendra KS, Thathachar MA (1974) Learning automata-a survey. IEEE Trans on systems, man, and cybernetics, 4, pp. 323–334, SMC-4

  20. Tsiropoulou EE, Vamvakas P, Katsinis G, Papavassiliou S (2015) Combined power and rate allocation in self- optimized multi-service two-tier femtocell networks. Computer Communications, Elsevier 72:38–48

    Article  Google Scholar 

  21. Yates RD (1995) A framework for uplink power control in cellular radio systems. IEEE J on Sel Areas of Comm 13:1341–1347

    Article  Google Scholar 

  22. Saraydar CU, Mandayam NB, Goodman DJ (2002) Efficient power control via pricing in wireless data networks. IEEE Trans on Communications 50:291–303

    Article  Google Scholar 

  23. Tsiropoulou EE, Vamvakas P, Papavassiliou S (2017) Joint Customized Price and Power Control for Energy-Efficient Multi-Service Wireless Networks via S-Modular Theory. IEEE Transactions on Green Communications and Networking 1(1):17–28

    Article  Google Scholar 

Download references

Acknowledgements

The research of Dr. Eirini Eleni Tsiropoulou was conducted as part of the UNM Research Allocation Committee award and the UNM Women in STEM Faculty Development Fund.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Symeon Papavassiliou.

Appendices

Appendix 1

1.1 Proof of Theorem 1

Towards determining customer’s best response strategy \( B{R}_i\left({\mathrm{e}}_{-\mathrm{i}}^{\left(\mathrm{t}\right)}\right)={e}_{i,j}^{(t)\ast } \), the first and the second order derivatives of customer’s utility function \( {U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right) \)with respect to \( {e}_{i,j}^{(t)} \) are used.

$$ \frac{\partial {U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right)}{\partial {e}_{i,j}^{(t)}}=\frac{1}{E_{-i}^{(t)}}\cdot \left[{s_i}^{\prime}\left({r}_i^{(t)}\right)-{a}_i^{(t)}\cdot {p}_j^{(t)}\right] $$
(14)
$$ \frac{\partial^2{U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right)}{\partial {e_{i,j}^{(t)}}^2}=\frac{1}{{E_{-i}^{(t)}}^2}\cdot {s_i}^{\prime^{\prime }}\left({r}_i^{(t)}\right) $$
(15)

As stated in Section 2.2, customer’s satisfaction function \( {s}_i\left({r}_i^{(t)}\right) \) is an increasing concave function with respect to \( {r}_i^{(t)} \), thus \( {s_i}^{\prime^{\prime }}\left({r}_i^{(t)}\right)<0 \) and \( \frac{\partial^2{U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right)}{\partial {e_{i,j}^{(t)}}^2}<0 \). We set \( \tau =\underset{r_i^{(t)}\to \infty }{\lim }{s^{\prime}}^{-1}\left({r}_i^{(t)}\right) \). Since \( {s_i}^{\prime}\left({r}_i^{(t)}\right) \) is a strictly decreasing function (due to \( {s_i}^{\prime^{\prime }}\left({r}_i^{(t)}\right)<0 \)) and as \( {s_i}^{\prime}\left({r}_i^{(t)}\right)>0 \), we know that \( \tau <{s_i}^{\prime}\left({r}_i^{(t)}\right)\le {s_i}^{\prime }(0) \) and 0 ≤ τ < si(0). Hence, for \( 0\le {a}_i^{(t)}\cdot {p}_j^{(t)}\le \tau \), we have \( \frac{\partial {U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right)}{\partial {e}_{i,j}^{(t)}}>0 \) and thus \( {U}_i^{(t)} \) is an increasing function of \( {e}_{i,j}^{(t)} \). In this case, the best response strategy for customer is to demand her maximum electricity consumption, i.e., \( {e}_{i,j,\max}^{(t)} \). So, for \( 0\le {a}_i^{(t)}\cdot {p}_j^{(t)}\le \tau \), we have \( B{R}_i\left({\mathrm{e}}_{-\mathrm{i}}^{\left(\mathrm{t}\right)}\right)={e}_{i,j,\max}^{(t)} \) for all . For \( \tau <{a}_i^{(t)}\cdot {p}_j^{(t)}\le {s_i}^{\prime }(0) \), the equation \( \frac{\partial {U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right)}{\partial {e}_{i,j}^{(t)}}=0 \) is equivalent to \( {s_i}^{\prime}\left({r}_i^{(t)}\right)={a}_i^{(t)}\cdot {p}_j^{(t)}\iff \) . Note that as \( {s_i}^{\prime}\left({r}_i^{(t)}\right) \) is a strictly decreasing function, its inverse (i.e., \( {s_i^{\prime}}^{-1} \)) exists, and that \( {\widehat{r}}_i^{(t)} \) is a decreasing function of \( {a}_i^{(t)}\cdot {p}_j^{(t)} \). Since \( {s_i}^{\prime^{\prime }}\left({r}_i^{(t)}\right)<0 \) for all \( {r}_i^{(t)} \) and hence \( \frac{\partial^2{U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right)}{\partial {e_{i,j}^{(t)}}^2}<0 \), the roots of \( \frac{\partial {U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right)}{\partial {e}_{i,j}^{(t)}}=0 \) maximize \( {U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right) \) for given electricity consumption of the rest of the customers, i.e., \( {E}_{-i}^{(t)} \). An one-to-one relation exists between \( {r}_i^{(t)} \) and \( {e}_{i,j}^{(t)} \), and thus the best response electricity consumption in response to \( {\mathrm{e}}_{-\mathrm{i}}^{\left(\mathrm{t}\right)} \) that maximizes \( {U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right) \) is also unique and is equal to \( {e}_{i,j}^{(t)\ast }={E}_{-i}^{(t)}\cdot {\widehat{r}}_i^{(t)}={E}_{-i}^{(t)}\cdot {s_i^{\prime}}^{-1}\left({a}_i^{(t)}\cdot {p}_j^{(t)}\right) \). If \( {e}_{i,j}^{(t)\ast }>{e}_{i,j,\max}^{(t)} \) customer does not request for \( {e}_{i,j}^{(t)\ast } \). In this case, since \( {e}_{i,j}^{(t)\ast } \) is the unique maximizer of \( {U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right) \), then \( {U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right) \) is an increasing function of \( {e}_{i,j}^{(t)} \) in \( {e}_{i,j}^{(t)}\le {e}_{i,j,\max}^{(t)}\le {e}_{i,j}^{(t)\ast } \) for fixed \( {\mathrm{e}}_{-\mathrm{i}}^{\left(\mathrm{t}\right)} \). Therefore, the best response to \( {\mathrm{e}}_{-\mathrm{i}}^{\left(\mathrm{t}\right)} \) is the maximum value of customer’s electricity consumption, i.e., \( B{R}_i\left({\mathrm{e}}_{-\mathrm{i}}^{\left(\mathrm{t}\right)}\right)={e}_{i,j,\max}^{(t)} \). This implies that for \( \tau <{a}_i^{(t)}\cdot {p}_j^{(t)}\le {s_i}^{\prime }(0) \). For \( {a}_i^{(t)}\cdot {p}_j^{(t)}>{s_i}^{\prime }(0) \), we have \( \frac{\partial {U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right)}{\partial {e}_{i,j}^{(t)}}<0 \), thus \( {U}_i^{(t)}\left({\mathrm{e}}^{\left(\mathrm{t}\right)}\right) \) is a decreasing function of \( {e}_{i,j}^{(t)} \). In this case, the imposed price by the companies is extremely high for customers to afford it, thus .

Appendix 2

1.1 Proof of Theorem 3

Given customers’ electricity consumption, the welfare function of each utility company is written as follows.

(16)

Considering the first order derivative of \( {W}_j^{(t)} \) with respect to \( {p}_j^{(t)} \), we have:

(17)

Thus, the critical points of \( {W}_j^{(t)}\left({\mathrm{p}}^{\left(\mathrm{t}\right)}\right) \) are as follows.

(18)

The second order derivative of \( {W}_j^{(t)}\left({\mathrm{p}}^{\left(\mathrm{t}\right)}\right) \) is as follows.

(19)

As observed via (19), we have \( \frac{\partial^2{W}_j^{(t)}\left({\mathrm{p}}^{\left(\mathrm{t}\right)}\right)}{\partial {p}_j^{(t)2}}<0 \), thus \( {p}_j^{(t)\ast } \) as determined in Eq. (18) maximizes utility company’s welfare.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Apostolopoulos, P.A., Tsiropoulou, E.E. & Papavassiliou, S. Demand Response Management in Smart Grid Networks: a Two-Stage Game-Theoretic Learning-Based Approach. Mobile Netw Appl 26, 548–561 (2021). https://doi.org/10.1007/s11036-018-1124-x

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11036-018-1124-x

Keywords

Navigation