Next Article in Journal
The Hawaiian Islands: Conceptualizing an Industrial Ecology Holarchic System
Next Article in Special Issue
Significant Implication of Optimal Capacitor Placement and Sizing for a Sustainable Electrical Operation in a Building
Previous Article in Journal
Genetic Algorithm for Embodied Energy Optimisation of Steel-Concrete Composite Beams
Previous Article in Special Issue
Energy Efficiency and GHG Emissions Mapping of Buildings for Decision-Making Processes against Climate Change at the Local Level
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Deep Learning Approach to Forecasting Monthly Demand for Residential–Sector Electricity

Department of Architectural Engineering, Chung-Ang University, Seoul 06974, Korea
*
Author to whom correspondence should be addressed.
Sustainability 2020, 12(8), 3103; https://doi.org/10.3390/su12083103
Submission received: 18 March 2020 / Revised: 31 March 2020 / Accepted: 3 April 2020 / Published: 13 April 2020
(This article belongs to the Special Issue Energy Efficiency and Sustainability in Buildings)

Abstract

:
Forecasting electricity demand at the regional or national level is a key procedural element of power-system planning. However, achieving such objectives in the residential sector, the primary driver of peak demand, is challenging given this sector’s pattern of constantly fluctuating and gradually increasing energy usage. Although deep learning algorithms have recently yielded promising results in various time series analyses, their potential applicability to forecasting monthly residential electricity demand has yet to be fully explored. As such, this study proposed a forecasting model with social and weather-related variables by introducing long short-term memory (LSTM), which has been known to be powerful among deep learning-based approaches for time series forecasting. The validation of the proposed model was performed using a set of data spanning 22 years in South Korea. The resulting forecasting performance was evaluated on the basis of six performance measures. Further, this model’s performance was subjected to a comparison with the performance of four benchmark models. The performance of the proposed model was exceptional according to all of the measures employed. This model can facilitate improved decision-making regarding power-system planning by accurately forecasting the electricity demands of the residential sector, thereby contributing to the efficient production and use of resources.

1. Introduction

Forecasting electricity demand at the regional or national level is vital for efficient power-system planning aimed at ensuring optimal energy management [1] and secure electricity supply [2]. Overestimating electricity demand not only wastes resources but also introduces unnecessary environmental pollution from thermal power plants by devising expensive expansion plans for the relevant power systems [2,3]. On the other hand, underestimating electricity demand can also have disastrous social consequences, such as blackouts caused by insufficient power supply [2]. According to the Annual Energy Outlook 2019 by the U.S. Energy Information Administration (EIA) [4], residential electricity usage accounted for close to 30% of electricity consumption worldwide in 2018 and is fairly volatile relative to the electricity demand in, for example, industrial and commercial sectors. Meanwhile, it is projected that residential electricity consumption will account for more than one-third of electricity consumption worldwide in 2040 and about 36% in 2050; the amount of electricity used in the residential sector is expected to more than double compared to 2018 [4]. In this context, forecasting monthly electricity demand in the residential sector represents an urgent concern for efficient power-system planning, but is also challenging on account of its tendency to follow a complex pattern characterized by fluctuations and increases. Residential demand depends on various non-linear variables, including economic [5,6], demographic [6,7], and weather conditions [6,7]. Weather conditions, such as wind speed and temperature, vary monthly, pursuant to seasonal trends. By contrast, economic and demographic variables tend to elicit increases in demand. Forecasting residential electricity demand requires consideration of multiple non-linear variables subject to diverse patterns and is, therefore, a highly complex problem in the context of power-system planning.
To preemptively resolve this problem, previous studies (e.g., [8,9,10]) have employed various statistical methods and data mining techniques. Although statistical methods have the advantage of interpretive transparency [3], data mining techniques have been shown to perform better at forecasting residential electricity demand because it is non-linear [2,6]. Recently, support vector regression (SVR) was found to have superior performance in forecasting electricity demand relative to other data mining techniques [2]. Despite the promising results with SVR, the model’s accuracy with respect to mean absolute percentage error (MAPE) and unpaired peak accuracy (UPA) were measured at 2.13 and −5.89, respectively, leaving quite a bit of room for improvement [2].
Recently, deep learning has proven to be a particularly powerful tool for time series analyses [11]. Deep learning is an advanced machine learning approach that utilizes a deep architecture to generate accurate models, but conventional machine learning algorithms employ a relatively shallow architecture that transforms the input data only once or twice. Specifically, in deep learning, the input data is transformed multiple times by a linear or non-linear method and the output is subsequently derived from deep architecture [12]. Deep learning has performed exceptionally well in various time series analyses, such as stock market forecasting, speech recognition, and human motion modeling [11]. However, to our knowledge, its potential in forecasting monthly residential electricity demand at the regional or national level remains unexplored.
This study aimed to propose a model that can accurately forecast monthly residential electricity demand based on long short-term memory (LSTM). The LSTM model has been regarded as an advanced and powerful algorithm for time series data analyses among deep learning-based approaches. This paper is organized as follows: first, Section 2 provides a review of related works on forecasting monthly electricity demand; next, Section 3 outlines the characteristics of the methodology; Section 4 provides the dataset, implementation, benchmark models, and performance measures; Section 5 discusses the experimental results; and, finally, Section 6 contains conclusions and recommendations, highlighting this study’s contribution and areas worthy of future research.

2. Related Works

A consensus has developed around the accurate forecasting of monthly electricity demand, especially regarding approaches that employ data mining techniques because their performance has been consistently superior to that of traditional statistical methods. Several studies have used data mining techniques to explore the problem of forecasting monthly electricity demand.
Bunnoon et al. [8] employed a multi-regional artificial neural network (ANN) to study forecasting of monthly electricity demand in Thailand for 1997 and 2007, the results of which calculated a MAPE value of 1.42 for their model, which renders it more reliable than a traditional ANN. Hamzaçebi et al. [9] proposed a model using the grey system theory to study forecasting monthly electricity demand in Turkey for 1987 and 2014. In terms of MAPE, their model demonstrated an accuracy value of 5.18. Son and Kim [2] used an SVR model to study the forecasting of monthly residential electricity demand in South Korea for 1991 and 2012. Their model selected socioeconomic and weather variables from previous studies, after which they employed a fuzzy-rough and particle swarm optimization algorithm to identify the relevant variables and handle the problems associated with non-linear time series. Their results achieved a MAPE of 2.13 and a UPA of −5.89. Guo et al. [10] proposed a model to study the forecasting of monthly electricity demand in China between 2000 and 2014. Their model is based on the self-adaptive screening and seasonal vector error correction model (SAS-SVECM). They obtained a MAPE value of 1.66, which signifies a higher level of accuracy than time series based on X–12–ARIMA, which is an extension of the autoregressive integrated moving average (ARIMA), SVR with X–12–ARIMA, ANN, growth equation method, and support vector machine (SVM) with X–12–ARIMA.
These studies have proved that it is possible to forecast electricity demand but, thus far, they have yielded results that illustrate the unsatisfactory nature of the models employed, even though data mining techniques have shown superior results relative to those derived from conventional statistical methods. As electricity demand grows, it will become even more difficult to forecast monthly residential electricity demand. Therefore, the previous methods developed to solve this problem still require refinement in terms of their actual forecasting performance.
In recent years, deep learning-based approaches have been applied to various subjects relevant to a wide range of electricity demand forecasting problems (e.g., [13,14,15,16,17,18]). Various forecasting resolutions have been studied from multiple times a day (e.g., minute-by-minute [18], half-hourly [14,16], hourly [18], and every two hours [16]) to daily [15,17,18], weekly [18], monthly [13], or yearly depending on the purposes of forecasting. There have been various target units, such as a single household consumer [15,18], a business consumer (e.g., a manufacturing company in [14] and small and medium enterprise consumer in [16]), any end-user not distinguished as a household or business consumer [13], and a region or country (e.g., UT Chandigarh, India, in [17]). At the regional or national level, the problem of forecasting electricity demand is then broken down by end-use sectors, such as residential, industrial, commercial, and transportation. The characteristics used for each subject’s historical demand series and the variables that affect the series can differ and it is sometimes difficult to consider additional variables depending on the forecasting resolution and target unit. Although deep learning-based approaches showed promise as alternatives to conventional statistical methods in [14,18] and machine learning algorithms in [17], their potential for forecasting monthly residential electricity demand at the regional or national level needs to be explored. To that end, this paper hereby proposed a forecasting model based on LSTM that utilizes a deep learning algorithm with great potential for time series analyses.

3. Long Short-Term Memory

LSTM is an advanced and deep architecture of recurrent neural network (RNN) [19]. RNN can handle time series data because the activation of a recurrent hidden state at each time step depends on the hidden state of a previous time step, whereas conventional neural network transmits information to the next layer without reference to the previous time step [20,21]. As illustrated in Figure 1, t , which signifies a hidden state at each time, is updated at each time step by the input as well as the hidden state at time t and t   1 , respectively. Therefore, an RNN is regarded as an example of deep network architecture, which assigns one layer to each time step. Although an RNN is theoretically suitable for forecasting time-based feature sequences, training the sequences with long time steps is challenging due to RNN’s inherent limitation, i.e., exploding and vanishing gradients [19,22,23]. In this study, a prolonged historical data sequence, related to monthly residential electricity demand, was considered for reliable time series forecasting and LSTM, which is more effective than conventional RNN, was employed to handle the time series with long time steps [19,24].
The main idea of LSTM involves replacing the traditional neuron in an RNN with a memory cell comprised of three sigmoid layers, known as the input, output, and forget gates [25], as illustrated in Figure 2. The first step decides whether to keep or discard the information in the state of the cell, a result achieved by the forget gate. In this process, the gate generates a value between 0 and 1 by referring to the input as well as the hidden state at time t and t   1 , which it discards and keeps, respectively. The generated value is multiplied by the hidden state at time t   1 to perform this process. Next, newly-generated information to be stored is determined by multiplying the outputs from the following two processes in the state of the cell. First, the input gate generates the value to denote the updated information of the state of the cell at time t , then the t a n h layer returns a new candidate vector. Here, the vector can be used to update the state of the cell. The outputs from these two processes are multiplied and the result of that calculation is then added to the forget gate’s output. The output of the memory cell is translated by the output gate using the new hidden state at time t . In the process of generating output, the output gate decides which information to use as the output based on the state of the cell. Finally, the memory cell’s output is generated by multiplying the vector, element-wise, from the output gate and the state of the cell through the t a n h layer.
In the context of forecasting electricity demand, x = ( x 1 , x 2 , , x T ) refers to the historical input data and y = ( y 1 , y 2 , , y T ) refers to the forecasted data. The forecasted electricity demand can be calculated as [26,27]:
i t = σ s ( W x i x t + W h i h t 1 + W c i c t 1 + b i ) ,
f t = σ s ( W x f x t + W h f h t 1 + W c f c t 1 + b f ) ,
o t = σ s ( W x o x t + W h o h t 1 + W c o c t 1 + b o ) ,
c t = f t ° c t 1 + i t ° σ c ( W x c x t + W h c h t 1 + b c ) ,
h t = o t ° σ h ( c t ) ,
where i denotes the input gate, f refers to the forget gate, o signifies the output gate, c represents each cell’s activation vectors, h stands for each memory block’s activation vectors, W denotes the weight matrix of each gate unit, b represents the bias vectors, and ° denotes the element-wise multiplication. The input gate determines what information is stored in the memory cell. The forget gate serves to determine what information to take and store in the memory cell from the previous state, that is, whether to retain the information. Then the output gate helps to determine the output of the information from the memory cell for the new hidden state. The operation of each gate that constitutes the LSTM is represented in Equations (1)–(3), respectively. By the operation of the input, forget, and output gates, the states of the cell and the hidden layers are determined; these are shown in Equations (4) and (5), respectively. For this series of processes, LSTM features the capability to extract and remember information for a long time.
The activation function σ s ( · ) used for the input, forget, and output gates, which represents a sigmoid-type function, is written as:
σ s ( x ) = 1 1 + e x ,
while each of the activation function σ c ( · ) and σ h ( · ) used for the activation gate is expressed as
t a n h ( x ) = 1 2 e 2 x + 1 = 2 σ s ( 2 x ) 1 .

4. Experiments

4.1. Dataset

Data were collected from 1991 when the monthly data for the input considered in this study began to be officially published. Data spanning 22 years (1991–2012, a total of 264 observations) were used for the training, test, and validation, among which the period from January 2011 to December 2012 was considered the validation period instead of more recent years. This was because the electricity demand during this period showed complex seasonal and monthly patterns, significant fluctuations, and gradual increases as well as abnormal cases in forecasting electricity demand. While residential electricity demand peaked in August in most years, in 2011 the electricity demand in January was higher than in August. This was presumed to be due to the fact that the electricity demand in January 2011 was affected by a severe cold wave that led to exceptionally low temperatures and the second-coldest winter since 1981. In September 2011, South Korea experienced its first-ever serious demand and supply imbalance and this led to nationwide rolling blackouts. In August, a month before this point, humid weather continued and heavy rainfall events replaced the heat waves that occurred in previous years. Instead, a late heat wave came in early- and mid-September. In August 2012, an unexpected capacity shortage due to abnormal electricity demand occurred. This was also presumed to happen due to a severe heat wave with the highest average temperature since 1994. The monthly electricity demand growth rate was 3.29% on average between 2011 and 2012, whereas it increased drastically (by 12.58%) in August 2012 compared to the same time during the previous year. In South Korea, Korea Electric Power Corporation provides the state of the amount of reserved power by categorizing the state as normal or five levels of insufficiency (below 5 × 103 MWh). In August 2012, the amount of reserved power fell below 3 × 103 MWh, resulting in a level 3 insufficiency warning. This indicated that this validation period constitutes a forecasting challenge.
Figure 3 shows a graphical plot of the time series for the electricity demand, presenting complex seasonal and monthly patterns and significant fluctuations. The data indicate a considerable upward pattern lasting until the end of 2012, particularly linked to the economic and demographic growth of the country. Moreover, a cyclic yearly trend that correlated with annual global climatic changes was imposed on seasonal and monthly patterns. Further, the amplitude of the values obtained denoted holidays, workdays, and various other days incorporated into the series.
According to previous studies [28,29,30,31], electricity demand series is affected by several non-linear variables, such as social and weather conditions. In this study, related variables that Son and Kim [2] found to impact the time series for monthly residential electricity demand were employed to reflect such influences. Table 1 summarizes 10 related variables, including 2 social variables and 8 weather-related ones. In the present study, data on monthly residential electricity demand and associated variables were assembled from published statistics by the Korea Energy Statistical Information System, Korea Meteorological Administration, and Korean Statistical Information Service.
For weather variables, this study collected the monthly weather data according to the representative regions of South Korea, including the Seoul metropolitan area (including Seoul, Incheon, and Gyeongggi), Gangwon, North Chungcheong, South Chungcheong (including Daejeon and Sejong), North Jeolla, South Jeolla (including Gwangju), North Gyeongsang (including Daegu), South Gyeongsang (including Busan and Ulsan), and the Jeju special self-governing province. Each region was linked to a climatic condition, each of which represents an area of similar climate. The data for weather variables, collected from these nine regions, were averaged according to the population ratio of their regions. Notably, because climate affects the electricity demand via people’s responses to the weather, the population was used as an indicator [32]. That is to say, people vary their use of electric air conditioning or heating appliances depending on the degree of hotness or coldness of the weather, so a population’s density is likely to increase the electricity demand [33]. Similarly, a stronger correlation exists between the population and the regional electricity demand [34].
Figure 4 and Figure 5 compare the normalized values of the weather and social-related variables in the surveyed time series data, each with its own characteristics, and the trend of normalized demand values of electricity in the residential sector. Although the normalized values of the weather-related variables (Figure 4) could delineate the fluctuations in the demand for electricity, they could not account for the long-term upward pattern of electricity demand.
On the other hand, while the normalized values of social variables (Figure 5) followed an upward pattern of normalized demand for electricity values annually, they showed less fluctuation than the weather-related variables, especially the consumer price index.

4.2. Implementation

The proposed LSTM model was implemented using Keras with a Tensorflow backend [35]. To consider the seasonal and annual rising patterns of residential electricity demand, the input layer’s time step was set to 24. The sigmoid-type function was used for the input, forget, and output gates, and the activation function t a n h ( · ) was used for the activation gate. An LSTM model, consisting of one input, two hidden, and one output layers was trained. For the two hidden layers, 125 hidden neurons were used in this study. The architecture was experimentally determined by exploring the optimal configurations under the different numbers of hidden layers in the network from 1 to 3 and neurons in each hidden layer at 25 intervals from 25 to 125. The proposed LSTM model was trained on the mean-squared error (MSE) loss function and optimized through adaptive moment estimation (ADAM) optimization scheme [36].

4.3. Benchmark Models

Four forecasting models (SVR, ANN, ARIMA, and multiple linear regression (MLR)) were used for a comparative demonstration of the proposed LSTM model’s performance in forecasting monthly residential electricity demand.

4.3.1. Support Vector Regression

SVR is a modified version of the SVM devised by Müller et al. [37] to address the time series forecasting problem. It has gained increasing attention over the past decades, especially in electricity demand forecasting [38]. The main objective of SVR is to find a hyperplane function that can recognize patterns in the given time series data. One advantage of this model is that the upper boundary of the generalized error, rather than the learning error, is minimized [39]. SVR was selected as a benchmark model because previous studies have proven that an SVR-based model can yield satisfactory performance across various electricity demand forecasting [2,40,41]. In this study, the hyperparameters of SVR, penalty factor and gamma, were experimentally determined as 100 and 0.001, respectively.

4.3.2. Artificial Neural Networks

An ANN is a mathematical model designed to simulate the processes and functionality of the human brain [42]. Generally, the main objective in this context involves finding a relationship that can automatically map output to input through the training stage, by which the network is iteratively trained to minimize forecasting error [39]. ANN has the advantage of being applicable to multivariate models and capable of describing non-linear relationships among output and input from given historical data [39]. A back-propagation ANN, combining a back-propagation algorithm with a feedforward multi-layer perceptron is the most widely used network structure and was adopted in this study; the momentum and learning-rate parameters were experimentally determined as 0.2 and 0.3, respectively.

4.3.3. Autoregressive Integrated Moving Average (ARIMA)

ARIMA, the most well-known statistical method for time series analyses, can model complex patterns and forecast in univariate time series data [43]. The ARIMA model function has three important factors, denoted as p , d , and q [44,45], which represent the autoregressive, integration, and moving average factors, respectively. An expression of the ARIMA ( p , d , q ) model in general is as follows [44]:
Φ p ( B O ) ( 1 B O ) d Y t = θ q ( B O ) e t + δ ,
where Φ p indicates the autoregressive parameter of order p , p indicates the number of autoregressive terms, B O represents the backshift operator, d indicates the number of differences which are non-seasonal, Y t indicates the actual value in the given time series data, θ q is the order p ’s autoregressive parameter, q indicates the number of forecasting errors, which are lagged in the forecasting equation, e t is a random perturbation or white noise, and δ indicates a constant value.

4.3.4. Multiple Linear Regression (MLR)

MLR, one of the most common statistical regression models, was also employed as a benchmark model. The MLR’s objective is to model the relationship among a dependent variable and multiple independent variables [44]. The relationship between the vector of regressors and the dependent variable is assumed to be linear in MLR. The equation of linear regression maps the forecasting model to the observations in the given time series, x and y values, for forecasting purposes. An expression of the MLR model is as follows [46]:
y = ε + β 1 x 1 + β 2 x 2 + · · · + β n x n   ,
where ϵ represents an error or a random disturbance, β 1 , β 2 , , β n are the coefficients, and n refers to the number of observations. The resulting model is used for forecasting the y value with additional observations x .

4.4. Performance Measures

In this study, the forecasting results were evaluated using the values of six measures, including mean absolute error (MAE), root-mean-squared error (RMSE), MAPE, post-error ratio C , mean bias error (MBE), and UPA. The use of these performance measures represents various ways of evaluating the models. MAE and RMSE are absolute performance measures that allowed us to compare the actual deviation between the actual and forecasted values. On the other hand, MAPE is a relative measure that is scale-invariant and represents the relative criterion of the forecasting error between the actual and forecasted values [47]. The ratio C , represents the ratio between the standard deviations of the actual values and the forecasting errors between the actual and forecasted values. A smaller value of C indicates that the model has a higher level of accuracy. It is possible to determine whether and by how much forecasted values are overestimated or underestimated based on whether the MBE is positive or negative, where a smaller value of MBE is more accurate. The value of UPA represents the forecasting model’s performance in estimating peak demand, but is incapable of pairing the forecasted value with actual value in terms of space or time. In this study, employed performance measures were calculated as the following equations:
MAE = i = 1 n | y i x i | n ,
RMSE = i = 1 n ( x i y i ) 2 n ,
MAPE = 1 n i = 1 n | y i x i y i | × 100 ,
C = S 2 S 1 ,
MBE = i = 1 n ( x i y i ) n ,
U P A = x m a x y m a x y m a x × 100 ,
where n is the number of validation points, refers to the forecasted values, y i refers to the actual values, and S 1 and S 2 denote the standard deviations of the actual values and the forecasting errors, respectively.

5. Results and Discussion

The performances of the models for forecasting monthly residential electricity demand were tested and validated using the dataset and the variables described in the Dataset section. The results for the proposed LSTM model were compared with those of the four benchmark models: SVR, ANN, ARIMA, and MLR. The results were obtained by training based on 240 months and then forecasting the remaining 24 months. A comparison of the forecasting performance, in terms of MAE, RMSE, MAPE, C , MBE, and UPA is summarized in Table 2.
In each of the measures considered in this study, the proposed LSTM model outperformed the four benchmark models. As the values of MAE and RMSE indicate a comparison of the absolute error between the actual and forecasted demand, no absolute criterion exists for both measures’ reliability. Therefore, a smaller value of MAE and RMSE indicates that the forecasted demand during the validation period was closer to the actual values, relative to the other forecasting models. Thus, the proposed LSTM model with the lowest MAE and RMSE values suggests that, within the validation period, it could provide the most accurate forecast, relative to the other tested models.
The proposed LSTM model also showed a superior performance for MAPE by achieving the lowest MAPE value, of less than or equal to 1. When a MAPE value is less than 10, it is interpreted as highly accurate forecasting [48] or acceptable forecasting accuracy [49], and when a MAPE value is less than 1, it is interpreted as near-perfect forecasting [49]. So, the LSTM model was proven to yield superior performance in terms of absolute and relative error. In terms of the ratio   C , the proposed LSTM model performed the best, yielding the lowest value of C . As this value represents the change rate of the error in forecasting, a lower value implies a higher forecasting performance. Moreover, it achieved the lowest C -value of less than or equal to 0.35, indicating near-perfect forecasting furnished by that model [49,50].
For MBE, positive and negative values represent underestimated and overestimated values, respectively, and an absolute value of MBE closer to zero indicates a more accurate forecasting performance. The proposed LSTM model attained a lower absolute value for MBE than SVR, which achieved the next best performance among the benchmark models. Additionally, the ability to forecast the peak demand month is critical for efficient power-system planning as well as for identifying errors in monthly demand forecasts [51]. Therefore, the UPA value, which is a measure for evaluating the ability to forecast peak demand, is also meaningful when evaluating the model’s forecasting performance. As described in Section 4.1, there were abnormal peak demands in 2011 and 2012. In 2011, the peak demand was recorded in January instead of August. In August 2012, there was a drastic surge in electricity demand compared to August 2011. Within the validation period, the proposed LSTM model gave the best peak-demand forecasting performance, with UPA values of 0.02 and −0.44 for 2011 and 2012, respectively, which means that the proposed LSTM model overestimated and underestimated the electricity usage peak in the residential areas by 0.02% and 0.44% for each year of the validation period. Although the proposed model showed better results than other models with respect to deviation between the actual and forecasted values during peak demand in 2012, the results were worse than in the year 2011.
Figure 6 shows a graphical comparison of residential electricity demand during the validation period versus the electricity demand forecasted by the proposed LSTM model. As shown in the graph, the electricity demand used for validation gradually increased monthly and showed strong fluctuations, according to seasonal trends as well as according to increasing national economic and demographic growth. So, the data for the validation had a relatively complex pattern relative to data from the whole observation period considered in this study. Nonetheless, the proposed LSTM model, represented by the blue graph, forecasted residential electricity demand that was very close to the actual electricity demand. This result not only means that the proposed LSTM model can forecast future patterns of electricity demand pattern but also that the forecasted monthly demand is highly accurate, as indicated by the statistical results in Table 2.
The graphical comparisons of the results are shown in Figure 7 and Figure 8. Figure 7 also provides a visual comparison of the accuracy of the electricity demand forecasts between the proposed LSTM model and the four models employed for comparison. As indicated, the benchmark models clearly deviated from the actual demand, but the deviation between the actual and forecasted values cannot be visually confirmed in the proposed LSTM model.
Figure 8 shows the deviation between the actual and forecasted values in each month to render a sharper and more explicit comparison of the performance in forecasting residential electricity demand. As indicated, the benchmark models clearly reflected deviation between actual and forecasted values, indicating underestimation or overestimation, whereas the proposed LSTM model exhibited few deviations. To visually evaluate the proposed model’s performance, Figure 8b illustrates the deviations, expanded by 10-times magnification of the vertical axis of the portion, marked as blue in Figure 8a.
As indicated, the proposed model showed a deviation of less than 50 kWh (27.47 kWh), even in August 2012 when the aforementioned abnormal peak demand occurred. On the other hand, as shown in Figure 8c–f, the deviations between the forecasted electricity demand and the actual consumption by the benchmark models were significant as they were outside of the blue zone. These results visually demonstrated that the proposed model forecasted monthly demand for electricity in the residential sector more accurately than the benchmark models. Therefore, the experimental results prove that an accurate model was achieved using the deep learning technique in forecasting monthly residential electricity demand in the present dataset, which is in agreement with recently introduced research that also used deep learning techniques to investigate a wide range of electricity demand forecasting problems.

6. Conclusions

As the residential sector contributes significantly to electricity consumption worldwide and its proportion of consumption is on the rise, developing a forecasting model capable of making accurate predictions is important for efficient power-system planning. In this regard, this study aimed to propose and develop an accurate forecasting model for monthly residential electricity demand. To achieve this objective, LSTM, one of the most powerful deep learning algorithms, was trained and validated on 22 years of data collected from South Korea. The electricity demand series, together with 10 social and weather-related variables, was used as the input. Then, the model’s forecasting performance was evaluated and compared to the models employed in previous studies in terms of six performance measures: MAE, RMSE, MAPE, C , MBE, and UPA.
Some remarkable findings can be extrapolated from the results of this study. Previous studies demonstrated relatively little understanding of the value of employing deep learning algorithms, especially to forecast monthly residential electricity demand at the regional or national level. The proposed model achieved a highly accurate forecasting performance with a MAPE value of less than 1 by considering social and weather-related variables. Although the proposed LSTM-based model achieved a superior forecasting performance in all of the evaluated performance measures relative to the four benchmark models in the present dataset, comprehensive evaluation using data from across the world is recommended for future research to generalize these findings and conclusions in other countries and different situations.
The proposed model is expected to contribute to efficient power-system planning by accurately forecasting monthly residential electricity usage, which accounts for the majority of total electricity usage. The ability to forecast monthly electricity demand with high accuracy can enable power-system personnel and market players to make sustainable power-system planning decisions to ensure efficient production and utilization of resources. This accurate deep learning-based forecasting model could also be applied to other problems associated with analyzing time series and optimizing energy processes, energy conservation, and sustainable use of energy resources.
Recommended future research includes studies of different forecast resolutions, such as 10 to 15 min in the future, for smart grid applications, which are currently receiving tremendous attention worldwide. In addition, by using detailed data for each city or groups of residential buildings, the deep learning–based model could be adapted for use in fields other than power–system planning.

Author Contributions

Conceptualization, H.S. and C.K.; Data curation, H.S. and C.K.; Formal analysis, H.S.; Funding acquisition, C.K.; Investigation, H.S. and C.K.; Methodology, H.S. and C.K.; Project administration, C.K.; Resources, H.S. and C.K.; Software, H.S.; Supervision, C.K.; Validation, H.S. and C.K.; Visualization, H.S.; Writing – original draft, H.S. and C.K.; Writing – review & editing, C.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF – 2018R1D1A1B07049846).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhou, K.; Yang, S.; Shao, Z. Household monthly electricity consumption pattern mining: A fuzzy clustering-based model and a case study. J. Clean. Prod. 2017, 141, 900–908. [Google Scholar] [CrossRef]
  2. Son, H.; Kim, C. Short-term forecasting of electricity demand for the residential sector using weather and social variables. Resour. Conserv. Recycl. 2017, 123, 200–207. [Google Scholar] [CrossRef]
  3. Vu, D.H.; Muttaqi, K.M.; Agalgaonkar, A.P.; Bouzerdoum, A. Short-term electricity demand forecasting using autoregressive based time varying model incorporating representative data adjustment. Appl. Energy 2017, 205, 790–801. [Google Scholar] [CrossRef]
  4. IEA. World Energy Outlook 2019; IEA: Paris, France, 2019. Available online: https://www.iea.org/reports/world-energy-outlook-2019 (accessed on 30 March 2020).
  5. Chen, Y.-T. The factors affecting electricity consumption and the consumption characteristics in the residential sector—A case example of Taiwan. Sustainability 2017, 9, 1484. [Google Scholar] [CrossRef] [Green Version]
  6. Lusis, P.; Khalilpour, K.R.; Andrew, L.; Liebman, A. Short-term residential load forecasting: Impact of calendar effects and forecast granularity. Appl. Energy 2017, 205, 654–669. [Google Scholar] [CrossRef]
  7. Al-Musaylh, M.S.; Deo, R.C.; Adamowski, J.F.; Li, Y. Short-term electricity demand forecasting with MARS, SVR and ARIMA models using aggregated demand data in Queensland, Australia. Adv. Eng. Inform. 2018, 35, 1–16. [Google Scholar] [CrossRef]
  8. Bunnoon, P.; Chalermyanont, K.; Limsakul, C. Multi-substation control central load area forecasting by using HP-filter and double neural networks (HP-DNNs). Int. J. Electr. Power Energy Syst. 2013, 44, 561–570. [Google Scholar] [CrossRef]
  9. Hamzaçebi, C. Primary energy sources planning based on demand forecasting: The case of Turkey. J. Energy S. Afr. 2016, 27, 1–10. [Google Scholar] [CrossRef] [Green Version]
  10. Guo, H.; Chen, Q.; Xia, Q.; Kang, C.; Zhang, X. A monthly electricity consumption forecasting method based on vector error correction model and self-adaptive screening method. Int. J. Electr. Power Energy Syst. 2018, 95, 427–439. [Google Scholar] [CrossRef]
  11. Längkvist, M.; Karlsson, L.; Loutfi, A. A review of unsupervised feature learning and deep learning for time-series modeling. Pattern Recognit. Lett. 2014, 42, 11–24. [Google Scholar] [CrossRef] [Green Version]
  12. Fan, C.; Xiao, F.; Zhao, Y. A short-term building cooling load prediction method using deep learning algorithms. Appl. Energy 2017, 195, 222–233. [Google Scholar] [CrossRef]
  13. Berriel, R.F.; Lopes, A.T.; Rodrigues, A.; Varejão, F.M.; Oliveira-Santos, T. Monthly energy consumption forecast: A deep learning approach. In Proceedings of the 2017 International Joint Conference on Neural Network, Anchorage, AK, USA, 14–19 May 2017; pp. 4283–4290. [Google Scholar]
  14. Chandramitasari, W.; Kurniawan, B.; Fujimura, S. Building deep neural network model for short term electricity consumption forecasting. In Proceedings of the 2018 International Symposium on Advanced Intelligent Informatics, Yogyakarta, Indonesia, 29–30 August 2018; pp. 43–48. [Google Scholar]
  15. Nair, A.S.; Hossen, T.; Campion, M.; Ranganathan, P. Optimal operation of residential EVs using DNN and clustering based energy forecast. In Proceedings of the 2018 North American Power Symposium, Fargo, ND, USA, 9–11 September 2018; pp. 1–6. [Google Scholar]
  16. Balaji, A.J.; Ram, D.S.H.; Nair, B.B. A deep learning approach to electric energy consumption modeling. J. Intell. Fuzzy Syst. 2019, 36, 4049–4055. [Google Scholar] [CrossRef]
  17. Bedi, J.; Toshniwal, D. Deep learning framework to forecast electricity demand. Appl. Energy 2019, 238, 1312–1326. [Google Scholar] [CrossRef]
  18. Kim, T.-Y.; Cho, S.-B. Predicting residential energy consumption using CNN-LSTM neural networks. Energy 2019, 182, 72–81. [Google Scholar] [CrossRef]
  19. LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
  20. Chung, J.; Gulcehre, C.; Cho, K.; Bengio, Y. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv 2014, arXiv:1412.3555. [Google Scholar]
  21. Wang, H.; Yang, Z.; Yu, Q.; Hong, T.; Lin, X. Online reliability time series prediction via convolutional neural network and long short term memory for service-oriented systems. Knowl. Based Syst. 2018, 159, 132–147. [Google Scholar] [CrossRef]
  22. Bengio, Y.; Simard, P.; Frasconi, P. Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 1994, 5, 157–166. [Google Scholar] [CrossRef]
  23. Kraus, M.; Feuerriegel, S. Decision support from financial disclosures with deep neural networks and transfer learning. Decis. Support Syst. 2017, 104, 38–48. [Google Scholar] [CrossRef] [Green Version]
  24. Graves, A.; Mohamed, A.R.; Hinton, G. Speech recognition with deep recurrent neural networks. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada, 26–31 May 2013; pp. 6645–6649. [Google Scholar]
  25. Guo, L.; Li, N.; Jia, F.; Lei, Y.; Lin, J. A recurrent neural network based health indicator for remaining useful life prediction of bearings. Neurocomputing 2017, 240, 98–109. [Google Scholar] [CrossRef]
  26. Ke, J.; Zheng, H.; Yang, H.; Chen, X.M. Short-term forecasting of passenger demand under on-demand ride services: A spatio-temporal deep learning approach. Transp. Res. Part C Emerg. Technol. 2017, 85, 591–608. [Google Scholar] [CrossRef]
  27. Zhang, D.; Lindholm, G.; Ratnaweera, H. Use long short-term memory to enhance Internet of Things for combined sewer overflow monitoring. J. Hydrol. 2018, 556, 409–418. [Google Scholar] [CrossRef]
  28. Kavaklioglu, K.; Ceylan, H.; Ozturk, H.K.; Canyurt, O.E. Modeling and prediction of Turkey’s electricity consumption using artificial neural networks. Energy Convers. Manag. 2009, 50, 2719–2727. [Google Scholar] [CrossRef]
  29. Swan, L.G.; Ugursal, V.I. Modeling of end-use energy consumption in the residential sector: A review of modeling techniques. Renew. Sustain. Energy Rev. 2009, 13, 1819–1835. [Google Scholar] [CrossRef]
  30. Aroonruengsawat, A.; Auffhammer, M. Impacts of Climate Change on Residential Electricity Consumption: Evidence from Billing Data; Chapter in NBER book The Economics of Climate Change: Adaptations Past and Present; Libecap, G.D., Steckel, R.H., Eds.; University of Chicago Press: Chicago, IL, USA, 2011; pp. 311–342. [Google Scholar]
  31. Apadula, F.; Bassini, A.; Elli, A.; Scapin, S. Relationships between meteorological variables and monthly electricity demand. Appl. Energy 2012, 98, 346–356. [Google Scholar] [CrossRef]
  32. Hondroyiannis, G. Estimating residential demand for electricity in Greece. Energy Econ. 2004, 26, 319–334. [Google Scholar] [CrossRef]
  33. Valor, E.; Meneu, V.; Caselles, V. Daily air temperature and electricity load in Spain. J. Appl. Meteorol. 2001, 40, 1413–1421. [Google Scholar] [CrossRef]
  34. Sailor, D.J.; Muñoz, J.R. Sensitivity of electricity and natural gas consumption to climate in the USA—Methodology and results for eight states. Energy 1997, 22, 987–998. [Google Scholar] [CrossRef]
  35. Chollet, F. Keras. Available online: https://github.com/fchollet/keras (accessed on 30 March 2020).
  36. Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
  37. Müller, K.R.; Smola, A.J.; Rätsch, G.; Schölkopf, B.; Kohlmorgen, J.; Vapnik, V. Predicting time series with support vector machines. In Proceedings of the 7th International Conference on Artificial Neural Networks, Lausanne, Switzerland, 8–10 October 1997; pp. 999–1004. [Google Scholar]
  38. Massidda, L.; Marrocu, M. Decoupling weather influence from user habits for an optimal electric load forecast system. Energies 2017, 10, 2171. [Google Scholar] [CrossRef] [Green Version]
  39. Sousa, J.C.; Jorge, H.M.; Neves, L.P. Short-term load forecasting based on support vector regression and load profiling. Int. J. Energy Res. 2014, 38, 350–362. [Google Scholar] [CrossRef]
  40. Chen, Y.; Xu, P.; Chu, Y.; Li, W.; Wu, Y.; Ni, L.; Bao, Y.; Wang, K. Short-term electrical load forecasting using the support vector regression (SVR) model to calculate the demand response baseline for office buildings. Appl. Energy 2017, 195, 659–670. [Google Scholar] [CrossRef]
  41. Khosravi, A.; Koury, R.N.N.; Machado, L.; Pabon, J.J.G. Prediction of hourly solar radiation in Abu Musa Island using machine learning algorithms. J. Clean. Prod. 2018, 176, 63–75. [Google Scholar] [CrossRef]
  42. Son, H.; Kim, C. Early prediction of the performance of green building projects using pre-project planning variables: Data mining approaches. J. Clean. Prod. 2015, 109, 144–151. [Google Scholar] [CrossRef]
  43. Alani, A.Y.; Osunmakinde, I.O. Short-term multiple forecasting of electric energy loads for sustainable demand planning in smart grids for smart homes. Sustainability 2017, 9, 1972. [Google Scholar] [CrossRef] [Green Version]
  44. Adamowski, J.; Chan, H.F.; Prasher, S.O.; Ozga-Zielinski, B.; Sliusarieva, A. Comparison of multiple linear and nonlinear regression, autoregressive integrated moving average, artificial neural network, and wavelet artificial neural network methods for urban water demand forecasting in Montreal, Canada. Water Resour. Res. 2012, 48. [Google Scholar] [CrossRef]
  45. Kia, A.N.; Fathian, M.; Gholamian, M.R. Using MLP and RBF neural networks to improve the prediction of exchange rate time series with ARIMA. Int. J. Inform. Electron. Eng. 2012, 2, 543–546. [Google Scholar] [CrossRef]
  46. Dudek, G. Pattern-based local linear regression models for short-term load forecasting. Electr. Power Syst. Res. 2016, 130, 139–147. [Google Scholar] [CrossRef]
  47. Hu, Z.; Bao, Y.; Xiong, T. Comprehensive learning particle swarm optimization based memetic algorithm for model selection in short-term load forecasting using support vector regression. Appl. Soft Comput. 2014, 25, 15–25. [Google Scholar] [CrossRef]
  48. Lewis, C.D. Industrial and Business Forecasting Methods: A Practical Guide to Exponential Smoothing and Curve Fitting; Butterworth Scientific: London, UK; Boston, MA, USA, 1982. [Google Scholar]
  49. Hamzacebi, C.; Es, H.A. Forecasting the annual electricity consumption of Turkey using an optimized grey model. Energy 2014, 70, 165–171. [Google Scholar] [CrossRef]
  50. Jiang, Y.; Yao, Y.; Deng, S.; Ma, Z. Applying grey forecasting to predicting the operating energy performance of air cooled water chillers. Int. J. Refrig. 2004, 27, 385–392. [Google Scholar] [CrossRef]
  51. McSharry, P.E.; Bouwman, S.; Bloemhof, G. Probabilistic forecasts of the magnitude and timing of peak electricity demand. IEEE Trans. Power Syst. 2005, 20, 1166–1172. [Google Scholar] [CrossRef]
Figure 1. The basic architecture of RNN [19].
Figure 1. The basic architecture of RNN [19].
Sustainability 12 03103 g001
Figure 2. The long short-term memory (LSTM) memory cell structure.
Figure 2. The long short-term memory (LSTM) memory cell structure.
Sustainability 12 03103 g002
Figure 3. Monthly residential electricity demand () over the data collection period.
Figure 3. Monthly residential electricity demand () over the data collection period.
Sustainability 12 03103 g003
Figure 4. Normalized values of electricity demand () for the weather-related variables: (a) wind speed, (b) vapor pressure, (c) maximum temperature, (d) global solar radiation, (e) daylight time, and (f) cooling degree days.
Figure 4. Normalized values of electricity demand () for the weather-related variables: (a) wind speed, (b) vapor pressure, (c) maximum temperature, (d) global solar radiation, (e) daylight time, and (f) cooling degree days.
Sustainability 12 03103 g004
Figure 5. Normalized values of electricity demand (—) for the social variables: (a) real electricity price and (b) consumer price index.
Figure 5. Normalized values of electricity demand (—) for the social variables: (a) real electricity price and (b) consumer price index.
Sustainability 12 03103 g005
Figure 6. Actual () and forecasted (---) values obtained by using the proposed LSTM model.
Figure 6. Actual () and forecasted (---) values obtained by using the proposed LSTM model.
Sustainability 12 03103 g006
Figure 7. A visual comparison of the forecasting results for the five models.
Figure 7. A visual comparison of the forecasting results for the five models.
Sustainability 12 03103 g007
Figure 8. Graphical representation of the deviations between the actual and forecasted values: (a) the proposed model, (b) 10-times magnification of the vertical axis of the blue area in (a), (c) SVR, (d) ANN, (e) ARIMA, and (f) MLR.
Figure 8. Graphical representation of the deviations between the actual and forecasted values: (a) the proposed model, (b) 10-times magnification of the vertical axis of the blue area in (a), (c) SVR, (d) ANN, (e) ARIMA, and (f) MLR.
Sustainability 12 03103 g008aSustainability 12 03103 g008b
Table 1. Variables for modeling the monthly demand for electricity in the residential sector.
Table 1. Variables for modeling the monthly demand for electricity in the residential sector.
Variable NameUnit
Social variables
Real electricity pricewon/ kWh
Consumer price index
Weather variables
Wind speedm/s
Vapor pressure0.1 hPa
Total cooling degree daysDegree-days
Mean temperature°C
Maximum temperature°C
Cooling degree daysDegree-days
Global solar radiation0.01 MJ/m2
Daylight timehours
Table 2. Summary of the forecasting results for the five models.
Table 2. Summary of the forecasting results for the five models.
MethodProposedSVRANNARIMAMLR
Measure
MAE3,808.67113,870.67522,503.25242,005.30361,170.52
RMSE6,446.66149,287.54633,073.65261,671.62426,779.00
MAPE0.072.139.924.717.15
C0.020.371.010.350.79
MBE930.2545,695.11500,754.67224,571.82300,159.62
UPA in 20110.022.1112.162.10−0.41
UPA in 2012−0.44−5.8914.03−2.57−5.83
Note: The values are rounded to two decimal places.

Share and Cite

MDPI and ACS Style

Son, H.; Kim, C. A Deep Learning Approach to Forecasting Monthly Demand for Residential–Sector Electricity. Sustainability 2020, 12, 3103. https://doi.org/10.3390/su12083103

AMA Style

Son H, Kim C. A Deep Learning Approach to Forecasting Monthly Demand for Residential–Sector Electricity. Sustainability. 2020; 12(8):3103. https://doi.org/10.3390/su12083103

Chicago/Turabian Style

Son, Hyojoo, and Changwan Kim. 2020. "A Deep Learning Approach to Forecasting Monthly Demand for Residential–Sector Electricity" Sustainability 12, no. 8: 3103. https://doi.org/10.3390/su12083103

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop