Calculation of solar irradiation prediction intervals combining volatility and kernel density estimates

doi:10.1016/j.energy.2016.07.167

Energy

Volume 114, 1 November 2016, Pages 266-274

https://doi.org/10.1016/j.energy.2016.07.167 Get rights and content

Highlights

•
This work explores uncertainty forecasting models to build prediction intervals.
•
Kernel density estimators, exponential smoothing and GARCH models are compared.
•
An optimal combination of methods provides the best results.
•
A good compromise between coverage and average interval width is shown.

Abstract

In order to integrate solar energy into the grid it is important to predict the solar radiation accurately, where forecast errors can lead to significant costs. Recently, the increasing statistical approaches that cope with this problem is yielding a prolific literature. In general terms, the main research discussion is centred on selecting the “best” forecasting technique in accuracy terms. However, the need of the users of such forecasts require, apart from point forecasts, information about the variability of such forecast to compute prediction intervals. In this work, we will analyze kernel density estimation approaches, volatility forecasting models and combination of both of them in order to improve the prediction intervals performance. The results show that an optimal combination in terms of prediction interval statistical tests can achieve the desired confidence level with a lower average interval width. Data from a facility located in Spain are used to illustrate our methodology.

Introduction

Solar power generation has been steadily increasing worldwide as a response to environmental concerns. Unfortunately, the integration of solar energy into the energy mix of a country brings new challenges. The main problem is due to the variability of the solar energy, which is not available “on demand”. Therefore, reasonably accurate forecasts are required to make this kind of energy economically viable. Depending on the objective, different forecasting horizons are needed, for instance, long-term forecasts are useful for locating potential solar power plants, energy resource planning and scheduling employs mid-term (up to 48 h) solar forecasts, whereas intraday forecasts are required for load following and predispatch [20]. Kraas et al. [21] economically quantified the impact of forecasting errors in the Spanish electricity system for a concentrating solar power plant. In Spain, forecasts of production have to be provided to the transmission system operator (TSO). In case of deviations from the scheduled production, the TSO applies a cost penalty. If the forecast is higher than the real production, the TSO charges falling penalties, and conversely, charges rising penalties for a production dispatch above the forecasted value. In that reference, forecasting improvement may significantly reduce penalty charges by 47.6% compared to the simple persistence forecasts.

In general terms, most of the published literature on solar energy forecasting is based on the application of different techniques in order to provide point forecasts. Among the diverse techniques provided, the selection of the best technique is usually based on choosing the one with the lowest forecast error. Nevertheless, the need of the users of such forecast (or stakeholders) require more information apart from the point forecast. In particular, they require both uncertainty and variability forecasts (chapter 14 [20]), that they can be even more useful than typical point forecasts [10]. In this context, point forecasts and its associated uncertainty are usually given as hourly-average time series, whereas irradiance variability or ramp events are measured in an intra-hour time scale, for example, minutes. Note that this difference between variability and uncertainty might not be unanimous in other research disciplines out of the solar energy literature. For instance, in supply chain applications, variability is commonly employed as a measure of uncertainty. Nonetheless, in this work we follow the distinction made in (chapter 14, [20]).

The aim of this work is twofold. Firstly, to bridge the gap in the solar energy forecasting literature by focusing on uncertainty forecasting, which has remained overlooked in comparison with point forecasting. Secondly, to use such uncertainty forecasts to compute prediction intervals on the basis of a novel methodology that combines them in an optimal manner.

Essentially, uncertainty can be quantified by the standard deviation of the forecast error, also known as volatility in finance terms, and this is assumed to be homoskedastic, independent and normally distributed. However, since the solar irradiance time series involves very complex processes, it is expected that those assumptions may be violated. Therefore, this work explores different models depending on the assumption that might not be fulfilled. For instance, if the forecast error is not normal a potential solution is to use Kernel Density estimates [31]. On the other hand, if the forecast error is neither homoskedastic nor independent, then, volatility estimators as either Generalized Autoregressive Conditional Heteroskedastic (GARCH) [4] or exponential smoothing models [34] can be employed. Since it is also possible that any of the aforementioned assumptions may be fulfilled, the novel methodology proposed intend to compute prediction intervals by combining Kernel Density estimates and volatility models. Furthermore, the combination suggested is optimal in the sense that maximizes the conditional coverage Christoffersen test p-value [9].

The results show that such a combination provides a robust performance by achieving a compromise between prediction interval coverage and average interval width.

In order to illustrate the methodology employed, we are going to focus on one-step-ahead uncertainty forecasts obtained from Global Horizontal Irradiation (GHI) data that is crucial for photovoltaic generators. Note that this study can be extended in a straightforward manner to analyze the Direct Normal Irradiation (DNI) that is more relevant for Concentrated Solar Power applications.

The rest of the paper is organised as follows: Section 2 reviews the literature on prediction intervals and it describes the models employed in this article. Section 3 describes the case study dataset. Section 4 lays out the experimental setup and the discussion of initial results. Section 5 defines a new approach to compute prediction intervals based on combining previous models, and finally, Section 6 presents the concluding remarks.

Section snippets

Prediction intervals

There are two principal streams to compute prediction intervals. In the first place, a theoretical prediction interval can be calculated based on a forecasting model that assumes to reproduce the data correctly and that the forecast error follows a determined distribution [8]. Since the model is assumed to be specified correctly, the forecasts are unbiased and the forecast errors have zero mean and constant variance, which is function of a certain forecasting horizon and the model parameters

Case study data

Solar irradiance data have been collected by the Spanish Institute for Concentration Photovoltaics Systems (ISFOC), located in Ciudad Real in the region of Castilla-La Mancha in Spain (at 38.67 °N, 4.15 °W, 687 m). Minute-by-minute solar irradiance measurements have been recorded using pyranometers and pyrheliometers, which comply with the international standards of Baseline Surface Radiation Network (BSRN) [24].

Global Horizontal Irradiance (GHI) data have been provided. GHI is the total solar

Experimental setup

The data (8760 observations) have been split down in two parts of the same size approximately. The first part (4392 observations) has been used to estimate the parameters of the ARIMA model (in-sample data) and the GARCH parameters. Once the forecasts have been calculated, we have removed the nights by eliminating observations with an elevation angle lower than 15°. Note that we have not removed the nights in the calculation of the ARIMA-GARCH model in order to use the same model than [28] with

Combination of prediction intervals

Since the non-parametric approach gives a higher hit rate and the parametric approaches, specially the SES approach, tend to be more independent, another alternative is to combine both methods to compute a prediction interval. Actually, Christoffersen [9] pointed out that combining non-parametric error distribution with time-varying variance estimators is likely to present a favorable alternative. In this section we will test such an option.

Given that the non-parametric and SES were the methods

Conclusions

Forecasts of solar irradiation are required to incorporate the electricity generated into the grid. Recently, a large variety of approaches have proliferated so as to provide point forecasts. Nonetheless, the literature about the uncertainty associated to such forecasts is scarce. This work examines the aforementioned uncertainty through prediction intervals computed by means of non-parametric kernel density estimates and parametric approaches based on volatility forecasting models. The results

Acknowledgment

The author is grateful to Alberto Martín and ISFOC for kindly providing the data used in this paper and he is also grateful to Diego Pedregal and three anonymous reviewers for their constructive comments. This work was supported by the European Regional Development Fund and Spanish Government (MINECO/FEDER, UE) under the project with reference DPI2015-64133-R

References (38)

F.H. Al-Sadah et al.
Hourly solar radiation over Bahrain
Energy
(1990)
A. Baig et al.
A novel approach to estimate the clear day global radiation
Renew Energy
(1991)
J. Boland
Time-series analysis of climatic variables
Sol Energy
(1995)
T. Bollerslev
Generalized autoregressive conditional heteroskedasticity
J Econ
(1986)
Y. Chu et al.
Real-time prediction intervals for intra-hour DNI forecasts
Renew Energy
(2015)
M. Diagne et al.
Review of solar irradiance forecasting methods and a proposition for small-scale insular grids
Renew Sustain Energy Rev
(2013)
Z. Dong et al.
Short-term solar irradiance forecasting using exponential smoothing state space model
Energy
(2013)
S. Kaplanis
New methodologies to estimate the hourly global solar radiation; comparisons with existing models
Renew Energy
(2006)
B. Kraas et al.
Economic merits of a state-of-the-art concentrating solar power forecasting system for participation in the spanish electricity market
Sol Energy
(2013)
Y.S. Lee et al.
Empirical prediction intervals revisited
Int J Forecast
(2014)

G. Reikard

Predicting solar radiation at high resolutions: a comparison of time series forecasts

Sol Energy

(2009)

A. Sfetsos et al.

Univariate and multivariate forecasting of hourly solar radiation with artificial intelligence techniques

Sol Energy

(2000)

L. Tashman

Out-of-sample tests of forecasting accuracy: an analysis and review

Int J Forecast

(2000)

J.W. Taylor

Volatility forecasting with smooth transition exponential smoothing

Int J Forecast

(2004)

J.R. Trapero et al.

Short-term solar irradiation forecasting based on dynamic harmonic regression

Energy

(2015)

D. Yang et al.

Forecasting of global horizontal irradiance by exponential smoothing, using decompositions

Energy

(2015)

J. Bollinger

Bollinger on bollinger bands

(2001)

J. Boudoukh et al.

Investigation of a class of volatility estimators

J Deriv

(1997)

G.E.P. Box et al.

Time series analysis: forecasting and control

(1994)

Cited by (33)

Probabilistic-based electricity demand forecasting with hybrid convolutional neural network-extreme learning machine model
2024, Engineering Applications of Artificial Intelligence
Implementing key engineering solutions to optimise the operation of energy industries requires daily electricity demand forecasting and including uncertainty, to promote markets insight analysis as part of their strategic planning, regulating and supplying electricity to consumers. This paper proposes hybrid artificial intelligence models combining convolutional neural networks (CNN) as a feature extraction algorithm with extreme learning machines (ELM) as a framework to predict electricity demand with confidence intervals generated by Kernel Density Estimation (KDE) approaches. In order to develop CELM-KDE model, time-lagged series of daily electricity demand with local climate variables based on the air temperature, atmospheric vapour pressure, evaporation, solar radiation, humidity and sea level pressures are used to train the proposed CELM-KDE hybrid model. In order to fully evaluate the newly developed model from a point-based, as well as a probabilistic prediction strategy, the observed and predicted electricity demand as well as the probability distribution of errors are analysed using KDE method that operates without prior data distribution assumptions. Based on observed and predicted electricity demand and the relevant probabilistic confidence intervals generated by the CELM-KDE model, the final results show that the proposed method attains significantly better probability interval predictions than traditionally-used point-based models. The proposed CELM-KDE model is demonstrated to be highly effective in providing a comprehensive coverage of predicted errors, as well as providing greater insights into the average bandwidth and detailed predicted electricity demand in the testing stage. The results also indicate that the proposed hybrid model is a reliable decision support tool to develop engineering solutions in area of energy modelling, monitoring and forecasting, which could potentially be useful to the industry policymakers. We show that the point-and probabilistic-based electricity demand predictive models can be employed as an effective tool to improve accuracy of forecasting and provision of insights for national electricity markets and key energy industry stakeholder application tools.
Interval forecasting of photovoltaic power generation on green ship under Multi-factors coupling
2023, Sustainable Energy Technologies and Assessments
Shipboard photovoltaic power generation is affected by various factors, such as meteorological factors, navigation, and ship rolling. Traditional power prediction methods of the land-based grid do not apply to solar ships. Considering the unavoidable Machine Learning algorithm errors and solar energy fluctuations, an interval prediction framework based on a combination of neural network and kernel density estimation methods is proposed. Its generality is demonstrated by comparing different prediction engines. An improved Extreme Learning Machine approach is used to enhance model robustness, considering the computational speed constraints of online prediction. The improved ELM prediction method is at least 5.26% more accurate than the other forecast engines. Considering the sensitivity of the prediction results to the input data, the K-Means clustering is deployed to cluster the historical data to improve the forecast accuracy. The enhanced prediction framework performs well at different confidence levels (85%-98%). Through experimental verification and comparison with diverse state-of-art benchmarks, the effectiveness and stability of the method are proved. At a confidence level of 98%, the prediction interval average width of the proposed method is at least 26.38% smaller relative to other advanced interval prediction methods. It provides the potential to be applied in a ship’s power system.
Interval forecasting for urban water demand using PSO optimized KDE distribution and LSTM neural networks
2022, Applied Soft Computing
Citation Excerpt :
Yu et al. [33]proposed a KDE model with fixed bandwidth to estimate the distribution of wind power predictive errors. Trapero et al. [34] combined volatility forecasting model and the KDE method to improve the performance of solar irradiation PIs. However, a KDE model with optimized bandwidth is more suitable for estimating the PI than a single KDE method since the bandwidth is a key parameter to improve the fitting performance of PDF curves obtained by the KDE model [35].
The current literature on water demand forecasting mostly focuses on giving accurate point predictions of water demand. However, the water demand point forecasting will encounter uninformative and unreliable problems when the uncertainty level of data increases. To solve the above problem, a hybrid model (KDE-PSO-LSTM), which combines long short-term memory networks (LSTM) to kernel density estimation (KDE) optimized by using the particle swarm optimization (PSO) algorithm, is proposed to acquire the water demand prediction interval (PI) to quantify the likely uncertainties in the predictions. At first, the prediction errors are obtained by the difference between the real values of water demand and the predictive values based on the LSTM model. Then, a novel splitting strategy is proposed to divided point predictions into different levels to deal with the problem that it is difficult to fit the prediction errors of the whole water demand using a single probability density function (PDF). Next, the PSO is used to optimize the hyper-parameter of the KDE method for fitting the PDF curves of different levels prediction errors. Moreover, due to the irregular distribution of prediction errors, a search method called confidence-window shifting is presented to determine the optimal prediction error interval from the fitted PDF curves. After that, the upper bounds and the lower bounds of the best intervals of prediction errors are added to the point predictions to attain the final PI of urban water demand. Finally, to demonstrate the superiorities of the proposed model, the proposed KDE-PSO distribution is compared to other well-known distributions, i.e, the KDE distribution, the Beta-PSO distribution and the normal distribution. The experimental results show that the comprehensive performances of the PIs generated from the proposed KDE-PSO-LSTM model are better than that of KDE-PSO-BP, KDE-PSO-RNN, ND-LSTM, KDE-LSTM, Beta-PSO-LSTM and KDE-GA-LSTM. Therefore, it can be demonstrated that the KDE-PSO-LSTM model can provide reliable decision support to policy-makers for making the optimal water supplying management.
Prediction intervals estimation of solar generation based on gated recurrent unit and kernel density estimation
2021, Neurocomputing
With the increasing attention to the energy crisis and global warming, solar generation has become an important way to use clean solar energy and is playing an increasingly important role. Due to the highly-variable patterns of solar generation, the estimation of prediction intervals is receiving more attention, which is conducive to the safe and stable operation of the power system. In order to further improve the performance of prediction intervals of solar generation, this paper proposes a prediction intervals estimation method for solar generation based on gated recurrent unit (GRU) neural networks and kernel density estimation (KDE). GRU, a commonly used recurrent neural networks, is utilized to obtain the deterministic forecast of solar generation. In addition, according to the characteristics of solar generation, attention mechanism is designed on the GRU prediction model to further improve the prediction performance. Then, the KDE method is used to fit the prediction errors of solar generation obtained by the deterministic forecasting method. In order to verify the effectiveness of the proposed method, we have carried out a large number of experiments on freely available datasets. The experimental results show that the proposed method outperforms competing methods and can generate high-quality prediction intervals.
Adjusted combination of moving averages: A forecasting system for medium-term solar irradiance
2021, Applied Energy
Global Horizontal Irradiation forecasts are necessary for an efficient use of fluctuating energy output from photovoltaic plants. The purpose of this paper is to provide efficient, easy-to-implement Global Horizontal Irradiation forecasts on an hourly basis for a medium-term horizon (longer than 48 h). These forecasts are essential for the strategic deployment of tasks including unit commitment, transmission management, trading, hedging, planning, asset optimization, maintenance scheduling and spinning of power units. Nonetheless, forecasting models for medium term horizons are scarce. This work intends to bridge that gap by proposing a method based on an adjusted combination of moving averages which includes the yearly cycle in addition to the day–night cycle. This method is straightforward to implement and easy to integrate into corporate computing systems. We compare our approach with several well-known alternative methods, such as deep recurrent neural networks or autoregressive integrated moving average models, among others. The results show that recurrent neural networks are 10% more accurate than the second best method, and 90% better than our proposal for very short horizons (up to 5 h ahead). However, this advantage is lost when longer horizons are tested, especially for horizons longer than 30 h ahead, for which our method produces forecasts which are 20% more accurate than those made by recurrent neural networks. The reason behind that improvement at longer horizons is the inclusion of the yearly cycle in the proposed approach.
Impact of the forecast price on economic results for methanol production from olive waste
2021, Fuel
Citation Excerpt :
Forecasting is a vital task in energy applications, particularly, in a renewable energy context due to the lack of solutions for energy storage in power plants. See, for instance, some works related to wind energy [31] and solar energy [32–34]. Regarding price forecasting, literature is mainly focused on electricity time series [35,36] and crude oil [37], whereas, price forecasting of other crucial energy variables as Natural gas is less common [38].
The development of circular economies due to the limitation of natural resources is becoming a common strategy of paramount importance among different countries. In Spain, given the strategic nature of its olive industry, trying to value one of its main residuals (olive pomace) alone or together with other residues through its chemical transformation in methanol is a promising research line. One of the key variables that would make the investment advisable or not is the future value of the methanol price. However, most of the literature do not consider its future price volatility on the economic evaluation of the chemical processes. This work bridges that gap by proposing three econometric models based on Unobserved Components to forecast the methanol price over the life cycle plant. Those probabilistic forecasts feed a Monte Carlo simulation that provides an exhaustive investment risk assessment in terms of Net Present Value, Internal Rate of Return and Value at Risk metrics. The results showed the relationship between forecasting models and the investment profitability with an average Internal Rate of Return ranging from 23% to 31%. Additionally, the previous analysis was completed by adding other variables subject to uncertainty (olive pomace feed, capital investment, feedstock price, labor costs and discount rate). In this case, assuming a potential underestimation error up to 100% of the capital cost, the probability of obtaining a profitable investment was significantly reduced ranging the Value at Risk from 48% to 98%.

View all citing articles on Scopus

View full text

Calculation of solar irradiation prediction intervals combining volatility and kernel density estimates

Highlights

Abstract

Introduction

Section snippets

Prediction intervals

Case study data

Experimental setup

Combination of prediction intervals

Conclusions

Acknowledgment

Energy

Renew Energy

Sol Energy

J Econ

Renew Energy

Renew Sustain Energy Rev

Energy

Renew Energy

Sol Energy

Int J Forecast

Sol Energy

Sol Energy

Int J Forecast

Int J Forecast

Energy

Energy

Bollinger on bollinger bands

Investigation of a class of volatility estimators

J Deriv

Time series analysis: forecasting and control