Generating probabilistic forecasts from arbitrary point forecasts using a conditional invertible neural network

Phipps, Kaleb; Heidrich, Benedikt; Turowski, Marian; Wittig, Moritz; Mikut, Ralf; Hagenmeyer, Veit

doi:10.1007/s10489-024-05346-9

Generating probabilistic forecasts from arbitrary point forecasts using a conditional invertible neural network

Open access
Published: 13 May 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Applied Intelligence Aims and scope Submit manuscript

Generating probabilistic forecasts from arbitrary point forecasts using a conditional invertible neural network

Download PDF

281 Accesses
Explore all metrics

Abstract

In various applications, probabilistic forecasts are required to quantify the inherent uncertainty associated with the forecast. However, many existing forecasting methods still only generate point forecasts. Although methods exist to generate probabilistic forecasts from these point forecasts, these are often limited to prediction intervals or must be trained together with a specific point forecast. Therefore, the present article proposes a novel approach for generating probabilistic forecasts from arbitrary point forecasts. In order to implement this approach, we apply a conditional Invertible Neural Network (cINN) to learn the underlying distribution of the data and then combine the uncertainty from this distribution with an arbitrary point forecast to generate probabilistic forecasts. We evaluate our approach by generating probabilistic forecasts from multiple point forecasts and comparing these forecasts to six probabilistic benchmarks on four data sets. We show that our approach generally outperforms all benchmarks with regard to CRPS and Winkler scores and generates probabilistic forecasts with the narrowest prediction intervals whilst remaining reasonably calibrated. Furthermore, our approach enables simple point forecasting methods to rank highly in the Global Energy Forecasting Competition 2014.

Multi-step probabilistic forecasting model using deep learning parametrized distributions

Article 25 May 2023

A review of predictive uncertainty estimation with machine learning

Article Open access 18 March 2024

Probabilistic seasonal precipitation forecasts using quantiles of ensemble forecasts

Article Open access 29 February 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Probabilistic forecasts are required to quantify the inherent uncertainty associated with any prediction of the future [23, 45]. These probabilistic forecasts are crucial for many applications such as stabilising energy systems [11], managing congestion in traffic systems [39], or sizing servers of web applications to cope with a certain number of daily visits [36]. Despite this necessity for probabilistic forecasts, many modern forecasting methods still generate point forecasts [44]. Although many recent machine learning libraries offer support for probabilistic loss functions to simplify the generation of probabilistic forecasts, this may not be possible if an existing point forecast model that cannot easily be modified or retrained is already in use.

One solution to overcome this challenge is to generate probabilistic forecasts based on these existing point forecasts. For many years, such forecasts have been generated by analysing the residual errors of the point forecast. Based on these errors’ standard deviation or quantiles, prediction intervals can be calculated to generate probabilistic forecasts [29, 56]. Moreover, such probabilistic forecasts can be generated by using machine learning methods exploiting the residual errors [5, 54], by applying the Bayesian theory of probability to a point method [37], or by considering Monte-Carlo sampling methods [29]. Although these methods may be effective, they also have various limitations. For example, the prediction interval-based approaches can only generate prediction intervals as probabilistic forecasts, while machine learning methods depend on the point forecast and must be retrained if the point forecast is altered. Ideally, such probabilistic forecasts should be generated directly from arbitrary point forecasts and should not require retraining if the point forecast changes.

Therefore, in the present article, we present an approach that generates probabilistic forecasts from arbitrary point forecasts by using a Conditional Invertible Neural Network (cINN) to learn the underlying distribution of the time series data. Since time series have an inherent component of randomness [29], we propose using this uncertainty within the distribution of the time series data to generate probabilistic forecasts. However, the underlying system responsible for this uncertainty typically generates observations of an unknown probability distribution. Therefore, with our approach, we first map this unknown probability distribution of the underlying time series data to a known and tractable distribution by applying a cINN. Then, we use the output of a trained arbitrary point forecast method as an input to the trained cINN and consider the representation of this forecast in the known and tractable distribution. We then analyse the neighbourhood of this representation in the known and tractable distribution to quantify the uncertainty associated with the representation. Finally, we use the backward pass of the cINN to convert this uncertainty information into the forecast. In our approach, the cINN is trained independently of the point forecast and must not be retrained when the point forecast is altered.

Thus, the main contribution of the present article is twofold. First, we provide a novel approach for generating probabilistic forecasts from arbitrary point forecasts whose training is independent of the point forecast. Second, we empirically evaluate the approach using different data sets from various domains. In this empirical evaluation, we compare our approach to six probabilistic benchmarks, evaluate multiple metrics, and recreate the Global Energy Forecasting Competition 2014 (GEFCom2014) competition setting.

The remainder of our article is structured as follows. First, we present related work and highlight the research gap that the present article addresses in Section 2. In Section 3, we then explain our approach in detail and highlight how we use a cINN to generate probabilistic forecasts from an arbitrary point forecast. We detail the experimental setup in Section 4, before presenting our results in Section 5. In Section 6 we discuss our evaluation and key insights. Finally, we conclude and suggest possible directions for future work in Section 7.

Table 1 An overview of previous research related to the present article. None of the identified articles proposes methods capable of generating probabilistic forecasts from existing point forecasts without being limited to only generating prediction intervals or involving a training process that is dependent on the training of the point forecast

Generating probabilistic forecasts from arbitrary point forecasts using a conditional invertible neural network

Abstract

Similar content being viewed by others

Multi-step probabilistic forecasting model using deep learning parametrized distributions

A review of predictive uncertainty estimation with machine learning

Probabilistic seasonal precipitation forecasts using quantiles of ensemble forecasts

1 Introduction

2 Related work

3 Generating probabilistic forecasts with a cINN

3.1 Including uncertainty from the underlying distribution of the data

3.2 Applying our approach

4 Experimental setup

4.1 Data

4.2 Evaluation metrics

4.3 Selected base forecasters

4.4 Probabilistic benchmarks

4.4.1 Probabilistic forecasts based on existing point forecasts

4.4.2 Direct Probabilistic Forecasts

4.5 Used cINN

5 Evaluation

5.1 Comparison of different base point forecasters

5.2 Comparison to benchmarks

5.2.1 Probabilistic forecasts based on existing point forecasts

5.2.2 Direct probabilistic forecasts

5.2.3 Qualitative analysis

5.3 GEFCom2014 probabilistic price forecasting

6 Discussion

6.1 Results

6.2 Insights

7 Conclusion

Data Availability Statement

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Appendices

Appendix A additional implementation details

Appendix B point forecast evaluation

Appendix C additional result summaries

Appendix D Full GEFCom2014 Results

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation