1 Introduction

The financialization of commodities [16, 28, 100] has drawn the attention of risk managers and financial institutions on the propagation of the commodity risk that arises from the fluctuations of commodity future price values [70, 79]. Commodities are actively traded in financial markets and have been largely used for hedging purposes [73]. However, market volatility makes commodity prices vulnerable to highly correlated shocks [41] creating significant business challenges that affect financial performances, exacerbate well-known spillovers effects among commodities, and tight credit availability.

Hidden severe consequences affect the economic system as well, especially in those countries where commodities are heavily employed as raw materials (Crude Oil, Gasoline, Natural Gas, Copper, Aluminium, and agricultural commodities) in the industrial sector. Indeed, UNCTAD [109] reports that over the last two decades 67% of developing countries has been relying on commodities, a percentage that rises to 80% when considering only the least developed countries. Therefore, a big concern for risk managers and policy-makers becomes the monitoring of the propagation of the commodity risk in commodity markets which requires the development of new operational approaches [8, 17, 58]. The understanding of the propagation of the commodity risk through the financialization of commodities requires to be detected designing a framework that accounts for the specific contribution of the single commodities to the market risk. This is the aim of our work.

The relevance of the issue is particularly highlighted by the role of the commodity risk within the regulatory framework. The Basel Accords establish a minimum capital standard to cover the risk of holding or taking positions in commodities and impose each bank subject to capital charges for market risk to monitor and report the level of commodity risk against which a capital requirement is to be applied [15]. Over the past decades, rich literature has flourished to propose valuable instruments for measuring and quantifying such risk. The most employed market risk measure is the Value-at-Risk (VaR), defined as the worst expected loss of an asset or a portfolio given a certain confidence level and over a specific time period [71]. It is a crucial component of risk management when designing and monitoring an appropriate modelling framework able to quantify commodity price risk exposure, avoiding unexpected large losses [8].

In this paper, we propose a framework with a twofold risk management goal: i) forecasting commodity risks and spillovers to identify and understand the factors that drive commodity markets and ii) capturing the impact of contagion of such risk on the stability of the financial system. We combine an econometric and statistical set-up where several models are compared and backtested to find the one that better estimates the VaR for each commodity futures returns. The network is built on the residuals of the models chosen according to a risk management approach. The network approach guides decision-makers in the field through the investigation of the extent to which uncertainty in commodity prices affects the practical transmissions of the commodity risks [88].

More specifically, we carry out the first task estimating the VaR of the commodities through GARCH and Markov-switching GARCH (MS-GARCH) models with different distribution of the innovations. The choice of GARCH-type models accommodates the typical stylized facts of commodity time series such as volatility clustering, skewness, kurtosis [1, 37, 38, 110], and regime changes in the conditional variance dynamics. We perform model selection relying on targeted tests procedures and evaluate the model that outperforms the others from a risk management point of view, i.e. from a VaR forecasting perspective [76]. To do this, we use three backtesting procedures: the Unconditional Coverage (UC) test of Kupiec [75], the Conditional Coverage (CC) test of Christoffersen [32], and the Dynamic Quantile (DQ) test of Engle and Manganelli [45]. We refer to Masala [80] for a recent survey about the role of backtesting procedures in commodity portfolios. To contemplate the stylized facts of commodity returns, we consider various specifications in the model selection procedure such as Normal GARCH, Skewed Normal GARCH, Student’s-t GARCH, Skewed Student’s-t GARCH, Generalized Error GARCH, Skewed Generalized Error GARCH, Normal MS-GARCH, Skewed Normal MS-GARCH, Student’s-t MS-GARCH, Skewed Student’s-t MS-GARCH, Generalized Error MS-GARCH, Skewed Generalized Error MS-GARCH, thus empowering our framework with the flexibility that the policy-maker needs.

The second task deals with the estimation of the interconnections among and within the commodity sectors considered. To explore this issue we use a graphical model approach which is an intuitive way of representing and visualizing the relationships among many variables. In particular, to manage the interconnection structure of commodity markets, we exploit the Graphical LASSO (GLASSO) methodology proposed by Friedman et al. [54] to estimate a sparse Gaussian Graphical model and detect the strongest conditional dependencies among the commodities. Since GLASSO relies on the assumption of gaussianity, we build a Gaussian copula with marginals obtained from the residuals of the best model evaluated under the backtesting criteria. Moreover, to synthesize the information contained in the graphical model, we compute the eigenvector centrality measure that shows the most relevant commodities within the network structure in terms of the influence of each node in the graph. This allows us to guide decision makers [88] ranking the commodities in the network according to their importance in the propagation of the commodity risk.

We collect the future prices of twenty-four commodities belonging to the commodity sectors Agriculture, Energy, and Metals over the sample period that spans from October 3, 2005 to March 25, 2022. Our findings show that the MS-GARCH outperforms GARCH models in 75% and 58% of the cases before and after the Covid-19 pandemic, respectively. From a graphical point of view, the network analysis reveals that commodities are overall densely connected and the Covid-19 shock hits such connectedness reducing the density of the graph by 18%. The degree of connectedness between the commodity sectors is also affected. Inter-sectorial linkages are severely weakened as opposed to those among commodities within the same sectors. In fact, before Covid-19 Coffee and Soybean Oil present the maximum rate of connections with the Energy sector coherently with Myint and El-Halwagi [82], Dunis et al. [44], and Al-Maadid et al. [3] where the spillover effect between Coffee and the energy sector results even increased after the global financial crisis in 2008. With the outbreak of Covid-19, these patterns have been disentangled. Concerning the specific roles of the commodities in the network, Soybean Oil occupies the most central node in the graph only after the pandemic while Natural Gas was the most connected until 2018 (in line with Ergen and Rizvanoghlu [46]). Remarkable changes are also encountered in the safe-haven commodities. Before Covid-19, they were Heating Oil, Soybean Meal, and Gold and after the pandemic, they become Natural Gas UK, Gold, and Heating Oil.

In a similar spirit to our paper, other studies have investigated commodity connectedness [12, 41, 115]. However, their modelling frameworks are characterized by a common distributional assumption for the commodities included in the analysis and do not simultaneously address the assessment of the market risk. Indeed, the novelty of our approach lies in the combination of backtesting procedures, Gaussian Copula, and GLASSO estimation approach for the network. Unlike traditional network estimation based on the single assumption on the vector of observations, we build the network of commodity risk basing on the residuals of GARCH-type models with underlying distribution selected for each commodity through the backtesting procedures thus accommodating for well-known stylized facts. The dependence structure of the residuals is captured by the Gaussian Copula. Centrality measures retrieved from the sparse estimated network are exploited to guide decision makers choices through commodity risk management. To the best of our knowledge, this is the first paper that addresses this issue.

The rest of the paper is organized as follows. Section 2 discusses the major strands of the literature to which this paper contributes. Section 3 provides a brief outline of the employment of the GARCH-type models and the backtesting procedures for model selection. Section 4 presents the GLASSO model. Empirical results are reported in Sect. 5 and the conclusions are in Sect. 6.

2 Literature review

The effects of commodity price fluctuations on the macro-economy have been pioneered in Hamilton [62], and the connection with economic growth has been a fruitful thematic in the financial literature [18, 21, 24, 25, 27, 40, 74, 108, 112, 114]. Despite the key informational role of commodity futures in addressing the monetary policy [9, 66] the exposure of commodity price fluctuations to macro risk has been hard to price [95] due to the strong inter-sectorial dependencies, first documented in Pindyck and Rotemberg [90] and Ciner [35]. A rich literature has then focused on the spillover effects between oil price and financial markets [72], and oil price and other commodities [10, 39] such as precious metals [47, 93], agricultural [43, 84, 85], energy [49, 92, 102], and, more recently, the impact of climate related variables on the co-movements of commodity prices that affect the stability of the financial system [50]. Besides, commodity price behavior shows small trends and big variability that affects market preferences also in the long-run [23, 30, 103].

Spillovers effects have particularly intensified since 2004, the onset of the financialization of commodity markets [16, 28, 65, 100]. From that moment onward, commodities have been considered among the likely sources of financial distress due to the centrality of the role acquired. Gradually, they have revealed to be responsive to macro-economic shocks and to investors sentiment [57, 91, 98, 105] and to be strongly connected to widely spread financial instruments [77, 83, 113].

Major operational and management implications caused by the financialization of commodities spring up in the uncertainty of the decision-making processes for the related industries. Examples concern the role of supporting the management of refinery operations and productions of oil and gas, or the management generator operations and the supply chain due to the non-storable nature of the production of energy, as discussed in Andriosopoulos and Nomikos [6], Gabrel et al. [55], Joëts [70], Aven [8], and Nguyen et al. [88].

Such relevance of commodities from various perspectives of the financial system has encouraged the assessment of the market risk for commodity markets (see, among others, [89]). Giot and Laurent [59] introduce the use of VaR to compute the contribution of the commodities to the market risk and find that for market participants trading short positions the risk arises from an increase in commodity prices, while for long positions it is given by a price drop. Marimoutou et al. [78] apply VaR to the oil market and Aloui and Mabrouk [5] study VaR estimations of energy commodities documenting financial stylized facts such as long-memory, asymmetry, and fat tails. Laporta et al. [76] investigate the selection of VaR for energy commodities, whereas Algieri and Leccadito [4] and Ji et al. [68] propose a (Co)VaR based model to study risk spillovers between energy and non-energy commodity markets. Shen et al. [97] integrate VaR estimation in a Vector Autoregression (VAR) to evaluate the risk transmission channel in energy markets. An alternative approach to address systemic risk in a VaR set-up is through copula [81]. VaR forecasting may help risk managers and regulators to evaluate the exposure to unexpected loss and consequently calibrate the overall riskiness of financial markets.

Besides VaR computation, many studies have delved into the analysis of the connectedness among commodity markets [12, 41, 106, 107] and between commodity and financial markets. Ji and Fan [69] propose a graph analysis of the evolution of the world crude oil market whereas the works of Diebold et al. [41], Zhang and Broadstock [115], and Balli et al. [12] derive the connectedness of commodities from the Diebold and Yılmaz [42] forecast-error variance decomposition matrix of a vector autoregressive (VAR) model. In Diebold et al. [41], the VAR is constructed from the range-based realized volatility of Garman and Klass [56], whereas in Balli et al. [12] it comes from the commodity uncertainty index proposed in Chuliá et al. [34] that builds on the residuals of a generalized dynamic factor model.

These studies have tackled several relevant issues in the analysis of the mechanisms of commodity markets.Footnote 1 However, the major research questions concerning the financialization of commodities, commodity connectedness, and the assessment of the market risk remain separately addressed. In particular, in the field of the study of commodity connectedness, the modelling framework of the previous literature exploits common distributional assumptions for the commodities while commodity risk management requires models that account for the structural differences between the commodities to gauge the different risk exposures. Hence, for a robust detection of the major risk transmitters in commodity markets, methodologies must be integrated with new approaches that consider the stylized facts of the single commodities. For this reason, we propose a framework that merges commodity connectedness and modelling selection according to market risk criteria.

3 Model specifications

It is well-known that the time series of commodities show most of the stylized facts detected in financial markets such as skewness, kurtosis, and volatility clustering. Moreover, recent studies have shown that the variance process often exhibits regime changes [61] and that ignoring this feature affects the precision of the volatility forecast [36].

Throughout the paper, we consider the GARCH(1,1) and MS-GARCH(1,1) models with different conditional distribution \({\mathcal {D}}_{\Theta }(\cdot )\) to account for stylized facts. Overall, our framework includes 12 model specifications, recovered as a combination of:

  • the conditional variance specification: GARCH(1, 1) and MS-GARCH(1,1);

  • the choice of the conditional distribution \({\mathcal {D}}_{\Theta } \in \) \(\lbrace \)norm, snorm, std, sstd, ged, sged\(\rbrace \).

We detail the description of the model specification in the Appendix. The choice of the conditional variance specifications are supported by the works of Bollerslev et al. [20], Sadorsky [96], Huang et al. [67] and, especially, Hansen and Lunde [63]. Moreover, GARCH(1,1) models have proved good fitting performance on commodities [76]. We remark that the specifications considered are only intended to provide an example of design of policy for commodity markets that simultaneously addresses the commodity interdependencies and the exposure to the market risk. Among the conditional distributions, we include the standardized skewed version of each model implemented via the Fernández and Steel [48] transformation. We denote the standardized Skewed Normal, the Skewed Student’s-t, and the Skewed Generalized Error Distribution by "snorm", "sstd", and "sged", respectively. For the MS-GARCH specification, we focus on double-regime MS-GARCH models accounting for low and high volatility levels, thus the scale and asymmetry parameters vary with the regimes.

In line with the first goal of the paper, we consider a wide range of models and use a risk management approach that selects the best models to accurately predict future risks, especially in the case of volatility models [33]. We select the model that provides the most reliable forecast of the VaR performing backtests [31].

4 Sparse Gaussian graphical model

The second goal of the paper is to study the interdependence among and within the commodity sectors. To achieve this purpose, we rely on the Gaussian Graphical Lasso (GLASSO) methodology proposed in Friedman et al. [54]. GLASSO allows us to build an undirected Gaussian graphical model and perform a network representation of the connections of the commodity risks where only the most relevant intra- and inter- sectorial linkages are highlighted. This is conveniently accomplished estimating a sparse conditional dependence structure among the commodities. That is, we estimate the inverse Gaussian covariance matrix, \(\Omega = \Sigma ^{-1}\), where the zero off-diagonal elements correspond to a pair of commodities returns that are conditionally independent [64]. More specifically, GLASSO builds on Tibshirani [101] where a penalized maximum likelihood problem shrinks to zero some coefficients through a \(L_1\)-norm penalty term as follows:

$$\begin{aligned} \Omega ^{*}=\text {arg max}_{\Omega } \log (\det \Omega - tr(\Sigma \Omega ) - \rho \left\| \Omega \right\| _1), \end{aligned}$$
(1)

where \(tr(\cdot )\) denotes the trace operator and \(\left\| \Omega \right\| _1\) the \(L_1\)-norm that can be calculated as the sum of the absolute values of the elements of \(\Omega \). The parameter \(\rho \) controls for the size of the penalty and it determines the number of zeros in the sparse precision matrix \(\Omega \): a higher (lower) value is responsible for a more (less) sparse matrix. Like most of the shrinking methodologies, the right choice of the penalization parameter \(\rho \) is fundamental to obtain a reliable selection. To estimate the optimal value of \(\rho \), we minimize the Extended Bayesian Information Criterion (EBIC, [26]) which has been shown to work particularly well in retrieving the true network structure [14, 53] and it is a computationally efficient alternative to cross-validation [111]. The criterion is indexed by the hyperparameter \(\gamma \in [0,1]\). Typical values of \(\gamma \) are 0, 0.5 or 1 with values closer to 1 leading a stronger penalization. For this reason, in the empirical application of this paper we choose \(\gamma = 1\).

As stated above, GLASSO relies on the assumption of Gaussianity introduced through the Gaussian copula. In particular, the copula approach provides the framework to model multivariate associations from the univariate distributions of the observed variables. In the case of the d–random vector, \(X=(X_1,\dots ,X_i,\dots ,X_d)\), with marginal cumulative distributions \(F_i(x_i)=P(X_i<x_i)\), we can define the joint cumulative distribution function (cdf) as \(F(x)=P(\cap _{i=1}^d X_i<x_i)\). In many cases, the margins of the cdf are relatively easy to describe, but an explicit expression of the joint distribution may be difficult to obtain. When \(X \sim N_d(\mu , \Sigma )\) is a Gaussian random vector then its copula is called Gaussian copula. Denoting \(u_i \equiv F_i(x_i)\), the Gaussian copula is defined by the cdf \(C^{Ga} (u_1,\dots ,u_i,\dots ,u_d)= P(\cap _{i=1}^d \Phi (X_i) \le u_i) = \mathbf {\Phi } (\cap _{i=1}^d \Phi ^{-1}(u_i))\) where \(\Phi ^{-1}(\cdot )\) is the univariate standard Gaussian quantile function and \(\mathbf {\Phi }(\cdot )\) is the d-variate Gaussian cdf with mean 0 and covariance matrix \(\Sigma \).

4.1 Network metrics

Network metrics are used to synthesize the information contained in a graphical model. In this section, we briefly introduce the definitions and the metrics that we use to detect the position of a commodity within the network.

We define an undirected graph as an ordered pair of two disjoint sets (VE), where V is the set of vertices and E is the set of edges, consisting of pair of elements taken from V. We denote the number of vertices with \(n = |V|\) and the number of edges with \(m=|E|\). The density, D, of the graph is given by the ratio between the number of edges and the number of possible edges:

$$\begin{aligned} D = \frac{2m}{n(n-1)}. \ \ \ \end{aligned}$$
(2)

Vertices i and j are adjacent if the undirected edge between i and j is in the set E, and a line connects them in the diagram of the graph. The matrix representation of such a graph is obtained via the adjacency matrix, \(A_G\), of the inverse covariance matrix, \(\Omega \). The single element in \(A_G=(a_{ij})\) is \(a_{ij}=1\) if the corresponding element of the inverse covariance matrix is positive, \(a_{ij}=0\) if the corresponding element is zero. Hence, the graph contains an edge that links two vertices i and j if and only if \(a_{ij}=1\). A simple yet fundamental metric is the degree of a node defined by \(k_i = \sum _j a_{ij} \), which measures the number of neighbors of the node.

Centrality measures are crucial metrics used in the network topology to highlight nodes that occupy critical positions in the graph. For instance, the eigenvector centrality, or Gould’s index of accessibility [60], indicates which are the most geographically central and important nodes, and it has been exploited in financial applications to capture the capacity of an agent to cause systemic risk [19]. It builds on the eigenvector assigned to the leading eigenvalue of the adjacency matrix to assign a relative score to the nodes, depending on how connected they are to the rest of the network. A metric that is strictly related to the eigenvector measure is the eigenvector community structure [87], which allows us to create a subgraph starting from a group of vertices densely connected, linked with other groups of vertices through sparse connections. The eigenvector community structure depends on the spectrum of the modularity matrix B, with elements defined as:

$$\begin{aligned} b_{ij}=a_{ij} - \frac{k_i k_j}{2m}. \ \ \ \end{aligned}$$
(3)

\(a_{ij}\) are the elements of the adjacency matrix and \(\frac{k_i k_j}{2m}\) is the number of edges between vertices i and j if edges are placed at random, where \(k_i\) and \(k_j\) are the degrees of the vertices, and \(m = \frac{1}{2}\sum _i k_i\) is the total number of edges in the network. The algorithm to compute the eigenvector community structure calculates the leading eigenvector of the modularity matrix and divides the vertices into two groups according to the signs of the elements in this vector. The values of the leading eigenvector assess the importance of each vertex in its community: a larger (smaller) value corresponds to a more (less) central member.

5 Empirical results

5.1 Data description

We collect data of twenty-four time series of commodity futures prices from Bloomberg over the period that spans from October 3, 2005 to March 25, 2022 for a total of 4300 observations. The commodities in the sample belong to the commodity sectors Agriculture, Energy, and Metals and are divided as follows:

  • Agriculture: Coffee (KC1), Oats (O1), Soybeans (S1), Wheat (W1), Cocoa (CC1), Corn (C1), Rough Rice (RR1), Cotton (CT1), Sugar (SB1), Soybean Oil (BO1), Soybean Meal (SM1), and Orange Juice (JO1).

  • Energy: Gasoline (XB1), Heating Oil (HO1), Low Sulfur Gasolio (QS1), Natural Gas (NG1), Ethanol (DL1), WTI Crude Oil (CL1), and Natural Gas UK (FN1).

  • Metals: Gold (GC1), Silver (SI1), Palladium (PA1), Copper (HG1), and Zinc (LX1)

Daily returns with continuous compounding are calculated taking the logarithm of the difference between closing prices in consecutive trading days and then multiplied by 100.Footnote 2

In Table 3 we report the summary statistics of the log-returns of the commodities as in Laporta et al. [76] before and after the outbreak of Covid-19. The distribution of the returns for each commodity displays fat tails and serial correlation. The Jarque–Bera test significantly rejects the normality behaviour of daily returns, the ARCH Lagrange Multiplier and the Augmented Dickey–Fuller tests suggest the presence of autoregressive conditional heteroskedasticity and the absence of unit roots, respectively. Ethanol and Natural Gas are the only commodities with negative average returns and show high standard deviation. The maximum, however, is reached by WTI Crude Oil with 20.7. The distributions of the commodity returns generally exhibit negative skewness with Natural Gas, Natural Gas UK, Wheat, and Coffee as the only exceptions. The returns of these four commodities are then featured by extreme positive values. The impact of Covid-19 is captured in the higher kurtosis and standard deviation which are amplified in response to the growing level of uncertainty in the market. Besides, the distribution of the returns of the commodities tends to be more negatively skewed indicating the stronger propensity to undergo high losses.

5.2 Backtesting results

We perform backtests over 2699 observations using 1600 days as estimation window. The width of the rolling window allows us to obtain significant estimates of the parameters while providing a trustworthy picture of the fluctuations in the market. Also, an amplitude of six years is sufficient to capture the evolution and major regime changes in the returns. In particular, we include the entire period of the recession in the United States that began in December 2007 and ended in June 2009 according to the National Bureau of Economic Research.Footnote 3

For each time series of commodities, we fit GARCH and MS-GARCH with conditional distribution \({\mathcal {D}}_{\Theta } \in \) \(\lbrace \)norm, snorm, std, sstd, ged, sged\(\rbrace \). The computational analysis is conducted using the software R and the package "MSGARCH". We point out that the introduction of GARCH models is only intended to accommodate the different behaviors of commodity returns within a risk management perspective based on the VaR predictability of the model.

We choose the best model out of the 12 models considered according to the backtesting procedures described in the Appendix. We backtest each GARCH-type estimated VaR using the Unconditional Coverage (UC) test of Kupiec [75], the Conditional Coverage (CC) test of Christoffersen [32], and the Dynamic Quantile (DQ) test of Engle and Manganelli [45]. We are concerned with the downside risk at two confidence levels: 95% and 99%. The best models are selected choosing among those that show a p-value higher than the 5% significance level in at least two out of three tests. To determine the final model, we pick the one with the highest p-value relative to the DQ test on the 95-th quantile level. This choice is motivated by a trade-off between the risk management perspective and the need to have an adequate number of observed violations of the estimated quantiles.

Backtesting results are represented in Tables 5, 6, and 7. Overall, many of the model specifications succeed in forecasting the returns volatility of the commodities. On the one hand, we find no prevalence for the asymmetric distributions over the symmetric ones. On the other hand, we detect a considerable prevalence of the Markov-switching specification which feature 14 commodities out of 24 (Table 1). This result contributes, among others, to the findings in Bulla and Bulla [22] and Ardia et al. [7], that show that the Markov-switching specification better captures the breaks in the dynamics of the volatility of financial returns. The effect of Covid-19 is found in the reduced number of optimal MS-GARCH models. Thus, the shock of the pandemic has affected the propensity of regime changes of the volatility of the commodities to describe the evolution of VaR exceedances.

Table 1 Best models for the commodities selected according to the backtesting criteria

5.3 The network of commodity risks

We turn the analysis to the estimation and discussion of the network structure among the commodities in the sample. To achieve the estimation of the graph, we fit the Gaussian Copula on the residuals obtained from the GARCH-type model selected according to the the backtesting results. Therefore, the marginals of the Gaussian Copula are given by the distribution of each series of residuals. Then, we estimate the tuning parameter for the GLASSO minimizing the EBIC. To quantify the degree of connectedness within the graph and among specific underlying clusters, we compute the network metrics described in Sect. 4.1. Figure 1 shows the network of commodity risks. The computational analysis is conducted using the software R and the package "GLASSO". The graph is estimated using the optimal tuning parameter \(\rho = 0.0158\) and the size of the nodes in Fig. 1 is proportional to the eigenvector centrality score. The density \(D = 0.61\) highlights the strong interconnections in the network. This result is explained in part because of the well-known spillover effects [43, 85] originated from the dependence that links commodities to cycles of production and consumption, and in part because of the effects of the financialization of commodities.

The graphical representation of the results has brought relevant information to the analysis of the network of commodity risk, making easy the interpretation of results, and strengthening previous literature in this field. For instance, the safe-haven role of Gold is immediately captured. In fact, it represents one of the least central nodes in the network. This implies that it has a poor dependence relation with the other commodities and therefore it can be a good investment in anticipation of high volatility periods. Copper is the first central metal commodity in the graph. It is a cheap and plentiful metal, among the most traded in the world markets, and in the years has seen its consumption rise, especially because of building construction and electronic products. Palladium has seen its consumption rise too. Their centrality, together with that of Coffee, provides useful information to investors in the financial markets and serves as a barometer for the stability of the financial markets of large parts of Asia and South America.

In Table 8 we report the relative connections for each commodity defined as the ratio between the number of active connections that the commodity has with a certain sector and the maximum number of possible connections with that sector. For example, Gold has 3 connections with the Metal sector, dividing by the maximum number of possible connections of the metal sector, i.e. 4, we get the value of 0.75. Most of the commodities exhibit the highest connection with the corresponding sector. In particular, this is the case of Copper and Palladium and Silver for the Metal sector, WTI Crude Oil, Low Sulfur Gasoil, and Natural Gas for the Energy sector. It is interesting to notice that some commodity shows particular rates of connection with other sectors. It is worth noting that some commodity show particular rates of connection with other sectors. For example, in the Metals sector, Palladium reports that connection rate of 0.75 with the Agriculture sector. Regarding the Energy sector, Low Sulphur Gasolio shows the maximum rate of connection with the Metals sector (0.80). Concerning the Agricultural sector, Oats, Cocoa, Coffee, and Soybean Oil have high connection rates with other sectors. The high rate of connections among Coffee and the Energy sector is coherent with Al-Maadid et al. [3] where the authors find the spillover effect between Coffee and the Energy sector to be even increased during the post-crisis period. Soybean Oil presents the maximum connection rate with the Energy sector. This confirms a result found in Myint and El-Halwagi [82] and Dunis et al. [44]. Still, Biodiesel production in the U.S is based predominantly on the use of Soybean Oil, precisely for the 82% [2].

In Table 9 the connection rate between the three considered sectors is shown. We divide the total number of connections that an entire sector has with another sector by the maximum number of possible connections between the two sectors considered. For example, there are 14 connections between metals and energy sectors, dividing by the maximum number of possible connections, i.e. 35, we get a ratio of 0.40. The highest connection rates lie on the main diagonal. Interestingly, the pandemic has strengthened the relationships between the commodities belonging to the same sector and weakened the inter-sectorial linkages. Before Covid-19, agriculture and energy commodities exhibited strong dependence also supported by other studies [13, 94, 104]. Such a link can be explained as a cause of the introduction of biofuels, that have intensified the ties between the sectors, as well as the need for efficient transport for perishable commodities like the agricultural ones. The degree of connectedness between the energy and agricultural commodity market, and between agricultural and metals, highlights the great effect produced by the financialization of commodity markets. The outbreak of Covid-19 has remarkably diminished the strength of inter-sectorial relationships. One hypothesis for this lies on the fact that during normal times trades are more liberalized while during periods of market turmoils the level of trust of investors is impaired. Moreover, in the specific event of the Covid-19 pandemic, the demand and supply economic chain has majorly hit with the abrupt interruptions of most economic activity therefore concerning the classes of commodities that are employed in the related industrial processes. Furthermore, we sort the considered commodities in terms of eigenvector centrality identifying the group of the "most connected" commodities.

Table 2 shows the eigenvector centrality scores. Soybean Oil is the most important in terms of centrality. This is explained by the fact that Soybean forms a large proportion (over 1/5-th) of the agricultural output of US farmers [2, 44], mainly because it is the most used agricultural commodity for biodiesel production in the US [82]. Cotton, Soybean Oil, and Copper are used as raw material or inputs for industries [12] and their prices are subject to demand-side shocks that are highly correlated, which can explain their prominent position in the eigenvector centrality. On the contrary, it is worth noting that Gold and Natural Gas UK lies in the lowest positions in terms of centrality. Less central nodes suggest a more stable and isolated behavior, highlighting safe-haven assets. As Balli et al. [12] point out in their study of commodity connectedness, Gold is known for his hedging abilities in crisis time and is therefore an alternative investment vehicle. We strengthen their findings ranking the position of commodities in the network with the use of eigenvector centrality and find that indeed that this commodity occupies one of the lowest positions in the Table 2.

Finally, to investigate the existence of clusters in the network of commodity risk we split the estimated graph into different clusters. We estimate three sub-graphs as presented in Fig. 2 and Table 10, which highlight the cluster for each commodity. The first cluster contains 6 nodes: it includes the entire Metal sector and one agricultural commodity (Cocoa). The second cluster contains 11 nodes: the entire agriculture sector, apart from Cocoa and Orange Juice, and Ethanol. The results confirm the strong interactions among Ethanol and agricultural commodities, which are generally higher than the interactions between oil and gas and agricultural markets [29]. The third cluster includes all the Energy commodities, apart from Ethanol, and Orange Juice. The results confirm the tendency of each node to attach preferably to a node of the same sector, with few exceptions.

Fig. 1
figure 1

Sparse Gaussian graphical model built on the residuals following the distribution selected according to the backtesting criteria and then aggregated into a Gaussian copula. The optimal tuning parameter for the implementation of the GLASSO is 0.0158. The size of each vertex is proportional to the corresponding Eigenvector Centrality coefficient. Sample period: October 3, 2005–March 25, 2022

Fig. 2
figure 2

Community structure obtained via optimizing the modularity score. Size of the vertices change accordingly to the corresponding eigenvector centrality score. Sample period: Ocotber 3, 2005–March 25, 2022

Table 2 Eigenvector centrality scores of the commodities in the sample in decreasing order

6 Conclusions

In this paper, we investigate the connectedness within commodity markets relying on a risk management perspective. Building upon the Sparse Graphical Lasso (GLASSO) methodology, we build the network of commodity risk basing on the residuals of GARCH-type models with underlying distribution selected for each commodity through a risk management approach. The risk management approach enters the model selection criteria designed to identify the best model for the assessment of the contribution of the commodities to the market risk. Such criteria exploits traditional backtesting procedures to evaluate the Value-at-Risk predictability of the set of the models considered. We apply the methodology to the sample of twenty-four commodity futures prices over the period that spans from October 3, 2005 to March 25, 2022.

Overall, we find that commodities show a moderate degree of connectedness within their network structure. The Covid-19 crisis has affected the interconnections increasing the heterogeneity within commodity markets as captured by the creation of three underlying clusters instead of the two that are estimated without including the pandemic in the sample period. However, the clusters identified mostly coincide with the three commodity sectors indicating that the additional heterogeneity is reflected in emphasized connections between commodities that belong to the same commodity sector. From the quantitative viewpoint, this finding is also endorsed by the lower eigenvector centrality scores, weakened degree of connections between the commodities and the other sectors, and density of the graph which has decreased by 18.6%. The pandemic has also influenced the proportion of optimal MS-GARCH models in the sample which has gone from 75% to 58%. The response of the network of the commodity risks to the recent persistent market uncertainty proves that global increasing connectedness does not represent a crisis stylized fact.

Our work is especially valuable to risk managers and policy-makers involved in the monitoring of the propagation of the commodity risk. Unlike previous studies on the assessment of the network of commodities, our methodology provides linkages that are determined conditionally on the ability of the models to forecast the VaR. In this way, the architecture of commodity markets depicts connections that incorporate information on the contribution to the market risk besides the underlying relationships that feature the commodities. This allows regulators and risk managers to infer more appropriate contribution of the commodity risks to the market risks since the risk exposure of each commodity has been conveniently modelled. Our findings reveal useful in this context providing information on the most relevant threats to the stability of commodity markets and, therefore, to design of tailored strategies for the mitigation of the commodity risk.

An immediate extension of our framework would consider either the application of the Gaussian Graphical model set-up to different network specifications or the implementation of non-Gaussian copulas. Another departure from our model is the consideration of long memory among the typical stylized facts of the volatility process. A third possible insight regards the employment of a more flexible class of models for the MS-GARCH, as the semi-Markov model where the sojourn-distribution is explicitly modelled rather than implicitly assumed to be geometric. This is the object of ongoing research.