Applying multivariate-fractionally integrated volatility analysis on emerging market bond portfolios

This study examines emerging market (EM) local bonds from a portfolio risk perspective and suggests methodologies for risk evaluation, on which the literature is limited. Despite the growth of EM bond funds in recent years, comprehensive studies regarding this industry have been scarce. In light of this, 203 different local bonds of EM countries—Indonesia, Brazil, India, South Africa, Mexico, and Turkey—are elaborated in terms of return, volatility, and cross-correlation features. This study focuses on an untouched field—long memory properties—and the application of fractional models to EM bond portfolios. Based on the outcomes of a dynamic conditional correlation and fractionally integrated generalized autoregressive conditional heteroscedasticity approach and related value at risk analysis, the study finds that fractional models are useful tools for risk management, as they deliver satisfactory empirical results for several static and dynamic versions of EM bond portfolios.


Introduction and literature review
In the last decade, surging capital flows to emerging market (EM) bonds and the popularity of EM funds have made research in this field valuable. In recent years, due to extraordinary shifts in the monetary policies of developed market (DM) and countryspecific macroeconomic developments, EM currencies and interest rates have fluctuated sharply. Although the composition of EM bond funds is heterogeneous because the countries involved have different economic and market structures, these countries' asset prices are correlated to each other. Unhedged bond portfolios that are affected by both currencies and interest rates have suffered, and having an effective risk management strategy has become more critical.
The literature on risk analysis of EM fixed income has been limited and scattered. Early research mostly focused on the risk-reward profile and showed the diversification benefits of EM bonds, along with other asset classes (Burik andEnnis 1990, Erb et al. 1999, etc.). Similarly, research on systemic risk, risk transmission, or financial network touched on EM bonds as part of the whole investment universe (see Kou et al. 2019). Some others considered immunization strategies in this area (see Ortobelli and Sebastiano 2018, etc.).
In the extensive literature, value at risk (VaR) and related volatility modeling are the prominent risk management concepts that have been applied frequently. Nevertheless, applications of this approach to EM bond portfolios are insufficient. Therefore, referring to studies on other asset classes, especially those dealing with the developed market bonds, is the only plausible way. Guo et al. (2007) applied quantile VaR to U.S. corporate bond indices and incorporated treasury interest rates as information variables. Although using these information variables was beneficial, the estimated confidence intervals for VaRs were wide, thus downgrading the applicability of the model. Tu and Chen (2018) evaluated U.S. bond indices with a factor-based approach. VaR estimations of the study present that market shocks (not macroeconomic developments) primarily cause the variations. Vlaar (2000) compared the out of sample performances of volatility models with different distributions and stated that generalized autoregressive conditional heteroscedasticity (GARC H) models deliver the best results under a normal distribution assumption. Yet, the analysis considered only the univariate volatilities in Dutch bond portfolios.
In the literature, there are also other advanced techniques to consider such as the novel Markov-switching (MS) models. Although EM bonds have not been part of the scope, the MS model framework has been applied to fixed income markets, as well as other asset classes, in recent years (Escobar et al. 2017, Elliott and Nishide 2014, Dimpfl and Peter 2016, Guidolin et al. 2014, Hevia et al. 2015, Kim et al. 2019. Business cycles in long-term data series and bull and bear markets are the usual concepts for the MS models. As such, the necessity of separating the impacts of the global financial crisis has been one of the motivations behind the recent popularity of MS models. Meanwhile, in fractal (self-similar) structures, MS models have some drawbacks in terms of capturing the long memory because assumed regime changes reduce the persistence impact. Furthermore, in the literature, most studies have focused on univariate or bivariate time series with two-state models. In the case of multivariate data series, with time variant probabilities and possible multi-state model selection, the process becomes cumbersome, and with too many explanatory variables, the inference could become complicated.
In the data series of bonds, especially in the volatilities, fractal features can be observed. Apart from the pricing dynamics of the bonds, as the coupon payments or the accrued interests slowly change over time, they become integrated in the returns. Under these conditions, regarding volatility, we evaluate based on a relatively new concept, that is, fractional modeling. The preliminary analysis justifies this type of approach. It considers the fractal (self-similar) features of datasets and quantifies the persistence of shocks in financial markets. The volatility model, fractionally integrated GARCH (FIGARCH), was proposed by Baillie (1996) after the introduction of the mean model alternative auto-regressive fractionally integrated moving average (ARFIMA; see Joyeux 1980 andHosking 1981).
The FIGARCH model focuses on the hyperbolic decay of the impacts of previous innovations in the volatility. It extends the application of the integrated GARCH model (IGARCH), thereby providing an opportunity to determine the true level of the hyperbolic decay at the impact of the previous shocks. Unlike regime-switching models, the number of explanatory variables increases slightly, which makes it easy to incorporate and interpret. Subsequently, various models have been proposed in this "fractional" framework. In this field, fractionally integrated exponential GARCH (FIEGARCH), hyperbolic GARCH (HYGARCH) and fractionally integrated asymmetric power ARCH (FIAPARCH) are some examples that cover additional features of volatility such as leverage or asymmetry. Tsung and Shieh (2007) conducted a VaR analysis for Treasury bond futures with fractional volatility models and showed the superior performance of FIGARCH under normal, Student-t, and skewed Student-t distributions. Martinez et al. (2016) used fluctuation analysis (i.e., detrended fluctuation analysis; DFA) to investigate the long memory in European stock and bond markets. The major finding of the study is the fractal structure of corporate bonds, which implies that modeling the data has to consider long memory properties. In similar studies, Zunino et al. (2015) and Ferreira (2018) presented evidence of long-range dependence in the returns of various sovereign and corporate bond markets as a factor of market inefficiency. Cotter (2004) applied several GARCH type models to U.K. financial markets. He stated that the highest level of long range-dependence is observed in bond futures.
On EM bonds, as one of a few examples, Jung and Kim (2012) analyzed highfrequency data of Korean Treasury bond futures and concluded that the return volatility of this asset class has persistence. As another example, Mendoza (2005) successfully revealed long memory features in Latin American sovereign bonds. Although its application is rare in fixed income assets, the efficiency of fractional volatility modeling has been demonstrated for other asset classes several times. (See Ding et al. 1993, Lardic and Mignon 1999, Serletis and Andreadis 2004, Jin and Frechette 2004, Baillie and Morana 2007, Tabak and Cajueiro 2007, Kasman 2009, Ksaier and Cristiani-D'ornano 2010, Manap and Kassim 2011, Chang et al. 2012, Wang 2013, Sensoy and Sobaci 2014 For the second point, we deploy the dynamic conditional correlation (DCC) model of Engle (2002) to examine the co-movements. Dynamic copulas or wavelet transformations are the other advanced techniques that have been applied in recent literature. Copulas focus on the interdependence of individual distributions and the linkage between individual and multivariate distributions; they have various forms including parametric and/or regime switching. Nevertheless, as copulas deal with distributions, solo applications can be used for tail dependence, risk budgeting, and related tools such as option pricing. Moreover, since model alternatives are relatively limited for multivariate cases (Archimedean and elliptical) and having too many parameter estimations is burdensome, most of the literature have focused on bivariate cases. Furthermore, optimal parametric copula model selection (related goodness-of-fit tests), which affects tail dependence, is another discussion point (see also Weiß 2013). Recent applications in the fixed income market have usually been through developed markets or credit risks (see Kim et al. 2020, Yang et al. 2020, Chao and Zou 2018, Otani and Imai 2018, Bekiros et al. 2018, Benlagha 2014, Chen et al. 2014 The other technique, wavelet transformation, also drew attention, in recent years. Especially in stock markets and commodities, the wavelet method has been frequently applied to investigate interdependence and coherence. However, there has been limited application on EM bonds (Najeeb et al. 2017). As in the case of copulas, wavelet transformation needs additional model for forecasting. Application, extracting, and model selection needs intensive computation for a meaningful analysis. Moreover, in multivariate cases, as the number of data series increases, the process becomes highly complicated. Gulerce and Unal (2016) reached, at most, five different time series for coherence of the analysis.
Meanwhile, our selection of the DCC model is not only useful for risk budgeting but also easy to implement for forecasting and portfolio optimization. Furthermore, unlike the aforementioned approaches, under this method, estimation results are easy to comprehend.
Most of the existing DCC applications to bonds are specific to regional submarkets and tackle the concept of market integration. Applying the DCC-GARCH model, Tsukuda et al. (2017) revealed that integration within the Asian bond market is shallow. Champagne et al. (2017) showed strong market interdependence between the U.S. and Canadian corporate bond markets. Kenourgios et al. (2013) investigated the contagion effects of the global financial crisis (GFC) by applying an asymmetric generalized DCC (AG-DCC) model to Brazilian long-term bond indices, along with other asset classes, and found contagion links of the bonds with U.S. stocks, real estate, and commodities. In a similar framework, Kenourgios and Padhi (2012) covered the bond markets of a wide range of EM countries and found the co-integration levels and diversification benefits of EM bonds during wellknown crises, including the late 1990s. By applying DCC-GARCH models, Sclip et al. (2016) and Bhuiyan et al. (2018) revealed the diversification benefits of sukuks (Islamic bonds) within given samples (see also Goeij 2004, Kenourgios et al. 2011, Celik 2012, Christiansen 2010, Benlagha 2014, Bessler et al. 2016, and Fang et al. 2018. Regarding other asset classes of EM, Dimitriou et al. (2013) applied a DCC-FIAPAR CH model to Brazil, Russia, India, China and South Africa stock markets and showed an increasing contagion effect during the GFC. Other studies revealed the superior performance of multivariate applications regarding volatility persistence properties (e.g., Engle and Colacito 2006;Harris and Nguyen 2013;Selmi and Hachicha 2015).
To summarize, there has been a literature gap in covering EM bonds from the portfolio management standpoint. Furthermore, despite findings of the long memory feature of bond markets (and many other markets such as stocks and commodities) it has not been taken into account in this framework. This study contributes by filling this gap. Specifically, it covers EM local bonds (as well as the currencies) of a wide range of countries and derives satisfactory model outcomes.
Because this study targets funds or portfolio investments, it deals with institutional investors. For the retail side, there are other concepts to deal with (see Wen et al. 2019). As another assumption, this study leaves out matters related to cost, as further analysis may be required regarding cost efficiency. Particularly for the credit-risk side, cost sensitive analysis may affect the strategies (see Wang et al. 2020).
This study mainly presents risk management tools that are versatile and comprehensive for EM local bond portfolios. For this purpose, modern time series approaches that consider fractal data structures and correlation dynamics are sought. The study primarily focuses on three aspects: return-volatility dynamics, correlation features, and VaR performances.
The remainder of this paper is organized as follows. The next section explains the compilation method of the bonds and the construction of the proxy time series that are suitable for evaluating the portfolios. Section 3 briefly covers specific long memory models and the DCC approach to be employed in the empirical analyses. In Section 4, together with a basic analysis of the data, the results of four different multivariate models are presented. In this section, using the selected model, out-of-sample VaR performances are examined for static and dynamic portfolio samples, including portfolio optimization. In Section 5, inferences of the analyses are compiled and the main findings of this study are presented.

Data
For the portfolio construction, the local bonds of Indonesia, Brazil, India, South Africa, Mexico, and Turkey's treasuries are examined. All bonds traded in the last 10 years are considered, and a total of 203 different bonds are selected. Every selected bond is either in the discounted form or has a fixed coupon payment. The selected bonds are in their local currencies.
The bond selection is based on maturity and liquidity criteria. Bonds that have maturities close to either 2 or 10 years are filtered. Bonds that are not liquid and not traded every working day are omitted. Coupon payments are assumed to be reinvested in the same bond. As time passes, when a bond in the portfolio fails to match the criteria, it is changed with a new one.
Although, daily prices of the selected bonds are recorded, the returns are compounded on a weekly basis. In financial markets, determination of the data frequency can affect the results of empirical analysis (see Narayan and Sharma 2015;. Nevertheless, as in the case of security selection, the main motivation for using weekly returns is to reflect the performance of an EM bond portfolio, that is, practicality. Because EM bonds are usually traded in over-the-counter platforms, and there are primary dealing advantages, higher frequencies, like daily data, are prone to distortions. Sometimes, liquidity is scarce with few transactions; as such, observed noises are cleaned only the following sessions. Although, these kinds of noises are disregarded by real investors, they can distort our volatility analysis. Furthermore, as we focus on portfolio analysis, applying portfolio weights to daily returns may not be realistic because in the real world it can be difficult and costly to rebalance EM bond portfolios on a daily basis. By contrast, lower frequencies, like on monthly basis, can lower the number of observations and lower the degrees of freedom in the analysis. Because available data are limited in EM, the number of out-of sample observations can be too small to perform a reliable VaR analysis. Again, regarding a sensible rebalancing period in the portfolios, using weekly data for analyses is considered more appropriate than using lower frequencies.
The local currency returns of bonds are converted to U.S. dollar returns based on the exchange rates of USD/IDR, USD/BRL, USD/IND, USD/ZAR, USD/MXN, and USD/ TRY, at the respective dates of the transactions. The pricing source for the bonds, as well as the currencies, is Bloomberg, and daily closing prices are used. Portfolio data are constructed to reflect the structure of USD-denominated EM local bond funds.
Bonds are grouped as short-term (ST) and long-term (LT) bonds to construct two different portfolios. In the literature or in the industry, there is no direct definition of short-term or long-term bonds. Meanwhile, to construct proxy portfolios, the holdings or classifications of available developed market funds are useful guides. In the market, bonds with less than five-year maturity (or particularly, 1-3 year maturity) are conventionally accepted as short-term bonds. 1 For long-term bonds, the interval is much wider: bonds with maturities (durations) of more than 10 years (7 years) are evaluated as long-term bonds in the major indices or funds in Europe and the United States. 2 Under these conditions, it is plausible to assign bonds that have close to a two-year maturity (approximately 1.7 years duration) to the short-term bond portfolio. For the long-term bond portfolio, bonds with maturities (approximate durations) close to 10 years (6.5 years) are chosen. The duration of the long-term bond portfolio can be evaluated with a little bit shorter maturity, but for some EMs such as in Brazil and Turkey there is an insufficient number of available long-term securities.
The portfolio data cover the period January 1, 2008 to August 31, 2018 and consist of 2784 observations. A weekly rebalancing is assumed for the portfolios. The ISIN (International Securities Identification Number) codes of the selected bonds are listed in the Appendix.
In Table 1, the average duration of Turkish bonds is up to 1 year shorter than the other bonds in the long-term bond portfolio because before 2011, the Turkish treasury did not issue long term bonds. Moreover, as can be observed below, the average yields of Mexican bonds are below average, as Brazilian and Turkish bonds have exceptionally high yields.

Methodology
In the data sample, every return series has periods that are riskier than the others (see Figures 1 and 2 in Appendix). In other words, there exist periods of sharp movements and small changes. Further statistical analyses of the data are shown in the next section. However, the marks of volatility clustering and autocorrelation (also of the squared residuals) have to be considered to ensure the assumptions of classical linear regression models. Since in the case of conditional heteroskedasticity, the efficiency rule is not ensured, checks on the parameter estimations are unreliable. In other words, in such cases, linear regression estimations will still be unbiased but less certain. Variations in the estimations are high, and standard errors are biased, which can lead to dealing with statistically insignificant coefficients. Conditional heteroscedasticity can lead to wrong hypothesis testing if the null hypothesis is mistakenly rejected (see Gauss-Markov theorem; Engle 1982 and the assumptions of the classical linear regression model). Considering related findings in the literature, the estimations are performed by applying ARIMA and ARFIMA mean models, together with GARCH and FIGARCH volatility models. Moreover, each return series has distinctive behaviors: periods of jumps and calms are different for each country, and co-movements are not stable at first glance. Again, based on relevant literature, a multivariate model is applied with a dynamic conditional correlation approach.
First, for the return estimation of each asset, the ARIMA method is defined as follows: where L is the lag operator with a(L) assuming N(0, 1); and σ 2 t is the variance (see Box and Jenkins 1976).
The ARFIMA model introduces fractional integration on the conditional mean model to measure the long memory dependence of time series. Furthermore, ARFIMA (p,D,q) is a generalized form of ARIMA, where the integration does not have to be a positive integer. The model expression is as follows: where a(L) and b(L) are usual AR(p) and MA(q) expressions, and all the roots are in the unit circle. D is the fractional differencing parameter that defines the fractional differencing filter as ð1 − LÞ D ¼ ð j − D − 1Þ! j!ð − D − 1Þ! L j , and j = 1, 2, 3… (see Granger and Joyeux 1980;Hosking 1981).
Regarding volatility, univariate volatilities and cross-correlations are to be estimated, as the portfolio variance is defined as follows: where w i is the weight, and σ i, t is the conditional volatility of the i th asset in the portfolio at time t, and ρ i, j, t is the conditional correlation of the i th asset with the j th asset at time t (see the modern portfolio theory or Markowitz 1952). The GARCH model is, in general, in the following form (Bollerslev 1986): where L is the lag operator with φ(L)(1 − L) d (y t − μ) = θ(L)u t ; β(L) ≡ L + β 2 L 2 + … + β p L p ; and u t is the residual of the mean model, with u t = σ t e t , e t~i . i. d. f(.). f(.) is assumed to be a strong white noise process. w > 0; α i ≥ 0; β i ≥ 0; P maxðp;qÞ i¼1 ðα i þ β i Þ < 1; and α i , β i ≡ 0 for i > p and j > q for a general GARCH(p, q) model.
Meanwhile, Baillie (1996) defines the FIGARCH model as follows: where d is the fractional differencing parameter, and using its properties, the equation reduces to (see Hosking 1981): λ(L) = λ 1 L 1 + λ 2 L 2 + λ 3 L 3 …, and the features of the fractional differencing filter are the same as in the case of ARFIMA.
To estimate the portfolio risk together with individual volatilities, contemporaneous correlations also need to be estimated. In this step, with the DCC model, pairwise correlations are modeled in a way similar to the modeling of volatilities. Let u t = r t − μ t be the residuals vector, with r t as the vector of individual returns and μ t as the expected returns.
In the DCC model, the covariance matrix of the residuals vector can be split as Σ t ≡ D t R t D t , where R t is the conditional correlation matrix, and D t is the diagonal matrix of individual volatilities. Bollerslev (1990) defines the estimator of the constant conditional correlation as follows: The dynamic correlation generalization is similar to the GARCH approach: where the elements of Q t are from the estimated pairwise correlations at time t. The estimation of a DCC (1,1) model is performed through the log-likelihood function where θ = (ω, α, β, φ, γ, δ), and the parameters of univariate GARCH models and pairwise correlations are estimated separately (see Sheppard 2001 andEngle 2002).

Preliminary analysis
For both short-and long-term bonds, Brazil's bonds have the best returns, in line with their average yields. From Table 2, regarding the means and the standard deviations, it is difficult to observe the expected risk-return relationship. For the short-term portfolio, Turkish bonds are the riskiest assets, but they do not offer the best return. For the long-term portfolio, although Indonesian bonds have the highest risk, with a maximum of 13.5% weekly loss and 1.95% standard deviation, it ranks third in terms of mean returns.
Regarding the normality checks, except for the short-term bonds of Turkey and Indonesia and the long-term bonds of South Africa, all bonds are negatively skewed. As almost all bonds are fat-tailed with high Kurtosis levels, the Jarque-Bera test results reject the null hypothesis that distributions are in Gaussian form. Similarly, Figures 1 and 2 in Appendix show that distributions are fat-tailed; thus, it is more convenient to use Student t-distribution for further analysis.
The Q-statistics of the returns and squared returns also confirm the model selection process mentioned in the previous section. The tests of squared returns are all significant, which implies the existence of autoregressive conditional heteroskedasticity for all data series. This situation indicates the need for ARIMA and GARCH models.
The Hurst exponents of the volatility series are also listed in Table 2. Since the exponents are higher than 0.5 and especially for some series they are close to 1, it can be deduced that there exist strong fractal features that imply long memory in the volatilities  (see Mandelbrot 1977). Under these conditions, long memory models can be evaluated as proper approaches. Augmented Dickey-Fuller and Philips-Perron tests reject the null hypothesis and show that the series are stationary and thus, are appropriate for time series modeling.
As can be seen in Appendix Fig. 1, all returns have very high volatility at the initial 100 observations. EM assets started in 2013 with solid performances. However, in May 2013, with the Fed announcement implying a policy normalization process, EM assets got under pressure. As the core rates went up, both currencies and bonds depreciated dramatically. This situation was common for almost all EMs.
However, for the rest of the observation, there is decoupling. The periods when the volatility levels have risen or declined differ from one country to another. In particular, Turkish bonds are seen to move in a distinct form at the latest part of the observation.

In-sample analysis
The sample data for empirical analysis is formed by the total return series of the bonds for the period January 1, 2008 to September 1, 2017. The return series of short-term bonds and long-term bonds are separated to construct the short-and long-term bond portfolios. In the tables below, the short-and long-term bonds are indicated as ST and LT, respectively. The data from September 1, 2017 to October 31, 2018 (52 weeks) are left for the out-of-sample performance analysis. First, the estimations of the benchmark model, DCC-ARIMA GARCH, are presented in Table 3. In the mean model, seven out of 12 return series exhibit the features of auto-regression, as their related estimations are statistically significant. Notably, the auto-regression terms for both the short-and long-term bonds of India, South Africa, and Mexico are statistically significant. For the long-term bonds, almost all autoregression estimations are negative, and except for Brazil, there also seems to be a significant level of moving average effect.
Regarding volatility, the β parameters are statistically significant for all series, providing clues about the memory in the volatilities. Since the α coefficients of the volatility modeling are statistically significant, except for the short-term bonds of Indonesia and India and the long-term bonds of Brazil, the impacts of the most recent innovations are also noteworthy.
Moreover, in the diagnostic checking, in line with the QQ-plots and the preliminary analysis of the raw data, the models based on a Student's t-distribution deliver way better information criteria (IC) and residual test results than the normal distribution for each data series. Thus, the results from the Student's t-distribution are shown in this section. In Table 3, the values of AIC (Akaike information criterion) and Bayesian IC are below −  30.000. Although the number of parameters is the same for both portfolios, the IC for the short-term bond portfolio are lower, implying a higher level of goodness of fit. Meanwhile, the multivariate residual tests of the long-term bond portfolio deliver higher values. 3 These results imply that the model has limitations in solving the residual autocorrelation and heteroscedasticity.
The empirical results of the next application are shown in Table 4. Here, without modeling the return series, FIGARCH is directly applied to all bonds but in the DCC framework again. For 10 out of the 12 data series, the long memory coefficient d is statistically significant.
For the fractional based models, the level of the d parameter is also critical. If the d coefficient is close to 1, then the shocks will diminish very slowly, and the model will look like an IGARCH model. However, if the level of this parameter is close to 0, it will be difficult to see the memory impact, and the model will approach to the plain GARC H model. Nevertheless, in Table 4, statistically significant long-memory parameters lie in the 0.31-0.66 interval. The d parameters, which are close to 0.5, also justifies the application of long memory models to this kind of data structure.
Meanwhile, we observe fewer statistically significant β parameters in Table 4, compared with that of the previous application: DCC-ARIMA GARCH. For example, for the long-term bonds, only Indonesia and Turkey have these statistically significant parameters. In the previous model, all the β coefficients were statistically significant. Two factors are effective in this situation. First, in the previous estimation, we applied the volatility model to the residuals of a mean model that provided healthier volatility estimations. Nevertheless, here in the DCC-FIGARCH estimation the model is directly applied to the data, and the expected return is assumed constant. The second factor relates to the role of d. In straightforward GARCH models, the level of α reflects the impact of the recent innovation, as β carries the memory that shows the impact of past volatility. However, in FIGA RCH, the impact of previous shocks is exhibited by an adjustable hyperbolic rate d. As memory is partially held by d, we come up with a lower number of significant β parameters.
When we examine the values of d in Table 4 and in the following models (see Tables 5 and 6), we find that Indonesian bonds have the highest numbers, which means that the highest level of persistence is observed in the Indonesian bonds (this situation is also observed in the Hurst exponents, see Preliminary Analysis). Mexico and India are the other two countries with high persistence figures.
Particularly during the post-GFC period, foreign capital flows have been one of the major determinants of EM assets. At the initial stage, unprecedented liquidity actions from the central banks of DMs, particularly the Fed, and low GDP growth rates in the mature economies induced historical amounts of capital flows to EM. However, this situation made the countries that have high external deficits vulnerable to liquidity conditions. From this viewpoint, the current account balances of Indonesia and Mexico have been much more stable compared with the other countries. For the observation period, the average current account deficit (CAD) to GDP figures of Indonesia and Mexico were 1.1% and 1.7%, respectively. The deterioration in India had been  limited, and its recovery was fast. However, Brazil, South Africa, and Turkey have been evaluated as the weakest members of the major EMs, with sharply widening external deficits.
The events that changed the conditions of easy liquidity, such as the taper tantrum or the lift-off decision, 4 significantly affected the fund flows to these countries since their economies had become much more dependent on external capital. Particularly, the South African and Turkish economies sharply oscillated during this period. The CAD figures of South Africa had tested 6% of GDP, and as of 2011 year end, Turkish figures had reached all-time-high levels of 9%. Moreover, apart from these countries' political developments (like election cycles) were other events that affected the volatility structure.
To summarize, vulnerabilities to global liquidity conditions had partially diminished the impact of long memory in the volatilities, as the liquidity conditions sharply changed during this period. As a result of the economic situation, even the d parameters of Turkey's short-term bonds and South Africa's long-term bonds failed to be statistically significant in the fractional models below.
In terms of goodness of fit, the Schwarz and Hannan-Quinn IC are lower than those in the previous model. Because the number of observations is large enough, the better results of these Bayesian approaches are noteworthy. However, as in the previous case, residual portmanteau test results are high for the short-term bond portfolio; thus, further modeling is needed to lower the autocorrelation and heteroscedasticity values.
In the third example, fractional models are applied to both the mean and the volatility. A single-step ARFIMA is applied on the mean, as FIGARCH is used for the volatility modeling. The parameter estimations and t-statistics are listed in Table 5. In the mean equation, the long memory parameters are statistically significant for the shortterm and long-term bonds of South Africa. The auto-regression and moving average parameters of the mean models are statistically significant only for one-third of the data series. The short-term bonds of South Africa and Mexico and the long-term bonds of India and Mexico are some of the examples.
In the volatility modeling, similar to the previous application, most of the d coefficients are statistically significant. In Table 5, only the volatilities of Turkish bonds and the long-term bonds of South Africa do not exhibit long memory impact. Meanwhile, the modeling could not be performed for the Indonesian long-term bonds because the d coefficient is negative, and all of the mean and volatility parameters are statistically insignificant.
The DCC-ARFIMA FIGARCH model has a lower Shibata IC for the short-term bond portfolio, and the Akaike IC for both short-term and long-term portfolios. The residual diagnostics here are also better, as compared with the previous applications.
Considering the test results of the fractional parameters in Table 5, the long-memory impact in the mean model is questionable. Thus, plain and fractional models are combined in the last application below. The empirical results of the fourth application are listed in Table 6. In this last step, the mean model is estimated under the plain ARIMA approach, and the fractional modeling is performed only for the volatility.
In Table 6, most of the mean model parameters (especially for the long-term bond portfolio) show statistically significant auto-regressive and moving average coefficients.
Meanwhile, the volatilities show a long memory feature, as the d coefficients are statistically significant for 10 out of the 12 samples. These parameters are estimated at around 0.5 levels; the definite necessity of the fractional modeling is observed again. Furthermore, six out of the 12 data series still have statistically significant β coefficients, together with long-range dependence.
The model's diagnostic check also shows sufficient IC and residual diagnostics. The residual diagnostics imply that fractional volatility modeling is better in terms of overcoming heteroskedasticity. For each case, the fractional based models deliver better results compared with the benchmark model. However, DCC ARIMA-FIGARCH has the lowest values of the Akaike and Shibata IC for both portfolios. Among the eight goodness of fit tests for each model the DCC ARIMA-FIGARCH model delivers the lowest level of IC for four times. Applying the parameter tests and IC contribute to the robustness of the model. Specifically, the levels of the statistically significant fractional parameter d under various conditions (for two different distribution assumptions and various samples) are also supportive of the model selection. Moreover, the residual tests show that the autocorrelation and the heteroscedasticity are untangled for both portfolios.
In the framework of this study, the correlations are evaluated as being dynamic, and the estimations of each model are listed in each of the tables. Apart from standard correlations, DCC estimation is composed of three parts, and two of them are time-varying. All coefficients add up to 1 (i.e., constitute the estimated correlation matrix), and in the table, the time-varying α and β coefficients are listed. β shows the level of dependence of the correlation matrix to the most recent estimated matrix, while α determines the contribution of the most recent residuals of the GARCH model. In other words, α adjusts the estimation with the most recent innovations. In the DCC, the remaining part, the covariance matrix of the error terms (which is the average product of the standardized residuals of the model) is constant.
The outputs of the short-and long-term bonds are listed separately in Tables 3, 4, 5 and 6. In the tables, the left-(right-) hand side of each headline refers to the short-(long-) term bond portfolio. The β and pairwise coefficients are statistically significant for all data series. For the short-term bond series, the α parameters are also significant at the 1% level. However, the α parameters in the DCC estimations of the ARIMA-GARCH and ARFIMA-FIGARCH models are not statistically significant under the 5% threshold.
The parameter estimations of the DCC models are close to each other (see Tables 3,  4, 5 and 6). As the β levels are above 90%, and the α levels are around 2% for all models, we can infer that the correlations are persistent; the change caused by the recent innovations is slow. The values and related tests exhibit signs of long memory in the correlation part as well.
In the ρ parameters, the highest figures are observed in the cross-correlation estimations of South Africa's and Turkey's bonds. Since these parameters are constant, this adds additional persistence impact on the estimations. In line with our economic interpretation about the volatility estimations above, this situation seems plausible. During the observation period, the macroeconomic dynamics of Turkey and South Africa were similar, making their bond markets behave similarly.
Meanwhile, from the perspective of forecasting, the parameters for each model are very similar. This implies that the level of individual volatilities, not the conditional correlations, is the determinant of the difference in the conditional covariance matrices. In other words, the risk estimation is mostly based on the forecasting of the individual volatilities.

Out-of-sample value at risk analysis
Out-of-sample performances are examined for a period of 1 year, based on a rolling window analysis. Conditional mean, conditional volatility, and dynamic correlation forecasts of a model based on rolling samples are generated for 52 weeks, covering the period September 1, 2017 to August 31, 2018. From these forecasts, the expected returns of the portfolios are calculated; correlation matrices and individual volatilities are used to construct manually the portfolio variances.
For each week, 54 parameters (12 individual returns, 12 volatilities, and 30 pairwise correlations) are forecasted and taken for further calculation. This burdensome process is repeated for each of the 52 weeks. As further computations are required for the portfolio weighting strategies at each step, the process becomes even more complicated.
As a broader analysis, the process can be performed under different distribution conditions such as the normal and t-distributions for all of the models. However, in this section, due to the intensive process involved, one model is examined under a selected distribution condition, and the rest is left for future studies. The model and distribution selections are based on the in-sample test results. In line with the outcomes of the previous section for the models and particularly the IC results under the normal and t-distributions, DCC ARIMA-FIGARCH is chosen for further analysis. The statistical analysis of the individual assets and the model outcomes that are examined for both distributions particularly pointed to the selection of a t-distribution.
The VaR analysis of these forecasts is performed with respect to five different weighting methods, as shown in Table 7. The weighting strategies consider both static and dynamic approaches. As examples of static approaches, equally weighted, GDP-weighted, and market cap-(MCAP) weighted strategies are adopted. These strategies can reflect (or develop) the structures of the related passive investments.
For the dynamic portfolios, biweekly-changing efficient portfolios are considered. The mean-variance method of the modern portfolio theory is a common approach that covers the dynamic hedging ratios among different asset classes (see Markowitz 1952). Each week, model forecasts of the mean, volatility, and dynamic correlation variables are evaluated for portfolio optimization. The optimizations are performed with the objective of mean-variance (MV) efficiency, thereby maximizing the portfolio return per additional risk. The portfolios comprise only long positions; the possibility of short sales and leverage are restricted. There are no individual constraints on assets. In this way, for every period, asset weights for the short-and long-term bond portfolios are calculated. In Table 7, the average values from the weights of 52 different portfolios for short-and long-term bonds (total of 104) are listed.
In Table 8, for all portfolios, there is supposed to be five and two failures out of 52 observations in the 90% VaR and 95% VaR, respectively.
For the short-term bond portfolios, the model estimated six failures in the 90% VaR and four failures in the 95% VaR (three for the MCAP weighted portfolio). The numbers are close to actual failures, and some of the exceedances occurred with tiny breaches, for example, 2 bps. Moreover, we can conclude that the estimations are relatively cautious.
For the long-term bond portfolios, the model's results are better: the 95% VaR is crossed two times, as it is supposed to be in the GDP-weighted, MCAP-weighted, and  In the Kupiec test, the log-likelihood function, is asymptotically chi-squared, where s is the number of failures and N is the number of observations. The (1 − α) VaR method fails if the function exceeds the critical value (see Kupiec 1995). The conditional coverage test of Christoffersen (1998) elaborates on the independence of failures. The log-likelihood function, Þ s 11 g , similarly follows a chi-squared distribution with special counting parameters. s 00 refers to the number of two consecutive periods without failure, and s 11 refers to the number of two consecutive periods with failure. s 01 and s 10 represents the number of the periods without failure followed by a period of failure, and the other way round, respectively. p 0 , p 1 , p are the probabilities or the share of the counts; p 0 ¼ s 01 s 00 þs 01 ; p 1 ¼ s 11 s 10 þs 11 ; and p ¼ ðs 01 þ s 11 Þ=ðs 00 þ s 01 þ s 10 þ s 11 Þ. The test results are presented in Table 8. The number of failures and their independence are in-line with the VaR assumptions. The model performance for each one of the static and dynamic portfolios is sufficient. With high p-values, these results imply that the DCC ARIMA-FIGARCH is a good application from the VaR perspective.
As an additional assessment, the results are also compared with the estimations of the plain approaches: delta-normal and historical simulation. Although it is not presented here, especially for the long-term bond portfolios, the time series model delivers performance that is significantly better than both of these basic approaches.

Conclusion
Bonds and bills have been one of the major asset classes in almost all portfolios. In the post-millennium era with the yield-hunting motivation becoming prevalent, EM bonds have received the attention of global investors. These days, all of the major asset management houses provide EM bond funds for their investors. However, there is a lack of literature about the management of EM bond portfolios, especially from the risk perspective.
This study covers all the local bonds of six major EM economies that have been actively traded in the last 10 years. We filter 203 different bonds and constructed hypothetical portfolios to study the performance of certain risk management tools.
Volatility modeling is a critical part of this study, and relatively new approaches are applied, together with common methods. Fractionally integrated models are evaluated in this manner. In the academic literature, these models have been used in a univariate framework and applied to specific asset classes such as the commodity and stock markets. For the other asset classes, especially those concerning the fixed income market, there is a limited number of univariate applications. In the academic literature, this study is the first to tackle the multivariate analysis of EM bonds and apply the fractional modeling concept.
We deploy four models in the DCC framework: ARIMA-GARCH, FIGARCH, ARFI MA-FIGARCH, and ARIMA-FIGARCH. The parameter estimations and in-sample performances of these four models are examined. All analyses are made under the normal and t-distributions.
First, for the mean models, more than half of the data series show auto-regressive features. The AR coefficients of both the short-and long-term bonds of India, South Africa, and Mexico and the long-term bonds of Indonesia are statistically significant. Moreover, except for Brazil, the returns of all long-term bonds have moving average impacts at statistically significant levels.
Meanwhile, regarding the return series, it is difficult to determine the impact of long memory. Unlike in the case of the plain ARIMA model, for most of the other cases, the parameters of the ARFIMA model have no statistical significant.
Regarding volatility, the GARCH model have statistically significant β coefficients, which indicates the impact of memory. Furthermore, except for three data series, the α parameters are also significant. The fractionally integrated volatility models are applied to the residuals of the approaches: unconditional mean, ARIMA, and an ARFIMA model. In the applications, evidence of long memory is detected, at least once for almost all the assets. Most of the time d coefficients are statistically significant. Furthermore, for each case, the level of d lies at around 0.5; this indicates the need for fractionally integrated modeling.
Third, for the DCC part of each mean and volatility modeling, almost all model parameters of conditional correlations are statistically significant. For almost all models, the individual coefficients, as well as the α and β parameters of the DCC models, are positive at the 1% significance levels, which signifies dependence on the previous innovations.
From the diagnostic checking results, the IC and residual test outputs of the fractionally integrated models are better than those of the benchmark ARIMA-GARCH model. In conclusion, the DCC ARIMA-FIGARCH model have better goodness of fit results for both the short-and long-term bonds, in line with the results of the tests of parameter estimations.
For the out-of-sample performance evaluation, the VaR analysis is conducted based on one-year rolling window forecasts. The portfolios are populated in static and dynamic forms. Homogeneously weighted, GDP weighted, and market-capweighted portfolios are considered as static proxies. Furthermore, mean-variance efficient portfolios (optimal weighting strategy) for each week are also taken into consideration.
The Kupiec and Christoffersen tests indicate that DCC ARIMA-FIGARCH is a sufficient VaR model for various static and dynamic EM portfolios. The VaR analysis is performed only for the DCC ARIMA-FIGARCH model due to the intensiveness of the process.
This study provides implications for both investors and policymakers. The analysis results suggest that the risk management of EM bond investments should consider the long memory concept, as the empirical findings are from direct applications. Not only index-tracking static portfolios but also dynamically managed active portfolios should consider the fractional methods when conducting risk analysis.
Second, in the academic literature, persistence of shocks is usually observed in high-risk assets such as commodities or equities. Meanwhile, government bonds, especially the short-term ones, are usually perceived as low-risk assets, and their yields are considered as the benchmarks of local interest rates. In most countries, the volatility of bonds and their yields affecthe policies of the governments and the central banks. As a liquidity tool, it is common for central banks to intervene actively in the bond markets. Although the situation is not the same for emerging countries and their assets, the evidence of volatility persistence in government bonds is noteworthy from this viewpoint. Once a shock in government bonds takes place, it takes a long time for its effects to diminish, and this may require specific approaches. The application of fractional models can be useful for this core asset class, which has a significant role in liquidity policies.
For future work, the application of the other fractional models, which cover an asymmetric power or leverage effects, can be evaluated. In addition, risk analysis can be conducted using different approaches such as risk metrics and variance-covariance. Regarding correlation, this study only dealt with dynamic correlation. An analysis of constant correlation models and other multivariate models can also be performed in the EM bonds universe.