The effect of intraday periodicity on realized volatility measures

Dette, Holger; Golosnoy, Vasyl; Kellermann, Janosch

doi:10.1007/s00184-022-00875-0

The effect of intraday periodicity on realized volatility measures

Open access
Published: 16 July 2022

Volume 86, pages 315–342, (2023)
Cite this article

Download PDF

You have full access to this open access article

Metrika Aims and scope Submit manuscript

The effect of intraday periodicity on realized volatility measures

Download PDF

3025 Accesses
1 Citation
Explore all metrics

Abstract

We focus on estimating daily integrated volatility (IV) by realized measures based on intraday returns following a discrete-time stochastic model with a pronounced intraday periodicity (IP). We demonstrate that neglecting the IP-impact on realized estimators may lead to invalid statistical inference concerning IV for a common finite number of intraday returns. For a given IP functional form, we analytically derive robust IP-correction factors for realized measures of IV as well as their asymptotic distributions. We show both in Monte Carlo simulations and empirically that the proposed bias corrections are the robust way to account for IP by computing realized estimators.

Volatility in the Cryptocurrency Market

Article 24 August 2019

Momentum: what do we know 30 years after Jegadeesh and Titman’s seminal paper?

Article Open access 02 August 2022

Functional central limit theorems for rough volatility

Article Open access 16 April 2024

1 Introduction

For the majority of financial markets, the pervasive intraday periodicity (IP), which often takes a U- or mirrored J-form during the daily trading time, is a well documented empirical feature of intraday absolute returns (Wood et al. 1985; Harris 1986). As it appears to be highly correlated with intraday variation of trading volume, Admati and Pfleiderer (1988) propose to explain daily IP-shape by strategic interaction of traders around market openings and closures, whereas the periodicity at weekly or monthly horizons could be attributed to the impact of slowly varying macroeconomic fundamentals (Andersen and Bollerslev 1998b; Andersen et al. 2001, 2003).

The availability of high-frequency data allows for the construction of precise estimators of daily integrated volatility (IV) for risky asset returns. The realized volatility (RV) defined as a sum of squared intraday returns is known to be a consistent estimator of daily IV in absence of jumps. Other realized measures such as the bipower variation (BV) should be used for IV estimation in presence of finite-activity jumps during the day (cf. Aït-Sahalia and Jacod 2014). Barndorff-Nielsen and Shephard (2004) derive the asymptotic properties of these quantities for the number of intraday returns $M\rightarrow \infty $ under mild assumptions on the corresponding pricing process. In practice, however, the number of intraday returns available for computing realized estimators often remains limited due to insufficient liquidity and/or irregular trading activity. Even for highly liquid stocks a practitioner could prefer to rely on simple realized estimators based e.g. on 5 min returns primarily in order to escape from adverse effects of market microstructure noise (MMN) which are particularly pronounced at (ultra) high sampling frequencies (cf. Aït-Sahalia and Jacod 2014).

In this paper we analyze and quantify the impact of IP on the finite M properties of RV and BV estimators by providing corresponding formal statements given the IP functional form. To the best of our knowledge, this research agenda has not been explored yet, although there is a vast amount of literature devoted to modeling and estimating IP (cf. Engle et al. 1990; Andersen et al. 2019; Christensen et al. 2018). Thus, our investigation provides useful insights for exploring differences between the asymptotic theory and the practical finite sample performance of realized measures based on intraday data.

To model intraday returns, we presume a discrete time stochastic specification as in Andersen and Bollerslev (1997), where the variance of intraday returns is written as a product of the deterministic periodic and stochastic volatility (SV) components. The IP is assumed to be constant for all days, whereas the SV part is slowly changing over time. Our framework is motivated by the empirical evidence that the IP captures a vast part of intraday volatility variation whereas the SV impact is of a smaller order (cf. Christensen et al. 2018). We show that for the commonly available finite number of intraday returns M neglecting the impact of IP would lead to non-valid statistical inference concerning daily IV. For a given IP, we compute the first and the second moments of RV and BV, moreover, we establish the asymptotic bivariate distribution of these measures as $M\rightarrow \infty $. We also quantify the impact of IP on realized tri-power (TP) and quad-power (QP) estimators of daily integrated quarticity (IQ) required for statistical inference about IV.

Our major finding is that for the commonly available finite number of intraday returns the impact of IP should be explicitly addressed when making statistical inference concerning IV. While the RV estimator of IV is unaffected by IP, BV has a finite sample bias negligible only for large sample sizes which are not always available in practice. Moreover, by estimating IQ one should account – at least for small M values – for scaling factors, which depend on the functional form of the IP. We derive the explicit expressions for these IP-correction factors and provide the asymptotic distribution for their estimators.

Our theoretical results are illustrated in a Monte Carlo study, where we investigate the impact of IP on various realized measures. We find that the IP-correction procedures proposed in this paper are helpful against the adverse impact of IP on realized measures in finite samples. Moreover, our IP-corrected estimates are advantageous compared to immediate removing of the estimated IP due to their robustness with respect to IP misspecifications at days with unusual pattern of intraday volatility (cf. Gabrys et al. 2013; Kokoszka and Reimherr 2013). In such situations our approach is preferable in terms of relative bias and mean squared error (MSE) compared to the standard procedure with immediate scaling of intraperiod returns by the estimated IP profile as in Boudt et al. (2011), for example. In the empirical application we estimate the IP for the daily volatility of the Dow Jones Industrial Average Index with and without IP bias corrections.

The remaining part of the paper is organized as follows. In Sect. 2, we introduce the model for intraday returns and discuss the realized estimators of daily IV and IQ. The theoretical results are derived in Sect. 3 where we establish both finite sample and asymptotic stochastic properties of commonly applied realized estimators for a given IP form. Moreover, we provide expressions for IP correction factors and derive the asymptotic distributions of their estimators. Our approach is illustrated in Sect. 4 by means of a simulation study and in Sect. 5 by an empirical application. Sect. 6 concludes, whereas the proofs are placed in the Appendix.

2 Measuring daily volatility based on intraday information

Before we introduce our model in Sect. 2.2, we provide definitions of the objects which are of importance for our analysis. For this purpose we start from a general jump-diffusion model for log price increments in order to define the objects of our interest, namely the daily IV and daily IQ. Next, we present the corresponding realized estimators which are based on M intraday returns. These realized measures are consistent estimators of IV for $M\rightarrow \infty $, however, in practice we often face $M\le 10^2$ primarily because ultra high frequency returns are contaminated by MMN. Thus, it is of importance to explore the finite sample stochastic properties of realized estimators. For this purpose we then consider a discrete time model for intraday returns with an explicit functional specification of intraday periodicity (IP) and study the impact of IP on the realized measures of IV and IQ for finite M.

2.1 Model for intraday returns and realized measures

Assume that log-prices of risky assets $p(t)=\ln P_t$ follow a continuous time process with (possible) additive jump components. We consider a day t as the period of interest with the daily return $r_t=p(t)-p(t\!-\!1)$ and focus on the integrated volatility (IV), which is defined for day t as

$$\begin{aligned} IV_t=\sigma ^2_t=\int _{t-1}^t \sigma ^2(u)du, \end{aligned}$$

where $\sigma (u)$ is a spot volatility. In order to make statistical inference about IV measures one also needs statements concerning the daily integrated quarticity (IQ) defined by

$$\begin{aligned} IQ_t=\int _{t-1}^t \sigma ^4(u)du. \end{aligned}$$

The availability of intraday returns allows to construct precise realized estimators (Andersen and Bollerslev 1998a) for the daily IV which are of immense practical importance for estimation and inferences concerning daily volatility. Assume that M equally spaced intraday returns are available for day t. We denoted them by $r_{t,m}=p(t\!-\!1+m/M)-p(t\!-\!1+(m\!-\!1)/M)$ for $m=1,\ldots ,M$. Then the daily return $r_t$ is the sum of intraday returns with $r_t=\sum _{m=1}^M r_{t,m}$. The most popular IV estimator is the realized volatility (RV) measure given as

$$\begin{aligned} RV_t=\sum _{m=1}^M r^2_{t,m}. \end{aligned}$$

Barndorff-Nielsen and Shephard (2004) show that $RV_t$ is a consistent estimator of $IV_t$ without jumps at day t, i.e. $RV_t\overset{p}{\longrightarrow }IV_t$ as $M\rightarrow \infty $. Although the estimator $RV_t$ possesses a set of appealing properties, it is not appropriate in the presence a non-zero jump component.

The bipower variation (BV) proposed by Barndorff-Nielsen and Shephard (2004)

$$\begin{aligned} BV_t = \frac{M}{M-1} \frac{\pi }{2} \sum \limits _{m=2}^{M} |r_{t,m}||r_{t,m-1}| \end{aligned}$$

(1)

is a jump-robust estimator of IV. It is consistent even in presence of jumps, i.e. $BV_t\overset{p}{\longrightarrow }IV_t$ as $M\rightarrow \infty $. However, RV has a smaller variance than BV if there is no jumps, so the common practice is first to test for a jump during day t. Then, in case of a significantly large positive distance between RV and BV indicating jumps, one should apply BV; otherwise RV is to use.

Intraday returns are also suitable for the purpose of estimating the unknown daily IQ required for computing variances of RV and BV measures. The realized quarticity (RQ)

$$\begin{aligned} RQ_t=\frac{M}{3}\sum _{m=1}^{M} r^4_{t,m}, \end{aligned}$$

is a consistent estimator of IQ in the case of no jumps with $RQ_t \overset{p}{\longrightarrow }IQ_t$ as $M\rightarrow \infty $. However, as $RQ_t$ measure is not robust (cf. Andersen et al. 2014), Barndorff-Nielsen and Shephard (2004) suggest to use the realized tri-power (TP) and quad-power (QP) measures defined by

$$\begin{aligned} QP_t= & {} \frac{M^2}{M-3} \cdot \frac{\pi ^2}{4} \cdot \sum _{m=4}^M |r_{t,m-3}||r_{t,m-2}||r_{t,m-1}||r_{t,m}|, \end{aligned}$$

(2)

$$\begin{aligned} TP_t= & {} \frac{M^2}{M-2} \cdot \mu _{4/3}^{-3} \sum \limits _{m=3}^M |r_{t,m-2}|^{\frac{4}{3}}|r_{t,m-1}|^{\frac{4}{3}}|r_{t,m}|^{\frac{4}{3}}, \quad \text{ with } \quad \mu _{4/3}=0.8309.\nonumber \\ \end{aligned}$$

(3)

Although both RV and BV have appealing stochastic properties as $M\rightarrow \infty $, their practical implementation is often based on (say) 5 min intraday returns which makes $M=78$ intraday observations for a usual 6.5-hours trading day, because of MMN which hinders the use of ultra high frequency data for construction of realized estimators (McAleer and Medeiros 2008). To overcome these problems, the recent research has been focused on making realized estimators more robust to these features. However, in many situations the common practice still remains to sample returns at a lower frequency, i.e. to consider 5–, 10–, or even 15-min intraday returns (Andersen et al. 2011). We follow this strand of literature and concentrate on profound understanding of stochastic properties of realized estimators for comparatively small values M. Although we focus in our analysis primarily on the classical RV, BV, QP, and TP measures, we also provide a discussion of recently proposed further realized measures in Sect. 5.2 and in Dette et al. (2022).

2.2 Discrete time model for intraday returns

The IP in absolute intraday returns is one of the most important stylized facts characterizing high frequency data. In order to investigate the IP impact on realized measures for a fixed number of intraday returns M, next we introduce a discrete time model in (4) which is central for study. There is a substantial scope of recent literature concerning discrete-time modeling of intraday returns whereas the IP is assumed to be a multiplicative scaling component (Boudt et al. 2011; Engle and Sokalska 2012; Bekierman and Gribisch 2021). Following Andersen and Bollerslev (1997), we focus on a discrete stochastic model for intraday return without jumps, which is given as

$$\begin{aligned} r_{t,m}= & {} \sigma _{t,m} \cdot u_{t,m}, \qquad \text{ with } \qquad u_{t,m}\, \sim \; \text{ iid }~{\mathcal {N}}(0,1), \nonumber \\ \sigma ^2_{t,m}= & {} 1/M \cdot s^2_{t,m} \cdot \gamma ^2_{t,m}, \end{aligned}$$

(4)

where $s_{t,m}>0$ is the deterministic IP component and $\gamma ^2_{t,m}>0$ is the stochastic part. Note that there is no leverage effect in (4), as it is mostly of importance for daily returns but much less pronounced for high-frequency intraday returns, see e.g. Bollerslev et al. (2006).

For our theoretical derivations we presume that the stochastic part remains constant within day t (Andersen and Bollerslev 1998b; Hecq et al. 2012), i.e. $\gamma _{t,m}=\sigma _t$ for all $m=1,\ldots ,M$, but may change from one day to another. This assumption is justified by the empirical evidence that IP commonly accounts for a vast part of intraday volatility variation (cf. Christensen et al. 2018). In Sect. 4 we relax this assumption in the Monte Carlo simulation study by considering the intraday SV which follows a diffusion process as e.g. in Goncalves and Meddahi (2009). Based on the results of our Monte Carlo simulations, we conclude that our major findings also hold in the SV setting.

In line with the current literature (Hecq et al. 2012), we set the IP as constant at different days so that we further skip the time index with $s_{t,m}=s_m$. Moreover, the periodic component is standardized such that it sums up to M over the day with $\sum _{m=1}^M s^2_{m}=M$. Of course, a special case $s_m=1$ for all $m=1,\ldots ,M$ corresponds to no IP. Putting all together, we separate the intraday periodic component $s_m$ which is solely responsible for intraday heteroskedasticity, and interday stochastic component $\sigma _t$ which could change from one day to another by writing

$$\begin{aligned} \sigma ^2_{t,m}=1/M \cdot s^2_m \cdot \sigma ^2_t \qquad \text{ with } \qquad \sum _{m=1}^M s^2_{m}=M. \end{aligned}$$

(5)

Although the model in (4) and (5) is fairly simple, it allows a detailed analysis of the IP impact on popular realized measures of the objects of our interest, which are the IV for day t

$$\begin{aligned} IV_t = Var(r_t) = \sum \limits _{m=1}^M Var(r_{t,m})= \frac{1}{M}\sum \limits _{m=1}^M s_m^2 \sigma ^2_{t} = \sigma ^2_t, \end{aligned}$$

(6)

as well as the IQ written by (Andersen et al. 2014)

$$\begin{aligned} IQ_t = M/3 \cdot \sum _{m=1}^M E(r^4_{t,m})=\sigma ^4_t/M \, \cdot \, \sum _{m=1}^M s^4_{m}. \end{aligned}$$

(7)

Note that as in general it holds that $\sum _{m=1}^M s^4_{m} \ge M$, the IQ is directly influenced by IP. We aim to investigate the impact of IP on realized measures of IV and IQ.

3 The impact of intraday periodicity on RV and BV

To gain results on the IP impact on realized measures, re-write the normalized IP $\{s_m\}_{m=1}^M$ as

$$\begin{aligned} s_m^2=g\left( \frac{m}{M}\right) \, \big /\, g_M, \qquad \text{ with } \qquad g_M=\frac{1}{M}\sum _{m=1}^M g\left( \frac{m}{M}\right) , \end{aligned}$$

(8)

where $g:[0,1]\mapsto {\mathbb {R}}^+$ with ${\mathbb {R}}^+ :=(0,+\infty )$ is a given non-normalized function. This normalization is very common in IP-literature (cf. Andersen and Bollerslev 1997, p. 153). The functional form of $g(\cdot )$ could be quite flexible and is subject to very general regularity conditions specified in the following propositions.

3.1 Bias and variance of realized estimators

For the discrete model of intraday returns (4)–(5) and the IP from $\{g(m/M)\}_{m=1}^M$, we derive expectation, bias and variance of RV and BV estimators of daily IV in the next proposition.

Proposition 1

Assume that the IP component is given by (8) for some function $g:[0,1]\mapsto {\mathbb {R}}^+$.

$\text{(A) }$ The estimator $RV_t$ for daily IV is unbiased so that $E[RV_t]=IV_t$. The estimator $BV_t$ is biased, that is $E[BV_t]= M/(M-1)\sigma ^2_t(1-R_M)=M/(M-1) IV_t(1-R_M)$, where the factor $R_M\in (0,1)$ is given by

$$\begin{aligned} R_M = \left( g\left( \frac{1}{M}\right) +\sum \limits _{m=2}^M g\left( \frac{m}{M}\right) ^{1/2}\left[ g\left( \frac{m}{M}\right) ^{1/2}- g\left( \frac{m-1}{M}\right) ^{1/2}\right] \right) /\sum \limits _{m=1}^M g\left( \frac{m}{M}\right) . \end{aligned}$$

If $g(\cdot )$ is continuously differentiable on interval [0, 1], it holds as $M\rightarrow \infty $

$$\begin{aligned} M \cdot R_M = \left[ \frac{1}{2} \int _{0}^1 g'(x)dx\Big /\int _{0}^1 g(x)dx + g(0 )\Big /\int _{0}^1 g(x)dx \right] \cdot \left( 1+ o(1) \right) , \end{aligned}$$

so that $\lim _{M\rightarrow \infty } R_M=0$, i.e. $BV_t$ is an asymptotically unbiased estimator of IV.

$\text{(B) }$ The (co)variances of $RV_t$ and $BV_t$ are given as

$$\begin{aligned} Var(RV_t)= & {} \frac{2\sigma ^4_t}{M^2 g^2_M}\sum _{m=1}^M g^2\left( \frac{m}{M}\right) ,\\ Var(BV_t)= & {} \frac{\sigma ^4_t}{(M-1)^2g_M^2} \left\{ \left( \frac{\pi ^2}{4}-1\right) \sum \limits _{m=2}^{M} g\bigg (\frac{m}{M}\bigg )g\left( \frac{m-1}{M}\right) \right. \\&\quad \left. + (\pi -2)\sum \limits _{m=3}^{M}g\left( \frac{m}{M}\right) ^{1/2}g\left( \frac{m-1}{M}\right) g\left( \frac{m-2}{M}\right) ^{1/2}\right\} ,\\ Cov(RV_t,BV_t)= & {} \frac{\sigma _t^4}{M(M-1) g^2_M}\left[ \sum \limits _{m=2}^M g\left( \frac{m-1}{M}\right) ^{1/2}g\left( \frac{m}{M}\right) ^{3/2}\right. \\&\quad \left. +\sum \limits _{m=1}^{M-1} g\left( \frac{m+1}{M}\right) ^{1/2}g\left( \frac{m}{M}\right) ^{3/2}\right] . \end{aligned}$$

For $g(\cdot )$ continuously differentiable on interval [0, 1] and for $M\rightarrow \infty $ it holds that $Var(RV_t)= (1/M) \cdot 2 \sigma ^4_t \cdot \xi \ (1+o(1))$,

$$\begin{aligned} Var(BV_t) = (1/M) \cdot \sigma ^4_t \cdot \xi \left( \frac{\pi ^2}{4} - 3 + \frac{\pi }{4} \right) \ (1+o(1)), \end{aligned}$$

(9)

and $Cov(RV_t,BV_t) = (1/M) \cdot 2 \sigma ^4_t\cdot \xi (1+ o(1))$. The asymptotic scaling factor $\xi $ is defined by

$$\begin{aligned} \xi = \int ^1_0 g^2 (x) dx \big /\left( \int ^1_0 g(x) dx\right) ^2. \end{aligned}$$

(10)

It holds that $\xi \ge 1$ with $\xi =1$ if and only if $g(\cdot )$ is almost everywhere constant, i.e. there is no IP.

The property $R_M \in {(0,1)}$ follows from the proof of Proposition 1 in the Appendix. More precisely, by the Cauchy-Schwarz inequality, we have

$$\begin{aligned} 0< 1 - R_M&= \frac{\sum _{m=2}^M {\left[ g\left( \frac{m}{M}\right) g \left( \frac{m-1}{M}\right) \right] }^{1/2}}{\sum _{m=1}^M g \left( \frac{m}{M}\right) } \\&\le \frac{\left( \sum _{m=2}^M g \left( \frac{m}{M}\right) \right) ^{1/2} \left( \sum _{m=2}^M g \left( \frac{m-1}{M}\right) \right) ^{1/2}}{\sum _{m=1}^M g \left( \frac{m}{M}\right) } < 1. \end{aligned}$$

Thus, in the case of IP, RV is an unbiased estimator for IV but BV has a finite M bias which should be corrected in applications. Since the expectation of BV is given by

$$\begin{aligned} E[BV_t]=\frac{\sigma _t^2}{(M-1)\cdot g_M} \cdot \sum \limits _{m=2}^M g\left( \frac{m}{M}\right) ^{1/2}g\left( \frac{m-1}{M}\right) ^{1/2}= \frac{\sigma _t^2}{M-1} \cdot \sum \limits _{m=2}^M s_m s_{m-1}, \end{aligned}$$

we suggest the following bias-corrected measure

$$\begin{aligned} {\widetilde{BV}}_{t}=\frac{\pi }{2} \cdot M \cdot \left( \sum \limits _{m=2}^M s_m s_{m-1}\right) ^{-1} \cdot \sum _{m=2}^M |r_{t,m}||r_{t,m-1}|. \end{aligned}$$

(11)

Hence, it holds that ${\widetilde{BV}}_{t}=(M-1)\left( \sum \limits _{m=2}^M s_m s_{m-1}\right) ^{-1} BV_t$ leading the correction factor $\zeta _M$ for BV:

$$\begin{aligned} \zeta _M=\zeta _{M,BV}=BV_t/{\widetilde{BV}}_{t} = \frac{1}{M-1} \cdot \sum \limits _{m=2}^M s_m s_{m-1}, \end{aligned}$$

(12)

which should be replaced by its empirical counterpart ${\hat{\zeta }}_M$ based on IP estimates ${\hat{s}}_m$ in practice. By the same principle, we define the IP factor $\xi _M$ in RQ for finite M values as

$$\begin{aligned} \xi _M=\xi _{M,RQ}=\frac{1}{M} \cdot \sum \limits _{m=1}^M s_m^4, \end{aligned}$$

(13)

with $\lim _{M\rightarrow \infty } \xi _M=\xi $ as in (10) of Proposition 1. Note that in case of no IP it holds that $\xi _M=\xi _{M,RQ}=1$. Of course, in applications we replace $\xi _{M,RQ}$ by its estimator ${\hat{\xi }}_M={\hat{\xi }}_{M,RQ}$ which is discussed below in Sect. 3.3. Hence, we show analytically that although for BV it holds that $\lim _{M\rightarrow \infty } \zeta _M=1$, its counterpart $\xi $ for the realized measures of IQ is still present and could be (depending on the data) rather substantial. In the context of the IP-bias corrections for further realized measures, we investigate by means of numerical analysis several popular MMN-robust realized estimators such as min RV or med RV in the follow-up paper of Dette et al. (2022).

In the next proposition we provide the expectations of realized estimators RQ, TP and QP serving as the measures for IQ under the model defined in (4) and (5) with the function $g(\cdot )$.

Proposition 2

Assume that the IP is given by (8), then the expectations of $RQ_t$, $QP_t$ and $TP_t$ are given as $E[RQ_t]= \frac{\sigma ^4_t}{M \cdot g^2_M} \sum _{m=1}^M g(\frac{m}{M})^2$, i.e. $RQ_t$ is unbiased, and

$$\begin{aligned} E[QP_t]= & {} \frac{\sigma ^4_t}{\left( M-3\right) \cdot g^2_M} \sum _{m=4}^M \left[ g\left( \frac{m-3}{M}\right) g\left( \frac{m-2}{M}\right) g\left( \frac{m-1}{M}\right) g\left( \frac{m}{M}\right) \right] ^{\frac{1}{2}},\\ E[TP_t]= & {} \frac{\sigma ^4_t}{\left( M-2\right) \cdot g^2_M} \sum _{m=3}^M \left[ g\left( \frac{m-2}{M}\right) g\left( \frac{m-1}{M}\right) g\left( \frac{m}{M}\right) \right] ^{\frac{2}{3}}. \end{aligned}$$

Moreover, $\lim _{M\rightarrow \infty } E[RQ_t]=E[TP_t]= E[QP_t]=\sigma ^4_t \cdot \xi =IQ_t$ if $g(\cdot )$ is square integrable, where the asymptotic scaling factor $\xi $ which comprises the impact of IP is given by (10).

Proposition 2 suggests the following factorization of QP and TP measures of IQ in finite samples:

$$\begin{aligned} {\widetilde{QP}}_{t}= & {} QP_t/\xi _{M,QP}, \qquad \xi _{M,QP} = \frac{1}{M-3}\cdot \sum \limits _{m=4}^M s_m s_{m-1} s_{m-2} s_{m-3},\\ {\widetilde{TP}}_{t}= & {} TP_t/\xi _{M,TP}, \qquad \xi _{M,TP} = \frac{1}{M-2} \cdot \sum \limits _{m=3}^M (s_m s_{m-1} s_{m-2})^{4/3}, \end{aligned}$$

whereby $\xi _{M,QP}$ and $\xi _{M,TP}$ are the IP scaling factors for QP and TP, respectively. This factorization appears to be useful both in the simulations in Sect. 4 and in the empirical study in Sect. 5.

3.2 Asymptotic distribution in case of intraday periodicity

Next, we provide the corresponding bivariate limit distribution for RV and BV as $M\rightarrow \infty $ for our discrete time model of intraday returns with IP.

Theorem 1

Consider model (4) and (5) without jumps and assume that the IP component is given by (8) with a continuously differentiable function $g:[0,1]\mapsto {\mathbb {R}}$. Then, as $M\rightarrow \infty $,

$$\begin{aligned} M^{1/2}\cdot IQ^{-1/2}_t \cdot \begin{pmatrix} RV_t-\sigma ^{2}_t \\ BV_t-\sigma ^{2}_t \\ \end{pmatrix} \overset{L}{\longrightarrow } {\mathcal {N}}\left( \begin{bmatrix} 0 \\ 0 \\ \end{bmatrix}, \begin{bmatrix} 2 &{} 2\\ 2 &{} \frac{\pi ^2}{4}+\pi -3 \\ \end{bmatrix}\right) . \end{aligned}$$

The integrated quarticity $IQ_t=\xi \cdot \sigma ^{4}_t$ can be consistently estimated by $RQ_t$, $QP_t$, or $TP_t$.

Thus, a pronounced IP with $\xi > 1$ causes more variability of IV estimators compared to the case of no IP where $\xi =1$. The asymptotic $(1-\alpha )$-confidence interval for daily IV and IQ based on RV and RQ measures is given according to Theorem 1 as

$$\begin{aligned} CI_t(1-\alpha )= & {} \left[ RV_t+z_{\alpha /2}\left[ \frac{2}{M}\right] ^{1/2} \cdot RQ_t^{1/2}, \quad RV_t- z_{\alpha /2}\left[ \frac{2}{M}\right] ^{1/2} \cdot RQ_t^{1/2}\right] , \end{aligned}$$

where $z_{\alpha /2}$ is the $(\alpha /2)$-quantile of the standard normal distribution. The asymptotic confidence intervals based on BV, TP or QP measures are constructed similarly.

The results in Proposition 2 and Theorem 1 are useful for statistical inference on IV. As RQ is not robust, one could estimate IQ in presence of jumps by either TP or QP directly as in (2) or (3), i.e. without estimating $\xi $ separately. However, as we show in the Monte Carlo simulation, the approximation $TP_t\approx IQ_t$ is only precise for fairly large M. For this reason, for finite M we recommend to use the IP-scaled estimators ${\widetilde{QP}}_{t}$ or ${\widetilde{TP}}_{t}$ as well as the estimated scaling factor ${\hat{\xi }}_{M,RQ}$ from Eq. (15) below for the construction of confidence intervals as

$$\begin{aligned} {\widetilde{CI}}_t(1\!-\!\alpha )= & {} \left[ {\widetilde{BV}}_t\!+\! z_{\alpha /2}\left[ \frac{\pi ^2/4+\pi -4}{M}\right] ^{1/2} \cdot {\hat{\xi }}^{1/2}_{M,RQ} \cdot {\widetilde{QP}}_t^{1/2}, \right. \\&\left. \quad {\widetilde{BV}}_t- z_{\alpha /2}\left[ \frac{\pi ^2/4+\pi -4}{M}\right] ^{1/2} \cdot {\hat{\xi }}^{1/2}_{M,RQ} \cdot {\widetilde{QP}}_t^{1/2}\right] , \end{aligned}$$

because of $E[ {\hat{\xi }}_{M,RQ} \cdot {\widetilde{QP}}_t]=E[ {\hat{\xi }}_{M,RQ} \cdot {\widetilde{TP}}_t]=E[TP_t]=E[QP_t]=E[RQ_t]$.

3.3 Estimation of IP correction factors

For a given number of intraday returns M, one needs consistent estimators of the IP functions $\{{\hat{s}}^2_m\}_{m=1}^M$ in order to obtain the estimators of the quantities $\xi _M$ and $\zeta _M$ defined in (13) and (12), respectively. For this purpose we exploit the SD estimator (cf. Boudt et al. 2011) given as

$$\begin{aligned} {\widehat{SD}}^{2}_{m,T}={1\over T} \sum _{t=1}^T r^2_{t,m}~,~~ \text{ and } ~~~ {\hat{s}}_m=\frac{{\widehat{SD}}_{m,T}}{\left( (1/M)\sum _{m=1}^M {\widehat{SD}}^{2}_{m,T}\right) ^{1/2}}, \end{aligned}$$

(14)

and use the statistics

$$\begin{aligned} {\hat{\xi }}_M&= {\hat{\xi }}_{M,RQ} = (1/M) \cdot \sum \limits _{m=1}^M {\hat{s}}_m^4 ~, ~~~~ \quad {\hat{\zeta }}_M&={\hat{\zeta }}_{M,BV}= (1/M) \cdot \sum \limits _{m=2}^M {\hat{s}}_m {\hat{s}}_{m-1} \end{aligned}$$

(15)

as the estimators of $\xi _M$ and $\zeta _M$, respectively. If the variance

$$\begin{aligned} \sigma ^{2} := \lim _{T\rightarrow \infty } \sigma _{T}^{2} := \lim _{T\rightarrow \infty } {1 \over T} \sum _{t=1}^{T} \sigma _{t}^{2} >0 \end{aligned}$$

(16)

exists, it follows from (5) that $\lim _{T\rightarrow \infty } E [ {\widehat{SD}}^{2}_{m,T} ] =\sigma ^{2}s_m^{2}/M$ which motivates the definition (14). As a consequence, we expect that the estimators in (14) and (15) are consistent for ${s}_m$, $m=1,\ldots ,M$, as well as for $\xi _M$ and $\zeta _M$, respectively. Note that as long as the ergodicity condition (16) is met, the assumption of constant intraday volatility as in Sect. 2 could be relaxed for the following analysis in Sect. 3.3, e.g. by allowing intraday stochastic volatility.

In order to make the intuitive arguments above more precise we investigate in the following the asymptotic distribution of the statistics ${\hat{\zeta }}_M$ and ${\hat{\xi }}_M$ for finite M and $T\rightarrow \infty $. Note that under common assumptions, such as mixing conditions (see, for example Dehling et al. 1986, among many others) or physical dependence conditions (cf. Wu 2005), the vector $\widehat{\mathbf {SD}}_{M,T} = ({\widehat{SD}}^{2}_{1,T}, \ldots , {\widehat{SD}}^{2}_{M,T})^\top $ is asymptotically normal distributed if $T \rightarrow \infty $, that is

$$\begin{aligned} \sqrt{T} \left( \widehat{\mathbf {SD}}_{M,T} - {\mathbf {SD}}_{M,T} \right) \ {{\mathop {\longrightarrow }\limits ^{L}}}\ {\mathcal {N}} (\mathbf {0}, \sigma ^{4}\Sigma ) \quad \text{ with } \quad {\mathbf {SD}}_{M,T} = ({SD}^{2}_{1,T}, \ldots , {SD}^{2}_{M,T})^\top ~,\nonumber \\ \end{aligned}$$

(17)

where ${SD}^{2}_{m,T} = E ({\widehat{SD}}^{2}_{m,T} ) = \sigma _{T}^{2} s_{m}^{2}/M$ for $m=1, \ldots , M$, $ \sigma ^{4} \Sigma \in {\mathbb {R}}^{M \times M}$ denotes a covariance matrix reflecting the underlying dependence structure and $ \sigma ^{2}$, $ \sigma ^{2}_{T}$ are defined in (16).

Now we provide the theoretical result concerning the estimators ${\hat{\xi }}_M $ and ${\hat{\zeta }}_M$ for finite number of intraday observations M and the estimation period $T\rightarrow \infty $, which are based on the SD-estimator of the IP components $s_m$, $m=1,\ldots ,M$, although similar type of result could be also obtained for other IP-estimators. We denote by $\langle x,y \rangle = x^{\top }y$ the common inner product on ${\mathbb {R}}^{M}$ with the corresponding norm $\Vert x\Vert = \langle x,x \rangle ^{1/2}$ and by ${\mathbf {1}}_{M}$ an M-dimensional vector of unit entries.

Proposition 3

Assume that (16) and (17) hold, then, as $T \rightarrow \infty $, that

$$\begin{aligned}&\sqrt{T} \; ({\hat{\xi }}_M - \xi _M) \ {{\mathop {\longrightarrow }\limits ^{L}}} \ {\mathcal {N}} (0, \tau _{1}^{2}) \end{aligned}$$

(18)

$$\begin{aligned}&\sqrt{T} \; ({\hat{\zeta }}_M - \zeta _M) \ {{\mathop {\longrightarrow }\limits ^{L}}}\ {\mathcal {N}} (0, \tau _{2}^{2})~, \end{aligned}$$

(19)

where the asymptotic variances are given by

$$\begin{aligned} \tau _{1}^{2}= & {} 4 \left( {\mathbf {S}}_{M}^{\top } \Sigma {\mathbf {S}}_{M} - {2 \over M} \Vert {\mathbf {S}}_{M} \Vert ^{2 }{\mathbf {1}}_{M}^{\top } \Sigma {\mathbf {S}}_{M} + {{\varvec{1}}_{M}^{\top } \Sigma {\varvec{1}}_{M} \over M^{2}} \Vert {\mathbf {S}}_{M} \Vert ^{4 } \right) , \end{aligned}$$

(20)

$$\begin{aligned} \tau _{2}^{2}= & {} { 1 \over 4} \left( {\mathbf {R}}_{M} ^{\top } \Sigma {\mathbf {R}}_{M} - {2 \over M} \langle {\mathbf {R}}_{M} , {\mathbf {S}}_{M} \rangle {\varvec{1}}_{M}^{\top } \Sigma {\mathbf {R}}_{M} + {{\varvec{1}}_{M}^{\top } \Sigma {\varvec{1}}_{M} \over M^{2}} \langle {\mathbf {R}}_{M} , {\mathbf {S}}_{M} \rangle ^{2 } \right) ~, \end{aligned}$$

(21)

and the vectors ${\mathbf {S}}_{M} $ and $ {\mathbf {R}}_{M} $ are defined by

$$\begin{aligned} {\mathbf {S}}_{M} := \left( s_{1}^{2} , \ldots , s_{M}^{2} \right) ^{\top }~,~~\quad {\mathbf {R}}_{M} := \left( { s_{2} \over s_{1}} ~,~ {s_{1} + s_{3} \over s_{2}} , \ldots , {s_{M-2} + s_{M} \over s_{M-1}} , {s_{M-1} \over s_{M}} \right) ^\top , \end{aligned}$$

respectively. In particular, if $\Sigma = \text{ diag } (\vartheta ^2_1, \ldots , \vartheta ^2_M)$ is a diagonal matrix, we have

$$\begin{aligned} \tau _{1}^{2}= & {} 4 \left\{ \sum _{m=1}^{M} \vartheta ^{2}_{m}s_{m}^{4} -{2\over M} \sum _{m=1}^{M} \vartheta ^{2}_{m} s_{m}^{2} \sum _{m=1}^{M} s_{m}^{4} + {1 \over M^{2}} \left( \sum _{m=1}^{M} s_{m}^{4} \right) ^{2} \sum _{m=1}^{M} \vartheta ^{2}_{m} \right\} , \end{aligned}$$

(22)

$$\begin{aligned} \tau _{2}^{2}= & {} { 1 \over 4} \left\{ \sum _{m=1}^{M} {\vartheta ^{2}_{m} \over s_{m}^{2}} (s_{m-1} + s_{m+1})^{2} -{2\over M} \sum _{m=1}^{M} s_{m} (s_{m-1} + s_{m+1}) \sum _{m=1}^{M}\vartheta ^{2}_{m} {s_{m-1} + s_{m+1 } \over s_{m} } \right. \nonumber \\&\quad \left. + {1 \over M^{2}} \sum _{m=1}^{M}\vartheta ^{2}_{m} \left( \sum _{m=1}^{M} s_{m}(s_{m-1} + s_{m+1}) \right) ^{2} \right\} ~, \end{aligned}$$

(23)

where we put $s_0=s_{M+1}=0$.

When estimating IP elements $s_m$, we require for consistency that the number of days $T\rightarrow \infty $, whereas the number of intraday observations M is here fixed. Hence, we derive the distributions of the correction factors ${\hat{\zeta }}_M$ and ${\hat{\xi }}_M$ for fixed M and $T\rightarrow \infty $ in Proposition 3. Differently, inference about the asymptotic correction factor $\xi $ in Theorem 1 would require that $M \rightarrow \infty $ because then ${\hat{\xi }}_M$ is a consistent estimator of $\xi $. These results allow to make statistical inferences and conduct tests for IP correction factors $\xi _M$ and $\zeta _M$ with $T\rightarrow \infty $. Of course, an extension of our theoretical findings for IP estimated from a finite sample T is also of interest. However, this is also a quite challenging task, for which Christensen et al. (2018) present with some theoretical considerations in their Proposition 3.1. We investigate this issue in the Monte Carlo simulations in Sect. 4.2.

In Proposition 3 we provide the explicit asymptotic results for IP scaling factors given the SD estimator. In general, our results for the SD estimator could be extended for any consistent estimator of IP. In this paper both in our simulations and empirical studies we use a more robust WSD estimator of Boudt et al. (2011) which is described in the Appendix. However, it is much more difficult to get analytical results such as in Proposition 3 for this more complicated WSD estimator which is based on order statistics. For this reason we recommend to make statistical inferences for the WSD approach by using bootstrap procedures, as e.g. in Goncalves and Meddahi (2009) or in Dette et al. (2022).

4 Simulation study

We illustrate our theoretical findings by means of an extensive Monte Carlo simulation study which is structured as follows. First, we introduce the U-shaped IP functional form and discuss the parameter choice for the model in (4)–(5). We generate intraday returns for estimation of IP shape and construction of various realized measures. In Sect. 4.1 we study the impact of IP on various realized estimators. In Sect. 4.2 we investigate our IP-corrections in terms of MSE whereby we compare our approach with those of Boudt et al. (2011).

We consider $M=26$, $M=78$, or $M=390$ intraday returns, which roughly correspond to sampling at 15 min, 5 min, or 1 min for 6.5-hour trading days, respectively. For constant intraday volatility, we fix $IV=\sigma ^2=1$ and $IQ=\xi \cdot \sigma ^4 = \xi $. We generate M intraday returns for each of $T=10^4$ days with $r_{t,m} \sim {\mathcal {N}}(0,\gamma ^2_{t,m} s_m^2)$ where $\gamma ^2$ and $s_m^2$ are the respective SV and IP components. As a baseline, we set $\gamma ^2_{t,m} = 1/M$ which corresponds to $IV=1$ and $IQ=\xi $.

Later we also consider a SV model where we assume that $\gamma ^2_{t,m}$ is governed by the process

$$\begin{aligned} \Delta \!\gamma _{t,m}^2= & {} 0.035(0.636 - \gamma _{t,m}^2) \Delta t + 0.144\gamma _{t,m}^2 (\Delta t)^{1/2} u_{t,m}, \end{aligned}$$

(24)

where $\Delta t=1/M$ and $u_{t,m} \sim \text{ iid } \, {\mathcal {N}}(0,1)$, with $\gamma _{{1,1}}^2 = 1$. This model is a discretized version of the GARCH (1,1) diffusion used by Andersen and Bollerslev (1998a), Goncalves and Meddahi (2009) with parameters implying an autoregressive persistence; in the time series context it is related to state-space models for realized volatilities (cf. Golosnoy et al. 2021). In presence of intraday SV there is no exact analytical expression for the impact of IP on realized measures. However, since the SV in (24) follows a highly persistent process, the empirical contribution of $\gamma ^2_{t,m}$ to intraday heteroskedasticity is of a smaller order compared to the impact of IP (see e.g., Christensen et al. 2018; Bekierman and Gribisch 2021). Moreover, from the empirical perspective, one could precisely estimate the SV components $\gamma _{t,m}$ ex post, see Bekierman and Gribisch (2016). Then one could calculate the product components ${\hat{s}}_{t,m}={\hat{s}}_m {\hat{\gamma }}_{t,m}$ and compute the IP correction factors for this day t based on the obtained estimates ${\hat{s}}_{t,m}$.

The IP components follow a quadratic convex U-shaped given by

$$\begin{aligned} g(m/M) = \left[ c_1 + c_2(m-M/2)^2\right] ^2, \qquad \qquad c_1,c_2>0, \end{aligned}$$

(25)

so that the standardized IP values are $s^2_m = g(m/M)/g_M$. The choice of the functional form is (25) is motivated by our empirical findings, see Fig. 5. Note that our theoretical results are applicable for any functional form of IP including asymmetric mirrored J-shape specifications, as e.g. a more flexible asymmetric U-shaped IP specification as in Hasbrouck (1999) and Andersen et al. (2012). Because of $\sum _{m=1}^M s^2_m=M$, it holds for (25) that $c_2= 12 \, (1-c_1)\,/\,(M^2+2)$. We select $c_1 \in $ $\{0.01,0.11,\dots ,0.91,1\}$ with the most IP curvature for $c_1\rightarrow 0^+$ whereas $c_1=1$ is the no-IP case. Next we focus on the value $c_1=0.71$ corresponding to the evidence from U.S. stock market. Note that the IP form could be very distinct at ‘special’ days characterized by specific announcements and/or unexpected events where the IP curvature could be much more (or less) pronounced.

The asymptotic scaling factor $\xi $ defined in (10) is plotted as a function of $c_1$ in Fig. 1 where we observe that $\xi $ is substantially larger than one even for $c_1$ close to one. As the IP is unknown in practice, we construct estimators ${\hat{g}}(\cdot )$ and ${\hat{s}}^2(\cdot )$ by applying the WSD estimator of Boudt et al. (2011) which does not require a-priori specification of the IP functional form; the implementation details for the WSD estimator are provided in the Appendix. Then we compute the average estimates of the scaling factors ${\hat{\xi }}_{M,RQ}$, ${\hat{\xi }}_{M,TP}$, ${\hat{\xi }}_{M,QP}$ which are plotted in Fig. 1. We observe that the factor ${\hat{\xi }}_{M,RQ}$ appears to be very close to $\xi _M$ even for a fairly small value $M=26$.

4.1 The impact of IP on realized measures

As RV measure is not affected by IP, we study the IP impact on other measures, such as BV for IV, and TP, QP for IQ. After generating IID intraday returns as specified above, we calculate $BV_t$, $TP_t$, and $QP_t$ as well as the IP-corrected estimators denoted by ${\widetilde{BV}}_{t}$, etc. for each day $t=1,\ldots ,T$. Additionally, we compute the jump-robust medRV and minRV measures of Andersen et al. (2012). Then we build time averages, e.g. ${\overline{BV}}=(1/T) \cdot \sum _{t=1}^T BV_t$, for all measures. These averages of BV, ${\widetilde{BV}}$, medRV, and minRV are shown in Fig. 2; whereas of TP, QP, ${\widetilde{TP}}$, and ${\widetilde{QP}}$ in Fig. 3 for different M and $c_1$ values. All measures should be equal to one for no IP with $c_1=1$.

In Fig. 2 the IP-bias in BV is quite pronounced for $M=26$ and $M=78$ for large and medium curvatures and is still visible even for $M=390$. Remarkably, the biases in medRV and minRV are even stronger than in BV. As expected, the bias-corrected mean of ${\widetilde{BV}}$ is close to the true IV for all M, so the suggested correction functions properly.

The averages of the original TP, QP and scaled ${\widetilde{TP}}$, ${\widetilde{QP}}$ are reported in Fig. 3. The original measures are downward biased for finite $M=26,78$, compared to almost unbiased measures for $M=390$ which is close to the asymptotic value $\xi $, see Fig. 1. Remarkably, in case of 5 minute returns with $M=78$ the bias is still quite substantial for empirical relevant values of IP curvature parameter $c_1\in [0.6,0.8]$. As expected, QP is more biased than TP for finite M due to longer lags involved in its computing. Our scaled ${\widetilde{TP}}$ and ${\widetilde{QP}}$ measures are equal to the value $\sigma ^4=1$ for all considered values of $c_1$ and M which is an appealing property. Summarizing, the IP has a substantial influence on BV, minRV, medRV, so that IP-corrections are needed to get valid statistical inference on IV.

In order to shed light on the finite sample validity of the result in Proposition 3, we provide the exemplary QQ-plots in Fig. 4 for the statistics ${\hat{\zeta }}_M$ and ${\hat{\xi }}_M$ for the case of $M=78$ and $c_1 = 0.6$ which are based on the WSD estimator of IP (cf. Boudt et al. 2011). Being standardized properly, both statistics seem to approach normality with the increase of estimation period T, in particular, even $T=250$ the QQ plots show a decent fit.

4.2 The comparison of IP-bias corrected estimators

In Proposition 1 we show analytically that the original BV is downward biased so that the true level of risk measured by IV is underestimated. This is a rather undesired scenario from the risk management point of view making an MSE comparison of biased and unbiased measures not reasonable because the MSE is symmetric for upward and downward biases. For this reason, we provide an MSE comparison only for IP-bias corrected estimators.

In particular, we contrast the relative bias and MSE of our corrected estimator ${\widetilde{BV}}_t$ in (11) with those of Boudt et al. (2011) where the estimated IP component ${\hat{s}}_m$ is immediately removed from intraday returns by computing $r^*_{t,m}=r_{t,m}/{\hat{s}}_m$. Then the BCL-estimator $BV^*_t$ is given as

$$\begin{aligned} BV^*_t = \frac{M}{M-1} \cdot \frac{\pi }{2}\cdot \sum _{m=2}^M |r^*_{t,m}||r^*_{t,m-1}|, \qquad \text{ with } \qquad E[BV^*_t]=IV_t. \end{aligned}$$

(26)

This immediate removing of IP appears to be a common approach in the current literature (cf. Golosnoy et al. 2012; Bekierman and Gribisch 2021; Christensen et al. 2018), whereby the IP estimates ${\hat{s}}_m$ are based on historical data.

To contrast the relative biases and MSEs of $BV^*_t$ and ${\widetilde{BV}}_t$, we generate intraday returns for $T=250$ pre-sample days with the IP parameter value $c^h_1=0.5$ as in (4) and (25) and use them to get IP estimates ${\hat{s}}_m$. We denote by $c^h_1$ the ‘historical’ value of $c_1$ assumed to be constant during the pre-sample period. Then, we focus on the next day’s IP which functional form is described by the ‘current value’ of $c_1$ denoted by $c^c_1$. That means, for this next (single) day of our interest, intraday returns are generated with either unchanged current IP parameter $c^c_1=0.5=c^h_1$ or changed parameter $c^c_1\ne c^h_1$. The latter case could occur at some ‘special’ days characterized by announcements, unexpected events etc. Note that the values $c^c_1=0.1$ (extremely pronounced IP) and $c^c_1=0.9$ (almost no IP) are not empirically relevant but considered for the illustration purposes only. We calculate $BV^*$ and ${\widetilde{BV}}$ for this day of interest using the historical IP estimates ${\hat{s}}_m$, so there is an IP misspecification when $c^c_1\ne c^h_1$. We repeat the procedure (generating $T=250$ pre-sample days and an additional ‘day of interest’) $10^4$ times and put the computed relative biases and MSEs in Table 1, where we show results for constant intraday volatility in Block A and intraday SV is Block B.

Hence, we study both the effect of estimation risk in case of unchanged IP and the effect of a change in the IP form. The latter is of much practical importance, as there are many empirical confirmations for time variability of IP even after excluding days with important macroeconomic announcements, see Andersen et al. (2001), Hecq et al. (2012), or Andersen et al. (2019).

Table 1 Relative bias and MSE of $BV^*$ and ${\widetilde{BV}}$. The IP parameter $c_1$ equals to $c^h_1$ during estimation period of the IP components and to $c^c_1$ when realized measures are calculated

Full size table

We observe in Table 1 that in case of no IP change with $c^h_1=c^c_1$, the realized measure $BV^*$ of Boudt et al. (2011) is preferable in terms of relative bias and MSE. These findings are in line with the recent theoretical results of Ghysels et al. (2021) who show under similar assumptions that the most efficient quarticity estimators (in terms of MSE) can be obtained by an immediate adjustment of intraday returns for the IP as e.g. by Boudt et al. (2011) which is also confirmed by our evidence. However, even a comparatively small change in IP, e.g. from $c^h_1=0.5$ to $c^c_1=0.7$, leads to a substantial increase in the relative bias and MSE of $BV^*$. This evidence remains also for intraday SV in Block B. Hence, our IP-correction approach is more robust compared to the procedure of Boudt et al. (2011) which is rather sensible to even small changes in the IP form.

5 Empirical study

In our application we work with intraday returns for the Dow Jones Industrial Average Index with the focus on measuring daily IV. Our dataset consists of intraday observations from January 1996 to December 2010 with non-regular trading time days skipped. Days with non-regular trading times are those where the trading time have been substantially shorter than the common 6.5 hours because of these or that reasons. We consider 15 min, 10 min, 5 min and 2 min intraday returns; 5 min returns is the most popular choice in practice. To avoid the opening bias effects, we skip the first daily observation which is the common practice to escape from the impact of noisy overnight quotes. The final sample consists of 3329 days with $M=24, 37, 76$ or 193 observations for 15, 10, 5 or 2 min frequency, respectively.

We estimate the intraday IP $s_m$, and calculate the IP correction factors $\zeta $ and $\xi $. Then we present descriptive statistics for both uncorrected and IP-corrected realized measures. Finally, we discuss the impact of IP bias in some further realized measures which are proposed in the literature.

5.1 Estimation of intraday pattern and descriptive statistics

We estimate the IP with the non-parametric WSD estimator of Boudt et al. (2011) and show them in Fig. 5 with components ${\hat{s}}_m$ normalized such that $\sum _{m=1}^M {\hat{s}}^2_m=M$. The IP pattern has a convex U-shape for all considered sampling frequencies. It is high during morning and afternoon hours and low during the lunch break. In numerical terms, it is about twice as high during the peak in the morning compared to the trough in the middle of the day. Hence, in our empirical application for Dow Jones index the IP estimate approximately corresponds to $c_1\approx 0.7$ in our simulation study. However, the IP curvature could be more pronounced for individual stocks. For example, in Christensen et al. (2018) the IP curvature parameter takes values around $c_1=0.60$ for some assets from the Dow Jones index.

Given these IP estimates, we calculate the finite M scaling factors for IP bias correction, all reported in Table 2 with 95% confidence intervals obtained by the bootstrap procedure with $10^3-1$ replications outlined in Dette et al. (2022). The estimated factor ${\hat{\xi }}_{M,RQ}$ is larger than one and numerically similar for all sampling frequencies which corresponds to the evidence of similar patterns in Fig. 5. We also estimate finite sample corrections for BV, TP, and QP. The correction estimate ${\hat{\zeta }}_M$ for BV gets closer to one with increasing sampling frequencies. The same holds for other finite M scaling factors so that BV is biased downward. For example, we get ${\hat{\xi }}_{M,QP}\approx $1.23 for 15 min frequency but only ${\hat{\xi }}_{M,QP}\approx $1.03 for 5 min frequency returns. This evidence indicates that the IP bias problem should not be very acute for the U.S. stock market during the considered period of time.

Table 2 Estimated empirical IP factors with bootstrapped 95% confidence intervals in parentheses

Full size table

Next, we provide descriptive statistics of realized estimators. In Table 3 we report the full sample averages of both uncorrected and IP-corrected realized measures of daily IV and IQ. The average RV is stable for all sampling frequencies, whereas the average BV is biased downwards compared to RV even after the IP correction, however, the corrected BV gets numerically closer to RV. The average values for TP and QP are almost the same for 15 and 10 min frequencies, but are much higher for 5 min and 2 min returns. Note that there are reported difficulties in estimating IQ based on frequencies of 5 min or higher (Andersen et al. 2014). Another important quantity is the average relative component $(RV-BV)/RV$ which, as expected, gets substantially smaller with the IP-bias corrected ${\widetilde{BV}}$ both for all sampling frequencies.

Table 3 Empirical averages of both uncorrected and IP-corrected realized measures

Full size table

5.2 Discussion and further extensions

Our study is primarily focused on the popular realized measures which are widely used in practice. Recently, there have been much developments in estimation of IV and IQ based on high frequency observations. In particular, we would like to mention the robust threshold power variations (Corsi et al. 2010), the minRV and medRV estimators (Andersen et al. 2012), and the nearest neighborhood generalizations (Andersen et al. 2014). Moreover, in order to mitigate MMN-problems which arise by using ultra high frequency returns, several noise robust methods have been developed, as e.g. the pre-averaged BV (Podolskij and Vetter 2009), pre-averaged threshold RV (cf. Aït-Sahalia and Jacod 2014), or pre-averaged threshold BV (Christensen et al. 2014). The IP bias problem is also present for some of these more advanced realized measures as it could be shown by a simple Monte Carlo simulation exercise. To illustrate this point, we plot the IP-biases for minRV and medRV measures of Andersen et al. (2012) in Fig. 2 and observe that they are even larger than for the original BV estimator. The same argument applies to the minRQ and medRQ measures of IQ which are elaborated by Andersen et al. (2014). Of course, the IP-correction factors should be newly derived for each particular measure, however, this challenging task is beyond the scope of our paper and is left for future research, as e.g. the task of deriving IP correction factors for realized portfolio weights (Golosnoy et al. 2019, 2020, or Golosnoy and Gribisch 2022). The impact of IP on these additional estimators is investigated in the follow-up paper of Dette et al. (2022).

6 Summary

Availability of intraday high frequency returns on risky assets allows construction of precise realized estimators such as realized volatility (RV) and bipower variation (BV) which are commonly applied for estimation of daily integrated volatility (IV), or tri-power (TP) and quad-power (QP) variations which are serving as measures for daily integrated quarticity (IQ).

In this paper we investigate the impact of intraday periodicity (IP) on the finite sample properties of these realized measures. For our analysis we assume a discrete time model for intraday returns on risky assets and postulate a multiplicative deterministic IP component which is often of U-shape empirically. For a number of intraday returns $M\rightarrow \infty $ the impact of IP is asymptotically negligible, however, we show that the IP-impact should be taken into account for a practically relevant situation with finite M. In particular, we prove that finite sample corrections of BV as well as of TP and QP measures are necessary to obtain valid statistical inferences concerning daily IV. We derive analytically the factors for IP-correction and analyse their stochastic properties. Our results are illustrated by means of a Monte Carlo simulation study for both constant and stochastic intraday volatility models. Finally, we evaluate IP correction factors empirically for daily IV of the Dow Jones Industrial Average Index.

References

Admati A, Pfleiderer P (1988) A theory of intraday patterns: volume and price variability. Rev Financ Stud 1:3–40
Article Google Scholar
Aït-Sahalia Y, Jacod J (2014) High-Frequency Financial Econometrics. Princeton University Press, New Jersey
Book MATH Google Scholar
Andersen T, Bollerslev T (1997) Intraday periodicity and volatility persistence in financial markets. J Empir Financ 4:115–158
Article Google Scholar
Andersen T, Bollerslev T (1998) Answering the skeptics: yes, standard volatility models do provide accurate forecasts. Int Econ Rev 39:885–905
Article Google Scholar
Andersen T, Bollerslev T (1998) Deutsche Mark-Dollar volatility: intraday activity patterns, macroeconomic announcements, and longer run dependencies. J Financ 53:219–265
Article Google Scholar
Andersen T, Bollerslev T, Das A (2001) Variance-ratio statistics and high-frequency data: testing for changes in intraday volatility patterns. J Financ 56:305–327
Article Google Scholar
Andersen T, Bollerslev T, Diebold F, Vega C (2003) Micro effects of macro announcements: real-time price discovery in foreign exchange. American Econ Rev 93:38–62
Article Google Scholar
Andersen T, Bollerslev T, Huang X (2011) A reduced form framework for modeling volatility of speculative prices based on realized variation measures. J Econom 160:176–189
Article MathSciNet MATH Google Scholar
Andersen T, Dobrev D, Schaumburg E (2012) Jump-robust volatility estimation using nearest neighbor truncation. J Economet 138:125–180
Article MathSciNet MATH Google Scholar
Andersen T, Dobrev D, Schaumburg E (2014) A robust neighborhood truncation approach to estimation of integrated quarticity. Economet Theor 30:3–59
Article MathSciNet MATH Google Scholar
Andersen T, Thyrsgaard M, Todorov V (2019) Time varying periodicity in intraday volatility. J Am Stat Assoc 114:1695–1707
Article MathSciNet MATH Google Scholar
Barndorff-Nielsen O, Shephard N (2004) Power and bipower variation with stochastic volatility and jumps. J Financ Economet 2:1–37
Article Google Scholar
Bekierman J, Gribisch B (2016) Estimating stochastic volatility models using realized measures. Stud Nonlinear Dyn Econom 20:279–300
MathSciNet Google Scholar
Bekierman J, Gribisch B (2021) A mixed frequency stochastic volatility model for intraday stock market returns. J Financ Economet 19(3):496–530
Article Google Scholar
Bollerslev T, Litvinova J, Tauchen G (2006) Leverage and volatility feedback effects in high-frequency data. J Financ Economet 4:353–384
Article Google Scholar
Boudt K, Croux C, Laurent S (2011) Robust estimation of intraweek periodicity in volatility and jump detection. J Empir Financ 18:353–367
Article Google Scholar
Christensen K, Hounyo U, Podolskij M (2018) Is the diurnal pattern sufficient to explain the intraday variation in volatility: A nonparametric assessment. J Economet 205:336–362
Article MathSciNet MATH Google Scholar
Christensen K, Oomen R, Podolskij M (2014) Fact or friction: Jumps at ultra high frequency. J Financ Econ 114:576–599
Article Google Scholar
Corsi F, Pirino D, Reno R (2010) Threshold bipower variation and the impact of jumps on volatility forecasting. J Economet 114:576–599
MathSciNet MATH Google Scholar
Dehling H, Denker M, Philipp W (1986) Central limit theorems for mixing sequences of random variables under minimal conditions. Ann Probab 14(4):1359–1370
Article MathSciNet MATH Google Scholar
Dette H, Golosnoy V, Kellermann J (2022) Correcting intraday periodicity bias in realized volatility measures. Economet Stat 23:36–52
Article MathSciNet Google Scholar
Engle R, Ito T, Lin W-L (1990) Meteor showers or heat waves? Heteroskedastic intra-daily volatility in the foreign exchange market. Econometrica 58:525–542
Article Google Scholar
Engle R, Sokalska M (2012) Forecasting intraday volatility in the US equity market. Multiplicative component GARCH. J Financ Economet 10:54–83
Article Google Scholar
Gabrys R, Hörmann S, Kokoszka P (2013) Monitoring the intraday volatility pattern. J Time Series Economet 5:87–116
Article MathSciNet MATH Google Scholar
Ghysels E, Mykland P, Renault E (2021) In-sample asymptotics and across-sample efficiency gains for high frequency data statistics. Economet Theor. https://doi.org/10.1017/S0266466621000359
Article MATH Google Scholar
Golosnoy V, Gribisch B (2022) Modeling and forecasting realized portfolio weights. J Bank Finance 138:106404
Article Google Scholar
Golosnoy V, Gribisch B, Seifert MI (2019) Exponential smoothing of realized portfolio weights. J Empir Financ 53:222–237
Article Google Scholar
Golosnoy V, Köhler S, Schmid W, Seifert MI (2021) Tests for validity of linear state space representations. Appl Stoch Model Bus Ind 37:1060–1079
Article Google Scholar
Golosnoy V, Okhrin I, Schmid W (2012) Statistical surveillance of volatility forecasting models. J Financ Economet 10:513–545
Article Google Scholar
Golosnoy V, Schmid W, Seifert MI, Lazariv T (2020) Statistical inferences for realized portfolio weights. Economet Stat 14:49–62
Article MathSciNet Google Scholar
Goncalves S, Meddahi N (2009) Bootstrapping realized volatility. Econometrica 77:283–306
Article MathSciNet MATH Google Scholar
Harris L (1986) A transaction data study of weekly and intradaily patterns in stock returns. J Financ Econ 16:99–117
Article Google Scholar
Hasbrouck J (1999) The dynamics of discrete bid and ask quotes. J Financ 54:2109–2142
Article Google Scholar
Hecq A, Laurent S, Palm F (2012) Common intraday periodicity. J Financ Economet 10:325–353
Article Google Scholar
Kokoszka P, Reimherr M (2013) Predictability of shapes of intraday price curves. Economet J 16:285–308
Article MathSciNet MATH Google Scholar
McAleer M, Medeiros M (2008) Realized volatility: a review. Economet Rev 27:10–45
Article MathSciNet MATH Google Scholar
Podolskij M, Vetter M (2009) Estimation of volatility functionals in the simultaneous presence of microstructure noise and jumps. Bernoulli 15:634–658
Article MathSciNet MATH Google Scholar
Romano J, Wolf M (2000) A more general central limit theorem for $m$-dependent random variables with unbounded $m$. Stat Prob Letters 47:115–124
Article MathSciNet MATH Google Scholar
Wood R, McInish T, Ord J (1985) An investigation of transaction data for NYSE stocks. J Financ 25:723–739
Article Google Scholar
Wu WB (2005) Nonlinear system theory: Another look at dependence. Proc Natl Acad Sci USA 102(40):14150–14154
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

this work was partly supported by the Collaborative Research Center ‘Statistical modeling of nonlinear dynamic processes’ (SFB823, projects A1,C1) of German Research Foundation (DFG).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Mathematics, Ruhr-Universität Bochum, Bochum, Germany
Holger Dette
Faculty of Management and Economics, Ruhr-Universität Bochum, Universitätsstr. 150, 44801, Bochum, Germany
Vasyl Golosnoy & Janosch Kellermann

Authors

Holger Dette
View author publications
You can also search for this author in PubMed Google Scholar
Vasyl Golosnoy
View author publications
You can also search for this author in PubMed Google Scholar
Janosch Kellermann
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vasyl Golosnoy.

Ethics declarations

Conflict of interest statement

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix

Proofs

Proof of Proposition 1

Note that the statement for $RV_t$ is obvious and consider only ${\overline{BV}}_t=\frac{\pi }{2} \sum _{m=2}^M |r_{t,m}| |r_{t,m-1}|$ to simplify calculations and noting that $E[BV_t] = \frac{M}{M-1}E[{\overline{BV}}_t]$. We use the fact that $E[|X|]=\sqrt{2/\pi } \, \sigma $ if X is a centered normal distributed random variable with variance $\sigma ^2$, which implies

$$\begin{aligned} E[{\overline{BV}}_t]=\frac{\sigma ^2_t}{M} \sum _{m=2}^M s_m s_{m-1}=\sigma ^2_t \frac{\sum _{m=2}^M \left[ g\left( \frac{m}{M}\right) g\left( \frac{m-1}{M}\right) \right] ^{1/2}}{\sum _{m=1}^M g\left( \frac{m}{M}\right) } =\sigma ^2_t(1-R_M), \end{aligned}$$

where the term $R_M$ is given by

$$\begin{aligned} R_M=\frac{g\left( \frac{1}{M}\right) +\sum _{m=2}^M g\left( \frac{m}{M}\right) ^{1/2}\left( g\left( \frac{m}{M}\right) ^{1/2}-g\left( \frac{m-1}{M}\right) ^{1/2}\right) }{\sum _{m=1}^M g\left( \frac{m}{M}\right) }. \end{aligned}$$

We assume that g is continuously differentiable, therefore g and $g^\prime $ are continuous and Riemann integrable on the interval [0, 1], and also Lebesgue-integrable. Hence, the squares of g and $g^\prime $ are Riemann integrable as well because the squares are also continuous. Then we obtain by the mean value theorem

$$\begin{aligned} R_M = \frac{\frac{1}{M}\sum ^M_{m=2}g\left( \frac{m}{M}\right) ^{1/2}\left( g^{1/2}\right) ^\prime \left( \psi _m\right) + g\left( \frac{1}{M}\right) }{\sum ^M_{m=2}g\left( \frac{m}{M}\right) }~, \end{aligned}$$

where $\psi _m \in (\frac{m-1}{M}, \frac{m}{M})$, $m=2,\ldots , M$. Consequently, using the fact that $(g^{1/2}(x))^\prime = \frac{1}{2} \frac{g^\prime (x)}{g^{1/2}(x) }$ and an approximation of the sums by a Riemann integral it follows that

$$\begin{aligned} M \cdot R_M = \left( \frac{1}{2} \frac{\int ^1_0 g^\prime (x) dx}{\int ^1_0 g(x)dx} + \frac{g(0)}{\int ^1_0 g(x)dx} \right) \cdot (1+o(1))~, \end{aligned}$$

which proves the statement (A) of Proposition 1.

For the statement (B) in order to compute $Var(BV_t)$ we first look at $E(BV_t)^2$, setting $r_{t,m} = r_{m}$ for notation simplification. We have

$$\begin{aligned} E[({\overline{BV}}_t)^2]= & {} \frac{\pi ^2}{4}E\left[ \sum \limits _{m_1=2}^{M} \sum \limits _{m_2=2}^{M} |r_{m_1}||r_{m_1-1}|]|r_{m_2}||r_{m_2-1}|\right] \\ (E[{\overline{BV}}_t])^2= & {} \left[ \frac{\sigma _t^2}{M} \sum \limits _{m=2}^{M} s_m s_{m-1}\right] ^2 = \frac{\sigma ^4_t}{M^2}\left[ \sum \limits _{m=2}^{M} s_m s_{m-1}\right] ^2. \end{aligned}$$

As the random variables $r_m$ are independent, the double sum contains three types of non-vanishing expectations

$$\begin{aligned}&E(|r_{m_1}^2||r_{m_1-1}^2|) = \frac{\sigma ^4_t}{M^2} s_m^2s_{m-1}^2,\\&E(|r_{m_1}||r_{m_1-1}^2||r_{m_1-2}|) = \sqrt{\frac{2\sigma ^2_t}{\pi M} s_{m_1}^2} \frac{\sigma ^2_t}{M} s_{m_1-1}^2 \sqrt{\frac{2\sigma ^2_t}{\pi M} s_{m_1-2}^2} = \frac{2\sigma ^4_t}{\pi M^2} s_{m_1}s_{m_1-1}^2s_{m_1-2},\\&E(|r_{m_1}||r_{m_1-1}||r_{m_2}||r_{m_2-1}|) = \frac{4\sigma ^4_t}{\pi ^2 M^2} s_{m_1}s_{m_1-1}s_{m_2}s_{m_2-1}. \end{aligned}$$

This yields

$$\begin{aligned} Var({\overline{BV}}_t)&= \frac{\pi ^2}{4} \frac{\sigma ^4_t}{M^2}\left\{ \sum \limits _{m_1=2}^{M} s_m^2s_{m-1}^2 + \frac{4}{\pi } \sum \limits _{m_1=3}^{M} s_{m_1}s_{m_1-1}^2s_{m_1-2}\right. \\&\quad \left. + \frac{4}{\pi ^2} \underbrace{\sum \limits _{m_1=2}^{M} \sum \limits _{m_2=2}^{M}}_{|m_1-m_2|>1}s_{m_1}s_{m_1-1}s_{m_2}s_{m_2-1}\right\} - \frac{\sigma ^4_t}{M^2}\left[ \sum \limits _{m=2}^{M} s_m s_{m-1}\right] ^2 . \end{aligned}$$

Observing the definition of $s^2_m$ in (8), we get

$$\begin{aligned} Var({\overline{BV}}_t)&= \frac{\pi ^2}{4} \frac{\sigma ^4_t}{M^2g_M^2}\left\{ (1-\frac{4}{\pi ^2}) \sum \limits _{m_1=2}^{M} g\left( \frac{m_1}{M}\right) g\left( \frac{m_1-1}{M}\right) \right. \\&\quad \left. + \frac{4}{\pi }\left( 1-\frac{2}{\pi } \right) \sum \limits _{m_1=3}^{M} \left\{ {g\left( \frac{m_1}{M}\right) } \right\} ^{1/2} g\left( \frac{m_1-1}{M}\right) \left\{ {g\left( \frac{m_1-2}{M}\right) } \right\} ^{1/2} \right\} , \end{aligned}$$

which proves the first assertion regarding the variance of ${\overline{BV}}_t$. The approximation (9) finally follows by interpreting the sum as approximations of a Riemann integral. The expressions for the variance $Var(RV_t)$ and the covariance $Cov(RV_t,{\overline{BV}}_t)$ are obtained by similar arguments. However, we omit them for the sake of brevity. Therefore, the statement (B) of Proposition 1 follows. $\square $

Proof of Proposition 2

Recall the definition of $QP_t$ in (2) and (8), then

$$\begin{aligned} E[QP_t]&= M\frac{\pi ^2}{4} \left\{ \sum \limits _{m=4}^M E\left[ |r_{m-3}|\right] E\left[ |r_{m-2}|\right] E\left[ |r_{m-1}|\right] E\left[ |r_{m}|\right] \right\} \\&= \frac{\sigma ^4_t}{M} \sum \limits _{m=4}^M s_{m-4}s_{m-2}s_{m-1}s_{m} \\&= \frac{\sigma ^4_t}{M} (g_M)^{-2} \sum _{m=4}^M \left[ g\left( \frac{m-3}{M}\right) g\left( \frac{m-2}{M}\right) g\left( \frac{m-1}{M}\right) g\left( \frac{m}{M}\right) \right] ^{1/2}. \end{aligned}$$

Then we get $\lim _{M\rightarrow \infty } E[QP_t]=\sigma _t^4\xi /M \ (1+o(1))$ which follows again by a Riemann integral. The statement for RQ is obtained by analogy.

Next, consider $TP_t$ defined in (3) and observe that $E[|Z|^{4/3}]/\sigma ^{4/3} = \mu _{4/3}= 2^{2/3} \pi ^{-1/2} \Gamma (7/6)$ if Z is a centered normal distributed random variable with variance $\sigma ^2$. This yields observing (8)

$$\begin{aligned} E[TP_t]&\!= \!M\mu _{\frac{4}{3}}^{-3} \sum \limits _{m=3}^M \left\{ E\left[ |r_{m-2}|^{\frac{4}{3}}\right] E\left[ |r_{m-1}|^{\frac{4}{3}}\right] E\left[ |r_{m}|^{\frac{4}{3}}\right] \right\} \!=\! \frac{\sigma ^4_t}{M}\sum \limits _{m=3}^M s_{m-2}^{\frac{4}{3}}s_{m-1}^{\frac{4}{3}}s_{m}^{\frac{4}{3}} \\&= \frac{\sigma ^4_t}{M} (g_M)^{-2} \sum _{m=3}^M \left[ g\left( \frac{m-2}{M}\right) g\left( \frac{m-1}{M}\right) g\left( \frac{m}{M}\right) \right] ^{2/3}. \end{aligned}$$

The statement $\lim _{M\rightarrow \infty }E[TP_t]= \sigma ^4_t \xi /M \ (1+o(1))$ completes the proof of Proposition 2. $\square $

Proof of Theorem 1

It follows from Proposition 2 that

$$\begin{aligned} V(BV_t)= & {} \frac{\sigma ^4_t}{M} \left( \frac{\pi ^2}{4} + \pi -3 \right) \xi \cdot (1+o(1)),\\ V(RV_t)= & {} 2\frac{\sigma ^4_t}{M} \xi \cdot (1+o(1)),\\ Cov(RV_t,BV)= & {} 2\frac{\sigma ^4_t}{M} \xi \cdot (1+o(1)), \end{aligned}$$

where $\xi =\int _0^1 g^2(x)dx \Big /\left( \int _0^1 g(x)dx\right) ^2\ge 1$ by the Cauchy–Schwarz inequality. Therefore, Theorem 1 is a consequence of a straightforward application of a central limit theorem for triangular arrays of m-dependent random variables (Romano and Wolf 2000) and the Cramer–Wold device. $\square $

Proof of Proposition 3

Recall the definition of $ {\hat{s}}_m$ in (14), then we obtain the representation

$$\begin{aligned} \widehat{{\mathbf {S}}}_{M} = \left( {\hat{s}}_1^{2} , \ldots , {\hat{s}}_M^{2} \right) ^{\top } = g_{1} (\widehat{\mathbf {SD}}_{M,T})~, \end{aligned}$$

with $\widehat{\mathbf {SD}}_{M,T} = ({\widehat{SD}}^{2}_{1,T}, \ldots , {\widehat{SD}}^{2}_{M,T} )^{\top }$ defined by (14), and the function $g_{1}: { {\mathbb {R}}^M \rightarrow {\mathbb {R}}^{M}}$ is given by

$$\begin{aligned} g_{1} (x) = (g_{11} (x) , \ldots , g_{1M} (x) )^{\top }= {M \over \sum _{m=1}^{M} x_{m}} (x_{1}, \ldots , x_{M})^{\top }~. \end{aligned}$$

(27)

Let

$$\begin{aligned} g_{1}^{\prime }(x) = {M \over \sum _{m=1}^{M} x_{m}} \left\{ {\varvec{I}}_{M} - {1 \over \sum _{m=1}^{M} x_{m} } (x_{1}, \ldots , x_{M})^{\top } {\varvec{1}} _{M}^{\top } \right\} \end{aligned}$$

denote the derivative of $g_{1}$ whereby ${\varvec{I}}_{M}$ is the $M\times M$ identity matrix, then, by (16), it follows that

$$\begin{aligned} \lim _{T\rightarrow \infty } g_{1}^{\prime }( {\mathbf {SD}}_{M,T} ) = {M \over \sigma ^{2}} {\varvec{I}}_{M} - {1 \over \sigma ^{2}} {\varvec{S}}_{M} {\varvec{1}}_{M}^{\top }~, \end{aligned}$$

(28)

where ${\varvec{S}}_{M} = (s_{1}^{2}, \ldots , s_{M}^{2})^{\top }$ and we have used the fact that $\sum _{m=1}^{M} SD_{m,T}^{2} = \sigma _{T}^{2} \rightarrow \sigma ^{2}$. Now a componentwise Taylor expansion gives

$$\begin{aligned} \sqrt{T} \left( g_{1} (\widehat{\mathbf {SD}}_{M,T}) - g_{1} ({\mathbf {SD}}_{M,T} ) \right) = g_{1}^{\prime } ({\mathbf {SD}}_{M,T} ) \left( \widehat{\mathbf {SD}}_{M,T} - {\mathbf {SD}}_{M,T} \right) + \varvec{\Delta }_{M,T}~. \end{aligned}$$

(29)

The components of the vector $ \varvec{\Delta }_{M,T} = (\Delta _{1,T} , \ldots , \Delta _{M,T} )^{\top }$ are given by

$$\begin{aligned} \Delta _{m,T}= & {} \sqrt{T} \left( \widehat{\mathbf {SD}}_{M,T} ) - {\mathbf {SD}}_{M,T} \right) ^{\top } \nabla ^{2} g_{1m} \\&\quad \left( (1-\psi _{m})\widehat{\mathbf {SD}}_{M,T} + \psi _{m} {\mathbf {SD}}_{M,T} \right) \left( \widehat{\mathbf {SD}}_{M,T} - {\mathbf {SD}}_{M,T} \right) ~, \end{aligned}$$

where $\nabla ^{2} g_{1m}$ denotes the Hessian matrix of the m-th component $g_{1m}$ of the function $g_{1}$ and $\psi _{m}\in (0,1)$ is an intermediate point, $m=1, \ldots ,M$. By (17) and (16) we have

$$\begin{aligned} \lim _{T\rightarrow \infty } {\mathbf {SD}}_{M,T} = {\sigma ^{2} \over M} {\mathbf {S}}_{M} \quad \text{ and } \quad \lim _{T\rightarrow \infty } \widehat{\mathbf {SD}}_{M,T} = {\sigma ^{2} \ \over M} {\mathbf {S}}_{M} ~~~ \text{ in } \text{ probability } , \end{aligned}$$

and $\Delta _{m,T} = O_{P}(1/\sqrt{T})$. Note that $g_{1}^{\prime } $ and $\nabla ^{2} g_{1m}$ are continuous at the point ${\sigma ^{2} \over M} {\mathbf {S}}_{M} $, $m=1, \ldots , M$. Consequently, it follows from (28), (29) and (17) that

$$\begin{aligned}&\sqrt{T} \left( \left( {\hat{s}}_1^{2} , \ldots , {\hat{s}}_M^{2} \right) ^{\top } - \left( {s}_1^{2} , \ldots , {s}_M^{2} \right) ^{\top } \right) \nonumber \\&= \sqrt{T} \left( g_{1} (\widehat{\mathbf {SD}}_{M,T}) - g_{1} ({\mathbf {SD}}_{M,T} ) \right) \overset{L}{\longrightarrow } \mathcal{N} (\mathbf {0}, \Sigma _{1})~, \end{aligned}$$

(30)

where the $M \times M$ matrix $\Sigma _{1}$ is given by

$$\begin{aligned} \Sigma _{1} = M^{2} \Sigma - M \left( {\mathbf {S}}_{M} {\varvec{1}}_{M}^{\top } \Sigma + \Sigma {\varvec{1}}_{M} {\mathbf {S}}_{M} ^{\top } \right) + {\varvec{1}}_{M}^{\top } \Sigma {\varvec{1}}_{M} {\mathbf {S}}_{M} {\mathbf {S}}_{M} ^{\top } ~. \end{aligned}$$

Now note that ${\hat{\xi }}_M = g_{2} (\hat{{\varvec{S}}}_{M} )$, where the function $g_{2}: {\mathbb {R}}^M \rightarrow {\mathbb {R}}$ is defined by

$$\begin{aligned} g_{2}(x) ={1 \over M} \sum _{m=1}^{M} x^2_m ~. \end{aligned}$$

Observing that $\nabla g_{2}(x) = {2 \over M} (x_1 \ldots , x_M)$ assertion (18) follows by the Delta-method, where a straightforward calculation yields the limiting variance in (20). Similarly, (19) is obtained observing that ${\hat{\zeta }}_M = g_{3} (\hat{{\varvec{S}}}_{M} )$, where the function $g_{3}: {\mathbb {R}}^M \rightarrow {\mathbb {R}}$ is defined by

$$\begin{aligned} g_{3}(x)= & {} {1 \over M} \sum _{m=2}^{M} \sqrt{x_m x_{m-1} }~, \\ \nabla g_{3}(x)= & {} {1 \over 2M} \left( {\sqrt{x_2} \over \sqrt{x_1}} , {\sqrt{x_1} + \sqrt{x_3} \over \sqrt{x_2}} , ~\ldots ~, {\sqrt{x_{M-2}} + \sqrt{x_M} \over \sqrt{x_{M-1}}} , {\sqrt{x_{M-1}} \over \sqrt{x_M} } \right) ^{\top }~. \end{aligned}$$

Finally, if $\Sigma $ is a diagonal matrix as specified in Proposition 3 the formulas (22) and (23) follow by a direct calculation which completes the proof of Proposition 3.

Moreover, the arguments provided in this proof show that the result of Proposition 3 would remain correct for any estimator $\left( {\hat{s}}_1^{2} , \ldots , {\hat{s}}_M^{2} \right) ^{\top } $, for which an asymptotic normality in the sense (30) can be established. However, the limiting variances $\tau _1^2$ and $\tau _2^2$ may change as they depend on the specific form of the estimator. $\square $

1.1 Implementation of the WSD estimator

In the following we outline the implementation of the WSD estimator as in Boudt et al. (2011). Let $m \in \{1,\dots ,M\}$ be a fixed intraday interval and compute the returns $x_{t,m}=r_{t,m}/{\hat{\sigma }}_t$ where ${\hat{\sigma }}^2_t$ is the BV measure. Next, consider the T order statistics $x_{(1),m} \le x_{(2),m} \le \dots \le x_{(T),m}$. Let $h_T = \lfloor T/2 \rfloor +1.$ We first calculate $\text {SH}_m = 0.741\cdot \min (x_{(h_T),m}-x_{(1),m},\dots ,x_{(T),m}-x_{(T-h_T+1),m})$ and then

$$\begin{aligned} {\hat{s}}_m^{\text {SH}} = \frac{\text {SH}_m}{\sqrt{\frac{1}{M} \sum _{j=1}^{M} \text {SH}^2_j}}. \end{aligned}$$

The WSD estimates are then given by

$$\begin{aligned} {\hat{s}}_m = \frac{\text {WSD}_m}{\sqrt{\frac{1}{M} \sum _{j=1}^{M} \text {WSD}^2_j}} \quad \text{ with } \quad WSD_j = \sqrt{1.081 \cdot \frac{\sum _{t=1}^T w_{t,j} x^2_{t,j}}{\sum _{t=1}^T w_{t,j} }}. \end{aligned}$$

The weights are given by $w_{t,j} = \theta (x_{t,j} / {\hat{s}}_m^{\text {SH}})$ and the weight function is $ \theta (z) = \mathbbm {1}(z^2 \le 6.635)$ where $\mathbbm {1}(\cdot )$ denotes the indicator function.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dette, H., Golosnoy, V. & Kellermann, J. The effect of intraday periodicity on realized volatility measures. Metrika 86, 315–342 (2023). https://doi.org/10.1007/s00184-022-00875-0

Download citation

Received: 17 August 2021
Accepted: 05 July 2022
Published: 16 July 2022
Issue Date: April 2023
DOI: https://doi.org/10.1007/s00184-022-00875-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The effect of intraday periodicity on realized volatility measures

Abstract

Similar content being viewed by others

Volatility in the Cryptocurrency Market

Momentum: what do we know 30 years after Jegadeesh and Titman’s seminal paper?

Functional central limit theorems for rough volatility

1 Introduction

2 Measuring daily volatility based on intraday information

2.1 Model for intraday returns and realized measures

2.2 Discrete time model for intraday returns

3 The impact of intraday periodicity on RV and BV

3.1 Bias and variance of realized estimators

Proposition 1

Proposition 2

3.2 Asymptotic distribution in case of intraday periodicity

Theorem 1

3.3 Estimation of IP correction factors

Proposition 3

4 Simulation study

4.1 The impact of IP on realized measures

4.2 The comparison of IP-bias corrected estimators

5 Empirical study

5.1 Estimation of intraday pattern and descriptive statistics

5.2 Discussion and further extensions

6 Summary

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest statement

Additional information

Publisher's Note

Appendices

Appendix

Proofs

Proof of Proposition 1

Proof of Proposition 2

Proof of Theorem 1

Proof of Proposition 3

1.1 Implementation of the WSD estimator

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation