Identification and validation of stable ARFIMA processes with application to UMTS data

doi:10.1016/j.chaos.2017.03.059

Chaos, Solitons & Fractals

Volume 102, September 2017, Pages 456-466

https://doi.org/10.1016/j.chaos.2017.03.059 Get rights and content

Abstract

In this paper we present an identification and validation scheme for stable autoregressive fractionally integrated moving average (ARFIMA) time series. The identification part relies on a recently introduced estimator which is a generalization of that of Kokoszka and Taqqu and a new fractional differencing algorithm. It also incorporates a low-variance estimator for the memory parameter based on the sample mean-squared displacement. The validation part includes standard noise diagnostics and backtesting procedure. The scheme is illustrated on Universal Mobile Telecommunications System (UMTS) data collected in an urban area. We show that the stochastic component of the data can be modeled by the long memory ARFIMA. This can help to monitor possible hazards related to the electromagnetic radiation.

Introduction

The concept of anomalous diffusion and fractional dynamics has deeply penetrated the statistical and chemical physics communities, yet the subject has also become a major field in mathematics [1], [2]. Historically, fractional dynamical systems are related to the concept of fractional dynamic equations. This is an active field of study in physics, mechanics, mathematics, and economics investigating the behavior of objects and systems that are described by using differentiation of fractional orders. The celebrated fractional Fokker–Planck equation (FFPE), describing anomalous diffusion in the presence of an external potential was derived explicitly in [3], where methods of its solution were introduced and for some special cases exact solutions were calculated.

Derivatives and integrals of fractional orders can be used to describe random phenomena that can be characterized by long (power-like) memory or self-similarity [1], [2]. Long memory (or long-range dependence) is a property of certain stationary stochastic processes describing phenomena, which concern the events that are arbitrarily distant still influence each other exceptionally strong. It has been associated historically with slow decay of correlations and a certain type of scaling that is connected to self-similar processes [4], [5].

Recently, there has been a great interest in long-range dependent and self-similar processes, in particular fractional Brownian motion (FBM), fractional stable motion (FSM) and autoregressive fractionally integrated moving average (ARFIMA), which are also called fractional autoregressive integrated moving average (FARIMA) [6], [7]. This importance can be judged, for example, by a very large number of publications having one of these notions in the title, in areas such as finance and insurance [8], [9], [10], [11], [12], [13], [14], [15], telecommunication [16], [17], [18], [19], [20], [21], hydrology [22], climate studies [23], linguistics [24], DNA sequencing [25] or medicine [26]. Long-range dependent and self-similar processes also appear widely in other areas like biophysics [7], [27], [28], [29], [30], [31], [32] or astronomy [33]. These publications address a great variety of issues: detection of long memory and self-similarity in the data, statistical estimation of parameters of long-range dependence and self-similarity, limit theorems under long-range dependence and self-similarity, simulation of long memory and self-similar processes, relations to ergodicity and many others [6], [7], [34], [35], [36], [37].

The FBM, FSM and ARFIMA serve as basic stochastic models for fractional anomalous dynamics [7]. The former two models are self-similar and their increments form long-range dependent processes. The discrete-time ARFIMA process is stationary and generalizes both models since aggregated, in the limit, it converges to either fractional Brownian or stable motion. As a consequence, a partial sum ARFIMA process can be considered as a unified model for fractional anomalous diffusion in experimental data [38]. A type of anomaly of the process is controlled only by its the memory parameter regardless of the underlying distribution [28]. We also note that there is a relationship between the ARFIMA and continuous time random walk (CTRW) which is a classical model of anomalous diffusion [3], [39]. The latter can be obtained by subordination of the Ornstein–Uhlenbeck process which discrete version is an autoregressive (AR) process, so a special case of the ARFIMA [40], [41].

In contrast to FBM and FLSM, ARFIMA allows for different light- and heavy-tailed distributions, and both long (power-like) and short (exponential) dependencies [38]. Moreover, as a stationary process, it provides prediction tools.

It appears that the values of ARFIMA with Gaussian noise, for the memory parameter d greater than 0, have so slowly decaying autocovariance function that it is not absolutely summable. This behavior serves as a classical definition of the long-range dependence. However, it is also a well-known fact that the heavy-tailed probability distributions with diverging variance are ubiquitous in nature and finance [42], [43], [44], [45], [46], [47].

The stable probability densities have the asymptotics decaying at infinity as ${| x |}^{- 1 - α},$ where α is the index of stability varying between 0 and 2. They attract distributions having the same law of decay. On the contrary, the Gaussian distribution has the index of stability 2 and attracts all distributions with lighter tails [42], [48], [49].

Stably distributed random noises are observed in such diverse applications as plasma turbulence (density and electric field fluctuations [49], [50], [51]), stochastic climate dynamics [52], [53], [54], physiology (heartbeats [55]), electrical engineering [56], biology [28], [30], and economics [57], [58]. Heavy-tailed distributions govern circulation of dollar bills [59] and behavior of the marine vertebrates in response to patchy distribution of food resources [60].

In this paper we propose an identification and validation scheme for ARFIMA processes with noise in the domain of attraction of the stable law which is based on estimation algorithm introduced in [61]. The scheme is illustrated on the electromagnetic radiation data which shows long memory behavior which is also observed for telecommunication data in [19].

The paper is organized as follows: in Section 2 we recall basic facts about a prominent example of long memory processes, namely ARFIMA time series. In Section 3 we introduce a step by step procedure for identification of a ARFIMA process. The procedure involves (i) a method of preliminary estimation of the memory parameter based on the mean-squared displacement, (ii) a new method of fractional differencing which leads to model order estimation and (iii) the estimation formula for stable ARFIMA times series introduced in [61]. Section 4 is devoted to validation of the fitted model. It consists of analysis of residuals: testing their randomness and fitting a distribution which is done by standard statistical tests, and backtesting which involves prediction formula for ARFIMA time series. The identification and validation procedure is illustrated in Section 5 on electromagnetic field data collected in the vicinity of an Universal Mobile Telecommunications System (UMTS) station in Wroclaw. After removing deterministic seasonality and volatility from the data, a long memory ARFIMA process is identified and validated. In Section 6 a summary of the results is given.

Section snippets

ARFIMA process

In this section we briefly present the main facts about ARFIMA time series which were introduced in [62] and [63]. Such process {X_t}, denoted by ARFIMA(p, d, q), is defined by $Φ_{p} (B) X_{t} = Θ_{q} (B) {(1 - B)}^{- d} Z_{t},$ where innovations (noise sequence) Z_t are i.i.d. random variables with either finite or infinite variance. We also assume that the innovations belong to the domain of attraction of an α-stable law with 0 < α ≤ 2. For the infinite variance case (α < 2) this means that $P (| Z_{t} | > x) = x^{- α} L (x), a s x \to \infty,$ where L

ARFIMA identification

In this section we describe the identification algorithm of ARFIMA processes. In this procedure we assume that data that are stationary.

ARFIMA validation

In this section we describe statistical tools that can be applied to justify the hypothesis of ARFIMA time series as an underlying model for empirical data.

UMTS data

In this section we analyze a set of UMTS data, see Fig. 2. The electromagnetic field intensity was measured in Wroclaw in an urban area every minute from 12.01.2011 22:40 to 19.01.2011 21:18 (9999 observations).

Conclusions

The ARFIMA process can serve as a universal and simple discrete time model for fractional dynamics of empirical data and the celebrated FBM and FSM form the limiting case of ARFIMA [7]. It offers a lot of flexibility in modeling of long (power-like) and short (exponential) dependencies by choosing the memory parameter d and appropriate autoregressive and moving average coefficients. Modeling with ARFIMA processes also allows for taking into account different light and heavy-tailed

Acknowledgment

The authors would like to acknowledge a support of NCN Maestro Grant No. 2012/06/A/ST1/00258. We also thank prof. Bieńkowski from the Electromagnetic Environment Protection Lab of the Wroclaw University of Science and Technology for providing us the data.

References (87)

R. Metzler et al.
The random walks guide to anomalous diffusion: a fractional dynamics approach
Phys Rep
(2000)
D.O. Cajueiro et al.
Time-varying long-range dependence in US interest rates
Chaos Solitons Fractals
(2007)
D.O. Cajueiro et al.
Testing for long-range dependence in the brazilian term structure of interest rates
Chaos Solitons Fractals
(2009)
J.T. Barkoulas et al.
Long-memory exchange rate dynamics in the euro era
Chaos Solitons Fractals
(2016)
R.T. Baillie
Long memory processes and fractional integration in econometrics
J Econ
(1996)
T. Graves et al.
Efficient bayesian inference for natural time series using ARFIMA processes
Nonlin Processes Geophys
(2015)
J. Janczura et al.
Ergodicity testing for anomalous diffusion: small sample statistics
J Chem Phys
(2015)
H. Li et al.
Fractional-moment capital asset pricing model
Chaos Solitons Fractals
(2009)
K. Burnecki et al.
Discriminating between light- and heavy-tailed distributions with limit theorem
PLoS ONE
(2015)
P.D. Ditlevsen
Observation of alpha-stable noise induces millenial climate changes from an ice record
Geophys Res Lett
(1999)

D. Brockmann et al.

The scaling laws of human travel

Nature

(2006)

J. Geweke et al.

The estimation and application of long memory time series models

J Time Ser-Anal

(1983)

I. Bronstein

Transient anomalous diffusion of telomeres in the nucleus of mammalian cells

Phys Rev Lett

(2009)

M.M. Meerschaert et al.

Stochastic models for fractional calculus

De gruyter studies in mathematics

(2012)

J. Beran

Statistics for long-memory processes

(1994)

P. Doukham et al.

Theory and applications of long range dependence

(2003)

G. Samorodnitsky

Long range dependence

Found Trends Stoch Syst

(2006)

K. Burnecki et al.

Algorithms for testing of fractional dynamics: a practical guide to ARFIMA modelling

J Stat Mech

(2014)

S.R. Souzo et al.

Long memory testing for fed funds futures contracts

Chaos Solitons Fractals

(2008)

A.W. Lo

Long-term memory in stock market prices

Econometrica

(1991)

A.W. Lo

Fat tails, long memory, and the stock market since the 1960s

Econ Notes

(2001)

K. Burnecki

Self-similar processes as weak limits of a risk reserve process

Probab Math Statist

(2000)

J. Beran et al.

Long-range dependence in variable-bit-rate video traffic

IEEE Trans Commun

(1995)

I. Norros

On the use of fractional brownian motion in the theory of connectionless networks

IEEE J Sel Areas Commun

(1995)

W. Willinger et al.

Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level

IEEE/ACM Trans Net

(1997)

T. Karagiannis et al.

Long-range dependence ten years of internet traffic modeling

IEEE Internet Comput

(2004)

M. Coulon et al.

Detection of multiple changes in fractional integrated ARMA processes

IEEE Trans Signal Process

(2009)

S.A. Stoev et al.

Estimating heavy-tail exponents through max self “similarity”

IEEE Trans Inf Theory

(2011)

Painter S.. Long-range dependence in the subsurface: Empirical evidence and simulation methods. Invited paper at the...

C. Varotsos et al.

Long-memory processes in ozone and temperature variations at the region 60° S-60° N

Atmos Chem Phys

(2006)

E. Alvarez-Lacalle et al.

Hierarchical structures induce long-range dynamical correlations in written texts

PNAS

(2006)

D. Karmeshu et al.

Sequence variability and long-range dependence in DNA: an information theoretic perspective

C.K. Peng et al.

Long-range anticorrelations and non-Gaussian behaviour of the heartbeat

Phys Rev Lett

(1993)

J. Szymanski et al.

Elucidating the origin of anomalous diffusion in crowded fluids

Phys Rev Lett

(2009)

K. Burnecki et al.

Fractional lévy stable motion can model subdiffusive dynamics

Phys Rev E

(2010)

E. Kepten et al.

Ergodicity convergence test suggests telomere motion obeys fractional dynamics

Phys Rev E

(2011)

K. Burnecki

FARIMA processes with application to biophysical data

J Stat Mech

(2012)

K. Burnecki et al.

Statistical modelling of subdiffusive dynamics in the cytoplasm of living cells: a FARIMA approach

EPL

(2012)

K. Burnecki et al.

Estimating the anomalous diffusion exponent for single particle tracking data with measurement errors - an alternative approach

Sci Rep

(2015)

A. Stanislavsky et al.

FARIMA modeling of solar flare activity from empirical time series of soft X-ray solar emission

Astrophys J

(2009)

S. Stoev et al.

Simulation methods for linear fractional stable motion and FARIMA using the fast fourier transform

Fractals

(2004)

H. Guo et al.

Local whittle estimator for anisotropic random fields

J Multivariate Anal

(2009)

Cited by (21)

Rényi entropy and divergence for VARFIMA processes based on characteristic and impulse response functions
2022, Chaos, Solitons and Fractals
Citation Excerpt :
The main advantages of the generalized IRF is that it is unique and invariant to different orderings of variables in the system and provides a good alternative to the orthogonalized approach in case of an investigation of a high-dimensional system [12]. Long-memory models have been considered in fields such as economics [13,14], biomedicine [15], meteorology [16–18], social science [19], forestry [20], geophysics [21], and telecommunications [22]. Shannon entropy [23] was a pioneer information-theoretical measure to quantify aleatory aspects on random variables, which define an information quantity contained in a univariate/multivariate probability density function (pdf), e.g., the normal distribution [24].
Rényi entropy based on characteristic function has been used as an information measure contained in wide-sense and real stationary vector autoregressive and moving average (VARMA) processes. These classes of processes have been extended by fractionally integrated VARMA (VARFIMA) ones, composed of a VARMA process, a vector of fractional differencing parameters, and independent and identically distributed multivariate normal random errors. Such processes have often been used to explicitly account for persistence to incorporate long-term correlations into multivariate data. The purpose of this paper is to extend Rényi entropy from VARMA to VARFIMA processes, addressing long-memory behavior of time series by adding a fractional differencing parameter. The characteristic function of the process can be derived directly from the asymptotic form of the impulse response function using the Wold representation. Then, assuming multivariate Gaussian white noise with known fractional differencing, autoregressive and moving average matrix parameters, the differential and Rényi entropies and Kullback–Leibler and Rényi divergences were obtained by evaluating the variance-covariance matrix identified with VARFIMA process distribution. The influences of the fractional differencing parameters on the Rényi entropy increment were analyzed, as were comparisons between VARFIMA processes using the Kullback–Leibler and Rényi divergences. Finally, numerical examples and an application to U.S. daily temperature time series are presented.
Order flow in the financial markets from the perspective of the Fractional Lévy stable motion
2022, Communications in Nonlinear Science and Numerical Simulation
Citation Excerpt :
These results, as previous findings in [34–36] showed that the limit order flow exhibits strong positive autocorrelation. Here we admit that many widely used estimators of long-range memory were developed with the assumption of Gaussian noise distribution [40] thus, the more general approach based on the FLSM or ARFIMA models has to be implemented for the order flow analysis in the financial markets, and other social systems [30,32]. More careful investigation of order flow sizes (tick sizes) revealed that PDFs of tick sizes are specific for each stock with some power-law tail.
It is a challenging task to identify the best possible models based on given empirical data of observed time series. Though the financial markets provide us with a vast amount of empirical data, the best model selection is still a big challenge for researchers. The widely used long-range memory and self-similarity estimators give varying values of the parameters as these estimators themselves are developed for the specific models of time series. Here we investigate from the general fractional Lévy stable motion perspective the order disbalance time series constructed from the limit order book data of the financial markets. Our results suggest that previous findings of persistence in order flow could be related to the power-law distribution of order sizes and other deviations from the normal distribution. Still, orders have stable estimates of anti-correlation for the 18 randomly selected stocks when Absolute value and Higuchi’s estimators are implemented. Though the burst duration analysis based on the first passage problem of time series and implemented in this research gives slightly higher estimates of the Hurst and memory parameters, it qualitatively supports the importance of the power-law distribution of order sizes.
Modeling of water usage by means of ARFIMA–GARCH processes
2018, Physica A: Statistical Mechanics and its Applications
Citation Excerpt :
Another time series approach based on regression model was considered in [35] in the context of forecasting electricity load demand. We note that ARIMA or more general autoregressive fractionally integrated moving average (ARFIMA) models are frequently used in modeling of many real life phenomena which reflect the property of long memory (power-like decay of the autocorrelation function), see, e.g. [5] where telecommunication [16], astrophysical [36] and biological data [37] were studied, [38] for applications in climate science, and the monograph by Beran [6] and references therein. In the biophysical context ARFIMA processes with a negative memory parameter are closely related to the notion of subdiffusion [5,37].
This paper addresses an important problem of modeling and prediction of phenomena with antipersistent behavior and variance changing in time. As a proper stochastic model we propose an autoregressive fractionally integrated moving average (ARFIMA) process with generalized autoregressive conditional heteroskedasticity (GARCH) noise. First, we introduce a simple identification and validation algorithm for such model. Second, we apply the algorithm to weekday data of hot water usage at urban residential blocks. We extract the deterministic sinusoidal component from the data and fit successfully the ARFIMA–GARCH model to the stochastic part. The goodness of fit is checked by examining model errors and prediction performance. All analyses are performed by the rigorous statistical procedure. The proposed model allows for real-time accurate predictions and when implemented at a hot water supply level will lead to a better optimization of the control system and energy efficiency use.
Jeffrey's divergence between ARFIMA processes
2018, Digital Signal Processing: A Review Journal
Citation Excerpt :
They have become very popular in statistics, econometrics and finance [3] [7]. Furthermore, they have been recently used to model the stochastic component of the UMTS data [6] or to model the heart rate variability in biomedical applications [32]. Our purpose is hence to analyze if the interpretation of the JD based on inverse filtering is still valid for ARFIMA processes.
The symmetric Kullback–Leibler divergence known as Jeffrey's divergence (JD) has found applications in signal and image processing, from radar clutter modeling to texture analysis. Recently, several studies were done on the JD between ergodic wide-sense stationary autoregressive (AR) and/or moving average (MA) processes. It was shown that the so-called asymptotic JD increment can be useful to compare ergodic wide-sense stationary ARMA processes. An interpretation of the asymptotic JD increment was also proposed. It consists in calculating the power of the first process filtered by the inverse filter associated with the second process, and conversely. However, in some biomedical applications, econometrics and other areas, long-memory processes have rather to be studied. Therefore, this paper aims at addressing the JD between ergodic wide-sense stationary autoregressive fractionally integrated moving average (ARFIMA) processes. More particularly, we study the influence of the ARFIMA parameters on the value of the asymptotic JD increment. Then, we analyze if the interpretation of the asymptotic JD increment based on inverse filtering is still valid for this type of process. Finally, some simulation results illustrate the theoretical analysis.
Energy-Aware Multicriteria Control Performance Assessment
2024, Energies
Parameter estimation for Gegenbaeur Arfisma processes with infinite variance innovations
2024, Communications in Statistics - Theory and Methods

View all citing articles on Scopus

View full text

FrontiersIdentification and validation of stable ARFIMA processes with application to UMTS data

Abstract

Introduction

Section snippets

ARFIMA process

ARFIMA identification

ARFIMA validation

UMTS data

Conclusions

Acknowledgment

Phys Rep

Chaos Solitons Fractals

Chaos Solitons Fractals

Chaos Solitons Fractals

J Econ

Nonlin Processes Geophys

J Chem Phys

Chaos Solitons Fractals

PLoS ONE

Geophys Res Lett

Nature

J Time Ser-Anal

Phys Rev Lett

Stochastic models for fractional calculus

De gruyter studies in mathematics

Statistics for long-memory processes

Theory and applications of long range dependence

Long range dependence

Found Trends Stoch Syst

Algorithms for testing of fractional dynamics: a practical guide to ARFIMA modelling

J Stat Mech

Long memory testing for fed funds futures contracts

Chaos Solitons Fractals

Long-term memory in stock market prices

Econometrica

Fat tails, long memory, and the stock market since the 1960s

Econ Notes

Self-similar processes as weak limits of a risk reserve process

Probab Math Statist

Long-range dependence in variable-bit-rate video traffic

IEEE Trans Commun

On the use of fractional brownian motion in the theory of connectionless networks

IEEE J Sel Areas Commun

Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level

IEEE/ACM Trans Net

Long-range dependence ten years of internet traffic modeling

IEEE Internet Comput

Detection of multiple changes in fractional integrated ARMA processes

IEEE Trans Signal Process

Estimating heavy-tail exponents through max self “similarity”

IEEE Trans Inf Theory

Long-memory processes in ozone and temperature variations at the region 60° S-60° N

Atmos Chem Phys

Hierarchical structures induce long-range dynamical correlations in written texts

PNAS

Sequence variability and long-range dependence in DNA: an information theoretic perspective

Long-range anticorrelations and non-Gaussian behaviour of the heartbeat

Phys Rev Lett

Elucidating the origin of anomalous diffusion in crowded fluids

Phys Rev Lett

Fractional lévy stable motion can model subdiffusive dynamics

Phys Rev E

Ergodicity convergence test suggests telomere motion obeys fractional dynamics

Phys Rev E

FARIMA processes with application to biophysical data

J Stat Mech

Statistical modelling of subdiffusive dynamics in the cytoplasm of living cells: a FARIMA approach

EPL

Estimating the anomalous diffusion exponent for single particle tracking data with measurement errors - an alternative approach

Sci Rep

FARIMA modeling of solar flare activity from empirical time series of soft X-ray solar emission

Astrophys J

Simulation methods for linear fractional stable motion and FARIMA using the fast fourier transform

Fractals

Local whittle estimator for anisotropic random fields

J Multivariate Anal

Frontiers
Identification and validation of stable ARFIMA processes with application to UMTS data