Elsevier

Chaos, Solitons & Fractals

Volume 102, September 2017, Pages 456-466
Chaos, Solitons & Fractals

Frontiers
Identification and validation of stable ARFIMA processes with application to UMTS data

https://doi.org/10.1016/j.chaos.2017.03.059Get rights and content

Abstract

In this paper we present an identification and validation scheme for stable autoregressive fractionally integrated moving average (ARFIMA) time series. The identification part relies on a recently introduced estimator which is a generalization of that of Kokoszka and Taqqu and a new fractional differencing algorithm. It also incorporates a low-variance estimator for the memory parameter based on the sample mean-squared displacement. The validation part includes standard noise diagnostics and backtesting procedure. The scheme is illustrated on Universal Mobile Telecommunications System (UMTS) data collected in an urban area. We show that the stochastic component of the data can be modeled by the long memory ARFIMA. This can help to monitor possible hazards related to the electromagnetic radiation.

Introduction

The concept of anomalous diffusion and fractional dynamics has deeply penetrated the statistical and chemical physics communities, yet the subject has also become a major field in mathematics [1], [2]. Historically, fractional dynamical systems are related to the concept of fractional dynamic equations. This is an active field of study in physics, mechanics, mathematics, and economics investigating the behavior of objects and systems that are described by using differentiation of fractional orders. The celebrated fractional Fokker–Planck equation (FFPE), describing anomalous diffusion in the presence of an external potential was derived explicitly in [3], where methods of its solution were introduced and for some special cases exact solutions were calculated.

Derivatives and integrals of fractional orders can be used to describe random phenomena that can be characterized by long (power-like) memory or self-similarity [1], [2]. Long memory (or long-range dependence) is a property of certain stationary stochastic processes describing phenomena, which concern the events that are arbitrarily distant still influence each other exceptionally strong. It has been associated historically with slow decay of correlations and a certain type of scaling that is connected to self-similar processes [4], [5].

Recently, there has been a great interest in long-range dependent and self-similar processes, in particular fractional Brownian motion (FBM), fractional stable motion (FSM) and autoregressive fractionally integrated moving average (ARFIMA), which are also called fractional autoregressive integrated moving average (FARIMA) [6], [7]. This importance can be judged, for example, by a very large number of publications having one of these notions in the title, in areas such as finance and insurance [8], [9], [10], [11], [12], [13], [14], [15], telecommunication [16], [17], [18], [19], [20], [21], hydrology [22], climate studies [23], linguistics [24], DNA sequencing [25] or medicine [26]. Long-range dependent and self-similar processes also appear widely in other areas like biophysics [7], [27], [28], [29], [30], [31], [32] or astronomy [33]. These publications address a great variety of issues: detection of long memory and self-similarity in the data, statistical estimation of parameters of long-range dependence and self-similarity, limit theorems under long-range dependence and self-similarity, simulation of long memory and self-similar processes, relations to ergodicity and many others [6], [7], [34], [35], [36], [37].

The FBM, FSM and ARFIMA serve as basic stochastic models for fractional anomalous dynamics [7]. The former two models are self-similar and their increments form long-range dependent processes. The discrete-time ARFIMA process is stationary and generalizes both models since aggregated, in the limit, it converges to either fractional Brownian or stable motion. As a consequence, a partial sum ARFIMA process can be considered as a unified model for fractional anomalous diffusion in experimental data [38]. A type of anomaly of the process is controlled only by its the memory parameter regardless of the underlying distribution [28]. We also note that there is a relationship between the ARFIMA and continuous time random walk (CTRW) which is a classical model of anomalous diffusion [3], [39]. The latter can be obtained by subordination of the Ornstein–Uhlenbeck process which discrete version is an autoregressive (AR) process, so a special case of the ARFIMA [40], [41].

In contrast to FBM and FLSM, ARFIMA allows for different light- and heavy-tailed distributions, and both long (power-like) and short (exponential) dependencies [38]. Moreover, as a stationary process, it provides prediction tools.

It appears that the values of ARFIMA with Gaussian noise, for the memory parameter d greater than 0, have so slowly decaying autocovariance function that it is not absolutely summable. This behavior serves as a classical definition of the long-range dependence. However, it is also a well-known fact that the heavy-tailed probability distributions with diverging variance are ubiquitous in nature and finance [42], [43], [44], [45], [46], [47].

The stable probability densities have the asymptotics decaying at infinity as |x|1α, where α is the index of stability varying between 0 and 2. They attract distributions having the same law of decay. On the contrary, the Gaussian distribution has the index of stability 2 and attracts all distributions with lighter tails [42], [48], [49].

Stably distributed random noises are observed in such diverse applications as plasma turbulence (density and electric field fluctuations [49], [50], [51]), stochastic climate dynamics [52], [53], [54], physiology (heartbeats [55]), electrical engineering [56], biology [28], [30], and economics [57], [58]. Heavy-tailed distributions govern circulation of dollar bills [59] and behavior of the marine vertebrates in response to patchy distribution of food resources [60].

In this paper we propose an identification and validation scheme for ARFIMA processes with noise in the domain of attraction of the stable law which is based on estimation algorithm introduced in [61]. The scheme is illustrated on the electromagnetic radiation data which shows long memory behavior which is also observed for telecommunication data in [19].

The paper is organized as follows: in Section 2 we recall basic facts about a prominent example of long memory processes, namely ARFIMA time series. In Section 3 we introduce a step by step procedure for identification of a ARFIMA process. The procedure involves (i) a method of preliminary estimation of the memory parameter based on the mean-squared displacement, (ii) a new method of fractional differencing which leads to model order estimation and (iii) the estimation formula for stable ARFIMA times series introduced in [61]. Section 4 is devoted to validation of the fitted model. It consists of analysis of residuals: testing their randomness and fitting a distribution which is done by standard statistical tests, and backtesting which involves prediction formula for ARFIMA time series. The identification and validation procedure is illustrated in Section 5 on electromagnetic field data collected in the vicinity of an Universal Mobile Telecommunications System (UMTS) station in Wroclaw. After removing deterministic seasonality and volatility from the data, a long memory ARFIMA process is identified and validated. In Section 6 a summary of the results is given.

Section snippets

ARFIMA process

In this section we briefly present the main facts about ARFIMA time series which were introduced in [62] and [63]. Such process {Xt}, denoted by ARFIMA(p, d, q), is defined by Φp(B)Xt=Θq(B)(1B)dZt,where innovations (noise sequence) Zt are i.i.d. random variables with either finite or infinite variance. We also assume that the innovations belong to the domain of attraction of an α-stable law with 0 < α ≤ 2. For the infinite variance case (α < 2) this means that P(|Zt|>x)=xαL(x),asx,where L

ARFIMA identification

In this section we describe the identification algorithm of ARFIMA processes. In this procedure we assume that data that are stationary.

ARFIMA validation

In this section we describe statistical tools that can be applied to justify the hypothesis of ARFIMA time series as an underlying model for empirical data.

UMTS data

In this section we analyze a set of UMTS data, see Fig. 2. The electromagnetic field intensity was measured in Wroclaw in an urban area every minute from 12.01.2011 22:40 to 19.01.2011 21:18 (9999 observations).

Conclusions

The ARFIMA process can serve as a universal and simple discrete time model for fractional dynamics of empirical data and the celebrated FBM and FSM form the limiting case of ARFIMA [7]. It offers a lot of flexibility in modeling of long (power-like) and short (exponential) dependencies by choosing the memory parameter d and appropriate autoregressive and moving average coefficients. Modeling with ARFIMA processes also allows for taking into account different light and heavy-tailed

Acknowledgment

The authors would like to acknowledge a support of NCN Maestro Grant No. 2012/06/A/ST1/00258. We also thank prof. Bieńkowski from the Electromagnetic Environment Protection Lab of the Wroclaw University of Science and Technology for providing us the data.

References (87)

  • D. Brockmann et al.

    The scaling laws of human travel

    Nature

    (2006)
  • J. Geweke et al.

    The estimation and application of long memory time series models

    J Time Ser-Anal

    (1983)
  • I. Bronstein

    Transient anomalous diffusion of telomeres in the nucleus of mammalian cells

    Phys Rev Lett

    (2009)
  • M.M. Meerschaert et al.

    Stochastic models for fractional calculus

    De gruyter studies in mathematics

    (2012)
  • J. Beran

    Statistics for long-memory processes

    (1994)
  • P. Doukham et al.

    Theory and applications of long range dependence

    (2003)
  • G. Samorodnitsky

    Long range dependence

    Found Trends Stoch Syst

    (2006)
  • K. Burnecki et al.

    Algorithms for testing of fractional dynamics: a practical guide to ARFIMA modelling

    J Stat Mech

    (2014)
  • S.R. Souzo et al.

    Long memory testing for fed funds futures contracts

    Chaos Solitons Fractals

    (2008)
  • A.W. Lo

    Long-term memory in stock market prices

    Econometrica

    (1991)
  • A.W. Lo

    Fat tails, long memory, and the stock market since the 1960s

    Econ Notes

    (2001)
  • K. Burnecki

    Self-similar processes as weak limits of a risk reserve process

    Probab Math Statist

    (2000)
  • J. Beran et al.

    Long-range dependence in variable-bit-rate video traffic

    IEEE Trans Commun

    (1995)
  • I. Norros

    On the use of fractional brownian motion in the theory of connectionless networks

    IEEE J Sel Areas Commun

    (1995)
  • W. Willinger et al.

    Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level

    IEEE/ACM Trans Net

    (1997)
  • T. Karagiannis et al.

    Long-range dependence ten years of internet traffic modeling

    IEEE Internet Comput

    (2004)
  • M. Coulon et al.

    Detection of multiple changes in fractional integrated ARMA processes

    IEEE Trans Signal Process

    (2009)
  • S.A. Stoev et al.

    Estimating heavy-tail exponents through max self “similarity”

    IEEE Trans Inf Theory

    (2011)
  • Painter S.. Long-range dependence in the subsurface: Empirical evidence and simulation methods. Invited paper at the...
  • C. Varotsos et al.

    Long-memory processes in ozone and temperature variations at the region 60° S-60° N

    Atmos Chem Phys

    (2006)
  • E. Alvarez-Lacalle et al.

    Hierarchical structures induce long-range dynamical correlations in written texts

    PNAS

    (2006)
  • D. Karmeshu et al.

    Sequence variability and long-range dependence in DNA: an information theoretic perspective

  • C.K. Peng et al.

    Long-range anticorrelations and non-Gaussian behaviour of the heartbeat

    Phys Rev Lett

    (1993)
  • J. Szymanski et al.

    Elucidating the origin of anomalous diffusion in crowded fluids

    Phys Rev Lett

    (2009)
  • K. Burnecki et al.

    Fractional lévy stable motion can model subdiffusive dynamics

    Phys Rev E

    (2010)
  • E. Kepten et al.

    Ergodicity convergence test suggests telomere motion obeys fractional dynamics

    Phys Rev E

    (2011)
  • K. Burnecki

    FARIMA processes with application to biophysical data

    J Stat Mech

    (2012)
  • K. Burnecki et al.

    Statistical modelling of subdiffusive dynamics in the cytoplasm of living cells: a FARIMA approach

    EPL

    (2012)
  • K. Burnecki et al.

    Estimating the anomalous diffusion exponent for single particle tracking data with measurement errors - an alternative approach

    Sci Rep

    (2015)
  • A. Stanislavsky et al.

    FARIMA modeling of solar flare activity from empirical time series of soft X-ray solar emission

    Astrophys J

    (2009)
  • S. Stoev et al.

    Simulation methods for linear fractional stable motion and FARIMA using the fast fourier transform

    Fractals

    (2004)
  • H. Guo et al.

    Local whittle estimator for anisotropic random fields

    J Multivariate Anal

    (2009)
  • Cited by (21)

    • Rényi entropy and divergence for VARFIMA processes based on characteristic and impulse response functions

      2022, Chaos, Solitons and Fractals
      Citation Excerpt :

      The main advantages of the generalized IRF is that it is unique and invariant to different orderings of variables in the system and provides a good alternative to the orthogonalized approach in case of an investigation of a high-dimensional system [12]. Long-memory models have been considered in fields such as economics [13,14], biomedicine [15], meteorology [16–18], social science [19], forestry [20], geophysics [21], and telecommunications [22]. Shannon entropy [23] was a pioneer information-theoretical measure to quantify aleatory aspects on random variables, which define an information quantity contained in a univariate/multivariate probability density function (pdf), e.g., the normal distribution [24].

    • Order flow in the financial markets from the perspective of the Fractional Lévy stable motion

      2022, Communications in Nonlinear Science and Numerical Simulation
      Citation Excerpt :

      These results, as previous findings in [34–36] showed that the limit order flow exhibits strong positive autocorrelation. Here we admit that many widely used estimators of long-range memory were developed with the assumption of Gaussian noise distribution [40] thus, the more general approach based on the FLSM or ARFIMA models has to be implemented for the order flow analysis in the financial markets, and other social systems [30,32]. More careful investigation of order flow sizes (tick sizes) revealed that PDFs of tick sizes are specific for each stock with some power-law tail.

    • Modeling of water usage by means of ARFIMA–GARCH processes

      2018, Physica A: Statistical Mechanics and its Applications
      Citation Excerpt :

      Another time series approach based on regression model was considered in [35] in the context of forecasting electricity load demand. We note that ARIMA or more general autoregressive fractionally integrated moving average (ARFIMA) models are frequently used in modeling of many real life phenomena which reflect the property of long memory (power-like decay of the autocorrelation function), see, e.g. [5] where telecommunication [16], astrophysical [36] and biological data [37] were studied, [38] for applications in climate science, and the monograph by Beran [6] and references therein. In the biophysical context ARFIMA processes with a negative memory parameter are closely related to the notion of subdiffusion [5,37].

    • Jeffrey's divergence between ARFIMA processes

      2018, Digital Signal Processing: A Review Journal
      Citation Excerpt :

      They have become very popular in statistics, econometrics and finance [3] [7]. Furthermore, they have been recently used to model the stochastic component of the UMTS data [6] or to model the heart rate variability in biomedical applications [32]. Our purpose is hence to analyze if the interpretation of the JD based on inverse filtering is still valid for ARFIMA processes.

    View all citing articles on Scopus
    View full text