1 Introduction

General statistical properties of nested regional climate model (RCM)-generated simulations may seem at first sight similar to those produced by coupled global climate models (CGCMs). Similarities in fact exist when RCM-simulated fields are studied without consideration of the chronology of events, such as climatological means or frequency distributions. But due to the control exerted by the lateral boundary conditions upon nested RCM simulations, the RCM-simulated fields do exhibit some synchronicity with their driving data, whether GCM-simulated fields or reanalyses. When driven by reanalyses, RCM-simulated fields are also correlated with observations to some extent.

For several applications such as projected future climate changes, synchronicity is not an issue as only the overall trend matters. On the other hand, synchronicity is paramount in applications such as dynamical downscaling of reanalyses aiming at obtaining high-resolution pseudo observations. This last application was first suggested by Anthes (1983) and has become an active area of research over the last few years (e.g. Li et al. 2012; Stefanova et al. 2011; Weisse et al. 2009; Kanamaru and Kanamitsu 2007). A simple proof-of-concept exercise of any of these applications is to verify whether and under which conditions RCMs’ anomalies correlate well with those of the driving data. von Storch et al. (2000) have shown that the application of large-scale spectral nudging greatly improves the time correlation between a reanalysis-driven simulation and observations, although the match is not perfect (e.g. Alexandru et al. 2009).

In applications of RCM downscaling for which synchronicity matters, an important issue relates to the way chronology-related statistical properties vary with time scale. For example, how does the time correlation between the RCM-simulated fields and the corresponding driving fields or observations change when considering the time series on daily, weekly, monthly, seasonal or annual basis?

A naïve view might lead one to think that time upscaling (aggregation) should necessarily improve time correlation and apparent synchronicity, analogous to temporal or spatial upscaling that generally improves scores such as Root-Mean-Square error or bias by reducing the unpredictable noise. A typical example is “hedging” in weather forecast, operation that uses low-pass filters or horizontal diffusion to remove poorly predicted finer scales (e.g. Jakimow et al. 1992) in order to avoid the “double penalty” problem typical of high-resolution simulations (e.g. Mass et al. 2002; Bougeault 2002). The objective of this work is to investigate in a simple framework whether this conjecture is also correct for time correlation of upscaled RCM-generated data.

This question can be generalized in the following way: How are weather anomalies in the driving data or observations reproduced by the RCM as a function of time scale? If the RCM and driving data are found to be somewhat asynchronous at some short time scale, can they be more synchronous on longer time scales? As simple as it may sound, the problem just stated is not trivial, and as far as the authors are aware, there exists no general solution without some strong assumptions. In what follows we explore analytically four particular cases that help us partially answer this question. In addition, we use observation datasets and RCM-produced data to illustrate our findings.

2 Analytical approaches

2.1 A very simple case

Here we discuss the simplest possible case when both the RCM-simulated and the observations time series have zero time autocorrelation and identical time variance. Despite its simplicity, this assumption can be somewhat realistic in some cases. Here m t will refer to a reanalysis-driven RCM-simulation seasonal anomaly time series and o t the corresponding observations anomaly time series, and their variance \(\text{var} \left( {m_{t} } \right) = \text{var} \left( {o_{t} } \right) = \sigma^{2}\). In the following we will refer to ‘observations’ only, but all developments would equally apply to driving data as well.

Consider the case when the temporal correlation between these two time series is \(corr\left( {m,o} \right) = r\) (correlation hereafter refers to Pearson correlation). It is shown in Appendix 1 that upscaling these time series by averaging two successive time steps or archival times as

$$A_{m} = \frac{{m_{t} + m_{t + 1} }}{2},\,A_{o} = \frac{{o_{t} + o_{t + 1} }}{2}$$
(1)

yields a correlation

$$corr\left( {A_{m} ,A_{o} } \right) = r$$
(2)

which is identical to that of the original time series.

This may seem a counterintuitive result at first as one might have expected that time averaging would filter-out noise, resulting in an increased correlation (Sect. 2.4 will further illuminate this result). The covariance is indeed decreased, but the variances of the two variables are decreased in the same proportion, keeping the correlation unchanged. This is in fact a well-known result of multivariate statistics (see for example Johnson and Wichern 2007). Although theoretically the correlation is the same for the average, it should be noted that when estimating the correlation with data, the error of the estimation will increase as the sample size decreases with averaging (see Appendix 5). We will explore this with more detail later.

One may wonder if this rule could be generalized to all scales, that is, whether the following rule is valid: lack of correlation at short time scales implies lack of correlation at longer scales. But of course this generalization would violate one of the initial assumptions since daily values are strongly correlated in time. We will consider in Sect. 2.3 a case when autocorrelation is taken into account.

2.2 A case with different variances and correlations

In this example we discuss a somewhat more general case than the previous one. Let us consider a year composed of two seasons for simplicity, summer (S) and winter (W), each one with different variances, both for the simulated (m) and observed (o) variables, \(\sigma_{{S_{m} }}^{2} ,\sigma_{{S_{o} }}^{2} ,\sigma_{{W_{m} }}^{2} ,\sigma_{{W_{o} }}^{2}\), respectively, and with different correlations between simulations and observations anomalies, r S and r W , for summer and winter, respectively.

We may now ask about the correlation of the annual-mean anomaly r A , where A = (S + W)/2. Assuming that successive seasons are uncorrelated, Appendix 2 shows that the correlation of the annual anomaly r A can be written as a function of seasonal values as

$$r_{A} = \frac{{r_{S} \sigma_{{S_{m} }} \sigma_{{S_{o} }} + r_{W} \sigma_{{W_{m} }} \sigma_{{W_{o} }} }}{{\sqrt {\left( {\sigma_{{S_{m} }} \sigma_{{S_{o} }} + \sigma_{{W_{m} }} \sigma_{{W_{o} }} } \right)^{2} + \left( {\frac{{\sigma_{{S_{m} }} }}{{\sigma_{{S_{o} }} }} - \frac{{\sigma_{{W_{m} }} }}{{\sigma_{{W_{o} }} }}} \right)^{2} \sigma^{2}_{{S_{o} }} \sigma^{2}_{{W_{o} }} } }}.$$
(3)

Expression (2) is recovered in the special case of identical variances and identical correlations. An interesting case occurs when the second term in the denominator vanishes, that is, when the modeled variance differs from the observed one but this relative error is similar in both seasons. In this case annual anomaly correlation r A is then a weighted average of the seasonal anomaly correlations

$$r_{A} = \frac{{ar_{s} + br_{w} }}{a + b}.$$
(4)

where \(a = \sigma_{{S_{m} }} \sigma_{{S_{o} }}\) and \(b = \sigma_{{W_{m} }} \sigma_{{W_{o} }}\). This last expression can be easily generalized to more than two seasons or sub-seasons. The weights are a function of the simulated and observed time variability, and the annual anomaly correlation is dominated by the season with the largest variance. Contrary to what could have been intuitively expected, decreasing the time resolution of the time series from seasonal to annual does not increase the correlation over that of the individual values. Furthermore, if we now note that the neglected term on the denominator is positive, we see that r A is in fact smaller than the seasonal weighted average. Hence, upscaling the time series not only does not increase the correlation but in fact may reduce it; this condition would occur for instance when the model fails to simulate well seasonal variances.

2.3 Autoregressive case

Cases treated thus far have assumed no autocorrelation within each time series, making them unrealistic in many instances. To allow for autocorrelation, we will now use a representation of simulated and observed time series by first-order autoregressive processes. Assuming that the simulated time series m and that of the observations o follow first order autoregressive processes, and knowing \(corr(o_{t} ,m_{t} )\), we want to know the upscaled value for \(corr(A_{o} ,A_{m} )\), where \(A_{0} = \left( {o_{t + 1} + o_{t} } \right)/2\) and \(A_{m} = \left( {m_{t + 1} + m_{t} } \right)/2\).

In Appendix 3 it is shown that the solution is

$$corr(A_{o} ,A_{m} ) = corr(o_{t} ,m_{t} )\tfrac{1}{2}\left( {\frac{{2 + \rho_{m} + \rho_{o} }}{{\sqrt {1 + \rho_{o}^{{}} } \sqrt {1 + \rho_{m}^{{}} } }}} \right)$$
(5)

where \(\rho_{o}\) and \(\rho_{m}\) are the autoregressive parameters of each time series for a first order autoregressive process (these parameters need to be constrained between −1 and 1 for the process to be second-order stationary, and are equal to the autocorrelations at lag 1). The simplest case result presented in (2) is recovered when the autocorrelations vanish. The more interesting cases occur when the multiplying factor on the right-hand side differs from unity. As shown in Appendix 3, its value is always greater than or equal to 1, but only marginally so. For the reasonable case when the autoregressive parameters of simulations and of observations time series do not differ much, this factor exceeds unity by about 1 %, and it can reach 6 % when autocorrelations are the most different from each other but remaining positive (which means model and observations have quite different behaviours). These results suggest that

$$corr(A_{o} ,A_{m} ) \approx corr(o_{t} ,m_{t} )$$
(6)

is a very good approximation. So this result also indicates that time-upscaling an RCM time series does not improve noticeably its time correlation with observations under this approximation.

2.4 Noise plus sinusoidal signal

For the next case we assume that both time series—that of the observations and that of simulated data—consist in a random noise superimposed upon a slowly evolving sinusoidal signal. To simplify we will postulate that parameters of both time series are equal, differing only in the instantaneous values of the random noise, which are uncorrelated here. This corresponds to the case in which the model is able to reproduce the long time-scale signal in both amplitude and phase, but unable to reproduce the phase of the random noise features.

Under these assumptions, it is shown in Appendix 4 that the correlation of the upscaled time series is

$$corr(A_{o} ,A_{m} ) = corr(o_{t} ,m_{t} )\left[ {1 + \frac{{{\raise0.7ex\hbox{${\sigma_{R}^{2} }$} \!\mathord{\left/ {\vphantom {{\sigma_{R}^{2} } {\sigma_{S}^{2} }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sigma_{S}^{2} }$}}\cos (\omega\Delta t)}}{{{\raise0.7ex\hbox{${\sigma_{R}^{2} }$} \!\mathord{\left/ {\vphantom {{\sigma_{R}^{2} } {\sigma_{S}^{2} }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sigma_{S}^{2} }$}} + 1 + \cos (\omega\Delta t)}}} \right]$$
(7)

where \(\sigma_{R}^{2}\) and \(\sigma_{S}^{2}\) represent the variances of the random terms and of the sinusoidal signal, respectively. The factor \(\omega\) in the cosine function represents the angular frequency, which is inversely proportional to the period. For the sake of shortening the expressions we write it as a function of the angular frequency ω = 2π/T, where T is the period of the signal.

The factor on the right-hand side is greater than 1 for a short time interval and/or long time-scale signal, \(2\pi\Delta t/T < {\pi \mathord{\left/ {\vphantom {\pi 2}} \right. \kern-0pt} 2}\) or when \(\Delta t < {T \mathord{\left/ {\vphantom {T 4}} \right. \kern-0pt} 4}\). If in keeping with the notation used in the previous sections we assume \(\Delta t = 1\), we obtain that the period should be larger than the time covered by four successive time intervals. In this case the two successive data points to be averaged in the upscaling will have in most cases the same anomaly sign. For longer time periods this property becomes dominant. Table 1 gives a sense of the importance of this factor for a very long period signal fully encompassed in the time series.

Table 1 The first column indicates the ratio of variances between noise R and sinusoidal signal S

When the noise term is small compared with the sinusoidal signal amplitude, \(\sigma_{R}^{2} < \sigma_{S}^{2}\), the correlation of the original time series is high and averaging increases it further. When noise is dominant, \(\sigma_{R}^{2} > \sigma_{S}^{2}\), the original correlation is low and averaging increases this correlation substantially, but correlation remains modest.

This result sheds some light upon what our intuition may suggest: that when there is a distinct signal, reduction of noise by averaging does indeed increase the temporal correlation. The conflicting results obtained thus far indicate that analytical developments cannot reach the bottom of all our questions. One may wonder whether real cases have properties that resemble more one case or another.

3 Some results from RCM simulations and observations

In order to illustrate the previous developments with actual data, in this section some results for precipitation and temperature obtained from RCM simulations, weather stations, reanalysis, and gridded observation datasets are presented. The RCM used is the fifth-generation Canadian Regional Climate Model (CRCM5), described in Martynov et al. (2013) driven by Era Interim (Dee et al. 2011). For reasons of availability, data coming from different simulations will be discussed: The two simulations used in the first subsection were performed on a subcontinental domain over eastern Canada on a 11-km grid mesh, while the three used in the second subsection were performed over a domain encompassing the entire North America a 22-km grid mesh. One of these three simulations uses a moderate large-scale nudging on the wind field, with a 6-h relaxation time from the top of the troposphere until 500 hPa, then diminishing linearly with height to become unforced at the surface (Separovic et al. 2011). For seasonal values observations from CRU3.20 dataset are used (Harris et al. 2013). For daily observations data from Era-interim and from Environment Canada’s homogenized weather stations are used (see Mékis and Vincent 2011; Vincent et al. 2012). Stations were chosen using two criteria: geographical coverage within the integration domain and data quality and completeness. Five stations were selected: Bagotville, Gaspé, Kuujjuaq, Kuujjuarapik, and Roberval (see Fig. 1).

Fig. 1
figure 1

Location of selected Environment Canada weather stations

3.1 Time upscaling from daily to monthly values

In this section we will analyse the time-correlation upscaling from RCM-downscaled reanalysis and the corresponding observed values. Downscaled daily data is taken from two CRCM5 simulations differing only in initial conditions (known internally at Ouranos as bba and bbb, hereafter mentioned as “twin” simulations), integrated for 32 years from 1980 to 2011. The gridpoints used are those that are closest to the selected weather stations. Daily values are divided in two time series, one for January and the other for July, in order to remove seasonality that may increase artificially the correlation. January and July are considered to have 32 days to facilitate computations. Daily values (a total of 1024 for each series) are upscaled using expression (1) to a 2-day average time series (a total of 512 for each series), to a 4-day average (a total of 256 for each series), to a 8-day average (a total of 128 for each series), to a 16-day (a total of 64 for each series), and to a 32-day average (a total of 32 for each series).

Figure 2 depicts time correlation for surface temperature between the CRCM5 downscaled reanalysis and the Environment Canada observations for the different time scales studied. Blue lines indicate January values and red ones July values. Different symbols are used for the twin simulations (a star and a cross). During winter, correlation seems to generally increase for longer timescales. During summer, correlation is lower than during winter, and the existence of an increase with timescale is not obvious. Twin simulations produce very similar although not identical results for both seasons. Figure 3 is similar to Fig. 2 but for precipitation. Without surprise correlations are lower than for temperature. Winter precipitation seems to have an overall increase with time scale, with twin simulations producing very similar results. Summer precipitation shows no clear signs of increase and twin simulations may differ considerably in some locations.

Fig. 2
figure 2

Time correlation of surface temperature between different CRCM5 simulations and station observations from Environment Canada, for different timescales ranging between 1 and 32 days. Blue and red lines identify January and July values, respectively. Difference in symbols indicates results obtained from twin simulations. Each graphic depicts one of the selected weather stations (see Fig. 1)

Fig. 3
figure 3

As in Fig. 2 but for precipitation

It is important to recall that the estimations of correlation presented at longer time scales are associated with larger random errors as the sample number is reduced by upscaling. Appendix 5 discusses and illustrates the magnitudes of such errors. Single-day time series contain near 1000 values and hence error bars can be found to the right of the diagram in Fig. 8. Series of 32-day averages contain around 30 values and can be found near the left side of the same diagram. As we can see considering the color scale, high correlation values are associated with small but asymmetric intervals. For winter temperatures, correlation values between 0.8 and 0.9 are common, which gives intervals of confidence (total length) of around 0.03 for 1-day series and near 0.3 for 32-day series. This puts into question the statistical robustness of the apparent increase of correlation by time upscaling during winter noted in Figs. 2, 3. For summer temperature and precipitation in both seasons the situation is even less clear since lower overall correlation values are associated with larger intervals of confidence.

This suggests that even if time upscaling might appear to produce an increase in correlation, establishing its statistical robustness may need more data. But naturally this prompts the following question: Is a barely detectable increase in correlation by upscaling worth the statistical battle?

3.2 Time upscaling from seasonal to yearly values

The previous section omitted the impact of seasons when studying correlation at different timescales. In this section, as in Sect. 2.2 we will concentrate in the upscaling from seasonal to yearly sampling.

Figure 4 depicts correlations between a number of datasets and the CRU3.20 data for temperature (Fig. 4a) and precipitation (Fig. 4b). On the vertical axis the correlations obtained for annual values are plotted, while in the horizontal axis values that estimate the annual quantity by weighted average of seasonal values are depicted [here we use an extension to four seasons of expression (4)]. Each color corresponds to a different dataset, three for different simulations of the CRCM5 (bao, ban and bar2), and one (era_int) for the Era Interim reanalysis. Runs bao and ban have identical setups differing only in initial conditions (twin simulations). Run bar2 is also identical but large-scale nudging is used. In order to give some statistical robustness correlations are computed for all grid points over the province of Québec, Canada (812 points in total; but note that spatial autocorrelation makes these points far from independent, particularly for the case of temperature). Colored stars indicate the median value for each dataset.

Fig. 4
figure 4

Correlations between a number of datasets and the CRU3.20 data for temperature (a) and precipitation (b). On the vertical axis the correlations obtained for annual values are plotted, while in the horizontal axis values estimates of the annual quantity by weighted average of seasonal values are depicted [see expression (4)]. Each color corresponds to a different dataset, three (bao, ban and bar2) to different versions of CRCM5, and one (era_int) to the Era Interim reanalysis. Runs bao and ban are identical simulations differing only in initial conditions. Run bar2 is also identical but uses a moderate form of large-scale spectral nudging. Values are taken for a period from 1981 to 2010 over the province of Québec, Canada (main delimited area in Fig. 1). Correlations are computed at all grid points (which corresponds to 812 points). Colored stars indicate the mean value for each dataset

The graphs should be interpreted as follows: when points lay above the diagonal it indicates that upscaling from seasonal to annual does in fact improve over simple combinations of seasonal correlation. As discussed in Appendix 5, errors in the estimations of correlation with a 30-year long time series are quite large, hence values in the scattergram have a considerable spread.

Figure 4a shows that for temperature most of the data cloud and their mean values lay above the diagonal, suggesting that upscaling does improve correlation. This is particularly clear for simulations bao and ban performed without large-scale nudging. For ERA-Int and the spectrally nudged simulation bar2, the improved correlation is more modest; but it should be kept in mind that it is difficult to improve very high correlations since it is a bounded quantity.

The case for precipitation is somewhat different (Fig. 4b, notice change of range in both axes). Correlations between observed and simulated precipitation are overall significantly lower than that for temperature, and upscaling seems not to have a distinctive positive effect. The highest correlations are slightly degraded by upscaling and the lower ones are slightly improved.

4 Conclusions

The aim of this study was to analyse the effect of time upscaling (aggregating) on the correlation between a time series produced by a reanalysis-driven RCM and observations. Lacking a general solution, we have approached the issue from four different simple analytical perspectives: 1- Two stochastic correlated time series, without autocorrelation and with identical variances, 2- As in 1, but allowing different variances and correlations between series, as typical in the case of seasonal statistics, 3- Time series with an autoregressive behaviour, typical of short time interval series, and 4- Stochastic time series that are modulated by a sinusoidal signal, such as a interannual oscillations.

The four approaches delivered different results. The first two suggested that in the absence of autocorrelation, upscaling does not improve correlation and may in fact deteriorate it if the model fails to produce the appropriate seasonal interannual variances. Substantial autocorrelation is however a typical property of daily time series—for fields such as surface temperature for example–, and this case was discussed in the third approach. Results indicated that no degradation occurs, but no substantial improvement is to be expected by upscaling either. The last case represented a stochastic time series with no autocorrelation—and no cross correlation—with a superimposed sinusoidal signal (which in fact introduces autocorrelation and cross correlation in the time series). This case can be thought of, for example, as yearly values modulated by interannual variability with a single dominant mode. In this case, upscaling produces a substantial increase in correlation for long time scale oscillations.

The answer to the question asked in the introduction is hence not straightforward from a theoretical perspective. Results for real time series will depend on which property dominates and which assumptions are more realistic for the case chosen. Examples shown here with RCM-simulated data and observations corroborate in part the more cautious analytical results, producing some hope of correlation increase with scale for winter but almost none for summer. More work with real data is needed to provide more robust answers for a variety of cases, although the hope of substantial (and practical) gains by upscaling seems to be limited.

Before more is known about the topic, the safest approach may be to assume that lack of correlation at short time scales will express itself as lack of correlation at longer scales. If one is interested in achieving high correlation with observations at time scales such as monthly, seasonal or annual, one may be constrained to use some form of large-scale nudging within the RCM in order to strengthen the control exerted by the driving data on short time scales. It is still to be seen how intense this nudging should be to attain satisfactory results.

In the examples presented here we have focused mainly on correlations between reanalysis-driven RCM simulations and observed values. The formal considerations discussed in Sect. 2 are however relevant for other situations as well. Consider the following examples of two different sources of a given variable with significant cross correlation:

  1. 1.

    Two nested simulations driven by the same driving data, either using the same RCM (such as twin simulations as in Alexandru et al. 2009), variants of a given RCM (such as spectrally nudged or not), or different RCMs.

  2. 2.

    An RCM simulation and its driving dataset, whether a reanalysis or some CGCM-simulated data. Note that this applies even if the variable to correlate is not used to drive the RCM, such as precipitation. Studies of correlation between reanalysis-driven RCMs and reanalysis are common features in the production of high-resolution pseudo-observations or poor’s man high-resolution analysis by dynamical downscaling (e.g. Kanamaru and Kanamitsu 2007).

  3. 3.

    CGCMs simulations nudged towards reanalysis and observations (e.g. Eden et al. 2012).

  4. 4.

    Two observational datasets. Studies on temporal correlation have been carried out, for example by Brands et al. (2012), who computed time correlations on a daily timescale between two different reanalysis.

Finally, here we have concentrated on time upscaling but this discussion could be extended to spatial-upscaling too. For example, whether for daily precipitation time correlation between RCM and observations improves when instead of a single station a regional scale is considered. This is also a question that deserves attention.