Spatial spread of COVID-19 outbreak in Italy using multiscale kinetic transport equations with uncertainty

In this paper we introduce a space-dependent multiscale model to describe the spatial spread of an infectious disease under uncertain data with particular interest in simulating the onset of the COVID-19 epidemic in Italy. While virus transmission is ruled by a SEIAR type compartmental model, within our approach the population is given by a sum of commuters moving on a extra-urban scale and non commuters interacting only on the smaller urban scale. A transport dynamic of the commuter population at large spatial scales, based on kinetic equations, is coupled with a diffusion model for non commuters at the urban scale. Thanks to a suitable scaling limit, the kinetic transport model used to describe the dynamics of commuters, within a given urban area coincides with the diffusion equations that characterize the movement of non-commuting individuals. Because of the high uncertainty in the data reported in the early phase of the epidemic, the presence of random inputs in both the initial data and the epidemic parameters is included in the model. A robust numerical method is designed to deal with the presence of multiple scales and the uncertainty quantification process. In our simulations, we considered a realistic geographical domain, describing the Lombardy region, in which the size of the cities, the number of infected individuals, the average number of daily commuters moving from one city to another, and the epidemic aspects are taken into account through a calibration of the model parameters based on the actual available data. The results show that the model is able to describe correctly the main features of the spatial expansion of the first wave of COVID-19 in northern Italy.


Introduction
The advent of the COVID-19 pandemic has caused a strong commitment of many researchers acting in different fields with the scope of trying to understand and give explanations to the global crisis we are experiencing. From the mathematical modeling point of view, several progresses have been done concerning the development of epidemic models capable of taking into account the different facets of this terrible disease. In particular, many recent researches have been addressed to the search of control strategies [2,3,33,53] to limit the spread and consequently hospitalizations and deaths, possibly reducing, at the same time, as much as possible the negative impact on the economy of the restrictive measures [5,26].
Most of the recently proposed model represent improvements at various levels of the seminal works on compartmental epidemiological modeling proposed originally by Kermack and McKendrick [37]. These approaches [2,6,15,21,30,34,44,53] are typically focused on the epidemiological aspects of the virus spread under the hypothesis of global mixing of the population, hence, without taking into account the role of the spatial component in the evolution of an outbreak.
Despite in many situations the above description is sufficient to delineate the global trend of an epidemic, there are cases in which the spatial homogeneity assumption does not hold true. In these situations, one seeks for local interventions in order to reduce the spread without eventually affecting regions where the incidence of the infection does not require special care, as for instance in the recent case of the COVID-19 in Italy [61]. Consequently, from the modelling point of view, the inclusion of the spatial dependence represents a key challenge [51].
Indeed, with the increasing amount of information on population mobility and the computational resources available today, the design and simulation of epidemic models based on partial differential equations (PDEs) that include the details of spatial dynamics can be considered a realistic goal. We recall that most of the existing epidemiological models taking into account spatial heterogeneities are based on reaction-diffusion equations [4,17,29,40,45,46,52,54,55,58]. Alternative modelling approaches are represented by the interaction of different homogeneous populations [33] or agent-based dynamics [31].
Most of these models do not allow for a clear distinction about the possible spatial behaviors of the population inside a given compartment. Consequently, although capable of originating realistic spatial patterns in situations where individuals move indiscriminately through the domain, such approaches are likely to be less effective in the case where one is interested in studying the spread of a virus in the human population. Under these circumstances, it is more realistic to consider only commuting individuals moving in major and preferred directions, not considering overall mass migration between distinct urban areas. Indeed, not all individuals move indiscriminately in the region of interest, as most of the population only interacts at the urban scale. Furthermore, in contrast to the use of diffusion models, the propagation rate of the infection along the spatial domain is obviously finite. Recently, trying to overcome some of the above criticisms, such as the paradox of the infinite propagation speed, alternative models based on hyperbolic PDEs have been proposed [7,9,10,14,22].
In this work, following the approach introduced in [14], we consider a realistic compartmental structure for the description of the epidemic dynamic of commuters and non commuters individuals in presence of uncertain data. The model consists of a system of kinetic transport equations describing a largescale (extra-urban) commuting population by a continuous density [16,20,36,47] in a two-dimensional environment. This density can be interpreted as the probability for an individual to be at a given location and move in a given direction at a given instant of time, in analogy to particle flows in rarefied gases [8,19,23,48,50,59]. The above system is coupled to a second system consisting of a set of diffusion equations that characterize the movement of the non-commuting population on a small (urban) scale. The epidemic spread is ruled by a Susceptible, Exposed, Infected, Asymptomatic and Removed (SEIAR) compartmental structure, in which both commuters and non commuters can interact. Using a suitable scaling process [14,43], the model allows us to highlight, firstly, the relationship with well-known existing approaches based on reaction-diffusion equations and, secondly, to pass naturally from a hyperbolic description in peripheral areas to a diffusive regime when reaching an urban area, with regard to the commuting population.
An important aspect concerns overcoming the limitation caused by standard deterministic models that rely on the assumption that the initial conditions, boundary conditions, and all involved epidemic and mobility parameters are known. However, as observed in the case of the COVID-19 epidemic, this assumption, especially in the early stages of the epidemic, is not reliable. For example, the initial conditions in terms of the number of infected and asymptomatic persons are certainly affected by uncertainty because data are limited and population screening cannot be error-free. Epidemic parameters, although normally estimated or calibrated, are also often candidates for being random variables in a realistic approach. Therefore, in order to take into account these limitations underlying deterministic models, in this paper we resort to a stochastic approach based on the introduction of random terms into the initial modeling [2,3,10,49].
Once the model is set up, its numerical solution on a computational domain describing a realistic geographic scenario poses several difficulties. In fact, the model consists of two coupled systems of PDEs, each characterized by five unknown functions living in a multidimensional space characterized by space, velocity, and stochastic variables. Additionally, we have to deal with the irregular shape of the spatial region and the multiscale nature of the dynamics (indeed, as previously stated, hyperbolic and parabolic behaviors coexist). Therefore, a particular care is devoted to the development of efficient and accurate numerical schemes. More precisely, a discretization of the system based on Gaussian quadrature points in velocity space [35,39] and a finite volume approach on unstructured grids [28,32] is considered. The adoption of asymptotic preserving time discretization techniques permits to avoid time step limitations introduced by the parabolic scaling without degradation of accuracy [11,12,25]. Finally, a non-intrusive stochastic collocation method, which guarantees spectral accuracy in the space of the uncertain parameters, is considered to deal with the uncertainty quantification process [10,60].
The rest of the paper is organized as follows. In Section 2 the mathematical model is introduced. We first introduce the kinetic transport formulation for the epidemic compartments of commuters together with the corresponding diffusive dynamic of the non commuters in a deterministic setting. Subsequently, we link the two hyperbolic/parabolic dynamics through a formal passage to the limit for the system of commuters. A definition of the basic reproduction number of the epidemic for the resulting model is also reported. Next, we illustrate how to generalize the model in presence of uncertainty. The details of the numerical scheme used to approximate the resulting stochastic system are summarized in Appendix A.1. Section 3 is devoted to present an application of the current modelling to the first outbreak of COVID-19 in Italy and its spread in the Lombardy Region. The capability of the model to accurately represent the first wave of the COVID-19 epidemic in Italy is discussed in detail through comparisons with recorded data reported by official sources [64]. Conclusions and future perspectives are finally given in Section 4.

A compartmental kinetic transport model
Let Ω ⊂ R 2 characterize a two-dimensional geographical area of interest and assume that individuals have been separated into two different groups: a commuting population, typically moving over long distances (extra-urban), and a non-commuting population, moving only in small-scale urban areas. In the first part of this Section, for ease of presentation, the multiscale kinetic transport model is introduced in a deterministic setting. The relation between the current hyperbolic transport model and classical diffusion models is discussed in the second part. In the third part, details regarding the basic reproduction number associated with the model are given. Finally, in the last part, we discuss the generalization of the deterministic model to the case where uncertainty is taken into account.

Characterizing commuter and non commuter dynamics
We consider a population of commuters at position x ∈ Ω moving with velocity directions v ∈ S 1 and denote the respective kinetic densities of susceptible (individuals who may be infected by the disease) by f S = f S (x, v, t), exposed (individuals in the latent period, which are not yet infectious) by The kinetic distribution of commuters is then given by and their total density is obtained by integration over the velocity space As a consequence, one can recover the number of susceptible, exposed and recovered irrespective of their direction of displacement by integration over the velocity space. This gives which we refer to as the density fractions of non-infectious individuals, whereas are the density fractions of infectious individuals. In this setting, the kinetic densities of the commuters satisfy the transport equations where the total densities of non infected individuals are defined by and similarly the total densities of infected by In the above equations, S u (x, t), E u (x, t), I u (x, t), A u (x, t), R u (x, t) are the density fractions of noncommuters who, by assumption, move only on an urban scale. These densities satisfy a diffusion dynamic acting only at the same local scale In the resulting model (1)-(2) that couples the commuting and non-commuting dynamics, the velocities v i = λ i (x)v in (1), as well as the diffusion coefficients D u i = D u i (x) in (2), with i ∈ {S, E, I, A, R}, are designed to take into account the heterogeneity of geographical areas, and are thus chosen dependent on the spatial location. Similarly, also the relaxation times τ i = τ i (x), i ∈ {S, E, I, A, R} are space dependent. The quantities γ I = γ I (x) and γ A = γ A (x) are the recovery rates of symptomatic and asymptomatic infected (inverse of the infectious periods), respectively, while a(x) represents the inverse of the latency period and σ(x) is the probability rate of developing severe symptoms [15,33,53].
The transmission of the infection is governed by the incidence functions F I (·, I T ) and F A (·, A T ). We assume local interactions to characterize the nonlinear incidence functions [18,42] F I (g, I T ) = β I gI p where the classic bi-linear case corresponds to p = 1, k I = k A = 0. Parameters β I = β I (x, t) and β A = β A (x, t) characterize the contact rates of highly symptomatic and mildly symptomatic/asymptomatic infectious individuals, accounting for both the number of contacts and the probability of transmission. Hence, they may vary based on the effects of government control actions, such as wearing of masks, shutdown of specific activities or lockdowns [2,34,37]. On the other hand, parameters κ I = κ I (x, t) and κ A = κ A (x, t) are the incidence damping coefficients based on the self-protective behavior assumed by the individuals due to the awareness of the epidemic risk [10,30,58]. Alternative incidence functions are given by whereΩ is a chosen portion of the domain which permits to take into account the fact that social distancing may depend on the average level of infection of the region rather than only on the local situation. The resulting model (1)-(2) will be referred to as multiscale kinetic SEIAR (MK-SEIAR) model. Note that, because of the presence of two populations acting at different scales, the model allows a more realistic description of the typical commuting dynamic involving only a fraction of the population and distinguishes it from the epidemic process affecting the entire population.

Commuters behavior in urban areas
The hyperbolic transport model for the commuters deserves some remarks. In fact, while it is clear that a hyperbolic description permits to describe correctly the daily extra-urban commuting part, the same individuals when moving inside the urban area are better described by a traditional diffusion model. A remarkable feature of the transport model (1), is that it permits to recover a classical diffusion behavior under the hypothesis that the relaxation times τ S,I,R tend to zero while keeping finite the diffusion coefficients More precisely, let us introduce the flux functions Then, integrating system (1) in v, we get the following set of equations for the macroscopic densities of commuters whereas the flux functions satisfy Clearly, the above system is not closed because the evolution of the fluxes in (7) involves higher order moments of the kinetic densities. The diffusion limit can be formally recovered, by introducing the space dependent diffusion coefficients (5) and letting τ S,I,R → 0. We get from the r.h.s. in (1) and, consequently, from (7) we recover Fick's law and similarly for the other densities. Thus, substituting (8) into (6) we get the diffusion system for the population of commuters [46,52,57] which is coupled with system (2) for the non-commuting counterpart. The capability of the model to account for different regimes, hyperbolic or parabolic, accordingly to the space dependent relaxation times τ i , i ∈ {S, E, I, A, R}, makes it suitable for describing the dynamics of human beings. Indeed, it is clear that the daily routine is a complex mixing of individuals moving at the scale of a city and individuals moving among different urban centers. In this situation, due to the lack of microscopic information and the high complexity of the dynamics, it is reasonable to avoid describing the details of movements within an urban area and model this through a diffusion operator. On the other hand, commuters when moving from one city to another follow well-established connections for which a description via transport operators is more appropriate.

Basic reproduction number
The standard threshold of epidemic models is the well-known reproduction number R 0 , which defines the average number of secondary infections produced when one infected individual is introduced into a host population in which everyone is susceptible [37] during its entire period of infectiousness. This number determines when an infection can invade and persist in a new host population. For many deterministic epidemic models, an infection begins in a fully susceptible population if and only if R 0 > 1. Its definition in the case of spatially dependent dynamics is not straightforward, particularly when considering its spatial dependence. In the following, assuming no inflow/outflow boundary conditions in Ω, integrating over velocity and space, we derive the following definition for the average reproduction number value on the domain Ω The derivation of the above expression for R 0 (t), computed following the next-generation matrix approach [24], is presented in detail in [10] using a suitable linearization of the corresponding nonlinear process for the space averaged quantities. It is worth to underline that from definition (10) it can be deduced that it is a combination of the growth of E T , I T and A T that determines the persistence of the epidemic, not solely the growth of E T in time, neither the growth of the simple sum E T + I T + A T . If, additionally, compartments I and A are considered homogeneously mixed in a unique compartment, allowing β I = β A = β, κ A = κ I = κ and γ I = γ A = γ, we recover a SEIR-type compartmental model and the reproduction number results as in [14]: Let us finally observe that, under the same no inflow/outflow boundary conditions, integrating in Ω equations (1) and (2) yields respectively the conservation of the total populations of commuters and non-commuters

Including data uncertainty
To extend the model (1)-(2) to the case in which uncertainties are taken into account, let us suppose that the population of commuters depend on an additional random vector where z 1 , . . . , z d are independent random variables. This vector is used to characterize possible sources of uncertainty in the physical system due to lack of information on the actual number of infected or specific epidemic characteristics of the infectious disease. Thus, in the system we have the following high-dimensional unknowns The same considerations apply to the non-commuter population, yielding Notice that besides the introduction of a new vector of variables, the structure of the model (1)-(2) does not change, i.e. there is no direct variation of the unknowns with respect to z, which, instead, have to be intended as parameters into the equations. We will further assume that also some epidemic parameters are affected by uncertainty. Therefore, for instance, parameters acting inside the incidence function may have an additional dependence of the form In the next section, we discuss several numerical examples based on the model (1)-(2) with uncertainty. We will consider that the initial number of detected infected I, derived from the data at disposal [62,64], represents only a lower bound while the true values are not known but affected by uncertainty. Consequently, also initial conditions of exposed E, asymptomatic A and susceptible S will contain a stochastic dependence.

Application to COVID-19 spread in Italy
To validate the proposed methodology in a realistic geographical and epidemiological scenario, a numerical test reproducing the epidemic outbreak of COVID-19 in the Lombardy Region of Italy, from February 27, 2020 to March 22, 2020, is designed, taking into account the uncertainty underlying initial conditions of infected individuals. We underline that the discretization of the multiscale system of PDEs (1)- (2) presented in this work is not trivial and requires the construction of a specific numerical method able to correctly describe the transition from a convective to a diffusive regime in realistic geometries. Additionally the method should be capable to deal efficiently with the uncertainty characterized by the stochastic nature of the model. Details on these numerical aspects are given in Appendix A.1 together with the tables of the data used for population mobility in Appendix A.2.

Computational setup
The computational domain is defined in terms of the boundary that circumscribes the Lombardy Region, which is available in [62] as a list of georeferenced points in the ED50/UTM Zone 32N reference coordinate system. To avoid ill-conditioned reconstruction matrices and other related problems that arise when dealing with large numbers in finite arithmetic, all coordinates are re-scaled by a factor of 10 6 . The resulting computational grid is composed of N E = 10792 triangular control volumes. The mesh grid is presented in Figure 1a To avoid the mobility of the population in the entire territory and to simulate a more realistic geographical scenario in which individuals travel along the main traffic paths of the Region, different values of propagation speeds are assigned in the domain which reflect, as close as possible, the real characteristics of the territory. Along the main connections of the Region, a mean value of λ = 0.04 is prescribed for compartments S, E, A and R, which ensures a maximum travel distance of 80 km within a day; while, for the same compartments, λ = 0.02 is ulterior fixed in the urbanized circles. A spatial width of h = 0.5 km is assigned to the traveling paths. On the other hand, assuming that highly infectious subjects are mostly detected in the most optimistic scenario, being subsequently quarantined or hospitalized, the speed assigned to compartment I is set null. However, the infected people, even if limited by quarantines and social distancing, can still contribute to the spread the disease via the diffusion process at the urban scale (mimicking for instance the still possible infections happening at the family level). A null value λ = 0 is set in the rest of the computational domain for all the epidemiological compartments. The resulting distribution of the characteristic speeds is visible from Figure 1c.
The relaxation time is set τ i = 10 4 for i ∈ {S, E, I, A, R}, so that the model recovers a hyperbolic regime in the entire region, apart from the main cities, where a parabolic setting is prescribed to correctly capture the diffusive behavior of the disease spreading which typically occurs in highly urbanized zones. Hence, in the urban area of each city the following relaxation time τ c,i is prescribed: with a diffusive relaxation time chosen to be τ 0 = 10 −4 . The resulting distribution of the relaxation times is presented in Figure 1d.

Uncertain initial data and epidemic parameters
Considering p c the number of citizens of a generic city (province) denoted with subscript c, the initial spatial distribution of the generic population f (x, y) is assigned, for each province and each epidemiological compartment, as a multivariate Gaussian function with the variance being the radius of the urban area r c : with (x c , y c ) representing the coordinates of a generic city center. The initial population setting, for each province of the Lombardy Region, is taken from [63] and reported in Table 1, Appendix A.2. Note that, the radius r c associated to each city, defined in Table 1 (first column), permits to exactly re-obtain the population p c when integrating over the computational domain the initial spatially distributed population.
Since at the beginning of the pandemic, tracking of positive individuals in Italy was very scarce, in this numerical test we consider that the initial amount of infected people is the leading quantity affected by uncertainty. To this aim, we introduce a single source of uncertainty z having uniform distribution, z ∼ U(0, 1) so that the initial conditions for compartment I, in each control volume, are prescribed as with I T,0 initial amount of highly infectious corresponding to the values reported by February 27, 2020 in the GitHub repository [64] daily updated by the Civil Protection Department of Italy for each city, listed in the last column of Table 1. This choice effectively associates all infected individuals detected with the I compartment, as a result of the screening policy adopted during February-March 2020 in Italy. In fact, tests to assess the presence of SARS-CoV-2 were performed almost exclusively on patients with consistent symptoms and fever at the beginning of this pandemic. Regarding the uncertainty of these data, we impose µ = 1, assuming that at least half of the actual highly symptomatic infected were detected at the beginning of the pandemic outbreak. For all the cities with zero infected detected by February 27, 2020 (e.g., Mantua, Varese, Como and Lecco), we choose to fix I T,0 = 1 in order to assign an effective uncertainty.
Based on the estimations reported in [10], the expected initial amount of exposed E T,0 and asymptomatic/mildly symptomatic individuals A T,0 is imposed so that E T,0 = 10 I T,0 and A T,0 = 9 I T,0 in each location. Therefore, also initial conditions for compartments E, A and S become stochastic, depending on the initial amount of severe infectious at each location: Finally, removed individuals are initially set null everywhere in the network, R T (0, z) = 0.0. To properly subdivide the population in commuters and non-commuters, regional mobility data are considered. In particular, the matrix of commuters presented in Table 2, Appendix A.2, reflects mobility data provided by the Lombardy Region for the regional fluxes of year 2020 [65], which is in agreement with the one derived from ISTAT data released in October, 2011, as also confirmed in [33]. Therefore, to each control volume, the total percentage of commuters referred to the province where it is located is assigned, and non-commuting individuals are computed as a result of conservation principles. From Table 2 it can be noticed that some connections are not taken into account simply because the amount of commuters along these routes is negligible if compared to the amount of individuals traveling in the other paths and with respect to the dimension of the populations.
From the first day simulated in this test, the population was aware of the risk associated with COVID-19 and recommendations such as washing hands often, not touching their faces, avoiding handshakes and keeping distances had already been disseminated by the government, hence, we initially fix coefficients k I = k A = 50.
The initial value of the transmission rate of asymptomatic/mildly infectious people is calibrated as the result of a least square problem with respect to the observed cumulative number of infected I T (t), through a deterministic SEIAR ODE model set up for the whole Lombardy Region, which provide the estimate β A = 0.58×10 −3 . As previously mentioned, since we are assuming that highly infectious subjects are mostly detected in the most optimistic scenario, being subsequently quarantined or hospitalized, the transmission rate of I is set β I = 0.03 β A , as in [10,15,33]. Finally, in the incidence function we fix p = 1.
With this parametric setup, we obtain an initial expected value of the basic reproduction number for the Region, evaluated as from definition (10), R 0 (0) = 3.2, which is in agreement with estimations reported in [10,15,33,56]. Nevertheless, with the proposed methodology it is possible to present the heterogeneity underlying the basic reproduction number at the local scale, as shown in Figure 2 for µ = 0, together with the initial global amount of infected people E T,0 (x, y) + I T,0 (x, y) + A T,0 (x, y) present in the domain.
To model the effects of the lockdown imposed by the government from March 9, 2020 in north of Italy [61], from that day the transmission rate β A is reduced by 50%, also increasing the coefficients k I = k A = 80 as a result of the population becoming increasingly aware of the epidemic risks. In addition, the percentage of commuting individuals is reduced by 60% for each compartment according to mobility data tracked through GPS systems of mobile phones and made temporarily available by Google [1,56]

Results and discussion
Numerical results of the test are reported in Figures 3-7. In Figure 3, the expected evolution in time of the infected individuals, together with 95% confidence intervals, is shown for exposed E, highly symptomatic       subjects I and asymptomatic or weakly symptomatic people A, for each city of the Lombardy Region.
Here it is already appreciable the heterogeneity of the diffusion of the virus. Indeed, from the different y-axis scales adopted for the plot of the provinces, it can be noticed that Milan, Bergamo and Brescia present a consistently higher contagion. From Figure 4 it can be observed that the lower bound of the confidence interval of the cumulative amount in time of highly symptomatic individuals is in line with data reported by the Civil Protection Department of Italy [64]. As expected, due to the uncertainty taken into account, the mean value of the numerical result in each city is higher than the registered one.
The comparison between the expected evolution in time of the cumulative amount of severe infectious with respect to the effective cumulative amount of total infectious people, including asymptomatic and Numerical results presented in terms of integrated variables for the whole Lombardy are shown in Figure 6, together with the expected temporal evolution of the reproduction number R 0 (t), again with 95% confidence bands. It is here highlighted that the drop of R 0 (t) on March 9 reflects the imposition of lockdown restrictions, as presented in the previous Section. Moreover, in this plot, results are reported starting from February 28, instead of February 27, to permit to the system to achieve a correct initialization of the commuters (who are totally placed in the origin location at the beginning of the simulation) during the first day simulated.
In Figure 7, final expectation and variance of the cumulative amount of infected people, namely E T + A T + I T , are reported in the 2D framework (first row). If comparing Figure 7a with 2a, it can be noticed that, at the end of March, the virus is no longer majorly affecting the province of Lodi and Cremona, but has been spread arriving to hit most of all Brescia, Milan and Bergamo. Finally, in the second row of Figure 7, the expectation of the susceptible population S T on the initial day simulated (February 27, 2020) is compared with the one obtained at the end of the simulation (March 22, 2020).
Here it can be verified that the majority of the population does not leave their home city, as per real behavior, but there is only a small percentage of commuters who move within the domain, along the prescribed routes. Similar results are obtained for compartments E and A, whose commuting part, even though small, strongly contributes to the spatial spread of the epidemic.

Conclusions
In this paper we introduced a realistic model for the spatial spread of a virus with a focus on the case of COVID-19. Unlike models currently in the literature, which typically ignore spatial details or alternatively introduce them as simple diffusive dynamics, in our approach we have tried to capture the essential characteristics of the movements of individuals, which are very different if we consider commuting individuals who for work reasons move over long distances, from one city to another, to individuals who instead carry out their activities on an urban scale. The separation of individuals into these two classes, and the use of different spatial dynamics characterized by appropriate systems of transport and diffusive equations, allows in particular to avoid mass migration phenomena typical of models based on a single population and the instantaneous propagation of infectious disease over long distances. From the epidemiological point of view, modeling is developed in a compartmental context described by a SEIARtype framework capable of describing the effect of exposed and asymptomatic individuals within the spatial spread of the disease. In addition, given the high uncertainty on the actual amount of individuals in the various compartments able to propagate the infection, the model was developed taking into account the presence of stochastic variables that therefore require an appropriate process of quantification. The resulting multiscale system of partial differential equations was then solved on realistic spatial geometries by a numerical method combining finite volume IMEX techniques for the deterministic part, with a non-intrusive collocation approach for the stochastic component. After a careful calibration of the model parameters based on the available data, an in-depth analysis of the results is reported in the case of the initial phase of the spread of COVID-19 in Italy occurred in the Lombardy region. The results show the ability of the model to correctly describe the epidemic dynamics and the importance of a heterogeneous spatial description and of the inclusion of stochastic parameters.

Acknowledgments
This work has been written within the activities of the GNCS group of INdAM (National Institute of High Mathematics). The support of MIUR-PRIN Project 2017, No. 2017KKJP4X "Innovative numerical methods for evolutionary partial differential equations and applications" is acknowledged.

A Supplementary material
In this appendix, we report the details of the numerical scheme adopted to approximate the MK-SEIAR system (1)-(2) together with the data tables concerning the details of the population and of the commuting flows.

A.1 Numerical method
We first give the details of the method in the case in which uncertainty is not present and successively we explain how the system (1)-(2) is solved in the case of stochasticity. For the commuters, the numerical scheme for the deterministic case is based on a discrete ordinate method in velocity in which the even and odd parity formulation is employed [25,39,41]. The details of such approach are given in A.1.1. Then a finite volume method working on two-dimensional unstructured meshes [13,32] for the discrete ordinate approximation of the commuters is introduced in A.1.2. The full discretization of the equations (1) is obtained through the use of suitable IMEX Runge-Kutta schemes [11,12]. In particular, the above choices permit to obtain a scheme which consistently captures the diffusion limit from the kinetic system when the scaling parameters τ S,I,R tends toward zero. This part of the method is discussed in A.1.3. Finally, the discretization of the stochastic part for the system (1)-(2) when uncertainty is present is explained in A.1.4.

A.1.1 Even and odd parities formulation
We rewrite (1) by using the so-called even and odd parities formulation. In other words, we denote v = (η, ξ) ∈ S 1 and then we obtain four equations with non-negative ξ, η ≥ 0 for each compartment of the commuters. The change of variables reads, omitting the time and space dependence for simplicity, as [39] r (1) while for the scalar fluxes one has with i = S, E, I, A, R. An equivalent formulation with respect to (1) can then be obtained thanks to this change of variables and reads as and One can observe that due to symmetry, we need to solve these equations for ξ, η in the positive quadrant only. Thus the number of unknowns in (1) and (12)- (13) is effectively the same.

A.1.2 Space discretization on unstructured grids
We consider now a spatial two-dimensional computational domain Ω which is discretized by a set of non overlapping polygons P i , i = 1, . . . N p . Each element P i exhibits an arbitrary number N Si of edges e j,i where the subscripts indicates that this is the edge shared by elements P i and P j . The boundary of the cell is consequently given by ∂P i = N S i j=1 e ji . The governing equations for the commuters rewritten in the odd and even formulation are then discretized on the unstructured mesh by means of a finite volume scheme which is conveniently rewritten in condensed form as where Q is the vector of conserved variables is the linear flux tensor and S(Q) represents the stiff source term defined in equations (12)- (13). As usual for finite volume schemes, data are represented by spatial cell averages, which are defined at time t n as where |P i | denotes the surface of element P i . A first order in time finite volume method is then obtained by integration of the governing system (14) over the space control volume |P i | Higher order in space is then achieved by substituting the cell averages by piecewise high order polynomials. We refer to these polynomial reconstructions to as w i (x) which are obtained from the given cell averages (15). In particular, our choice is to rely on a second order Central WENO (CWENO) reconstruction procedure along the lines of [13,27]. We omit the details for brevity. The numerical flux function F ij · n ij is given by a simple and robust local Lax-Friedrichs flux yielding where w + i,j , w − i,j are the high order boundary extrapolated data evaluated through the CWENO reconstruction procedure. The numerical dissipation is given by s max which is the maximum eigenvalue of the Jacobian matrix in spatial normal direction, Let us notice now that in the diffusion limit, i.e. as (τ S , τ I , τ R ) → 0, the source term S(Q) becomes stiff, therefore in order to avoid prohibitive time steps we need to discretize the commuters system implicitly. To this aim, a second order IMEX method which preserves the asymptotic limit given by the diffusion equations (9) is proposed and briefly described hereafter.

A.1.3 Time integration and numerical diffusion limit
We consider again system (1) formulated using the parities (12)- (13). We also assume τ S,I,R = τ and rewrite (12)- (13) in partitioned form as in which I , r (2) I , j (2) A , j A , j and f (u), g(u), E(v) are defined similarly. Now, following [12], the Implicit-Explict Runge-Kutta (IMEX-RK) approach appid to system (19) reads as where u (k) , v (k) are the so-called internal stages. The numerical solution reads In the above equations, the matricesÃ = (ã kj ), withã kj = 0 for j ≥ k, and A = (a kj ), with a kj = 0 for j > k are s × s matrices, with s number of Runge-Kutta stages, defining respectively the explicit and the implicit part of the scheme, and vectorsb = (b 1 , ...,b s ) T and b = (b 1 , ..., b s ) T are the quadrature weights. Furthermore, we choose the Runge-Kutta scheme in such a way that the following relations hold true a kj = b j , j = 1, . . . , s,ã kj =b j , j = 1, . . . , s − 1.
The scheme (21)- (22) treats implicitly the stiff terms and explicitly all the rest. Moreover, one can prove that the above scheme is a consistent discretization of the limit system in the diffusive regime. In fact, assuming for simplicity D S,I,R independent from space, the second equation in (21) can be rewritten as therefore, assuming (5), the limit τ → 0 yields in v (j) , i.e. we restore perfect symmetry in direction of propagation of the information. Using now the identity u (j) = U (j) into the first equation in (21) we get and using (24) into (25) thanks to the definitions of f and g gives Finally, integrating over the velocity field one has and thus, the internal stages correspond to the stages of the explicit scheme applied to the reactiondiffusion system (9). To conclude the proof one has to notice that thanks to the choice (23), the last stage is equivalent to the numerical solution. Thus, this is enough to guarantee that the scheme is a consistent discretization of the limit equation. Note that the limit system is consistent with the discretization of the non commuters diffusive system (2). In this latter case, we adopt the same finite volume setting for the unknowns

This simply reads
with Then, the same CWENO reconstruction and the same numerical local Lax-Friedrichs flux is employed where however, we account for a dissipation proportional to the diffusive terms. In other words, the numerical viscosity is given by the maximum eigenvalue of the viscous operator s V max = max (D u S , D u E , D u I , D u A , D u R ). Concerning the time discretization, the explicit part of the Runge-Kutta scheme introduced in the previous paragraph is employed.

A.1.4 Stochastic collocation method
In the case in which we deal with the stochastic system (1)-(2), we employ a generalized Polynomial Chaos (gPC) expansion technique [38,60]. We restrict to the case in which there is only one stochastic variable z in the system. The extension to the case of a vector of random variables is straightforward. The probability density function of the single random input is supposed known: ρ z : Γ → R + . In this case, the approximate solution for the commuters Q M (x, v, t, z) and the non commuters Q u M (x, t, z) are represented as truncations of the series of the orthonormal polynomials describing the random space, i.e.
where M is the number of terms of the truncated series and φ j (z) are orthonormal polynomials, with respect to the measure ρ z (z) dz. The expansion coefficients are obtained bŷ Q j (x, v, t) = Γ Q(x, v, t, z) φ j (z) ρ z (z) dz,Q u j (x, t) = Γ Q u (x, t, z) φ j (z) ρ z (z) dz, j = 1, . . . , M. (30) Then, the exact integrals for the expansion coefficients in Eq. (30) are replaced by a suitable quadrature formula characterized by the set {z n , w n } Np n=1 , where z n is the n-th collocation point, w n is the corresponding weight and N p represents the number of quadrature points. We then havê where Q(x, v, t, z n ) and Q u (x, t, z n ) with n = 1, . . . , N p are the solutions of the problem evaluated at the n-th collocation point for the commuters and non commuters. Thanks to the computation of the above coefficients than it is possible to compute all quantities of interest concerning the random variable. For example, the expectations are approximated as and In the same way, all other quantities of interest such as the variance of Q and Q u can be computed.

A.2 Population and mobility data
In this appendix we report tables containing data on the initial distribution of populations in the various urban areas considered in the Lombardy region, see Table 1, and data on the corresponding flows of commuters between cities, see Table 2. Table 1: Setting of the Lombardy provinces: urban radius r c , total inhabitants M and initial amount of highly infectious individuals I T,0 , detected by February 27, 2020 (initial day of the simulation). The total population is given by ISTAT data of December 31, 2019 [63], while data of highly infectious correspond to those reported in the GitHub repository of the Civil Protection Department of Italy [64]. Null values, listed with * , in the simulation are substituted with 1 to permit an effective assignation of uncertain initial condition.  provinces are reported in the following columns. Each entry is given as number of commuters of the origin province. The last column shows the amount of total commuters of the origin province and the corresponding percentage with respect to the total population of the city. This matrix is extracted from the origin-destination matrix provided by the Lombardy Region for the regional fluxes of year 2020 [65].