A metric tensor approach to data assimilation with adaptive moving meshes

https://doi.org/10.1016/j.jcp.2022.111407Get rights and content

Highlights

  • Metric tensor (MT) characterization of adaptive meshes in higher space dimensions.

  • Form meshes for data assimilation based on combinations of different MTs.

  • Metric tensors of ensemble meshes and observation locations, and for localization.

  • Adaptive common mesh to support Local Ensemble Transform Kalman Filter.

  • Conservative, moving mesh, DG discretization for tests on 1D, 2D inviscid Burgers.

Abstract

Adaptive moving spatial meshes are useful for solving physical models given by time-dependent partial differential equations. However, special consideration must be given when combining adaptive meshing procedures with ensemble-based data assimilation (DA) techniques. In particular, we focus on the case where each ensemble member evolves independently upon its own mesh and is interpolated to a common mesh for the DA update. This paper outlines a framework to develop time-dependent reference meshes using locations of observations and the metric tensors (MTs) or monitor functions that define the spatial meshes of the ensemble members. We develop a time-dependent spatial localization scheme based on the metric tensor (MT localization). We also explore how adaptive moving mesh techniques can control and inform the placement of mesh points to concentrate near the location of observations, reducing the error of observation interpolation. This is especially beneficial when we have observations in locations that would otherwise have a sparse spatial discretization. We illustrate the utility of our results using discontinuous Galerkin (DG) approximations of 1D and 2D inviscid Burgers equations. The numerical results show that the MT localization scheme compares favorably with standard Gaspari-Cohn localization techniques. In problems where the observations are sparse, the choice of common mesh has a direct impact on DA performance. The numerical results also demonstrate the advantage of DG-based interpolation over linear interpolation for the 2D inviscid Burgers equation.

Introduction

Data assimilation (DA) has its origins in numerical weather prediction and is now employed in many scientific and engineering disciplines. DA combines noisy data, usually from instrumental uncertainty or issues with scale, with models that are imperfect, due to simplifying assumptions or other inaccuracies, to improve predictions about the past, current, or future state of a system. Typical DA techniques require a prior uncertainty and use the likelihood of observations to calculate the posterior distribution. This Bayesian context provides not only predictions but also quantification of the uncertainty in these predictions, which is commonly realized by employing an ensemble of solutions.

When the physical model employed within a DA technique is the discretization of a time dependent partial differential equation (PDE), the potential exists to increase both accuracy and efficiency of obtaining approximate solutions through the use of non-uniform, time evolving spatial meshes. Combining DA with adaptive moving mesh techniques presents both challenges and opportunities. The advantages include the accurate, efficient approximation of ensemble solutions, especially those with sharp, moving interfaces, and the improved approximation/interpolation of observational data at fixed or time-dependent locations. Time-dependent adaptive meshes provide the ability to take optimal advantage of observations by focusing DA in regions of the model ensemble solutions where the observational constraint is most beneficial. Regions of importance in both space and time, based upon different measures of sensitivity, can be highlighted to improve DA skill in those regions. At times in which observational data is used to update ensemble solutions, the use of a well designed adaptive mesh that supports this update has the potential to reduce, balance, or control the impact of different sources of errors and sensitivities.

Our contribution in this paper is to develop a framework and techniques to utilize adaptive meshing techniques for DA with application to time dependent PDE models in one and higher space dimensions. The specific techniques employed include the use of a metric tensor characterization of adaptive meshes that generalizes to higher space dimensions and allows for easily forming combinations of different meshes. We develop a common mesh for the update of ensemble solutions at observation times that combines properties of both an average mesh that supports the ensemble solutions and a mesh concentrated near the location of observations. Meshes are obtained from metric tensors using the moving mesh PDE (MMPDElab [15]) techniques to form well-conditioned finite element PDE discretizations. Testing is performed with a moving mesh, discontinuous Galerkin (DG) discretization of 1D, 2D inviscid Burgers' equations for solutions with sharp, moving interfaces, together with the associated conservative interpolation capabilities, to show the efficacy of the techniques developed here. Localization techniques are developed based on metric tensors and robust results are obtained with RMSEs that do not vary significantly when restricted to regions of large variation in the true solution. In this work the ensemble solutions evolve on their own spatial meshes that are allowed to evolve during the analysis cycle. While this may improve the accuracy of the ensemble solutions, improved vectorization and applicability may be obtained by forming meshes that are fixed between ensemble members and over each analysis cycle.

Central to the techniques developed here is the characterization of meshes through a metric tensor. An adaptive mesh, while not usually uniform in the standard Euclidean metric, can be viewed as uniform with respect to some other metric. This metric is defined by a positive-definite matrix valued (locally defined and of dimension equal to the spatial dimension) monitor function, also called a metric tensor. In the finite element context we consider, the local metric tensor determines the shape, size, and orientation of the element. We also make use of a way to combine two or more metric tensors that are supported on the same mesh. Geometrically, the metric tensor intersection is the same as circumscribing an ellipsoid on an element of two meshes and then finding an ellipsoid that resides in the geometric intersection of the first two ellipsoids. The new element given by that intersecting ellipsoid is the result of the metric tensor intersection.

Localization techniques that limit the sphere of influence of observations are a critical component in the success of modern DA techniques. Broadly speaking, localization schemes fall into two categories: domain localization [12], [29] and covariance (or R) localization [11]. Domain localization schemes define a spatial radius and use that to define which mesh points are affected by a given observation. Covariance localization schemes use a correlation function to modify the covariance matrix that is used in the DA update, so that the covariance between an observation and the solution values decays to zero as the distance between the observation and the solution values increases.

We develop a domain localization scheme that employs the metric tensor. Employing a fixed, uniform radius of influence for all observations as a localization scheme may not be effective if there is a steep gradient in the solution. One could predetermine the location of the gradient and adjust the localization scheme accordingly, but if the regions of large gradient are time-dependent, this will usually result in the tuned localization parameter being quite small. On the other hand, since the metric tensor provides information about the dynamics of the ensemble solution, it can be used to define an adaptive localization scheme where the localization radius can vary in time and space.

One benefit of using an adaptive moving mesh is that fewer mesh points can be used while still maintaining the same accuracy. Having an adaptive time-dependent common mesh allows for a fewer number of nodes used in the common mesh as compared to a fixed, fine common mesh, increasing the efficiency of the linear algebra, e.g., when updating the mean and covariance with an ensemble Kalman filter. An efficient implementation of the Ensemble Kalman Filter (EnKF) [10] requires O((D+Ne)D2+(M+D)Ne2) flops when DM or more generally, for example when DM, O((M+D+Ne)Ne2) (see, e.g., [27]) where D is the dimension of the observation space, M is the dimension of the discretized dynamical system, and Ne is the number of ensemble members. In large scale geophysical applications we typically desire Ne20 (in general Ne should be roughly the number of positive and neutral Lyapunov exponents). A reduction in M based upon using fewer mesh points while maintaining or enhancing accuracy results in improved efficiency.

There are several recent works on integrating adaptive spatial meshing techniques with DA, although most of the focus has been on PDE models in one space dimension. This includes methods based on evolving meshes based on the solution of a differential equation, methods in which meshes are updated statically based upon interpolation, and remeshing techniques that add or subtract mesh points as the solution structure changes. In an ensemble-based method, an adaptive mesh PDE discretization can be used for each of the ensemble members. However, the computation of the ensemble mean and covariance is greatly simplified if, e.g., through interpolation, all ensemble members are supported on the same spatial mesh. If the ensemble members' meshes can evolve independently via an adaptive moving mesh scheme, special care must be taken to calculate the mean and covariance at each DA step. Previous works have explored two general approaches to this problem. One approach is to interpolate the ensemble solutions to a common mesh at each observational time step and assimilate the PDE variables only. The common mesh approach is used here, as well as in [2], [9]. Another method is to assimilate both the PDE variables and the common mesh locations, as done in [5], [32]. In [5], [32], the state variables of the PDE were augmented with the position of the nodes and incorporated into a DA scheme. The test problem consisted of a two-dimensional ice sheet assumed to be radially symmetric; therefore, it reduced to a problem with one spatial dimension. In [2] and [9] common meshes were developed based on combining, through interpolation, the ensemble meshes. This allowed for update of mean and covariance for Kalman filter based DA techniques while allowing each ensemble member to evolve on its own independent mesh. That is, at each observational timestep, the ensemble members were interpolated to the common mesh, updated with the DA analysis, and then interpolated back to their respective meshes. Specifically, a uniform, non-conservative (remeshing allowed with number of mesh points potentially varying with time) mesh was used in [2], with Lagrangian observations in one spatial dimension. Higher spatial dimensions were used in [9], with a fixed common mesh refined near observation locations. The work [32] employs a 1D non-conservative adaptive meshing scheme as in [2] and extends this approach through the use of an adaptive common mesh, where, like in [5], the state vector is augmented with the node locations.

The outline of this paper is as follows. Background of data assimilation and adaptive moving mesh techniques is given in Section 2. This includes the framework we develop to include equations describing mesh movement within a DA framework. In Section 3 we summarize several DA techniques that will benefit from the adaptive meshing techniques developed here. The development of adaptive meshing techniques for DA is detailed in Section 4. Metric tensors are introduced and their connection to non-uniform meshes is discussed. Techniques for combining meshes based on metric tensor intersection and for concentrating ensemble mesh(es) near observation locations are developed. The details of our implementation are in Section 5. This includes the discontinuous Galerkin discretization we employ and the specific metric tensor formulation we use to adaptively evolve the ensemble mesh(es). The details of our experimental setup and numerical results for both 1D and 2D inviscid Burgers equations are presented in Section 6.

Section snippets

Data assimilation framework and the ensemble Kalman filter

Data assimilation techniques seek to combine models and data to improve predictions and quantify uncertainty typically in a Bayesian context (see, e.g., [6], [24], [31], [33]). Consider a finite dimensional discrete time system for a state vector uRM (M>0) that evolves based uponun+1=Ψ(un)+ξn,ξnN(0,Σ), where Ψ() propagates the state forward in time, n stands for the nth time step, and ξn is assumed to be a normally distributed model error with covariance matrix Σ and mean 0. Equation (1) can

Bayes' theorem and data assimilation techniques

Many DA techniques are based upon a Bayesian approach that determine the posterior distribution from the prior distribution and the likelihood. Given an observation yn at time tn and a prior estimate P(un) of the state, Bayes' Theorem states thatP(un|yn)P(yn|un)P(un). This procedure extends to the sequential assimilation of observations at multiple times under the assumption that the state is Markovian. Note that since the model noise {ξj}j=1N is independent and identically distributed

Development of adaptive moving mesh methods for DA

In this section we develop techniques for DA with adaptive moving meshes. These techniques are based upon the use of a metric tensor that describes the mesh. In particular, a non-uniform mesh is uniform with respect to the metric tensor being employed. The use of metric tensors is applicable not only in one spatial dimension, but also in higher spatial dimensions. We first describe the development of metric tensors using the so-called equidistribution and alignment conditions. Metric tensor

Defining mesh movement

The following is a summary of the moving mesh PDE (MMPDE) approach developed in [14], [16], [18], [19], [20]. The central idea of the MMPDE moving mesh method is to view any nonuniform mesh as a uniform one in some metric M; that is, the elements have a constant volume and are equilateral in the metric M. These conditions are called the equidistribution and alignment criteria; see (20) and (21). It has been shown that if a mesh begins as nonsingular (that is, the elements have positive volume),

Numerical results

The following presents the application of these methods to the one and two dimensional inviscid Burgers equations. We generate synthetic observations by sampling from a truth run, obtained by solving this equation on an adaptive moving mesh. The ensemble members are initialized as perturbations of the initial conditions. Efficacy of the DA scheme is measured by the root mean squared error (RMSE), which is calculated asRMSE1Mutruthu¯2, where u¯ is the analysis mean. In all the experiments

Discussion and conclusions

Through the use of an adaptive common mesh, we develop an ensemble based DA scheme where each of the ensemble members evolve independently on their own adaptive meshes. At each observational timestep, the ensemble members are interpolated to the adaptive common mesh, updated according to the DA scheme, and then interpolated back to their individual meshes.

We follow the MMPDE adaptive meshing strategy where the mesh of each ensemble member is determined by a matrix-valued monitor function, also

CRediT authorship contribution statement

Cassidy Krause: Conceptualization, Investigation, Software, Writing – review & editing. Weizhang Huang: Conceptualization, Methodology, Writing – review & editing. David B. Mechem: Conceptualization, Writing – review & editing. Erik S. Van Vleck: Conceptualization, Investigation, Methodology, Software, Supervision, Writing – review & editing. Min Zhang: Software, Validation.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References (36)

  • B. Bonan et al.

    Data assimilation for moving mesh methods with an application to ice sheet modelling

    Nonlinear Process. Geophys.

    (2017)
  • A. Carrassi et al.

    Data assimilation in the geosciences: an overview on methods, issues and perspectives

    Wires Clim. Change

    (2018)
  • A. Carrassi et al.

    Data Assimilation for Chaotic Dynamics

    (2022)
  • J. Du et al.

    Ensemble data assimilation applied to an adaptive mesh ocean model

    Int. J. Numer. Methods Fluids

    (2016)
  • G. Evensen

    Sequential data assimilation with a nonlinear quasi-geostrophic model using Monte Carlo methods to forecast error statistics

    J. Geophys. Res., Oceans

    (1994)
  • T.M. Hamill et al.

    Distance-dependent filtering of background error covariance estimates in an ensemble Kalman filter

    Mon. Weather Rev.

    (2001)
  • P.L. Houtekamer et al.

    A sequential ensemble Kalman filter for atmospheric data assimilation

    Mon. Weather Rev.

    (2001)
  • W. Huang

    Mathematical principles of anisotropic mesh adaptation

    Commun. Comput. Phys.

    (2006)
  • Cited by (1)

    • Feature-informed data assimilation

      2023, Journal of Computational Physics

    This research was supported in part by NSF grants DMS-1714195 and DMS-1722578.

    View full text