Skip to content
BY 4.0 license Open Access Published by De Gruyter March 19, 2022

Decomposition of the total effect for two mediators: A natural mediated interaction effect framework

  • Xin Gao , Li Li and Li Luo EMAIL logo

Abstract

Mediation analysis has been used in many disciplines to explain the mechanism or process that underlies an observed relationship between an exposure variable and an outcome variable via the inclusion of mediators. Decompositions of the total effect (TE) of an exposure variable into effects characterizing mediation pathways and interactions have gained an increasing amount of interest in the last decade. In this work, we develop decompositions for scenarios where two mediators are causally sequential or non-sequential. Current developments in this area have primarily focused on either decompositions without interaction components or with interactions but assuming no causally sequential order between the mediators. We propose a new concept called natural mediated interaction (MI) effect that captures the two-way and three-way interactions for both scenarios and extends the two-way MIs in the literature. We develop a unified approach for decomposing the TE into the effects that are due to mediation only, interaction only, both mediation and interaction, neither mediation nor interaction within the counterfactual framework. Finally, we compare our proposed decomposition to an existing method in a non-sequential two-mediator scenario using simulated data, and illustrate the proposed decomposition for a sequential two-mediator scenario using a real data analysis.

MSC 2010: 62P10

1 Introduction

Mediation analysis has become the technique of choice to identify and explain the mechanism that underlies an observed relationship between an exposure or treatment variable and an outcome variable via the inclusion of intermediate variables, known as mediators. Decompositions of the total effect (TE) of the exposure into effects characterizing mediation pathways and interactions help researchers understand the effects through different mechanisms and have gained much attention in the literature and application in the last decade [1,2, 3,4,5, 6,7,8, 9,10]. In our motivating example, we are interested in the effects of drinking alcohol on systolic blood pressure (SBP) via the mediators, body mass index (BMI), and gamma-glutamyl transferase (GGT), and their interaction effects. Besides, the mediator BMI is previously reported to affect GGT and not vice versa, and hence the two mediators are causally sequential [3]. Current developments in this area for scenarios considering two mediators have primarily focused on either decomposition without interaction components, or decomposition allowing interactions but assuming no causally sequential order between the mediators [3,4,9]. Daniel et al. [3] and Steen et al. [4] discussed the decompositions in a general framework with causally sequential mediators; however, their decompositions do not include interaction components. Bellavia and Valeri [9] proposed a decomposition with components describing interactions, but they assumed these mediators are causally non-sequential. Taguri et al. [10] also considered scenarios with multiple mediators that are causally non-ordered, in which they developed a novel component termed “mediated interaction” (MI).

In this work, we develop decomposition methods for the scenarios when the two mediators are causally sequential and the interaction effects among the mediators and exposure possibly exist. Our approach also applies to a non-sequential two-mediator scenario. We present a unified approach for decomposing the TE into the components that are due to mediation only, interaction only, both mediation and interaction, neither mediation nor interaction within the counterfactual framework. Our decomposition methods are motivated by vanderWeele’s four-way decomposition [7] of the TE with one mediator, where the interaction effects include a reference interaction effect for interaction only and an MI effect for both mediation and interaction. VanderWeele [7] emphasized that these additive interaction terms are often considered of the greatest public health importance [11,12]. We also propose a new concept called natural MI effect for describing the two-way and three-way interactions in two-mediator scenarios that extend the MI from VanderWeele’s work [7]. Since the causal structures are more complex with two mediators, the decompositions have multiple terms for mediation only, interaction only, and both mediation and interaction. Identifiability issues appear in the presence of time-varying confounders, which will be naturally introduced by the mediators in a sequential structure [13,14]. We lay out the identification assumptions and provide identifiable counterfactual formulas in our proposed decomposition [15].

When the two mediators are casually non-sequential, our decomposition uses a different approach from what was proposed by Bellavia and Valeri [9]. For example, their population-averaged MI effect between A and M 1 is evaluated with M 2 fixed at a certain level while our natural MI effect between A and M 1 provides a natural interpretation and is essentially a weighted MI effect where the weights are determined by the distribution of M 2 in the population.

The rest of the article is organized as follows: Section 2 reviews VanderWeele’s four-way decomposition; Section 3 presents decompositions of TE for two-mediator scenarios; Section 4 relates the components of our proposed decompositions to the traditional definitions; Section 5 lays out identification assumptions and gives the empirical and regression-based formulas for computing each component in the decomposition with two causally sequential mediators; Section 6 presents a simulation study and real data analysis; and Section 7 concludes the article with discussions.

2 Decomposition of the TE in a single-mediator scenario

2.1 Counterfactual definitions

Consider a single-mediator scenario in Figure 1. Counterfactual formulas give the potential value of outcome Y or mediator M that would have been observed if the exposure A or mediator M were fixed at a certain level [8,16,17]. Let Y ( a ) denote the potential value of Y that would have been observed if the exposure A were fixed at a constant level a [8]. Similarly, M ( a ) denotes the potential value of M that would have been observed if A were fixed at a and Y ( a , m ) denotes the potential value of Y that would have been observed if A and M were fixed at a and m , respectively [8]. A nested counterfactual formula Y ( a , M ( a ) ) denotes the potential value of Y that would have been observed if the exposure were fixed at a and the mediator M were set to what would have been observed or potential value when the exposure were fixed at a (Figure 2) [8].

Figure 1 
                  Directed acyclic graph of a single-mediator scenario.
Figure 1

Directed acyclic graph of a single-mediator scenario.

Figure 2 
                  Graphical illustration of the nested counterfactual formula 
                        
                           
                           
                              Y
                              
                                 (
                                 
                                    a
                                    ,
                                    
                                       
                                          M
                                       
                                       
                                          1
                                       
                                    
                                    
                                       (
                                       
                                          
                                             
                                                a
                                             
                                             
                                                ∗
                                             
                                          
                                       
                                       )
                                    
                                 
                                 )
                              
                           
                           Y\left(a,{M}_{1}\left({a}^{\ast }))
                        
                     .
Figure 2

Graphical illustration of the nested counterfactual formula Y ( a , M 1 ( a ) ) .

2.2 Two-way decomposition

The TE of the exposure A for an individual is defined as the difference between Y ( a ) and Y ( a ) [8], where a and a are the treatment and reference level of the exposure A , respectively. The classical decomposition of the TE has two components: natural direct effect (NDE) and natural indirect effect (NIE) [8,17,18]. NDE represents the causal effect along the direct path from A to Y and NIE represents the causal effect along the indirect path from A through M to Y . The effects are defined using the following formulas:

TE = Y ( a ) Y ( a ) = Y ( a , M ( a ) ) Y ( a , M ( a ) ) = Y ( a , M ( a ) ) Y ( a , M ( a ) ) + Y ( a , M ( a ) ) Y ( a , M ( a ) ) , NDE = Y ( a , M ( a ) ) Y ( a , M ( a ) ) , NIE = Y ( a , M ( a ) ) Y ( a , M ( a ) ) .

The second equality of TE follows by the composition axiom [8,15] and the third equality of TE follows by subtracting and adding the same counterfactual formula Y ( a , M ( a ) ) . NDE is the difference in the potential value of outcome when A goes from a to a and M is at its potential value M ( a ) . NIE is the difference in the potential value of outcome had M gone from M ( a ) to M ( a ) while A is at its treatment level a . In the literature, NDE and NIE are also referred to as pure direct effect (PDE) and total indirect effect (TIE) [16], respectively. Furthermore, NDE also corresponds to a path-specific effect proposed by Pearl [17].

2.3 Four-way decomposition with interactions

VanderWeele [7] proposed a four-way decomposition in a single-mediator scenario where the exposure interacts with the mediator. The TE of the exposure on the outcome is decomposed into components due to mediation only, interaction only, both mediation and interaction, and neither mediation nor interaction. These four components are termed as pure indirect effect (PIE), reference interaction effect ( INT ref ( m ) ), MI effect ( INT med ), and controlled direct effect ( CDE ( m ) ), respectively, where m is an arbitrarily chosen fixed reference level of the mediator M . At the individual level, the four components are expressed in the following general forms [7]:

CDE ( m ) = Y ( a , m ) Y ( a , m ) , INT ref ( m ) = m [ Y ( a , m ) Y ( a , m ) Y ( a , m ) + Y ( a , m ) ] × I ( M ( a ) = m ) , INT med = m [ Y ( a , m ) Y ( a , m ) Y ( a , m ) + Y ( a , m ) ] × [ I ( M ( a ) = m ) I ( M ( a ) = m ) ] , PIE = m [ Y ( a , m ) Y ( a , m ) ] × [ I ( M ( a ) = m ) I ( M ( a ) = m ) ] .

The reference and MI effects can also be expressed in the form of the counterfactual formulas in our view:

INT ref ( m ) = Y ( a , M ( a ) ) Y ( a , M ( a ) ) Y ( a , m ) + Y ( a , m ) , INT med = Y ( a , M ( a ) ) Y ( a , M ( a ) ) Y ( a , M ( a ) ) + Y ( a , M ( a ) ) .

CDE measures the effect of A had M been fixed at level m . INT ref ( m ) measures the change in the effect of A had M gone from m to M ( a ) . If M ( a ) = m , INT ref ( m ) for the individual considered is reduced to zero. INT med describes the change in the effect of A had M gone from M ( a ) to M ( a ) . When A has no effect on the mediator, M ( a ) = M ( a ) , and INT med becomes zero. P I E describes the effect of M when A is set at a and M goes from M ( a ) to M ( a ) .

When A and M are both binary with the conditions a = 1 , a = 0 , and m = 0 , the counterfactual definitions of the components become:

C D E ( 0 ) = Y ( 1 , 0 ) Y ( 0 , 0 ) , INT ref ( 0 ) = [ Y ( 1 , 1 ) Y ( 1 , 0 ) Y ( 0 , 1 ) + Y ( 0 , 0 ) ] × M ( 0 ) , INT med = [ Y ( 1 , 1 ) Y ( 1 , 0 ) Y ( 0 , 1 ) + Y ( 0 , 0 ) ] × [ M ( 1 ) M ( 0 ) ] , PIE = [ Y ( 0 , 1 ) Y ( 0 , 0 ) ] × [ M ( 1 ) M ( 0 ) ] ,

where 1 is the treatment level and 0 is the reference level [7].

Both INT ref and INT med have an additive interaction [ Y ( 1 , 1 ) Y ( 1 , 0 ) Y ( 0 , 1 ) + Y ( 0 , 0 ) ] term, which will be non-zero for an individual if the joint effect of having both the exposure and the mediator present differs from the sum of the effects of having only the exposure or mediator present. The additive interaction effect is generally considered of great public health importance [11,12]. Provided the additive interaction exists, the difference between INT ref and INT med is that INT ref is non-zero only if the mediator is present in the absence of exposure (i.e., M ( 0 ) = 1 ), whereas INT med is non-zero only if the exposure has an effect on the mediator (i.e., M ( 1 ) M ( 0 ) 0 ).

Based on the counterfactual formula form of MI INT med , we propose the natural MI effect and provide the following definition. The MI effect and natural MI effect are mathematically equivalent in a single-mediator scenario; we define it from a different perspective only for building up the concepts for scenarios with two mediators in Section 3.

Definition 1

We define the natural MI effect of A and M ( NatINT AM ) to be the MI effect ( INT med ) in a single-mediator scenario:

NatINT AM INT med = Y ( a , M ( a ) ) Y ( a , M ( a ) ) Y ( a , M ( a ) ) + Y ( a , M ( a ) ) ,

where M ( a ) and M ( a ) denote the potential values of M that would have occurred if A were fixed at a and a , respectively.

3 Decomposition of the TE in two-mediator scenarios

When two mediators are considered, two-way interaction of the two mediators and three-way interaction of the exposure and the two mediators are likely to exist [7,8,9]. There may also be a causal sequence between the two mediators, i.e., there is a direct causal link between the two mediators. There is limited research on how to define interactions when the two mediators are causally sequential. We aim to develop interpretable interaction concepts and decomposition approaches for two-mediator scenarios.

3.1 Mediators causally non-sequential

We first consider the scenario when the two mediators are causally non-sequential, i.e., there is no direct causal link between the two mediators, which is shown in Figure 3. Below, we define two-way natural MI effects of A and M 1 , A and M 2 , M 1 and M 2 , and a three-way natural MI effect of A , M 1 , and M 2 .

Figure 3 
                  Directed acyclic graph with two non-sequential mediators.
Figure 3

Directed acyclic graph with two non-sequential mediators.

Definition 2

Natural MI effects in a causally non-sequential two-mediator scenario are defined as follows:

NatINT A M 1 Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) + Y ( a , M 1 ( a ) , M 2 ( a ) ) , NatINT A M 2 Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) + Y ( a , M 1 ( a ) , M 2 ( a ) ) , NatINT M 1 M 2 Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) + Y ( a , M 1 ( a ) , M 2 ( a ) ) , NatINT A M 1 M 2 Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) + Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) + Y ( a , M 1 ( a ) , M 2 ( a ) ) + Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) .

NatINT A M 1 , NatINT A M 2 , and NatINT A M 1 M 2 are components that capture the effects due to both mediation and interaction with the exposure. NatINT M 1 M 2 describes the effect due to mediation and interaction between the two mediators. When measuring the interaction between A and M 1 , M 2 is not fixed but takes its potential value M 2 ( a ) for each individual had the exposure been the reference level. Similarly, when measuring the interaction between A and M 2 , M 1 is not fixed but takes its potential value M 1 ( a ) for the individual. The three-way interaction NatINT A M 1 M 2 is similar to a three-way additive interaction. To demonstrate the similarity, we consider that A is binary with the conditions a = 1 and a = 0 ; NatINT A M 1 M 2 becomes

Y ( 1 , M 1 ( 1 ) , M 2 ( 1 ) ) Y ( 0 , M 1 ( 1 ) , M 2 ( 1 ) ) Y ( 1 , M 1 ( 0 ) , M 2 ( 1 ) ) + Y ( 0 , M 1 ( 0 ) , M 2 ( 1 ) ) Y ( 1 , M 1 ( 1 ) , M 2 ( 0 ) ) + Y ( 0 , M 1 ( 1 ) , M 2 ( 0 ) ) + Y ( 1 , M 1 ( 0 ) , M 2 ( 0 ) ) Y ( 0 , M 1 ( 0 ) , M 2 ( 0 ) ) .

The above three-way interaction measures the change in the two-way interaction between A and M 1 when M 2 goes from M 2 ( 0 ) to M 2 ( 1 ) . It also measures the change in the interaction between A and M 2 when M 1 goes from M 1 ( 0 ) to M 1 ( 1 ) or the change in the interaction between M 1 and M 2 when A goes from 0 to 1.

In Supplementary material S1, we show that the TE can be decomposed into ten components at the individual level:

TE = CDE ( m 1 , m 2 ) + INT ref- A M 1 ( m 1 , m 2 ) + INT ref- A M 2 ( m 1 , m 2 ) + INT ref- A M 1 M 2 ( m 1 , m 2 ) + NatINT A M 1 + NatINT A M 2 + NatINT A M 1 M 2 + NatINT M 1 M 2 + PIE M 1 + PIE M 2 ,

where m 1 and m 2 are fixed reference levels for M 1 and M 2 , respectively, and

CDE ( m 1 , m 2 ) = Y ( a , m 1 , m 2 ) Y ( a , m 1 , m 2 ) , INT ref- A M 1 ( m 1 , m 2 ) = Y ( a , M 1 ( a ) , m 2 ) Y ( a , M 1 ( a ) , m 2 ) Y ( a , m 1 , m 2 ) + Y ( a , m 1 , m 2 ) , INT ref- A M 2 ( m 1 , m 2 ) = Y ( a , m 1 , M 2 ( a ) ) Y ( a , m 1 , M 2 ( a ) ) Y ( a , m 1 , m 2 ) + Y ( a , m 1 , m 2 ) , INT ref- A M 1 M 2 ( m 1 , m 2 ) = Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , m 1 , M 2 ( a ) ) + Y ( a , m 1 , M 2 ( a ) ) Y ( a , M 1 ( a ) , m 2 ) + Y ( a , M 1 ( a ) , m 2 ) + Y ( a , m 1 , m 2 ) Y ( a , m 1 , m 2 ) , PIE M 1 = Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) , PIE M 2 = Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) .

Similar to the four-way decomposition, CDE denotes controlled direct effect due to neither mediation nor interaction, INT ref s denote reference interaction effects due to interactions only, and PIEs denote PIEs due to mediation only [7,16,17]. NatINT M 1 M 2 can be interpreted as the effect due to the mediation through both M 1 and M 2 , and the interaction between M 1 and M 2 . Since the interaction is not involved with the change in exposure A , the interpretation can be simply put as the effect due to the mediation through both M 1 and M 2 only. These ten components are displayed in Table 1 assuming that A , M 1 , and M 2 are binary with a = 1 , a = 0 , m 1 = 0 , and m 2 = 0 .

Table 1

Decomposition of the TE in a non-sequential two-mediator scenario when A , M 1 , and M 2 are binary with a = 1 , a = 0 , m 1 = 0 , and m 2 = 0

Effect a , b Definition Interpretation
CDE ( 0 , 0 ) Y ( 1 , 0 , 0 ) Y ( 0 , 0 , 0 ) Due to neither mediation nor interaction
INT ref- A M 1 ( 0 , 0 ) [ Y ( 1 , 1 , 0 ) Y ( 0 , 1 , 0 ) Y ( 1 , 0 , 0 ) + Y ( 0 , 0 , 0 ) ] × M 1 ( 0 ) Due to the interaction between A and M 1 only
INT ref- A M 2 ( 0 , 0 ) [ Y ( 1 , 0 , 1 ) Y ( 0 , 0 , 1 ) Y ( 1 , 0 , 0 ) + Y ( 0 , 0 , 0 ) ] × M 2 ( 0 ) Due to the interaction between A and M 2 only
INT ref- A M 1 M 2 ( 0 , 0 ) [ Y ( 1 , 1 , 1 ) Y ( 0 , 1 , 1 ) Y ( 1 , 0 , 1 ) + Y ( 0 , 0 , 1 ) Y ( 1 , 1 , 0 ) + Y ( 0 , 1 , 0 ) + Y ( 1 , 0 , 0 ) Y ( 0 , 0 , 0 ) ] × M 1 ( 0 ) × M 2 ( 0 ) Due to the interaction between A , M 1 , and M 2 only
NatINT A M 1 m 2 [ Y ( 1 , 1 , m 2 ) I ( M 2 ( 0 ) = m 2 ) Y ( 0 , 1 , m 2 ) I ( M 2 ( 0 ) = m 2 ) Y ( 1 , 0 , m 2 ) I ( M 2 ( 0 ) = m 2 ) + Y ( 0 , 0 , m 2 ) I ( M 2 ( 0 ) = m 2 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] Due to the mediation through M 1 and the interaction between A and M 1 conditional on the potential value of M 2 with the fixed reference level a = 0
NatINT A M 2 m 1 [ Y ( 1 , m 1 , 1 ) I ( M 1 ( 0 ) = m 1 ) Y ( 0 , m 1 , 1 ) I ( M 1 ( 0 ) = m 1 ) Y ( 1 , m 1 , 0 ) I ( M 1 ( 0 ) = m 1 ) + Y ( 0 , m 1 , 0 ) I ( M 1 ( 0 ) = m 1 ) ] × [ M 2 ( 1 ) M 2 ( 0 ) ] Due to the mediation through M 2 and the interaction between A and M 2 conditional on the potential value of M 1 with the fixed reference level a = 0
NatINT A M 1 M 2 [ Y ( 1 , 1 , 1 ) Y ( 0 , 1 , 1 ) Y ( 1 , 0 , 1 ) + Y ( 0 , 0 , 1 ) Y ( 1 , 1 , 0 ) + Y ( 0 , 1 , 0 ) + Y ( 1 , 0 , 0 ) Y ( 0 , 0 , 0 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] × [ M 2 ( 1 ) M 2 ( 0 ) ] Due to the mediation through both M 1 and M 2 and the interaction between A , M 1 , and M 2
NatINT M 1 M 2 [ Y ( 0 , 1 , 1 ) Y ( 0 , 0 , 1 ) Y ( 0 , 1 , 0 ) + Y ( 0 , 0 , 0 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] × [ M 2 ( 1 ) M 2 ( 0 ) ] Due to the mediation through both M 1 and M 2 only
PIE M 1 m 2 [ Y ( 0 , 1 , m 2 ) I ( M 2 ( 0 ) = m 2 ) Y ( 0 , 0 , m 2 ) I ( M 2 ( 0 ) = m 2 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] Due to the mediation through M 1 only conditional on the potential value of M 2 with the fixed reference level a = 0
PIE M 2 m 1 [ Y ( 0 , m 1 , 1 ) I ( M 1 ( 0 ) = m 1 ) Y ( 0 , m 1 , 0 ) I ( M 1 ( 0 ) = m 1 ) ] × [ M 2 ( 1 ) M 2 ( 0 ) ] Due to the mediation through M 2 only conditional on the potential value of M 1 with the fixed reference level a = 0

a The CDE and reference interaction effects are the same as those proposed by Bellavia and Valeri [9].

b CDE denotes controlled direct effect; INT ref denotes reference interaction effect; NatINT denotes natural MI effect; PIE denotes PIE.

Bellavia and Valeri [9] proposed a ten-way decomposition for the same directed acyclic graph in Figure 3. We show in Supplementary material S2 that their decomposition resembles our proposed decomposition under certain conditions. Their CDE and INT ref s are identical to the corresponding terms in our decomposition but their MI effects and pure NIEs are generally different from our natural MIs and PIEs. Figure 4a illustrates their MI effect between A and M 1 where M 2 is assigned a fixed value at m 2 = 0 assuming M 1 and M 2 are binary. Figure 4b illustrates the natural MI effect between A and M 1 , where both M 1 and M 2 take their potential values. In another publication, Taguri et al. [10] developed a four-way decomposition method and proposed the MI component to examine the contribution of the additive interaction effects between the mediators to the joint NIE, assuming that the mediators are not causally ordered. Our natural MI effect between M 1 and M 2 has some similarity to the MI component in terms of mathematical forms. However, there are three main differences between the Taguri et al. method and our proposed decomposition method. First, our ten-way decomposition also considers the MI effects between the exposure and the mediators. Second, the exposure is fixed at the treatment level in the MI component but our natural MI between M 1 and M 2 sets the exposure at the reference level. Third, our decomposition methods apply to scenarios with two causally sequential or non-sequential mediators.

Figure 4 
                  Comparison between the MI effect and the natural MI effect between 
                        
                           
                           
                              A
                           
                           A
                        
                      and 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    1
                                 
                              
                           
                           {M}_{1}
                        
                      at the individual level in a non-sequential two-mediator scenario. (a) 
                        
                           
                           
                              
                                 
                                    INTmed
                                 
                                 
                                    
                                       
                                          A
                                          M
                                       
                                       
                                          1
                                       
                                    
                                 
                              
                           
                           {{\rm{INTmed}}}_{{AM}_{1}}
                        
                      in Bellavia’s and Valeri’s method, where 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                           
                           {M}_{2}
                        
                      is assumed to be fixed at 0 for all individuals. (b) 
                        
                           
                           
                              
                                 
                                    NatINT
                                 
                                 
                                    
                                       
                                          A
                                          M
                                       
                                       
                                          1
                                       
                                    
                                 
                              
                           
                           {{\rm{NatINT}}}_{{AM}_{1}}
                        
                      where 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                           
                           {M}_{2}
                        
                      takes its potential value 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                              
                                 (
                                 
                                    0
                                 
                                 )
                              
                           
                           {M}_{2}\left(0)
                        
                      without such assumption.
Figure 4

Comparison between the MI effect and the natural MI effect between A and M 1 at the individual level in a non-sequential two-mediator scenario. (a) INTmed A M 1 in Bellavia’s and Valeri’s method, where M 2 is assumed to be fixed at 0 for all individuals. (b) NatINT A M 1 where M 2 takes its potential value M 2 ( 0 ) without such assumption.

The expected values of our natural MI effects provide natural interpretations by accounting for the distributions of M 1 ( 0 ) and M 2 ( 0 ) . For example, if the population distribution of M 2 ( 0 ) has a probability of 1 taking the value 0, E [ NatINT A M 1 ] becomes the expected value of the MI effect between A and M 1 as proposed by Bellavia and Valeri. However, if the population distribution of M 2 ( 0 ) does not have a probability of 1 taking the value 0, E [ NatINT A M 1 ] is more suitable to describe the population average of the counterfactual interaction effect. Table 2 presents our results of natural MI effects and PIEs under the assumption M 1 ( 0 ) = M 2 ( 0 ) = 0 , which are identical to those proposed by Bellavia and Valeri [9]. A detailed comparison of the mediated effects between Bellavia’s and Valeri’s method and our proposed decomposition under linear models assuming continuous mediators and outcome is described in Section 5.3 and Table 5. The differences between the two methods are further discussed in Section 6.1 with a simulated data set.

Table 2

Proposed mediated effects in a non-sequential two-mediator scenario with binary A , M 1 , and M 2 under the Assumption M 1 ( 0 ) = M 2 ( 0 ) = 0

Effect a Definition Interpretation
NatINT A M 1 [ Y ( 1 , 1 , 0 ) Y ( 0 , 1 , 0 ) Y ( 1 , 0 , 0 ) + Y ( 0 , 0 , 0 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] Due to the mediation through M 1 and the interaction between A and M 1 assuming M 2 ( 0 ) = 0
NatINT A M 2 [ Y ( 1 , 0 , 1 ) Y ( 0 , 0 , 1 ) Y ( 1 , 0 , 0 ) + Y ( 0 , 0 , 0 ) ] × [ M 2 ( 1 ) M 2 ( 0 ) ] Due to the mediation through M 2 and the interaction between A and M 2 assuming M 1 ( 0 ) = 0
NatINT A M 1 M 2 [ Y ( 1 , 1 , 1 ) Y ( 0 , 1 , 1 ) Y ( 1 , 0 , 1 ) + Y ( 0 , 0 , 1 ) Y ( 1 , 1 , 0 ) + Y ( 0 , 1 , 0 ) + Y ( 1 , 0 , 0 ) Y ( 0 , 0 , 0 ) ] × [ M 1 ( 1 ) M 2 ( 1 ) M 1 ( 0 ) M 2 ( 0 ) ] Due to the mediation through both M 1 and M 2 and the interaction between A , M 1 , and M 2 assuming M 1 ( 0 ) = M 2 ( 0 ) = 0
NatINT M 1 M 2 [ Y ( 0 , 1 , 1 ) Y ( 0 , 0 , 1 ) Y ( 0 , 1 , 0 ) + Y ( 0 , 0 , 0 ) ] × [ M 1 ( 1 ) M 2 ( 1 ) M 1 ( 0 ) M 2 ( 0 ) ] Due to the mediation through both M 1 and M 2 only assuming M 1 ( 0 ) = M 2 ( 0 ) = 0
PIE M 1 [ Y ( 0 , 1 , 0 ) Y ( 0 , 0 , 0 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] Due to the mediation through M 1 only assuming M 2 ( 0 ) = 0
PIE M 2 [ Y ( 0 , 0 , 1 ) Y ( 0 , 0 , 0 ) ] × [ M 2 ( 1 ) M 2 ( 0 ) ] Due to the mediation through M 2 only assuming M 1 ( 0 ) = 0

a NatINT denotes natural MI effect; PIE denotes pure indirect effect.

3.2 Mediators causally sequential

In this section, we consider the scenario where the two mediators are causally sequential, i.e., there is a direct causal link from mediator M 1 to M 2 (Figure 5). Let M 2 ( a , M 1 ( a ) ) be the potential value of M 2 if A were fixed at a and M 1 were at its potential value had A been set at a . Similarly, M 2 ( a , M 1 ( a ) ) denotes the potential value of M 2 if A were fixed at a and M 1 were at its potential value had A been set at a . Counterfactual values for Y are expressed using nested formulas but not all of them are non-parametrically identifiable [15]. For example, Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) is not identifiable since it has two distinct counterfactual values of mediator M 1 , i.e., M 1 ( a ) and M 1 ( a ) , which means M 1 is activated by two different values of A at the same time. Avin et al. [15] showed that such counterfactual formulas are not identifiable. We present identifiable decomposition components only with those identifiable counterfactual formulas of Y .

Figure 5 
                  Directed acyclic graph with two sequential mediators where there exists a direct causal link pointing from 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    1
                                 
                              
                           
                           {M}_{1}
                        
                      to 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                           
                           {M}_{2}
                        
                     .
Figure 5

Directed acyclic graph with two sequential mediators where there exists a direct causal link pointing from M 1 to M 2 .

Definition 3

Natural MI effects in a causally sequential two-mediator scenario are defined as follows:

NatINT A M 1 Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) + Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) , NatINT A M 2 Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) + Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) , NatINT M 1 M 2 Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) + Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) , NatINT A M 1 M 2 Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) + Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) + Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) + Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) .

These interaction terms are similar to those in Definition 2 except that M 2 has an extra input from M 1 . In NatINT A M 1 , M 2 is neither fixed nor set at a level independent of M 1 ; rather, M 2 changes whenever M 1 changes. Therefore, NatINT A M 1 captures the change in the TE of M 1 (going from M 1 ( a ) to M 1 ( a ) ) on the outcome when A goes from a to a . In NatINT M 1 M 2 , M 2 would still partially depend on the level of M 1 . Hence, this component describes the interaction between M 1 and M 2 had M 2 only change its exposure input. Similarly, the three-way interaction NatINT A M 1 M 2 can be interpreted as the change in the interaction between A and M 1 when M 2 has its exposure input going from a to a .

We show in Supplementary material S3 that the TE can be decomposed into ten components at the individual level:

TE = CDE ( m 1 , m 2 ) + INT ref- A M 1 ( m 1 , m 2 ) + INT ref- A M 2 ( m 1 , m 2 ) + INT ref- A M 1 M 2 ( m 1 , m 2 ) + NatINT A M 1 + NatINT A M 2 + NatINT A M 1 M 2 + NatINT M 1 M 2 + PIE M 1 + SNIE M 2 ,

where

CDE ( m 1 , m 2 ) = Y ( a , m 1 , m 2 ) Y ( a , m 1 , m 2 ) , INT ref- A M 1 ( m 1 , m 2 ) = Y ( a , M 1 ( a ) , m 2 ) Y ( a , M 1 ( a ) , m 2 ) Y ( a , m 1 , m 2 ) + Y ( a , m 1 , m 2 ) , INT ref- A M 2 ( m 1 , m 2 ) = Y ( a , m 1 , M 2 ( a , m 1 ) ) Y ( a , m 1 , M 2 ( a , m 1 ) ) Y ( a , m 1 , m 2 ) + Y ( a , m 1 , m 2 ) , INT ref- A M 1 M 2 ( m 1 , m 2 ) = Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , m 1 , M 2 ( a , m 1 ) ) + Y ( a , m 1 , M 2 ( a , m 1 ) ) Y ( a , M 1 ( a ) , m 2 ) + Y ( a , M 1 ( a ) , m 2 ) + Y ( a , m 1 , m 2 ) Y ( a , m 1 , m 2 ) , PIE M 1 = Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) , SNIE M 2 = Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) Y ( a , M 1 ( a ) , M 2 ( a , M 1 ( a ) ) ) .

Since the complexity significantly increases in a sequential two-mediator scenario with a direct causal link pointing from M 1 to M 2 , a few important points need to be addressed. First, we need to ensure that all the counterfactual formulas in the decomposition are identifiable especially when finding INT ref- A M 2 ( m 1 , m 2 ) and INT ref- A M 1 M 2 ( m 1 , m 2 ) . We use the method from Figure 3 in Pearl [17] to graphically illustrate the counterfactual formulas. Figure 6a depicts Y ( a , m 1 , M 2 ( a , M 1 ( a ) ) ) as an example of a non-identifiable counterfactual formula and could be seen as a variant of the problematic counterfactual formulas proposed by Avin et al. [15]. We show how such counterfactual formulas might appear in INT ref- A M 2 ( m 1 , m 2 ) and INT ref- A M 1 M 2 ( m 1 , m 2 ) and describe their non-identifiability in Supplementary material S4. Briefly, M 1 can potentially take two different values within Y ( a , m 1 , M 2 ( a , M 1 ( a ) ) ) , i.e., m 1 and M 1 ( a ) can be different, which results in non-identifiability. In our approach to find INT ref- A M 2 ( m 1 , m 2 ) , we set M 1 to a fixed reference level m 1 and also use it as the second input argument of M 2 . With this approach, M 1 only takes one value in each counterfactual formula of Y as illustrated in Figure 6b, and therefore the non-identifiability would not occur. A graphical illustration for the reference interaction effect between A and M 2 is shown in Figure 7.

Figure 6 
                  (a) The graphical illustration of 
                        
                           
                           
                              Y
                              
                                 (
                                 
                                    a
                                    ,
                                    
                                       
                                          m
                                       
                                       
                                          1
                                       
                                       
                                          ∗
                                       
                                    
                                    ,
                                    
                                       
                                          M
                                       
                                       
                                          2
                                       
                                    
                                    
                                       (
                                       
                                          
                                             
                                                a
                                             
                                             
                                                ∗
                                             
                                          
                                          ,
                                          
                                             
                                                M
                                             
                                             
                                                1
                                             
                                          
                                          
                                             (
                                             
                                                
                                                   
                                                      a
                                                   
                                                   
                                                      ∗
                                                   
                                                
                                             
                                             )
                                          
                                       
                                       )
                                    
                                 
                                 )
                              
                           
                           Y\left(a,{m}_{1}^{\ast },{M}_{2}\left({a}^{\ast },{M}_{1}\left({a}^{\ast })))
                        
                      which is an example of a type of non-identifiable counterfactual formula with 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    1
                                 
                              
                           
                           {M}_{1}
                        
                      taking two different values, 
                        
                           
                           
                              
                                 
                                    m
                                 
                                 
                                    1
                                 
                                 
                                    ∗
                                 
                              
                           
                           {m}_{1}^{\ast }
                        
                      and 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    1
                                 
                              
                              
                                 (
                                 
                                    
                                       
                                          a
                                       
                                       
                                          ∗
                                       
                                    
                                 
                                 )
                              
                           
                           {M}_{1}\left({a}^{\ast })
                        
                      in this example. (b) An identifiable counterfactual formula 
                        
                           
                           
                              Y
                              
                                 (
                                 
                                    a
                                    ,
                                    
                                       
                                          m
                                       
                                       
                                          1
                                       
                                       
                                          ∗
                                       
                                    
                                    ,
                                    
                                       
                                          M
                                       
                                       
                                          2
                                       
                                    
                                    
                                       (
                                       
                                          
                                             
                                                a
                                             
                                             
                                                ∗
                                             
                                          
                                          ,
                                          
                                             
                                                m
                                             
                                             
                                                1
                                             
                                             
                                                ∗
                                             
                                          
                                       
                                       )
                                    
                                 
                                 )
                              
                           
                           Y\left(a,{m}_{1}^{\ast },{M}_{2}\left({a}^{\ast },{m}_{1}^{\ast }))
                        
                     , where 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    1
                                 
                              
                           
                           {M}_{1}
                        
                      takes one fixed value 
                        
                           
                           
                              
                                 
                                    m
                                 
                                 
                                    1
                                 
                                 
                                    ∗
                                 
                              
                           
                           {m}_{1}^{\ast }
                        
                     .
Figure 6

(a) The graphical illustration of Y ( a , m 1 , M 2 ( a , M 1 ( a ) ) ) which is an example of a type of non-identifiable counterfactual formula with M 1 taking two different values, m 1 and M 1 ( a ) in this example. (b) An identifiable counterfactual formula Y ( a , m 1 , M 2 ( a , m 1 ) ) , where M 1 takes one fixed value m 1 .

Figure 7 
                  Graphical illustration of the reference interaction effect between 
                        
                           
                           
                              A
                           
                           A
                        
                      and 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                           
                           {M}_{2}
                        
                      in a sequential two-mediator scenario, where 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    1
                                 
                              
                           
                           {M}_{1}
                        
                      is fixed at the reference level 
                        
                           
                           
                              
                                 
                                    m
                                 
                                 
                                    1
                                 
                                 
                                    ∗
                                 
                              
                           
                           {m}_{1}^{\ast }
                        
                      so that the identifiability is ensured. 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                           
                           {M}_{2}
                        
                      takes 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                              
                                 (
                                 
                                    
                                       
                                          a
                                       
                                       
                                          ∗
                                       
                                    
                                    ,
                                    
                                       
                                          m
                                       
                                       
                                          1
                                       
                                       
                                          ∗
                                       
                                    
                                 
                                 )
                              
                           
                           {M}_{2}\left({a}^{\ast },{m}_{1}^{\ast })
                        
                      and 
                        
                           
                           
                              
                                 
                                    m
                                 
                                 
                                    2
                                 
                                 
                                    ∗
                                 
                              
                           
                           {m}_{2}^{\ast }
                        
                      as its treatment level and reference level, respectively.
Figure 7

Graphical illustration of the reference interaction effect between A and M 2 in a sequential two-mediator scenario, where M 1 is fixed at the reference level m 1 so that the identifiability is ensured. M 2 takes M 2 ( a , m 1 ) and m 2 as its treatment level and reference level, respectively.

Second, the causal effect along the path A M 1 M 2 Y and the causal effect along the path A M 2 Y combine to give the complete mediated effect through M 2 (Figure 5). However, the part from A M 1 M 2 Y is non-identifiable [15], and therefore we use the notion of seminatural indirect effect [19] instead of the PIE for the mediated effect through M 2 in a sequential two-mediator scenario. The seminatural indirect effect through M 2 , SNIE M 2 , measures the causal effect along the path A M 2 Y and can be interpreted as the effect due to partial mediation through M 2 only [19,20]. A graphical illustration of SNIE M 2 is presented in Figure 8.

Figure 8 
                  Graphical illustration of the seminatural indirect effect through 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                           
                           {M}_{2}
                        
                     , 
                        
                           
                           
                              
                                 
                                    SNIE
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          2
                                       
                                    
                                 
                              
                           
                           {{\rm{SNIE}}}_{{M}_{2}}
                        
                     , which evaluates the causal effect along the path 
                        
                           
                           
                              A
                              →
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                              →
                              Y
                           
                           A\to {M}_{2}\to Y
                        
                      and can be interpreted as the effect due to partial mediation through 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                           
                           {M}_{2}
                        
                      only.
Figure 8

Graphical illustration of the seminatural indirect effect through M 2 , SNIE M 2 , which evaluates the causal effect along the path A M 2 Y and can be interpreted as the effect due to partial mediation through M 2 only.

These ten components and their interpretations are shown in Table 3 for the special case when A , M 1 , and M 2 are all binary and additionally a = 1 , a = 0 , m 1 = 0 , and m 2 = 0 .

Table 3

Decomposition of the TE in a sequential two-mediator scenario when A , M 1 , and M 2 are binary with a = 1 , a = 0 , m 1 = 0 , and m 2 = 0

Effect a Definition Interpretation
CDE ( 0 , 0 ) Y ( 1 , 0 , 0 ) Y ( 0 , 0 , 0 ) Due to neither mediation nor interaction
INT ref- A M 1 ( 0 , 0 ) [ Y ( 1 , 1 , 0 ) Y ( 0 , 1 , 0 ) Y ( 1 , 0 , 0 ) + Y ( 0 , 0 , 0 ) ] × M 1 ( 0 ) Due to the interaction between A and M 1 only
INT ref- A M 2 ( 0 , 0 ) [ Y ( 1 , 0 , 1 ) Y ( 0 , 0 , 1 ) Y ( 1 , 0 , 0 ) + Y ( 0 , 0 , 0 ) ] × M 2 ( 0 , 0 ) Due to the interaction between A and M 2 only
INT ref- A M 1 M 2 ( 0 , 0 ) [ Y ( 1 , 1 , 1 ) Y ( 0 , 1 , 1 ) Y ( 1 , 1 , 0 ) + Y ( 0 , 1 , 0 ) ] × M 1 ( 0 ) × M 2 ( 0 , 1 ) + [ Y ( 1 , 0 , 1 ) + Y ( 0 , 0 , 1 ) + Y ( 1 , 0 , 0 ) Y ( 0 , 0 , 0 ) ] × M 1 ( 0 ) × M 2 ( 0 , 0 ) Due to the interaction between A , M 1 , and M 2 only
NatINT A M 1 m 2 [ Y ( 1 , 1 , m 2 ) I ( M 2 ( 0 , 1 ) = m 2 ) Y ( 0 , 1 , m 2 ) I ( M 2 ( 0 , 1 ) = m 2 ) Y ( 1 , 0 , m 2 ) I ( M 2 ( 0 , 0 ) = m 2 ) + Y ( 0 , 0 , m 2 ) I ( M 2 ( 0 , 0 ) = m 2 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] Due to the mediation through M 1 and the interaction between A and M 1 conditional on the potential values of M 2 with the fixed reference level a = 0
NatINT A M 2 m 1 [ Y ( 1 , m 1 , 1 ) I ( M 1 ( 0 ) = m 1 ) Y ( 0 , m 1 , 1 ) I ( M 1 ( 0 ) = m 1 )   Y ( 1 , m 1 , 0 ) I ( M 1 ( 0 ) = m 1 ) + Y ( 0 , m 1 , 0 ) I ( M 1 ( 0 ) = m 1 ) ] × [ M 2 ( 1 , m 1 ) M 2 ( 0 , m 1 ) ] Due to the mediation through M 2 and the interaction between A and M 2 conditional on the potential value of M 1 with the fixed reference level a = 0
NatINT A M 1 M 2 [ Y ( 1 , 1 , 1 ) Y ( 0 , 1 , 1 ) Y ( 1 , 1 , 0 ) + Y ( 0 , 1 , 0 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] × [ M 2 ( 1 , 1 ) M 2 ( 0 , 1 ) ]  + [ Y ( 1 , 0 , 1 ) + Y ( 0 , 0 , 1 ) + Y ( 1 , 0 , 0 ) Y ( 0 , 0 , 0 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] × [ M 2 ( 1 , 0 ) M 2 ( 0 , 0 ) ] Due to the mediation through both M 1 and M 2 and the interaction between A , M 1 , and M 2
NatINT M 1 M 2 [ Y ( 0 , 1 , 1 ) Y ( 0 , 1 , 0 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] × [ M 2 ( 1 , 1 ) M 2 ( 0 , 1 ) ] + [ Y ( 0 , 0 , 1 ) + Y ( 0 , 0 , 0 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] × [ M 2 ( 1 , 0 ) M 2 ( 0 , 0 ) ] Due to the mediation through both M 1 and M 2 only
PIE M 1 m 2 [ Y ( 0 , 1 , m 2 ) I ( M 2 ( 0 , 1 ) = m 2 )   Y ( 0 , 0 , m 2 ) I ( M 2 ( 0 , 0 ) = m 2 ) ] × [ M 1 ( 1 ) M 1 ( 0 ) ] Due to the mediation through M 1 only conditional on the potential values of M 2 with the fixed reference level a = 0
SNIE M 2 m 1 [ Y ( 0 , m 1 , 1 ) × I ( M 1 ( 0 ) = m 1 )   Y ( 0 , m 1 , 0 ) × I ( M 1 ( 0 ) = m 1 ) ] × [ M 2 ( 1 , m 1 ) M 2 ( 0 , m 1 ) ] Due to the partial mediation through M 2 only conditional on the potential value of M 1 with the fixed reference level a = 0

a CDE denotes controlled direct effect; INT ref denotes reference interaction effect; NatINT denotes natural MI effect; PIE denotes pure indirect effect.

4 Relations to traditional definitions

For both a non-sequential and a sequential two-mediator scenario, the ten components can be grouped into different portions with traditional definitions that are of great interest. In this section, we illustrate the relations of our proposed decompositions to the traditional definitions introduced in previous literature [7,16,17,21].

4.1 Non-sequential two-mediator scenario

Recall that the TE can be decomposed into the following ten components in a non-sequential two-mediator scenario:

TE = CDE ( m 1 , m 2 ) + INT ref- A M 1 ( m 1 , m 2 ) + INT ref- A M 2 ( m 1 , m 2 ) + INT ref- A M 1 M 2 ( m 1 , m 2 ) + NatINT A M 1 + NatINT A M 2 + NatINT A M 1 M 2 + NatINT M 1 M 2 + PIE M 1 + PIE M 2 .

First, the sum of the CDE and the reference interaction effects equals the PDE that evaluates the causal effect through the direct path A Y and is defined as the difference in the outcome when the exposure goes from a to a while the mediators take their potential values, M 1 ( a ) and M 2 ( a ) [16]. Namely, we have,

(1) PDE = Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) = CDE ( m 1 , m 2 ) + INT ref- A M 1 ( m 1 , m 2 ) + INT ref- A M 2 ( m 1 , m 2 ) + INT ref- A M 1 M 2 ( m 1 , m 2 ) .

Intuitively, the CDE and the reference interaction effects are the only components in the decomposition that do not require any mediated effects to exist as shown in equation (1). The four-way decomposition [7] also has the corresponding relation but the reference interaction effect only consists of one term.

The TDE [16] is different from PDE in the way that the potential values M 1 ( a ) and M 2 ( a ) are employed instead of M 1 ( a ) and M 2 ( a ) . TDE can be expressed as the sum of four components consisting of PDE, NatINT A M 1 , NatINT A M 2 , and NatINT A M 1 M 2 :

(2) TDE = Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) = PDE + NatINT A M 1 + NatINT A M 2 + NatINT A M 1 M 2 .

The natural MI effect between M 1 and M 2 , NatINT M 1 M 2 , is not included in equation (2). This is because NatINT M 1 M 2 measures the interdependence of the mediated effects through the two mediators while the exposure is fixed at a for the direct path.

The NIE through M 1 , NIE M 1 , is defined by disabling the direct path with the fixed reference level a as well as suppressing the indirect effect through M 2 with the potential value M 2 ( a ) , which can be seen as the type 2 mediator-specific effect proposed by Daniel et al. [3] without a direct causal link pointing from M 1 to M 2 . We show in Supplementary material S1 that NIE M 1 is the sum of NatINT M 1 M 2 and PIE M 1 , which can be expressed as the following equation:

NIE M 1 = Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) = NatINT M 1 M 2 + PIE M 1 ,

where PIE M 1 satisfies the definition of a path-specific effect through M 1 [17].

The PIE through M 2 , PIE M 2 , is also a path specific effect. Figure 9 depicts an alternative mediation decomposition and illustrates the relations between the ten components and the traditional definitions in a non-sequential two-mediator scenario. Other relations that are not shown in Figure 9 can also be obtained. For example, the TIE [16] can be expressed as the sum of the following components:

TIE = Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) = NatINT A M 1 + NatINT A M 2 + NatINT A M 1 M 2 + NatINT M 1 M 2 + PIE M 1 + PIE M 2 ,

since

TE-PDE = Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) + Y ( a , M 1 ( a ) , M 2 ( a ) ) = Y ( a , M 1 ( a ) , M 2 ( a ) ) Y ( a , M 1 ( a ) , M 2 ( a ) ) = T I E .

Figure 9 
                  A flowchart illustrating an alternative mediation decomposition. For a non-sequential two-mediator scenario, the PDE consists of the CDE (
                        
                           
                           
                              CDE
                              
                                 (
                                 
                                    
                                       
                                          m
                                       
                                       
                                          1
                                       
                                       
                                          ∗
                                       
                                    
                                    ,
                                    
                                       
                                          m
                                       
                                       
                                          2
                                       
                                       
                                          ∗
                                       
                                    
                                 
                                 )
                              
                           
                           {\rm{CDE}}\left({m}_{1}^{\ast },{m}_{2}^{\ast })
                        
                     ) and the reference interaction effects (
                        
                           
                           
                              
                                 
                                    INT
                                 
                                 
                                    ref
                                 
                              
                           
                           {{\rm{INT}}}_{{\rm{ref}}}
                        
                     s); the TDE consists of the PDE and the natural mediated interaction effects (NatINTs) except for the one between 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    1
                                 
                              
                           
                           {M}_{1}
                        
                      and 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                           
                           {M}_{2}
                        
                     ; the NIE through 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    1
                                 
                              
                           
                           {M}_{1}
                        
                      (
                        
                           
                           
                              
                                 
                                    NIE
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          1
                                       
                                    
                                 
                              
                           
                           {{\rm{NIE}}}_{{M}_{1}}
                        
                     ) consists of the PIE through 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    1
                                 
                              
                           
                           {M}_{1}
                        
                      (
                        
                           
                           
                              
                                 
                                    PIE
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          1
                                       
                                    
                                 
                              
                           
                           {{\rm{PIE}}}_{{M}_{1}}
                        
                     ) and the natural MI effect between 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    1
                                 
                              
                           
                           {M}_{1}
                        
                      and 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                           
                           {M}_{2}
                        
                      (
                        
                           
                           
                              
                                 
                                    NatINT
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          1
                                       
                                    
                                    
                                       
                                          M
                                       
                                       
                                          2
                                       
                                    
                                 
                              
                           
                           {{\rm{NatINT}}}_{{M}_{1}{M}_{2}}
                        
                     ); the TE consists of the TDE, the NIE through 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    1
                                 
                              
                           
                           {M}_{1}
                        
                      (
                        
                           
                           
                              
                                 
                                    NIE
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          1
                                       
                                    
                                 
                              
                           
                           {{\rm{NIE}}}_{{M}_{1}}
                        
                     ), and the PIE through 
                        
                           
                           
                              
                                 
                                    M
                                 
                                 
                                    2
                                 
                              
                           
                           {M}_{2}
                        
                      (
                        
                           
                           
                              
                                 
                                    PIE
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          2
                                       
                                    
                                 
                              
                           
                           {{\rm{PIE}}}_{{M}_{2}}
                        
                     ). For a sequential two-mediator scenario, one can still follow the flowchart by replacing 
                        
                           
                           
                              
                                 
                                    PIE
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          2
                                       
                                    
                                 
                              
                           
                           {{\rm{PIE}}}_{{M}_{2}}
                        
                      with 
                        
                           
                           
                              
                                 
                                    SNIE
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          2
                                       
                                    
                                 
                              
                           
                           {{\rm{SNIE}}}_{{M}_{2}}
                        
                     .
Figure 9

A flowchart illustrating an alternative mediation decomposition. For a non-sequential two-mediator scenario, the PDE consists of the CDE ( CDE ( m 1 , m 2 ) ) and the reference interaction effects ( INT ref s); the TDE consists of the PDE and the natural mediated interaction effects (NatINTs) except for the one between M 1 and M 2 ; the NIE through M 1 ( NIE M 1 ) consists of the PIE through M 1 ( PIE M 1 ) and the natural MI effect between M 1 and M 2 ( NatINT M 1 M 2 ); the TE consists of the TDE, the NIE through M 1 ( NIE M 1 ), and the PIE through M 2 ( PIE M 2 ). For a sequential two-mediator scenario, one can still follow the flowchart by replacing PIE M 2 with SNIE M 2 .

The portion eliminated (PE) is another useful measure that evaluates how much the causal effect of the exposure on the outcome would be removed if the mediators were set to 0 [16,21]. It can be expressed as follows:

PE = TE -CDE ( m 1 , m 2 ) = INT ref- A M 1 ( m 1 , m 2 ) + INT ref- A M 2 ( m 1 , m 2 ) + INT ref- A M 1 M 2 ( m 1 , m 2 ) + NatINT A M 1 + NatINT A M 2 + NatINT A M 1 M 2 + NatINT M 1 M 2 + PIE M 1 + PIE M 2 ,

where the graphical illustration for this alternative decomposition with PE is shown in Figure 10.

Figure 10 
                  A flowchart illustrating an alternative mediation decomposition. For a non-sequential two-mediator scenario, the PE can be found by summing up the reference interaction effects (
                        
                           
                           
                              
                                 
                                    INT
                                 
                                 
                                    ref
                                 
                              
                           
                           {{\rm{INT}}}_{{\rm{ref}}}
                        
                     s), the natural mediated interaction effects (NatINTs), and the PIEs. The PE can also be calculated by subtracting the CDE (
                        
                           
                           
                              CDE
                              
                                 (
                                 
                                    
                                       
                                          m
                                       
                                       
                                          1
                                       
                                       
                                          ∗
                                       
                                    
                                    ,
                                    
                                       
                                          m
                                       
                                       
                                          2
                                       
                                       
                                          ∗
                                       
                                    
                                 
                                 )
                              
                           
                           {\rm{CDE}}\left({m}_{1}^{\ast },{m}_{2}^{\ast })
                        
                     ) from the TE. For a sequential two-mediator scenario, one can still follow the flowchart by replacing 
                        
                           
                           
                              
                                 
                                    PIE
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          2
                                       
                                    
                                 
                              
                           
                           {{\rm{PIE}}}_{{M}_{2}}
                        
                      with 
                        
                           
                           
                              
                                 
                                    SNIE
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          2
                                       
                                    
                                 
                              
                           
                           {{\rm{SNIE}}}_{{M}_{2}}
                        
                     .
Figure 10

A flowchart illustrating an alternative mediation decomposition. For a non-sequential two-mediator scenario, the PE can be found by summing up the reference interaction effects ( INT ref s), the natural mediated interaction effects (NatINTs), and the PIEs. The PE can also be calculated by subtracting the CDE ( CDE ( m 1 , m 2 ) ) from the TE. For a sequential two-mediator scenario, one can still follow the flowchart by replacing PIE M 2 with SNIE M 2 .

If the components related to the effect due to interaction are of great interest, the portion attributable to interaction (PAI) [7] can be found by summing up the reference and natural MI effects. Namely, we have,

PAI = INT ref- A M 1 ( m 1 , m 2 ) + INT ref- A M 2 ( m 1 , m 2 ) + INT ref- A M 1 M 2 ( m 1 , m 2 ) + NatINT A M 1 + NatINT A M 2 + NatINT A M 1 M 2 + NatINT M 1 M 2 ,

which leads to a four-way decomposition for a non-sequential two-mediator scenario:

TE = CDE ( m 1 , m 2 ) + PAI + PIE M 1 + PIE M 2 .

Figure 11 presents an overall picture for the interaction and mediation decompositions with the ten components for a non-sequential two-mediator scenario. Suggested choices for the multiway interaction decompositions are summarized in Table 4.

Figure 11 
                  A flowchart illustrating alternative mediation and interaction decompositions. For a non-sequential two-mediator scenario, the left part shows an interaction decomposition. The portion attributable to interaction (PAI) consists of the reference interaction effects (
                        
                           
                           
                              
                                 
                                    INT
                                 
                                 
                                    ref
                                 
                              
                           
                           {{\rm{INT}}}_{{\rm{ref}}}
                        
                     s) and the natural mediated interaction effects (NatINTs). The TE consists of the CDE (
                        
                           
                           
                              CDE
                              
                                 (
                                 
                                    
                                       
                                          m
                                       
                                       
                                          1
                                       
                                       
                                          ∗
                                       
                                    
                                    ,
                                    
                                       
                                          m
                                       
                                       
                                          2
                                       
                                       
                                          ∗
                                       
                                    
                                 
                                 )
                              
                           
                           {\rm{CDE}}\left({m}_{1}^{\ast },{m}_{2}^{\ast })
                        
                     ), the portion attributable to interaction (PAI), and the PIEs. The right part shows a mediation decomposition. The PDE consists of the CDE (
                        
                           
                           
                              CDE
                              
                                 (
                                 
                                    
                                       
                                          m
                                       
                                       
                                          1
                                       
                                       
                                          ∗
                                       
                                    
                                    ,
                                    
                                       
                                          m
                                       
                                       
                                          2
                                       
                                       
                                          ∗
                                       
                                    
                                 
                                 )
                              
                           
                           {\rm{CDE}}\left({m}_{1}^{\ast },{m}_{2}^{\ast })
                        
                     ) and the reference interaction effects (
                        
                           
                           
                              
                                 
                                    INT
                                 
                                 
                                    ref
                                 
                              
                           
                           {{\rm{INT}}}_{{\rm{ref}}}
                        
                     s). The TIE consists of the NatINTs and the PIEs. The TE consists of the PDE and the TIE. For a sequential two-mediator scenario, one can still follow the flowchart by replacing 
                        
                           
                           
                              
                                 
                                    PIE
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          2
                                       
                                    
                                 
                              
                           
                           {{\rm{PIE}}}_{{M}_{2}}
                        
                      with 
                        
                           
                           
                              
                                 
                                    SNIE
                                 
                                 
                                    
                                       
                                          M
                                       
                                       
                                          2
                                       
                                    
                                 
                              
                           
                           {{\rm{SNIE}}}_{{M}_{2}}
                        
                     .
Figure 11

A flowchart illustrating alternative mediation and interaction decompositions. For a non-sequential two-mediator scenario, the left part shows an interaction decomposition. The portion attributable to interaction (PAI) consists of the reference interaction effects ( INT ref s) and the natural mediated interaction effects (NatINTs). The TE consists of the CDE ( CDE ( m 1 , m 2 ) ), the portion attributable to interaction (PAI), and the PIEs. The right part shows a mediation decomposition. The PDE consists of the CDE ( CDE ( m 1 , m 2 ) ) and the reference interaction effects ( INT ref s). The TIE consists of the NatINTs and the PIEs. The TE consists of the PDE and the TIE. For a sequential two-mediator scenario, one can still follow the flowchart by replacing PIE M 2 with SNIE M 2 .

Table 4

Suggested interaction decompositions for both a non-sequential and a sequential two-mediator scenario a

Number of components Decomposition b
2-Way decomposition (no mediation) CDE ( m 1 , m 2 ) + PAI
4-Way decomposition CDE ( m 1 , m 2 ) + PAI + PIE M 1 + PIE M 2 (or SNIE M 2 )
4-Way decomposition TDE + NatINT M 1 M 2 + PIE M 1 + PIE M 2 (or SNIE M 2 )
5-Way decomposition CDE ( m 1 , m 2 ) + INT ref- A M 1 ( m 1 , m 2 ) + INT ref- A M 2 ( m 1 , m 2 ) + INT ref- A M 1 M 2 ( m 1 , m 2 ) + TIE
7-Way decomposition PDE + NatINT A M 1 + NatINT A M 2 + NatINT A M 1 M 2 + NatINT M 1 M 2 + PIE M 1 + PIE M 2 (or SNIE M 2 )
10-Way decomposition CDE ( m 1 , m 2 ) + INT ref- A M 1 ( m 1 , m 2 ) + INT ref- A M 2 ( m 1 , m 2 ) + INT ref- A M 1 M 2 ( m 1 , m 2 ) + NatINT A M 1 + NatINT A M 2 + NatINT A M 1 M 2 + NatINT M 1 M 2 + PIE M 1 + PIE M 2 (or SNIE M 2 )

a Use SNIE M 2 instead of PIE M 2 in a sequential two-mediator scenario.

b CDE denotes controlled direct effect; INT ref denotes reference interaction effect; NatINT denotes natural MI effect; PIE denotes pure indirect effect; PAI denotes portion attributable to interaction; SNIE denotes seminatural indirect effect; TDE denotes total direct effect; TIE denotes total indirect effect; PDE denotes pure direct effect.

4.2 Sequential two-mediator scenario

We recall the ten components of the decomposed TE for a sequential two-mediator scenario:

TE = CDE ( m 1 , m 2 ) + INT ref- A M 1 ( m 1 , m 2 ) + INT ref- A M 2 ( m 1 , m 2 ) + INT ref- A M 1 M 2 ( m 1 , m 2 ) + NatINT A M 1 + NatINT A M 2 + NatINT A M 1 M 2 + NatINT M 1 M 2 + PIE M 1 + SNIE M 2 .

As discussed in Section 3.2, the complete mediated effect through M 2 cannot be identified with non-parametric models because of the direct causal link pointing from M 1 to M 2 , and hence the seminatural indirect effect through M 2 , SNIE M 2 , is used instead. One can also employ traditional definitions to perform alternative interaction and mediation decompositions for a sequential two-mediator scenario by replacing PIE M 2 with SNIE M 2 .

5 Identification assumptions and empirical formulas

The decompositions for one- and two-mediator scenarios thus far have been primarily conceptual. The individual-level effects in the decompositions cannot be identified from data, but under certain assumptions on confounding the population-averages of those components can be identified from data [6].

5.1 Identification assumptions

We first consider a single-mediator scenario. Four identification assumptions are required [22], which are listed below as ( A 1 )–( A 4 ):

Y ( a , m ) A C , ( A 1 ) Y ( a , m ) M { A , C } , ( A 2 ) M ( a ) A C , ( A 3 ) Y ( a , m ) M ( a ) C , ( A 4 )

where C is a set of covariates. The assumptions above state that given a covariate set C or { A , C } , there exist no unmeasured variables confounding the association between exposure A and outcome Y ( A 1 ), no unmeasured variables confounding the association between mediator M and outcome Y ( A 2 ), and no unmeasured variables confounding the association between exposure A and mediator M ( A 3 ) [8]. ( A 4 ) is a strong assumption and a few researchers published their works on this topic [4,7,23]. It could be interpreted as there exist no variables that are causal descendants of exposure A , and in the meantime, that confound the association between mediator M and outcome Y [4,17].

The analogs of ( A 1 )–( A 4 ) for a directed acyclic graph with two sequential mediators can be found by first considering M 1 and M 2 as a set [4]. Namely, we have four corresponding identification assumptions ( A 1 )–( A 4 ):

Y ( a , m 1 , m 2 ) A C , ( A 1 ) Y ( a , m 1 , m 2 ) { M 1 , M 2 } { A , C } , ( A 2 ) { M 1 ( a ) , M 2 ( a , m 1 ) } A C , ( A 3 ) Y ( a , m 1 , m 2 ) { M 1 ( a ) , M 2 ( a , m 1 ) } C . ( A 4 )

Similarly, the assumptions above state that given a covariate set C or { A , C } , there exist no unmeasured variables confounding the association between exposure A and outcome Y ( A 1 ), no unmeasured variables confounding the association between the mediator set { M 1 , M 2 } and outcome Y ( A 2 ), no unmeasured variables confounding the association between exposure A and the mediator set { M 1 , M 2 } ( A 3 ), and no unmeasured variables that are causal descendants of exposure A , and in the meantime, that confound the association between the mediator set { M 1 , M 2 } and outcome Y ( A 4 ) [4,22].

In order to account for the confounding between M 1 and M 2 , two more assumptions are required:

M 2 ( a , m 1 ) M 1 { A , C } , ( A 5 ) M 2 ( a , m 1 ) M 1 ( a ) C , ( A 6 )

where ( A 5 ) and ( A 6 ) state, respectively, that there exist no unmeasured variables confounding the association between M 1 and M 2 given { A , C } , and no unmeasured variables that are causal descendants of exposure A , and in the meantime, are confounding the association between M 1 and M 2 [4].

Steen et al. [4] presented comprehensive identification assumptions for the causal structures with multiple mediators and pointed out that weaker identification assumptions than ( A 1 )–( A 6 ) can be considered under certain decompositions.

5.2 Empirical formulas

Suppose a set of covariates C satisfies the assumptions on confounding for a decomposition. We can obtain the expected value of each component in the decomposition using the iterated conditional expectation rule. We focus on the scenario with two causally sequential mediators. Suppose M 1 and M 2 are categorical and let p a m 1 m 2 c = E [ Y A = a , M 1 = m 1 , M 2 = m 2 , C = c ] . The following formulas can be obtained:

E [ CDE ( m 1 , m 2 ) c ] = p a m 1 m 2 c p a m 1 m 2 c E [ INT ref- A M 1 ( m 1 , m 2 ) c ] = m 1 ( p a m 1 m 2 c p a m 1 m 2 c p a m 1 m 2 c + p a m 1 m 2 c ) × Pr ( M 1 = m 1 a , c ) E [ INT ref- A M 2 ( m 1 , m 2 ) c ] = m 2 m 1 [ ( p a m 1 m 2 c p a m 1 m 2 c ) × Pr ( M 2 = m 2 a , m 1 , c ) + ( p a m 1 m 2 c + p a m 1 m 2 c ) × Pr ( M 2 = m 2 a , m 1 , c ) ] × Pr ( M 1 = m 1 a , c )

E [ INT ref- A M 1 M 2 ( m 1 , m 2 ) c ] = m 2 m 1 [ ( p a m 1 m 2 c p a m 1 m 2 c p a m 1 m 2 c + p a m 1 m 2 c + p a m 1 m 2 c p a m 1 m 2 c ) × Pr ( M 2 = m 2 a , m 1 , c ) + ( p a m 1 m 2 c + p a m 1 m 2 c ) × Pr ( M 2 = m 2 a , m 1 , c ) ] × Pr ( M 1 = m 1 a , c ) E [ NatINT A M 1 c ] = m 2 m 1 ( p a m 1 m 2 c p a m 1 m 2 c ) × Pr ( M 2 = m 2 a , m 1 , c ) × [ Pr ( M 1 = m 1 a , c ) Pr ( M 1 = m 1 a , c ) ] E [ NatINT A M 2 c ] = m 2 m 1 ( p a m 1 m 2 c p a m 1 m 2 c ) × Pr ( M 1 = m 1 a , c ) × [ Pr ( M 2 = m 2 a , m 1 , c ) Pr ( M 2 = m 2 a , m 1 , c ) ] E [ NatINT A M 1 M 2 c ] = m 2 m 1 ( p a m 1 m 2 c p a m 1 m 2 c ) × [ Pr ( M 1 = m 1 a , c ) Pr ( M 1 = m 1 a , c ) ] × [ Pr ( M 2 = m 2 a , m 1 , c ) Pr ( M 2 = m 2 a , m 1 , c ) ] E [ NatINT M 1 M 2 c ] = m 2 m 1 p a m 1 m 2 c [ Pr ( M 1 = m 1 a , c ) Pr ( M 1 = m 1 a , c ) ] [ Pr ( M 2 = m 2 a , m 1 , c ) Pr ( M 2 = m 2 a , m 1 , c ) ] E [ PIE M 1 c ] = m 2 m 1 p a m 1 m 2 c Pr ( M 2 = m 2 a , m 1 , c ) × [ Pr ( M 1 = m 1 a , c ) Pr ( M 1 = m 1 a , c ) ] E [ SNIE M 2 c ] = m 2 m 1 p a m 1 m 2 c × Pr ( M 1 = m 1 a , c ) × [ Pr ( M 2 = m 2 a , m 1 , c ) Pr ( M 2 = m 2 a , m 1 , c ) ] .

When M 1 and M 2 are continuous, empirical formulas can be obtained by replacing the sums by integrations and the conditional probabilities by conditional densities.

5.3 Relations to linear models

Suppose Y , M 1 , and M 2 are continuous. For the scenario with two causally sequential mediators, we assume that the following regression models for Y , M 1 , and M 2 are specified:

E [ Y A , M 1 , M 2 , C ] = θ 0 + θ 1 A + θ 2 M 1 + θ 3 M 2 + θ 4 A M 1 + θ 5 A M 2 + θ 6 M 1 M 2 + θ 7 A M 1 M 2 + θ 8 C , E [ M 2 A , M 1 , C ] = β 0 + β 1 A + β 2 M 1 + β 3 A M 1 + β 4 C , E [ M 1 A , C ] = γ 0 + γ 1 A + γ 2 C ,

where C is a confounding set that satisfies the identification assumptions ( A 1 )–( A 6 ). The expected values of the effect components are as follows:

E [ CDE ( m 1 , m 2 ) c ] = ( θ 1 + θ 4 m 1 + θ 5 m 2 + θ 7 m 1 m 2 ) ( a a ) E [ INT ref- A M 1 ( m 1 , m 2 ) c ] = ( γ 0 + γ 1 a + γ 2 c m 1 ) ( θ 4 + θ 7 m 2 ) ( a a ) E [ INT ref- A M 2 ( m 1 , m 2 ) c ] = ( θ 5 + θ 7 m 1 ) ( β 0 + β 1 a + β 2 m 1 + β 3 a m 1 + β 4 c m 2 ) ( a a ) E [ INT ref- A M 1 M 2 ( m 1 , m 2 ) c ] = { θ 1 + θ 7 ( β 0 + β 1 a + β 4 c ) ( γ 0 + γ 1 a + γ 2 c ) + θ 5 ( β 2 + β 3 a ) ( γ 0 + γ 1 a + γ 2 c ) + θ 7 ( β 2 + β 3 a ) [ σ M 1 2 + ( γ 0 + γ 1 a + γ 2 c ) 2 ] ( θ 1 + θ 5 m 2 ) θ 7 m 2 ( γ 0 + γ 1 a + γ 2 c ) θ 5 ( β 2 m 1 + β 3 a m 1 m 2 ) θ 7 m 1 ( β 0 + β 1 a + β 2 m 1 + β 3 a m 1 + β 4 c m 2 ) } ( a a ) E [ NatINT A M 1 c ] = [ θ 4 γ 1 + θ 7 γ 1 ( β 0 + β 1 a + β 4 c ) + θ 5 γ 1 ( β 2 + β 3 a ) + 2 θ 7 γ 1 ( β 2 + β 3 a ) ( γ 0 + γ 2 c ) + θ 7 γ 1 2 ( β 2 + β 3 a ) ( a + a ) ] ( a a ) 2

E [ NatINT A M 2 c ] = [ θ 5 β 1 + θ 7 β 1 ( γ 0 + γ 1 a + γ 2 c ) + θ 5 β 3 ( γ 0 + γ 1 a + γ 2 c ) + θ 7 β 3 [ σ M 1 2 + ( γ 0 + γ 1 a + γ 2 c ) 2 ] ] ( a a ) 2 E [ NatINT A M 1 M 2 c ] = [ θ 7 β 1 γ 1 + θ 5 β 3 γ 1 + 2 θ 7 β 3 γ 1 ( γ 0 + γ 2 c ) + θ 7 β 3 γ 1 2 ( a + a ) ] ( a a ) 3 E [ NatINT M 1 M 2 c ] = [ β 1 γ 1 ( θ 6 + θ 7 a ) + β 3 γ 1 ( θ 3 + θ 5 a ) + 2 β 3 γ 1 ( θ 6 + θ 7 a ) ( γ 0 + γ 2 c ) + β 3 γ 1 2 ( θ 6 + θ 7 a ) ( a + a ) ] ( a a ) 2 E [ PIE M 1 c ] = [ γ 1 ( θ 2 + θ 4 a ) + γ 1 ( θ 6 + θ 7 a ) ( β 0 + β 1 a + β 4 c ) + γ 1 ( θ 3 + θ 5 a ) ( β 2 + β 3 a ) + 2 γ 1 ( θ 6 + θ 7 a ) ( β 2 + β 3 a ) ( γ 0 + γ 2 c ) + γ 1 2 ( θ 6 + θ 7 a ) ( β 2 + β 3 a ) ( a + a ) ] ( a a ) E [ SNIE M 2 c ] = [ β 1 ( θ 3 + θ 5 a ) + β 1 ( θ 6 + θ 7 a ) ( γ 0 + γ 1 a + γ 2 c ) + β 3 ( θ 3 + θ 5 a ) ( γ 0 + γ 1 a + γ 2 c ) + β 3 ( θ 6 + θ 7 a ) [ σ M 1 2 + ( γ 0 + γ 1 a + γ 2 c ) 2 ] ] ( a a ) E [ TE c ] = [ θ 1 + θ 5 ( β 0 + β 4 c ) + β 1 θ 3 + θ 4 ( γ 0 + γ 2 c ) + γ 1 θ 2 + θ 7 ( β 0 + β 4 c ) ( γ 0 + γ 2 c ) + β 1 θ 6 ( γ 0 + γ 2 c ) + γ 1 θ 6 ( β 0 + β 4 c ) + θ 5 β 2 ( γ 0 + γ 2 ) + θ 3 β 3 ( γ 0 + γ 2 c ) + θ 3 β 2 γ 1 + θ 7 β 2 σ M 1 2 + θ 6 β 3 σ M 1 2 + θ 7 β 2 ( γ 0 + γ 2 c ) 2 + θ 6 β 3 ( γ 0 + γ 2 c ) 2 + 2 γ 1 θ 6 β 2 ( γ 0 + γ 2 c ) ] ( a a ) + [ β 1 θ 5 + γ 1 θ 4 + β 1 θ 7 ( γ 0 + γ 2 c ) + γ 1 θ 7 ( β 0 + β 4 c ) + γ 1 β 1 θ 6 + θ 5 β 3 ( γ 0 + γ 2 c ) + θ 5 β 2 γ 1 + θ 3 β 3 γ 1 + θ 7 β 3 σ M 1 2 + θ 7 β 3 ( γ 0 + γ 2 c ) 2 + 2 γ 1 θ 7 β 2 ( γ 0 + γ 2 c ) + 2 γ 1 θ 6 β 3 ( γ 0 + γ 2 c ) + θ 6 β 2 γ 1 2 ] ( a 2 a 2 ) + [ γ 1 β 1 θ 7 + θ 5 β 3 γ 1 + 2 γ 1 θ 7 β 3 ( γ 0 + γ 2 c ) + θ 7 β 2 γ 1 2 + θ 6 β 3 γ 1 2 ] ( a 3 a 3 ) + θ 7 β 3 γ 1 2 ( a 4 a 4 ) ,

where σ M 1 2 denotes the constant variance of random error term for M 1 . A complete derivation for the aforementioned formulas are presented in Supplementary material S5.

For a scenario with two causally non-sequential mediators, again we assume that a set of covariates C satisfies the identification assumptions for the decomposition and that the following regression models for Y , M 1 , and M 2 are specified:

E [ Y A , M 1 , M 2 , C ] = θ 0 + θ 1 A + θ 2 M 1 + θ 3 M 2 + θ 4 A M 1 + θ 5 A M 2 + θ 6 M 1 M 2 + θ 7 A M 1 M 2 + θ 8 C , E [ M 2 A , C ] = β 0 + β 1 A + β 4 C , E [ M 1 A , C ] = γ 0 + γ 1 A + γ 2 C .

The results can be obtained as a special case of those derived from the scenario with two causally sequential mediators by setting parameters β 2 and β 3 to zero. Table 5 presents a side-by-side comparison of the expected value of six selected components in our proposed decomposition that are potentially different from the mediated effects in the study by Bellavia and Valeri [9]. Formulas are derived under linear structural equation models in a non-sequential two-mediator scenario with continuous outcome and mediators. Both decompositions have identical CDE and reference interaction effects. It was noted that the mediated effects in Bellavia and Valeri depend on two arbitrarily chosen values for M 1 ( a ) and M 2 ( a ) , respectively. For example, the expected value of MI effect between A and M 1 can be expressed as follows:

E [ INTmed A M 1 M 2 ( a ) = m 2 , c ] = ( θ 4 + θ 7 m 2 ) γ 1 ( a a ) 2 ,

where m 2 is an arbitrarily chosen value for M 2 ( a ) .

Table 5

Comparison of the mediated effects between Bellavia’s and Valeri’s method a and our proposed decomposition b in the formulas c under linear structural equation models in a non-sequential two-mediator scenario

Bellavia’s and Valeri’s method Our proposed decomposition
Component d,e Formula Component f Formula
E [ INTmed A M 1 m 2 , c ] ( θ 4 + θ 7 m 2 ) γ 1 ( a a ) 2 E [ NatINT A M 1 c ] [ θ 4 + θ 7 ( β 0 + β 1 a + β 4 c ) ] γ 1 ( a a ) 2
E [ INTmed A M 2 m 1 , c ] ( θ 5 + θ 7 m 1 ) β 1 ( a a ) 2 E [ NatINT A M 2 c ] [ θ 5 + θ 7 ( γ 0 + γ 1 a + γ 2 c ) ] β 1 ( a a ) 2
E [ INTmed A M 1 M 2 m 1 , m 2 , c ] [ β 1 ( γ 0 + γ 2 c m 1 ) + γ 1 ( β 0 + β 4 c m 2 ) + β 1 γ 1 ( a + a ) ] θ 7 ( a a ) 2 E [ NatINT A M 1 M 2 c ] θ 7 β 1 γ 1 ( a a ) 3
E [ PNIE M 1 M 2 m 1 , m 2 , c ] [ γ 1 ( β 0 + β 4 c ) + β 1 ( γ 0 + γ 2 c ) γ 1 m 2 β 1 m 1 + γ 1 β 1 ( a + a ) ] × ( θ 6 + θ 7 a ) ( a a ) E [ NatINT M 1 M 2 c ] β 1 γ 1 ( θ 6 + θ 7 a ) ( a a ) 2
E [ PNIE M 1 m 2 , c ] [ θ 2 + θ 4 a + ( θ 6 + θ 7 a ) m 2 ] γ 1 ( a a ) E [ PIE M 1 c ] [ θ 2 + θ 4 a + ( θ 6 + θ 7 a ) ( β 0 + β 1 a + β 4 c ) ] γ 1 ( a a )
E [ PNIE M 2 m 1 , c ] [ θ 3 + θ 5 a + ( θ 6 + θ 7 a ) m 1 ] β 1 ( a a ) E [ PIE M 2 c ] [ θ 3 + θ 5 a + ( θ 6 + θ 7 a ) ( γ 0 + γ 1 a + γ 2 c ) ] β 1 ( a a )

a The formulas in Bellavia’s and Valeri’s method are derived according to Web Table 2 in the study by Bellavia and Valeri [9].

b The formulas in our proposed decomposition are obtained by setting β 2 and β 3 to 0 in a sequential two-mediator scenario.

c All formulas under linear structural equation models are based on a continuous outcome Y and two continuous non-sequential mediators M 1 and M 2 . The structural equation models are as follows:

E [ Y A , M 1 , M 2 , C ] = θ 0 + θ 1 A + θ 2 M 1 + θ 3 M 2 + θ 4 A M 1 + θ 5 A M 2 + θ 6 M 1 M 2 + θ 7 A M 1 M 2 + θ 8 C , E [ M 2 A , C ] = β 0 + β 1 A + β 4 C , E [ M 1 A , C ] = γ 0 + γ 1 A + γ 2 C .

d The components in Bellavia’s and Valeri’s method are conditional on M 1 ( a ) = m 1 and/or M 2 ( a ) = m 2 . Only m 1 and/or m 2 are shown in Table 5 for simplicity.

e INTmed denotes MI effect; PNIE denotes pure NIE.

f NatINT denotes natural MI effect; PIE denotes pure indirect effect.

Compared to E [ INTmed A M 1 ] , the expected value of natural MI effect between A and M 1 is given as follows:

E [ NatINT A M 1 c ] = [ θ 4 + θ 7 ( β 0 + β 1 a + β 4 c ) ] γ 1 ( a a ) 2 .

The key difference is that E [ NatINT A M 1 ] does not assume any arbitrarily chosen value for M 2 ( a ) but uses the population averaged value of M 2 ( a ) in the linear model which is β 0 + β 1 a + β 4 c . Hence, E [ NatINT A M 1 ] provides a natural interpretation of the MI between A and M 1 .

6 Illustrations with simulated and real data

We use a simulated data set to compare our method to Bellavia’s and Valeri’s method [9] in a non-sequential two-mediator scenario. We also analyzed a real data set in a sequential two-mediator scenario using the formulas derived in Section 5.3 for illustration.

6.1 Illustration with a simulated data set in a non-sequential two-mediator scenario

To compare Bellavia’s and Valeri’s method and our proposed decomposition with two non-sequential mediators (Figure 3), we simulated n = 1,000 observations from the following linear structural equation models:

E [ Y A , M 1 , M 2 , C ] = 0.2 + 0.3 A + 0.3 M 1 + 0.4 M 2 + 0.01 A M 1 + 0.02 A M 2 + 0.6 M 1 M 2 + 0.7 A M 1 M 2 + 0.2 C , E [ M 2 A , C ] = 0.2 + 0.3 A + 0.2 C , E [ M 1 A , C ] = 0.2 + 0.3 A + 0.2 C ,

where the exposure A , mediators M 1 and M 2 , outcome Y , and covariate C are all continuous random variables.

The covariate C is the only confounder for the associations among A , M 1 , M 2 , and Y and was randomly drawn from N ( 0.2 , 0.5 ) , where 0.5 is the standard deviation. We randomly drew the exposure A from N ( 0.3 + 3 c , 0.5 ) , M 1 and M 2 from N ( 0.2 + 0.3 a + 0.2 c , 0.5 ) , and Y from N ( 0.2 + 0.3 a + 0.3 m 1 + 0.4 m 2 + 0.01 a m 1 + 0.02 a m 2 + 0.6 m 1 m 2 + 0.7 a m 1 m 2 + 0.2 c , 0.5 ) .

The treatment and reference level of A are a = 1 and a = 0 , respectively. The fixed reference levels of M 1 and M 2 , m 1 and m 2 , were set to 0 in calculating the CDE, reference interaction effects, and the mediated effects in Bellavia’s and Valeri’s method. We plugged in the maximum likelihood estimators for the coefficients and unbiased estimator for the constant variance into the regression-based formulas to obtain point estimates of the effects in the decompositions and used 100,000 bootstrap samples to obtain the 95% confidence intervals [24]. Table 6 shows the simulation results and interpretations of the identical components, including the CDE, reference interaction effects, PDE, and TE. Table 7 presents the simulation results of other decomposition components that are expected to be different.

Table 6

Simulation results a and corresponding interpretations b of identical components in Bellavia’s and Valeri’s method and our proposed decomposition

Component c True value Estimate 95% CI Interpretation
CDE ( 0 , 0 ) 0.3000 0.2891 0.2210, 0.3590 Due to neither mediation nor interaction with fixed reference levels m 1 = m 2 = 0
INT ref- A M 1 ( 0 , 0 ) 0.0024 0.0003 0.0082 , 0.0088 Due to the interaction between A and M 1 only with fixed reference levels m 1 = m 2 = 0
INT ref- A M 2 ( 0 , 0 ) 0.0048 0.0101 0.0029, 0.0181 Due to the interaction between A and M 2 only with fixed reference levels m 1 = m 2 = 0
INT ref- A M 1 M 2 ( 0 , 0 ) 0.0403 0.0332 0.0212, 0.0470 Due to the interaction between A , M 1 , and M 2 only with fixed reference levels m 1 = m 2 = 0
PDE 0.3475 0.3327 0.2647, 0.4012 The causal effect through the direct path A Y
TE 0.8707 0.8697 0.7841, 0.9561 The overall causal effect of A on Y

a The simulation results are calculated from the following structural equation models:

E [ Y A , M 1 , M 2 , C ] = 0.2 + 0.3 A + 0.3 M 1 + 0.4 M 2 + 0.01 A M 1 + 0.02 A M 2 + 0.6 M 1 M 2 + 0.7 A M 1 M 2 + 0.2 C , E [ M 2 A , C ] = 0.2 + 0.3 A + 0.2 C , E [ M 1 A , C ] = 0.2 + 0.3 A + 0.2 C .

b All effects are calculated from the contrast between a = 1 and a = 0 .

c CDE denotes controlled direct effect; INT ref denotes reference interaction effect; PDE denotes pure direct effect; TE denotes TE.

Table 7

Simulation results a of different components in Bellavia’s and Valeri’s method and our proposed decomposition

Bellavia’s and Valeri’s method Our proposed decomposition
Component b True value Estimate 95% CI Component c True value Estimate 95% CI
INTmed A M 1 0.0030 0.0004 0.0107 , 0.0116 NatINT A M 1 0.0534 0.0439 0.0260, 0.0634
INTmed A M 2 0.0060 0.0165 0.0048, 0.0283 NatINT A M 2 0.0564 0.0703 0.0499, 0.0922
INTmed A M 1 M 2 0.1638 0.1680 0.1474, 0.1887 NatINT A M 1 M 2 0.0630 0.0706 0.0521, 0.0911
PNIE M 1 M 2 0.1404 0.1286 0.1021, 0.1573 NatINT M 1 M 2 0.0540 0.0541 0.0378, 0.0734
PNIE M 1 0.0900 0.0902 0.0626, 0.1207 PIE M 1 0.1332 0.1236 0.0919, 0.1579
PNIE M 2 0.1200 0.1333 0.1011, 0.1688 PIE M 2 0.1632 0.1745 0.1353, 0.2174

a The simulation results are calculated from the following structural equation models:

E [ Y A , M 1 , M 2 , C ] = 0.2 + 0.3 A + 0.3 M 1 + 0.4 M 2 + 0.01 A M 1 + 0.02 A M 2 + 0.6 M 1 M 2 + 0.7 A M 1 M 2 + 0.2 C , E [ M 2 A , C ] = 0.2 + 0.3 A + 0.2 C , E [ M 1 A , C ] = 0.2 + 0.3 A + 0.2 C .

b INTmed denotes MI effect; PNIE denotes pure NIE.

c NatINT denotes natural MI effect; PIE denotes pure indirect effect.

Bellavia’s and Valeri’s method has a few drawbacks. First of all, the mediated effects in Bellavia’s and Valeri’s method vary with respect to the arbitrary choices of m 1 and m 2 . Second, the interpretations of the mediated effects in Bellavia and Valeri have to account for the choices of m 1 and m 2 , and therefore have a lack of generalizability (Table 8). At last, it is difficult to extend Bellavia’s and Valeri’s method into the scenarios with multiple sequential mediators by fixing the mediators at certain levels. For example, in a sequential two-mediator scenario (Figure 5), the direct causal link pointing from M 1 to M 2 would have to be removed by setting M 2 to a fixed value. Namely, the causal relationship between M 1 and M 2 in a sequential two-mediator scenario would be lost. In contrast, our proposed decomposition overcomes these disadvantages by allowing the mediators to naturally vary with respect to the exposure.

Table 8

Corresponding interpretations a for the simulation results of different components in Bellavia’s and Valeri’s method and our proposed decomposition

Bellavia’s and Valeri’s method Our proposed decomposition
Component b Interpretation Component c Interpretation
INTmed A M 1 Due to the mediation through M 1 and the interaction between A and M 1 assuming M 2 ( 0 ) = 0 NatINT A M 1 Due to the mediation through M 1 and the interaction between A and M 1 with M 2 ( 0 ) estimated from data
INTmed A M 2 Due to the mediation through M 2 and the interaction between A and M 2 assuming M 1 ( 0 ) = 0 NatINT A M 2 Due to the mediation through M 2 and the interaction between A and M 2 with M 1 ( 0 ) estimated from data
INTmed A M 1 M 2 Due to the mediation through both M 1 and M 2 and the interaction between A , M 1 , and M 2 assuming M 1 ( 0 ) = M 2 ( 0 ) = 0 NatINT A M 1 M 2 Due to the mediation through both M 1 and M 2 and the interaction between A , M 1 , and M 2 with M 1 ( 0 ) and M 2 ( 0 ) estimated from data
PNIE M 1 M 2 Due to the mediation through both M 1 and M 2 only assuming M 1 ( 0 ) = M 2 ( 0 ) = 0 NatINT M 1 M 2 Due to the mediation through both M 1 and M 2 only with M 1 ( 0 ) and M 2 ( 0 ) estimated from data
PNIE M 1 Due to the mediation through M 1 only assuming M 2 ( 0 ) = 0 PIE M 1 Due to the mediation through M 1 only with M 2 ( 0 ) estimated from data
PNIE M 2 Due to the mediation through M 2 only assuming M 1 ( 0 ) = 0 PIE M 2 Due to the mediation through M 2 only with M 1 ( 0 ) estimated from data

a All effects are calculated from the contrast between a = 1 and a = 0 .

b INTmed denotes MI effect; PNIE denotes pure NIE.

c NatINT denotes natural MI effect; PIE denotes pure indirect effect.

6.2 Illustration with real data in a sequential two-mediator scenario

6.2.1 Justification of the causal diagram

In our motivating example, we aim to examine the effect of alcohol consumption on hypertension, and the components of the TE that are due to the mediation or interaction with GGT and BMI. The hypothetical causal diagram with two sequential mediators is shown in Figure 12. We adopted the causal diagram from the study by Daniel et al. [3], and provided additional evidence from literature reports to support the causal diagram. While GGT is traditionally used as a biological marker for excessive alcohol consumption and liver function [25], it has been suggestive to be a robust marker for oxidative stress [26,27]. There is growing evidence that obesity, especially central obesity, may result in increased serum GGT levels [28,29]. Experimental and clinical studies have demonstrated the important role of GGT in antioxidant defense, detoxification, and inflammation processes [30]. There are a number of reports that have investigated the effects of GGT on the risk and prognosis of complex diseases such as cancer [31] and cardiovascular disease [32]. A study that has conducted a 12-week alcohol relapse prevention trial reported that participant with positive GGT ( 50 IU) had 10 mmHg greater SBP and 9 mmHg greater diastolic blood pressure (DBP) than those with negative GGT [33]. Mechanistic studies investigating the role of increases in GGT activity in predicting hypertension (commonly defined as SBP 140 mmHg or DBP 90 mmHg ) could be due to a connection with the increased level of arterial stiffness [34,35]. We acknowledge that the biological and pathological mechanisms involving the interactions among adiposity, ethanol, and GGT remain less understood. However, several epidemiological and clinical studies have investigated and reported the combined and interactive effects of excessive ethanol consumption and obesity on the biochemical variables. A study based on an analysis of 8,373 adults in the 2005–2008 National Health and Nutrition Examination Survey showed that the co-occurrence of obesity and patterns of alcohol use are significantly associated with elevated serum GGT [36]. Another study reported additive interaction effects between moderate drinking and obesity on serum GGT activities [37]. A longitudinal study investigating the relationship between serum GGT and risk of hypertension stratified by alcohol consumption status and BMI groups has reported a stronger association among current drinkers than that among non-drinkers [38]. In the same study of subgroup analysis by BMI groups, significant association between serum GGT and hypertension was only found among participants above the median of anthropometric measures (e.g., BMI > 26.4 ) [38]. These studies suggest potential complex two-way or even three-way interaction effects between BMI, alcohol consumption, and GGT on hypertension that warrant further investigation.

Figure 12 
                     Directed acyclic graph for the study on hazard of drinking alcohol, where alcohol drinking is used as the exposure, BMI and log-transformed GGT as the two sequential mediators, SBP as the outcome, and sex and age as two confounders.
Figure 12

Directed acyclic graph for the study on hazard of drinking alcohol, where alcohol drinking is used as the exposure, BMI and log-transformed GGT as the two sequential mediators, SBP as the outcome, and sex and age as two confounders.

To illustrate the concept of natural MI effect and the decomposition methods, we used the 2013–2014, 2015–2016, and 2017–2018 National Health and Nutrition Examination Survey data with 8,920 observations [3,39]. The data set was downloaded from http://www.cdc.gov/nhanes. Exposure A is alcohol drinking and treated as a binary random variable (never/moderate or heavy). As suggested by the Dietary Guidelines for Americans from US Department of Agriculture and US Department of Health and Human Services [40], we define heavy alcohol drinking as consuming 3 or more drinks in a day for males, and consuming 2 or more drinks in a day for females. In our causal diagram, the mediator BMI ( M 1 ) is measured in kg/m 2 , the mediator GGT ( M 2 ) is measured in U/L, and the outcome SBP ( Y ) is measured in mmHg. Sex (females or males) and age (measured in years) are considered a sufficient set satisfying the assumptions on confounding.

Log transformation was performed on GGT due to the skewness of the data. The fixed reference levels of M 1 and log ( M 2 ) were chosen to be the estimated means from data, where m 1 = 29.21 and log ( m 2 ) = 3.09 . Three linear models were fit for Y , log ( M 2 ) , and M 1 , which include all possible interactions among the exposure and mediators. The 95% confidence intervals were obtained by using a bootstrap method [24].

Table 9 presents the decomposition of the TE conditional on males and the mean level of age at 45.96. The CDE is 1.1014 (95% CI = 0.4900 to 1.7218); the reference interaction effect between A and M 1 is 0.0329 ( 0.0277 to 0.0963); the reference interaction effect between A and log ( M 2 ) is 0.0745 ( 0.0150 to 0.1706); the reference interaction effect between A , M 1 , and log ( M 2 ) is 0.0025 ( 0.1108 to 0.1151); the natural MI effect between A and M 1 is 0.0167 ( 0.0670 to 0.0305); the natural MI effect between A and log ( M 2 ) is 0.1307 ( 0.0383 to 0.3023); the natural MI effect between A , M 1 , and log ( M 2 ) is 0.0003 ( 0.0136 to 0.0143); the natural MI effect between M 1 and log ( M 2 ) is 0.0059 ( 0.0195 to 0.0050); the PDE is 1.2113 (0.6011 to 1.8326); the PIE through M 1 is 0.2137 (0.0927 to 0.3417); the seminatural indirect effect through log ( M 2 ) is 0.3952 (0.2581 to 0.5470); and the TE is 1.9287 (1.2874 to 2.5807). The results of the decomposition of the TE conditional on females and the mean level of age are shown in Table 10.

Table 9

Illustration with real data: decomposition of TE conditional on males and the mean age a

Component b Estimate 95% CI
CDE ( m 1 , log ( m 2 ) ) 1.1014 0.4900, 1.7218
INT ref- A M 1 ( m 1 , log ( m 2 ) ) 0.0329 0.0277 , 0.0963
INT ref- A log ( M 2 ) ( m 1 , log ( m 2 ) ) 0.0745 0.0150 , 0.1706
INT ref- A M 1 log ( M 2 ) ( m 1 , log ( m 2 ) ) 0.0025 0.1108 , 0.1151
NatINT A M 1 −0.0167 0.0670 , 0.0305
NatINT A log ( M 2 ) 0.1307 0.0383 , 0.3023
NatINT A M 1 log ( M 2 ) 0.0003 0.0136 , 0.0143
NatINT M 1 log ( M 2 ) −0.0059 0.0195 , 0.0050
PDE 1.2113 0.6011, 1.8326
PIE M 1 0.2137 0.0927, 0.3417
SNIE log ( M 2 ) 0.3952 0.2581, 0.5470
TE 1.9287 1.2874, 2.5807

a The exposure A is alcohol drinking; the mediator M 1 is BMI; the mediator M 2 is GGT; the outcome Y is SBP; the confounding covariate set contains sex and age.

b CDE denotes controlled direct effect; INT ref denotes reference interaction effect; NatINT denotes natural MI effect; PDE denotes pure direct effect; PIE denotes pure indirect effect; SNIE denotes seminatural indirect effect; TE denotes total effect.

Table 10

Illustration with real data: decomposition of TE conditional on females and the mean age a

Component b Estimate 95% CI
CDE ( m 1 , log ( m 2 ) ) 1.1014 0.4900, 1.7218
INT ref- A M 1 ( m 1 , log ( m 2 ) ) −0.0097 0.0426 , 0.0093
INT ref- A log ( M 2 ) ( m 1 , log ( m 2 ) ) −0.2218 0.4945 , 0.0458
INT ref- A M 1 log ( M 2 ) ( m 1 , log ( m 2 ) ) 0.0153 0.0971 , 0.1270
NatINT A M 1 −0.0195 0.0719 , 0.0290
NatINT A log ( M 2 ) 0.1312 0.0310 , 0.2968
NatINT A M 1 log ( M 2 ) 0.0003 0.0132 , 0.0139
NatINT M 1 log ( M 2 ) −0.0058 0.0190 , 0.0049
PDE 0.8853 0.2567, 1.5150
PIE M 1 0.2193 0.0949, 0.3512
SNIE log ( M 2 ) 0.3852 0.2527, 0.5319
TE 1.5960 0.9731, 2.2246

a The exposure A is alcohol drinking; the mediator M 1 is BMI; the mediator M 2 is GGT; the outcome Y is SBP; the confounding covariate set contains sex and age.

b CDE denotes controlled direct effect; INT ref denotes reference interaction effect; NatINT denotes natural MI effect; PDE denotes pure direct effect; PIE denotes pure indirect effect; SNIE denotes seminatural indirect effect; TE denotes total effect.

Overall, we observed a significant increase in SBP among heavy alcohol drinkers in both males (TE: 1.9287; 95% CI: 1.2874, 2.5807) and females (TE: 1.5960; 95% CI: 0.9731, 2.2246) compared to never/moderate drinkers. Detailed decomposition using our method showed that all three path effects (PDE, PIE M 1 and SNIE log ( M 2 ) ) significantly contribute to the TE. Among the natural MI effect components, we observed that the interaction effects between alcohol drinking and GGT have the highest magnitude in both females and males, although not statistically significant. The natural MI between alcohol drinking and GGT can be interpreted as the expected value of the product of the mediation effect through GGT and the additive interaction effects between heavy drinkers and the GGT levels, while the BMI is fixed at the potential value for never/moderate drinkers. Compared to never/moderate drinkers, heavy drinkers are associated with an average of 0.13 units higher SBP that is due to the MI effects between alcohol drinking and GGT. This suggests that the mediating and interactive mechanisms for alcohol drinking and GGT are likely operating in the same direction, which results in further increased SBP at the average population level in both females and males. We note that there are potential limitations of the real data analysis. First, we assume that the linear structural equation models are correctly specified. A bias would occur if the true relationships were non-linear. Second, observations with missing data were not considered in the analysis. Third, the data analysis is primarily for illustration purpose. Our data analysis may have limited power in detecting statistically significant reference or MI effects. However, it clearly demonstrates how to decompose the TE into different components. Results suggest that the detected significant TE may be driven by the components other than the interaction effects in this population. These results would also provide helpful information on developing targeted prevention strategies for hypertension. Finally, the causal interpretations in this example should be made with discretion because the identification assumptions on unmeasured confounding might be violated.

7 Conclusion

In this work, we develop decompositions for scenarios where the two mediators are causally sequential or non-sequential. We propose a unified approach for decomposing the TE into components that are due to mediation only, interaction only, both mediation and interaction, and neither mediation nor interaction within the counterfactual framework. The decomposition was implemented via a new concept called natural MI effect that we proposed to describe the two-way and three-way interactions for both scenarios that extend the two-way MIs in existing literature. To estimate the components of our proposed decompositions, we lay out the identification assumptions. We also derive the formulas when the response is assumed to be continuous with linear structural equation models. We use both simulated and real data sets to illustrate our method.

We believe that our proposed new concept of natural MI effects and the decomposition methods for the causal framework with two sequential or non-sequential mediators provide a powerful tool to decipher the refined path effects while appropriately account for interaction effects among the exposure and mediators. The counterfactual interaction effects evaluate the interaction terms that involve mediators by treating them at the natural levels. There is a gap in existing research of decomposing TE into mediation and interaction effects for the scenario of multiple sequential mediators, and our proposed methods have the potential to fill in the gap. Our future work will include developing decomposition methods for causal structures involving multiple sequential mediators and multiple exposures. We will also investigate the interventional analogue version of this decomposition and the corresponding interpretation of the effects in the future work.

  1. Funding information: This research was partially supported by UNM Comprehensive Cancer Center Support Grant NCI P30CA118100, the Biostatistics shared resource, UNM METALS Superfund Research Center (NIEHS 1P42ES025589), and Center for Native American Environmental Health Equity Research (NIMHD/NIEHS 9P50MD015706).

  2. Conflict of interest: Authors state no conflict of interest.

  3. Data availability statement: The R scripts for the simulation study and real data analysis are available at: https://github.com/flourish-727/data_analysis.

References

[1] VanderWeele TJ, Vansteelandt S. Mediation analysis with multiple mediators. Epidemiol Methods. 2014;2(1):95–115. 10.1515/em-2012-0010Search in Google Scholar PubMed PubMed Central

[2] VanderWeele TJ, Vansteelandt S, Robins JM. Effect decomposition in the presence of an exposure-induced mediator-outcome confounder. Epidemiology. 2014;25(2):300–6. 10.1097/EDE.0000000000000034Search in Google Scholar PubMed PubMed Central

[3] Daniel RM, De Stavola BL, Cousens SN, Vansteelandt S. Causal mediation analysis with multiple mediators. Biometrics. 2015;71(1):1–14. 10.1111/biom.12248Search in Google Scholar PubMed PubMed Central

[4] Steen J, Loeys T, Moerkerke B, Vansteelandt S. Flexible mediation analysis with multiple mediators. Am J Epidemiol. 2017;186(2):184–93. 10.1093/aje/kwx051Search in Google Scholar PubMed

[5] Mittinty MN, Lynch JW, Forbes AB, Gurrin LC. Effect decomposition through multiple causally nonordered mediators in the presence of exposure-induced mediator-outcome confounding. Stat Med. 2019;38(26):5085–102. 10.1002/sim.8352Search in Google Scholar PubMed

[6] VanderWeele TJ. A three-way decomposition of a total effect into direct, indirect, and interactive effects. Epidemiology. 2013;24(2):224–32. 10.1097/EDE.0b013e318281a64eSearch in Google Scholar PubMed PubMed Central

[7] VanderWeele TJ. A unification of mediation and interaction: a 4-way decomposition. Epidemiology. 2014;25(5):749–61. 10.1097/EDE.0000000000000121Search in Google Scholar PubMed PubMed Central

[8] VanderWeele TJ. Explanation in Causal Inference: Methods for Mediation and Interaction. New York: Oxford University Press; 2015. 10.1093/ije/dyw277Search in Google Scholar PubMed PubMed Central

[9] Bellavia A, Valeri L. Decomposition of the total effect in the presence of multiple mediators and interactions. Am J Epidemiol. 2018;187(6):1311–8. 10.1093/aje/kwx355Search in Google Scholar PubMed PubMed Central

[10] Taguri M, Featherstone J, Cheng J. Causal mediation analysis with multiple causally non-ordered mediators. Stat Methods Med Res. 2018;27(1):3–19. 10.1177/0962280215615899Search in Google Scholar PubMed PubMed Central

[11] Rothman KJ, Greenland S, Lash TL. Concepts of interaction. In: Modern epidemiology. Chapter 5, 3rd ed. Philadelphia, PA: Lippincott Williams and Wilkins; 2008. p. 71–84. 10.1093/oxfordjournals.aje.a113015Search in Google Scholar PubMed

[12] Hosmer DW, Lemeshow S. Confidence interval estimation of interaction. Epidemiology. 1992;3(5):452–56. 10.1097/00001648-199209000-00012Search in Google Scholar PubMed

[13] VanderWeele TJ, Tchetgen Tchetgen EJ. Mediation analysis with time varying exposures and mediators. J R Statist Soc B. 2017;79(3):917–38. 10.1111/rssb.12194Search in Google Scholar PubMed PubMed Central

[14] Daniel RM, De Stavola BL, Cousens SN. Gformula: Estimating causal effects in the presence of time-varying confounding or mediation using the g-computation formula. Stata J. 2011;11(4):479–517. 10.1177/1536867X1201100401Search in Google Scholar

[15] Avin C, Shpitser I, Pearl J. Identifiability of path-specific effects. In: Proceedings of the International Joint Conferences on Artificial Intelligence. Edinburgh, Schotland; 2005. p. 357–63. Search in Google Scholar

[16] Robins JM, Greenland S. Identifiability and exchangeability for direct and indirect effects. Epidemiology. 1992;3(2):143–55. 10.1097/00001648-199203000-00013Search in Google Scholar PubMed

[17] Pearl J. Direct and indirect effects. In: Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence. San Francisco, CA: Morgan Kaufmann Publishers Inc; 2001. p. 411–20. 10.1145/3501714.3501736Search in Google Scholar

[18] Robins JM. Semantics of causal DAG models and the identification of direct and indirect effects. In: Green JP, Hjort NL, Richardson S, eds. Highly structured stochastic systems. New York: Oxford University Press; 2003. p. 70–81. Search in Google Scholar

[19] Pearl J. Interpretation and identification of causal mediation. Psychol Methods. 2014;19(4):459–81. 10.1037/a0036434Search in Google Scholar PubMed

[20] Huber M. Identifying causal mechanisms in experiments (primarily) based on inverse probability weighting (Technical Report). St. Gallen, Switzerland: University of St. Gallen, Department of Economics; 2012. Search in Google Scholar

[21] VanderWeele TJ. Policy-relevant proportions for direct effects. Epidemiology. 2013;24(1):175–6. 10.1097/EDE.0b013e3182781410Search in Google Scholar PubMed PubMed Central

[22] VanderWeele TJ, Vansteelandt S. Conceptual issues concerning mediation, interventions and composition. Stat Its Interface. 2009;2(4):457–68. 10.4310/SII.2009.v2.n4.a7Search in Google Scholar

[23] Robins JM, Richardson TS. Alternative graphical causal models and the identification of direct effects. In: Shrout P, eds. Causality and psychopathology: finding the determinants of disorders and their cures. New York: Oxford University Press; 2010. 10.1093/oso/9780199754649.003.0011Search in Google Scholar

[24] Valeri L, VanderWeele TJ. Mediation analysis allowing for exposure-mediator interactions and causal interpretation: Theoretical assumptions and implementation with SAS and SPSS macros. Psychol Methods. 2013;18(2):137–50. 10.1037/a0031034Search in Google Scholar PubMed PubMed Central

[25] Conigrave KM, Davies P, Haber P, Whitfield JB. Traditional markers of excessive alcohol use. Addiction. 2003;98(Suppl 2):31–43. 10.1046/j.1359-6357.2003.00581.xSearch in Google Scholar PubMed

[26] Lim JS, Yang JH, Chun BY, Kam S, Jacobs, Jr. DR, Lee DH. Is serum gamma-glutamyltransferase inversely associated with serum antioxidants as a marker of oxidative stress? Free Radic Biol Med. 2004;37(7):1018–23. 10.1016/j.freeradbiomed.2004.06.032Search in Google Scholar PubMed

[27] Lee DH, Blomhoff R, Jacobs, Jr. DR Is serum gamma glutamyltransferase a marker of oxidative stress? Free Radic Res. 2004;38(6):535–9. 10.1080/10715760410001694026Search in Google Scholar PubMed

[28] Colicchio P, Tarantino G, delGenio F, Sorrentino P, Saldalamacchia G, Finelli C, et al. Non-alcoholic fatty liver disease in young adult severely obese non-diabetic patients in South Italy. Ann Nutr Metab. 2005;49(5):289–95. 10.1159/000087295Search in Google Scholar PubMed

[29] Daeppen JB, Smith TL, Schuckit MA. Influence of age and body mass index on gamma-glutamyltransferase activity: A 15-year follow-up evaluation in a community sample. Alcohol Clin Exp Res. 1998;22(4):941–4. 10.1097/00000374-199806000-00027Search in Google Scholar

[30] Zhang H, Forman HJ. Redox regulation of gamma-glutamyl transpeptidase. Am J Respir Cell Mol Biol. 2009;41(5):509–15. 10.1165/rcmb.2009-0169TRSearch in Google Scholar PubMed PubMed Central

[31] Fentiman IS. Gamma-glutamyl transferase: risk and prognosis of cancer. Br J Cancer. 2012;106(9):1467–8. 10.1038/bjc.2012.128Search in Google Scholar PubMed PubMed Central

[32] Jiang S, Jiang D, Tao Y. Role of gamma-glutamyltransferase in cardiovascular diseases. Exp Clin Cardiol. 2013;18(1):53–6. Search in Google Scholar

[33] Baros AM, Wright TM, Latham PK, Miller PM, Anton RF. Alcohol consumption, % CDT, GGT and blood pressure change during alcohol treatment. Alcohol Alcohol. 2008;43(2):192–7. 10.1093/alcalc/agm156Search in Google Scholar PubMed

[34] Song SH, Kwak IS, Kim YJ, Kim SJ, Lee SB, Lee DW. Can gamma-glutamyltransferase be an additional marker of arterial stiffness? Circ J. 2007;71(11):1715–20. 10.1253/circj.71.1715Search in Google Scholar PubMed

[35] Saijo Y, Utsugi M, Yoshioka E, Horikawa N, Sato T, Gong Y. The relationship of gamma-glutamyltransferase to C-reactive protein and arterial stiffness. Nutr Metab Cardiovasc Dis. 2008;18(3):211–9. 10.1016/j.numecd.2006.10.002Search in Google Scholar

[36] Tsai J, Ford ES, Zhao G, Li C, Greenlund KJ, Croft JB. Co-occurrence of obesity and patterns of alcohol use associated with elevated serum hepatic enzymes in US adults. J Behav Med. 2012;35(2):200–10. 10.1007/s10865-011-9353-5Search in Google Scholar

[37] Puukka K, Hietala J, Koivisto H, Anttila P, Bloigu R, Niemelä O. Additive effects of moderate drinking and obesity on serum gamma-glutamyl transferase activity. Am J Clin Nutr. 2006;83(6):1351–4. 10.1093/ajcn/83.6.1351Search in Google Scholar

[38] Stranges S, Trevisan M, Dorn JM, Dmochowski J, Donahue RP. Body fat distribution, liver enzymes, and risk of hypertension: evidence from the western New York study. Hypertension. 2005;46(5):1186–93. 10.1161/01.HYP.0000185688.81320.4dSearch in Google Scholar

[39] Leon DA, Saburova L, Tomkins S, Andreev E, Kiryanov N, McKee M, et al. Hazardous alcohol drinking and premature mortality in Russia: A population based case-control study. Lancet. 2007;369(9578):2001–9. 10.1016/S0140-6736(07)60941-6Search in Google Scholar

[40] U.S. Department of Agriculture and U.S. Department of Health and Human Services. 2020-2025 Dietary Guidelines for Americans. 9th edn, Washington, DC; 2020. Search in Google Scholar

Received: 2020-08-05
Revised: 2022-01-28
Accepted: 2022-02-04
Published Online: 2022-03-19

© 2022 Xin Gao et al., published by De Gruyter

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 28.4.2024 from https://www.degruyter.com/document/doi/10.1515/jci-2020-0017/html
Scroll to top button