Diphoton excess through dark mediators

Preliminary ATLAS and CMS results from the first 13 TeV LHC run have encountered an intriguing excess of events in the diphoton channel around the invariant mass of 750 GeV. We investigate a possibility that the current excess is due to a heavy resonance decaying to light metastable states, which in turn give displaced decays to very highly collimated e+e− pairs. Such decays may pass the photon selection criteria, and successfully mimic the diphoton events, especially at low counts. We investigate two classes of such models, characterized by the following underlying production and decay chains: gg → S → A′A′ → (e+e−)(e+e−) and qq¯→Z′→sa→e+e−e+e−\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ q\overline{q}\to {Z}^{\prime}\to sa\to\ \left({e}^{+}{e}^{-}\right)\left({e}^{+}{e}^{-}\right) $$\end{document}, where at the first step a heavy scalar, S, or vector, Z′, resonances are produced that decay to light metastable vectors, A′, or (pseudo-)scalars, s and a. Setting the parameters of the models to explain the existing excess, and taking the ATLAS detector geometry into account, we marginalize over the properties of heavy resonances in order to derive the expected lifetimes and couplings of metastable light resonances. We observe that in the case of A′, the suggested range of masses and mixing angles ϵ is within reach of several new-generation intensity frontier experiments.


Introduction
The start of the LHC run at 13 TeV center-of-mass energy has brought an unexpectedfrom the minimalist point of view -excess of events in the diphoton channel with the invariant mass of about 750 GeV [1,2]. In the Standard Model (SM) of particles and fields this energy is not associated with any known resonance, and may be the first sign for elusive New Physics (NP). The appearance of the "bump" in the diphoton spectrum, despite its rather limited statistical significance that may disappear or strengthen with more data, has generated a lot of excitement among physicists who wait for any manifestation of NP beyond SM (BSM) at the weak scale.
It is true that in most models of NP, the diphoton channel would not necessarily be the "discovery mode". That is, other manifestations of a (tenuous) 750 GeV resonance might have been expected first. Nevertheless, large classes of models where said resonance is produced from the fusion of the SM gauge bosons and/or quark-antiquark pairs with subsequent decay to the diphoton states have appeared in the literature, with most of them being tailored for the occasion. While the mass of a new resonance suggested by the CMS and (mostly) ATLAS data is to be around 750 GeV, its spin and parity remain open for discussion. Spin-zero and spin-two resonances come as the most natural candidates, while the spin-one resonance is disfavored by the so-called "Landau-Yang theorem" that forbids two photons in any state with the total angular momentum equal to one [3,4]. The couplings of the spin-zero resonances to photons or gluons cannot be expected to arise at JHEP07(2016)063 dimension four or lower operator level, and therefore it is reasonable to expect that the 750 GeV resonance is also coupled to the weak-scale particles, charged under the SM gauge groups. The loops of these particles (for example, vector-like fermions [5][6][7][8][9][10][11][12][13][14][15][16][17][18]) may have led to the effective couplings of the NP resonance to gauge bosons . If this picture is indeed valid, then more signatures of weak-scale NP are likely to come from future data.
While noting the significance of the excess is limited, it is reasonable to question every element of the existing anomaly. In particular, it is important to ask whether light BSM final states may be confused with the diphoton signal. A general framework for such a scenario has been already discussed in several publications [43][44][45][46]. A heavy resonance X produced by the gluon-gluon or quark-antiquark fusion may decay to a pair of light BSM states Y that have weak instability against subsequent decays to electron-positron pairs or photon pairs. We will call the Y states "dark mediators" (see e.g. refs. [47][48][49][50][51][52]). If the decay length of Y is commensurate with the linear geometry of the detector (e.g., of the inner tracker and eletro-magnetic calorimeter) and its mass is in the MeV-GeV range, then emergent highly collimated pairs of photons and/or electron-positron pairs may successfully mimic actual photons. Therefore, the interesting part of this scenario is that a new 750 GeV resonance opens the door to light weakly coupled states coupled to the SM sector, which is a particular realization of the "hidden valley" idea [53][54][55][56][57][58][59][60][61][62][63][64].
In recent years, dedicated searches of light weakly coupled states coupled to electrons, photons, muons and other light particles have become an important BSM direction at the intensity frontier [53,. They have resulted in significantly strengthened constraints on the mass-coupling parameter space of light NP particles. The purpose of this paper is to explore the consequences of the scenario where a 750 GeV resonance decays to dark mediators in terms of its implications for the intensity frontier searches. To that effect, we construct two explicit models, with heavy spin-zero and spin-one resonances, that decay to dark mediators. The parameters of the models are chosen to fit the current ATLAS excess of the diphoton events under the assumption that decaying mediators do indeed pass the selection criteria for photon identification. In the process, we make careful accounting for the ATLAS geometry and the distribution of dark mediators over the effective decay length. The end result is a suggested range for masses and couplings of dark mediators that falls largely within reach of the next generation of intensity frontier experiments (e.g. [72,73,82,89,[99][100][101][102]).
The paper is organized as follows. We first introduce the theoretical framework for the dark mediator explanation of the 750 GeV candidate resonance in section 2. We then calculate the strength of expected signal, evaluate the probability of light particle decays inside the relevant parts of ATLAS detector, and present favored parameter spaces for various models in section 3. Different experimental strategies that would allow differentiating diphoton from di-dark mediator events are discussed in section 4. We conclude in section 5.

750 GeV scalar resonance
In this sub-section, we consider a model of a heavy dark scalar (or pseudo-scalar) resonance S produced via gluon fusion that decays to the pair of two metastable "dark photon" JHEP07(2016)063 particles A . Each A gives displaced decays to e + e − pairs so that the whole chain can be represented as gg → S → A A → (e + e − )(e + e − ). (2.1) Here we explore a possibility that m S 750 GeV, but A is light, m A < O(few GeV). Because each dark photon carries a significant fraction of energy of the 750-GeV scalar, the e + e − pair from the decay of A are extremely collimated. The opening angle of e + e − pair is around 2m A /E A , where m A and E A are the mass and energy of A , respectively. For sub-GeV A s this angle is less than 0.01. Therefore it is plausible that events originating from the decay of A could pass the selection criteria for a real photon setting by e.g. the ATLAS collaboration.
Dark photon models have been studied extensively in the literature since 1980's [103,104]. In recent years, the attention to dark photons have been spearheaded by their possible connection to various particle physics and astrophysical "anomalies" (see e.g. [49,50,76,105]). The minimal dark photon model consists of a new massive vector field that couples to the SM U(1) via the so-called "kinetic mixing" operator, are field strengths of the U(1) D and U(1) Y gauge group respectively. The mass term in eq. (2.2) breaks the U(1) D explicitly but does not ruin the renormalizability. Y is the kinetic mixing parameter, which we will explicitly assume to be much smaller than one. It dictates the magnitude of the coupling of A to the SM sector. Even if the boundary conditions in the deep UV are such that Y (Λ U V ) = 0, a non-zero mixing can be mediated by loop processes with heavy particles charged under both U(1) groups [103]. In such a scenario, the choice Y 1 is justified due to the expected loop suppression. After electroweak symmetry breaking (EWSB), the SM gauge field B µ and W 3 µ mix with the new gauge field A µ . The resulting mass eigenstate Z couples to the SM electromagnetic and weak neutral currents. In the limit the mixing between A and the SM Z-boson is negligible, while the coupling between Z and SM fermions are given by where we introduce ≡ Y cos θ W . For more detailed discussions on the kinetic mixing, see appendix A. Finally, to avoid the proliferation of notations, we will call the physical Z particle as A , and refer to it as the dark photon. Our goal is to derive the acceptable range for masses and couplings in the proposed scenario. To achieve this, we need to specify the couplings of scalar S to gluons and dark photons beyond the effective dim = 5 operators. To that effect, we introduce a vector-like colored fermion, T , and a dark fermion, ψ, which is a singlet under the SM gauge group. The resulting Lagrangian reads where S is the 750 scalar resonance and A is the light on-shell dark photon that faking photons.
where f stands for a generic SM fermion. The covariant derivative here is where Q f and Q d are U(1) EM and U(1) D charges, respectively. e, g s and g d are U(1) EM , SU(3) c , and U(1) D gauge couplings, respectively. λ T and λ d are the Yukawa couplings of S to T and ψ fermions, respectively. Notice that one does not have to choose positive parity, and ST iγ 5 T pseudo-scalar couplings could also serve the same purpose. T and ψ fermion loops mediate the production and decay of S resonance, as shown in figure 1. Having formulated the model, we are now ready to evaluate the strength of the fake diphoton signal in it. We start from the master formula for the signal, where σ pp→S is the cross section for producing 750 GeV resonance S and Br S→A A is the branching ratio of this resonance decaying to two dark photons. P A A → (e + e − )(e + e − ) γγ is the probability that two dark photons decay to electron-positron pairs inside the detector (within appreciable distance), passing the selection criteria for the diphoton events, and successful reconstruction. It is the most complicated object, depending on factors such as the detector geometry, the detector acceptance, the reconstruction efficiency, as well as the decay length of A , and the mass of A that affects the size and the shape of the shower in the EM calorimeter. We will abbreviate P ( A A → (e + e − )(e + e − )| γγ) as P acc . The existing excess in the diphoton channel found by ATLAS [1] is at the level of σ Signal 5 − 10 fb, which corresponds to ∼ 16 to 32 events.
Production and decay of S in a U(1) D model. Data suggest that the total width of S is around 5-45 GeV, and therefore the narrow width approximation for S suffices for our accuracy. The production cross section of S through gluon fusion is given by where √ s = 13 TeV is the center of mass energy and f g (x, Q 2 ) is the gluon parton distribution function evaluated at Q 2 . We assume that the decay width of S → gg entering in (2. parameters to be scanned, we will adopt m T = 1 TeV throughout, which is safe relative to direct searches. Note that for such a massive particle in the loops, the form factor of the effective g − g − S vertex does not need to be taken into account. A very well known formula for the calculation of the width (e.g., see [106]) gives where τ T = 4m 2 T /m 2 S . In this expression, the invariant function f (τ ) is quite familiar from the Higgs physics literature, The cross section (2.7) can be further improved by taking into account NLO corrections.
With these expressions, we find that a fiducial value for the σ pp→S cross section at m T ∼ 1 TeV and λ T ∼ 1 to be around 40 fb. The branching ratio of S to dark photons directly follows from the three decay channels of the heavy scalar: S → gg, S →ψψ, and S → A A , If kinematically accessible, the decay of S to dark fermions ψ could be the largest: The S → A A decay is induced by the ψ loop and is given by where τ ψ = 4m 2 ψ /m 2 S . Note that in this expression we have taken m A to zero, as it is negligibly small compared to m S and m ψ .
The total width and branching ratios of the S-resonance are illustrated in figure 2. We have taken λ T to what we will consider its uppermost value, 4π, (which would imply a strongly interacting S − T sector). We can observe that if the decays to dark fermions ψ are allowed, one could easily achieve a width of the S resonance of ∼ 40 GeV. Rather large branching ratios to pairs of dark photons can be achieved for α d ∼ O(1). We note in passing that the hierarchy of mass scales, m ψ m A and large coupling constant λ d will create a variety of interesting effects for the dark matter phenomenology, should ψ remain stable on cosmological time scales (see e.g. [50,107]).
We now come to the most technically challenging part, the evaluation of A decays mimicking the diphoton signal, P acc . In a hypothetical limit of an infinite detector with 100% efficiency and 100% faking rate for a dark photon as a regular photon, this probability  is simply (Br A →e + e − ) 2 . The branching of dark photons to electrons is well-known [69], and is 100% below the dimuon threshold, while the A width is given by (2.13) At higher m A one has to include muon and hadronic decay channels, i.e., Γ A = Γ A →e + e − + Γ A →µ + µ − + Γ A →hadronic . In practice, of course, there are strict geometric requirements where the decays of the dark photons must occur so that they can be confused with a real photon. Obviously, a very important requirement is that both dark photons decay before or inside the first layer of the EM calorimeter, which depends rather sensitively on the decay length. Suppose that the parent S particle is produced almost at rest, and then decays into two dark photons, each of them carries energy around m S /2, where m S is the mass of the heavy scalar S. Then the decay length of the dark photon can be written as where β A is the velocity of A observed in the fixed laboratory frame and is the lifetime of A in its rest frame. Evidently, γ A 1 and β A is almost one. The decay length follows an approximate scaling

JHEP07(2016)063
with largest deviations of this scaling at m A ∼ m ρ . Below the dimuon threshold, we have the following useful expression, These numbers immediately tell us that currently allowed region of the dark photon parameter space can indeed be compatible with dark photons decaying within reasonable distance inside the LHC detectors so that they can be confused with real photons. If initial boost distributions of S particles, and angular dependences of its production and of detector geometry could have been neglected, then P acc would be determined by the relation between some relevant length scale of the detector, L det , and L A .
where P l<L det is the probability of a single photon to decay inside L det . This is of course a very crude formula that has to be carefully augmented for the detector geometry, boosts, and other factors, which we will attempt to do in section 3. We also note that should one of the dark photons decay outside the detector, this would mimic the mono-photon signal with the probability that scales as setting up the stage for an important constraint that would come from corresponding searches.
Variations on the dark photon model. In this subsection we would like to note that the dark photon model is not the only possibility for a weakly unstable light vector particles. Indeed, there are other UV complete choices based on anomaly-free symmetries, such as B − L, L L1 − L L2 (where L1 and L2 stand for different lepton flavors) etc. If we take, for example, a model with U(1)-gauged L e − L τ symmetry, then the main couplings of its gauge boson V to leptons are Here, g Le−Lτ is the U(1) Le−Lτ gauge coupling, so that the coupling to electrons is rescaled compared to the dark photon case as e → g Le−Lτ . In the entire mass range from a few MeV to 3.6 GeV the vector boson V decays to electrons and neutrinos, with equal probabilities so that Br V →e + e − = 0.5. Despite the fact that one can choose g Le−Lτ in the same range as e and thus adjust the decay length of V to be commensurate with L det , this model does not look as a good candidate to mimic the diphoton signal, for the following reasons. Firstly, g Le−Lτ is required to be very small, g Le−Lτ < 10 −2 , from the decay length requirements, which would correspond to a tiny α d . This in turn would require some additional model-building to generate an appreciable branching of S to V V states. Another reason is that this model will give a non-removable mono-photon signal due to the decay to neutrinos at a rate more than twice the diphoton signal, for any ratio of L det /L V . On account of these two difficulties, we will abandon further investigation of U(1) Le−Lτ models in connection to the 750 GeV resonance.

JHEP07(2016)063
Z 0 q q s a Figure 3. Feynman diagram for qq → Z → sa, where Z is the 750 GeV vector resonance and s(a) is the light on-shell scalar (pseudo-scalar) that faking photons.

750 GeV vector resonance
If light unstable particles can indeed fake real photons at the LHC, new possibilities for the spin of the 750 GeV resonance open up. In this section we will consider an option of dark mediators being scalar and pseudo-scalar, while the decayed 750 GeV resonance being a spin-1 vector boson. Notice that this is a novel possibility bypassing the Landau-Yang theorem (see e.g. earlier related discussion in ref. [108,109]).
The scenario of this section is based on the following sequence, where all new particles Z , s, a are assumed to be singlets under the SM gauge group. Scalar s and pseudo-scalar a can be combined in a complex scalar field that we assume is charged under some new U(1) D group with dark charge Q d = 1. The mass of a heavy dark Z boson is taken around 750 GeV. The coupling of Z to the SM can again proceed via the kinetic mixing operator. To avoid confusion with the case of the previous section, we will call the heavy boson Z (while the light one is A ). The Feynman diagrams for the process is shown as figure 3. The kinetic mixing operator will couple the Z to hypercharge of the SM particles (as opposed the electric charge in case of small vector mass). Since for the chosen m Z mass scale the coupling between Z and SM fermions are given by See appendix A for more details. The resulting effective Lagrangian reads

JHEP07(2016)063
and f L,R includes all left-handed/right-handed SM quarks and leptons. Q f , T 3 L,R and Y L,R represent their U(1) EM , SU(2) L , and U(1) Y charges respectively. e, g and g Z are electric coupling, weak coupling and the U(1) D gauge coupling, respectively. L dec is the most "delicate" part of the Lagrangian that is responsible for the decays of a and s particles. Notice that one cannot simply write down λ s sēe and λ a aēiγ 5 e operators at the fundamental level, as they would explicitly violate both the SM and U(1) D gauge invariances. Nevertheless these operators can be in fact generalized to the following gauge invariant structures of higher dimension: Since it is clear that displaced decays are only possible for λ S 1 and typically as small as 10 −4 while heavy m Z implies a large dark vev v d , the scale Λ Φ can be well above the LHC energy reach. We leave it at that, without trying to provide further UV completion to the effective operator (2.25). A further uncertainty in this approach arises from a possibility of nontrivial lepton flavor structure of (2.25). To avoid possible complications, we will assume that these couplings are flavor-diagonal, and will limit m s,a to be below the dimuon threshold.
Production and decay of Z . Going over to the production mechanism, we notice, of course, that Z does not couple to gluons, and have to be produced in qq fusion. Although the probability of finding (anti-)quarks inside the proton at high energy is smaller compared to that of gluons, the leading order contribution of this process is at tree-level and thus the cross section can be comparable to gluon-initiated but loop-suppressed processes. The production cross section of Z reads TeV is the center of mass energy and f q (x, Q 2 )(fq(x, Q 2 )) is the quark (anti-quark) parton distribution function evaluated at Q 2 . At the same time, the increase in parton luminosity between run I and run II for the production of the 750 GeV resonance is less pronounced for qq compared to gluons, by about a factor of order 3. The decay channels of Z are similar to those of the SM Z-boson but with an additional channel, JHEP07(2016)063 Z → sa available in this model. The decay width to the SM fermions is given by (2.28) where N c represents the number of colors of the SM fermions (f ) and Y L (Y R ) stands for the hypercharge of the left-handed (right-handed) SM fermions. The decay width of the "dark" sa channel is We take the limit m s,a m Z in the second equality. Figure 4 shows the total width of Z (green, solid) as well as its partial widths Γ(Z → ff ) (blue, dotted) and Γ(Z → sa) (red, dashed) for = 0.1 and m s = m a = 100 MeV. Γ(Z → ff ) does not vary with g Z since it only depends on while Γ(Z → sa) is proportional to g 2 Z and therefore grows with g Z . One can also see that for small g Z ∼ 0.01 the dominant decay branching ratio is from Z → ff and the total width of Z is also very small. However, for a large enough g Z ∼ 3, not only the dominant channel becomes Z → sa, but also the width of Z can reach ∼ 45 GeV due to Z → sa decays without any difficulty. Therefore in the following analysis we use g Z ∼ 3 as a representative point. Also notice that since the branching ratio of Z → sa is close to 1 at that point, the parameter g Z cancels in the branching ratio and has very small effect on subsequent considerations.  The decay lengths of s and a are as follows, where Γ s and Γ a are total widths of s and a, respectively. We only explore the region below the dimuon threshold so that one can have Br s→e + e − and Br a→e + e − of order one.

Geometry of LHC relevant for the diphoton signal
The ATLAS detector can be viewed as a series of ever-larger concentric cylinders around the beam line. From the inner region to the outer region, the main detector elements are silicon pixel and strip trackers, electromagnetic calorimeters (ECALs), hadron calorimeters (HCALs), and muon spectrometers. A 1/4 of the z view of the detector is demonstrated in figure 5. The inner detector tracking system is used to reconstruct primary vertices up to a radius in the transverse plane (r) less than 0.8 m [110]. Recently, ATLAS has upgraded the inner detector system and inserted another layer, the Insertable B-Layer (IBL) [111], near the beam-pipe with 0.03 m < r < 0.04 m to enhance the tracking ability and overcome the increased pileup at the LHC run-II. Therefore we define the fiducial volume of the inner detector to be in the region 0.03 m < r < 0.8 m. A photon passing through the fiducial volume of the inner detector can convert into an electron-positron pair, which leaves tracks in the fiducial volume. As a result, such photons are classified as converted photon candidates by the ATLAS collaboration.
The ECAL (as well as HCAL) is composed of a barrel and two endcaps. The ATLAS ECAL is a lead-liqid argon sampling calorimeter. The relevant geometrical parameters of the ECAL components are summarized in figure 5 and table 1 [112]. The ECAL consists of three layers, starting at r = 1.5 m. A photon is categorized as an unconverted photon candidate if it converts inside the region between 0.8 m and 1.5 m, consisting of the final part of the tracking system and a gap between the inner tracker and the first layer of the ECAL, since it does not leave any reconstructible tracks. In summary, the fiducial volumes of the event reconstruction for the converted and unconverted photons are 0.03 m< r < 0.8 m and 0.8 m< r < 1.6 m, respectively. Note that if the second layer of the ECAL is also included, the fiducial volume of the unconverted photons is 0.8 m< r < 1.93 m.
The inner detectors of ATLAS and CMS are very similar in geometrical coverage. As shown in the CMS TDR [113], the innermost tracker layer starts at r ∼ 44 mm. The fiducial region of the calorimeter ends at 1.79 m. These numbers are not too different from those of ATLAS (r = 31 mm to 1.59 m, respectively in our paper). The slight difference in the significance between the two collaborations may be due to the fact that CMS has around 20% less data compared to that of ATLAS. In addition, the angular resolution of the EM calorimeter at ATLAS might be better in distinguishing the collimated e + e − from a single JHEP07(2016)063 photon. As a result, one could expect a potentially smaller excess at CMS. However, the slight discrepancy between ATLAS and CMS could be just statistical fluctuations. More data is required to make a conclusive statement.

Displaced dark mediator decay signal
In order to obtain a more realistic evaluation of P acc than the one given in eq. (2.17), we need to take into account the distribution of the initial momentum of heavy resonances (S or Z ) affecting the boosts of emerging light particles, which in turns translates into a distribution of the decay lengths L A or L s,a .
Different production mechanisms for S and Z suggest differences in their boost factors. The scalar S is produced through gluon fusion, which means that the initial states are similarly distributed. On the other hand, in our second example, Z is produced through qq initial states, which is asymmetric because it is more probable to find a quark than an anti-quark in a proton due to differences in their parton distribution functions. As a consequence it is more likely that Z will have more of a longitudinal boost compared to S, while for the latter we find that the production-near-rest picture largely holds.
Suppose that the distribution of a heavy resonance initial velocities, or boosts, is given by f (β). The function satisfies normalization condition We simulate f (β) using standard MC tools in practice. Furthermore, given the geometry of the detector is cylindrical, and that all decays of light particles to collimated e + e − pairs within radial segments (distance from the origin) r min (θ) < r < r max (θ) pass the photon selection criteria, P acc is proportional to where P fid is the probability for dark mediators decaying inside the fiducial regions. P fid can be expressed as with are the decay lengths of the dark mediators 1 and 2 in the laboratory frame (denoted with subscript "L").  Table 2. Probabilities of dark photon decays inside the ATLAS detector, P fid , for the 750 GeV scalar resonance scenario. Various decay length L d in meter and fiducial regions are considered.
Events with at least one of the decays occurring inside the tracker volume are categorized as "Converted". The "Unconverted 1" category includes events where both dark mediators decay inside the remaining part of the fiducial volume (gap region and the first layer of the ECAL). Similarly, the "Converted 1+2" and "Unconverted 1+2" categories are the generalization of the Converted and Unconverted 1 categories by including the second layer of the ECAL into the fiducial volume of the event reconstruction.
in the laboratory frame. r i,min and r i,max are lower and upper boundaries of r i of the fiducial volume, which both are functions of θ. θ 1,min and θ 1,max represent lower and upper boundaries of θ 1 of the fiducial region. Note that θ 1 and θ 2 are not independent. cos θ 2 can be expressed in terms of cos θ 1 and β where β is the velocity of the parent particle (Z or S) after the production with a boost factor γ ≡ 1/ 1 − β 2 . We refer the readers to appendix B for the more detailed derivation of the decay probability including the boost effect. Given the geometry of the detector and the probability (B.14), we can calculate P fid for both 750 GeV scalar and vector resonance scenarios. We give the results for P fid for the 750 GeV scalar resonance scenario in table 2 as an example. P fid a function of decay length L A . One can observe that as the decay length grows, the P fid drops precipitously.
From eq. (B.14) to obtain the final P acc , we still need to multiple the right hand side by the acceptance rate and diphoton reconstruction efficiency, i.e., where γ = 95% is the reconstruction efficiency for a single photon [1]. The selection cuts on |η| has already been considered in the calculation P fid . The rest selection cuts in [1] are as follows: We use Monte-Carlo simulation to implement above cuts and obtain the acceptance A. The resulting acceptance A (after |η| cuts) is 68% (84%) for 750 GeV scalar (vector) resonance scenario. Substituting the acceptance and efficiency back to eq. (3.7), we get P acc that consequently yields σ Signal through eq. (2.6).

Preferred region of light particle parameter space
In this subsection we perform a "fusion" of all different components of our calculation in order to derive the allowed parameter space for light particles. Our strategy is to be conservative, which means we should allow the largest possible variations in the properties of the 750 GeV resonance. To that effect, we take the largest possible range for the coupling that regulates the production of S through the gluon fusion, 0 ≤ λ T ≤ 4π. The upper boundary would correspond to the largest production cross section, and therefore admits the lowest possible P acc . At this point we will also assume that every electron-positron decay of light particles is going to pass the photon selection criteria. Violation of this assumption in practice is possible for higher A masses, which would reduce the region of interest on the − m A parameter space.
A fixed minimum value for the P acc has, of course, two solutions in terms of L A . If the decay length is too short, all the decays will happen inside or close to the beam pipe, while if the decay length is too large, only a small finite number of A pairs would decay in or before the ECAL. For the dark photon model, we obtain the allowed region that would be consistent with our scenario for the 750 GeV resonance. The preferred part of the dark photon parameter space is shown in figure 6 with m T = 1 TeV, m ψ = 300 GeV, λ d = 2 while λ T and α d are varied. In figure 7, the same parameters as those of figure 6 are used except for m ψ = 600 GeV, which corresponds to a narrow width as shown in the left panel of figure 2. (Notice that the choices of m ψ and α d fit the reported widths of a possible 750 GeV resonance with m ψ = 300 and 600 corresponding to wide and narrow widths, respectively.) The yellow shaded region is favored by the 750 GeV resonance. One can also see that the allowed parameter space has a band structure, which follows from the L A ∝ ( m A ) −2 scaling. Wiggles, deviations and a dip near 1 GeV occurs due to the enhancement of hadronic decays of A and the reduction of Br e + e − . In the left panel of the plot λ T is set to its maximum value while α d is varied, while on the right panel α d = 1 and λ T is scanned. We observe that as the couplings diminish so does the allowed part of the parameter space. However, some allowed parameter space still exists for λ T ∼ O(1) or α d ∼ O(0.1). It is also worth mentioning that above m A = 2m µ there is an appreciable branching to muons, so that one should expect "fake photon" and muon pair, or two muon pair events appearing in the same model that should reconstruct to the same invariant mass.
On the whole, one can see that intensity frontier searches cannot fully exclude the suggested region of the model parameter space. It is easy to understand why: in the adopted LHC scenario, A particles have relatively small mixing angles ∼ O(10 −4 ), which for most fixed target searches would not lead to detectable displaced decays. At the same time, it is too small a coupling to be currently ruled out by the search for "bumps" in the e + e − spectrum. We also include the exclusion region imposed by the ATLAS monophoton constraints [114]. This constraint comes from the situation when one A decays before or inside the ECAL faking a photon, while the second A completely escapes the detector before decaying. The current limit on the cross section is 6.1 (5.3) fb at 95% C.L. This constraint will be relevant for the longer L A , and this is seen in figures 6 and figure 7 with the gray band being parallel but below the yellow one. It is worth mentioning   [114]. It excludes part of parameter space for α d = 1, λ T = 4π that we marked as purple-gray lines. Nevertheless, the mono-photon search does not further exclude preferred parameter space for smaller α d and λ T values listed in the plot. In the plot, we also include current constraints  and future prospects [99,100,102,115,116] on the 2 versus m A plane for dark photons that decay directly to SM particles (see also e.g. [101]).  In addition, with more data collected the mono-photon search should be able to provide a stronger constraint at the LHC run-II.
Next we present the result of the Z model in the λ 2 S −m s/a parameter space in figure 8 with various contours corresponding to = 0.05, 0.1 and 0.2. The left and right panels correspond to wide and narrow widths with g Z = 3 and 0.3, respectively. As illustrated in figure 2 BR(Z → sa) grows with the total width, and therefore the regions enclosed by a contours in the left panel increase for fixed values of compared to those in the right panel because of the larger branching fraction. The yellow shaded region are favored by the 750 GeV Z resonance while the gray shaded region is excluded by the mono-photon searches with = 0.2. For m s/a = 0.1 GeV one can have λ 2 S between 10 −10 and 10 −8 . The allowed range of in the yellow shaded region is 0.02 0.2. The lower limit is to ensure having enough production cross section while the upper limit comes from the measurement of the Z-boson mass and width [79] or the oblique parameter Y [117,118]. 1 Notice that the model with < 0.1 and g d g 1 is very difficult to constrain via "conventional" qq → Z → µ + µ − searches due to a small branching ratio for the Z decay to SM particles, which leads to 4 scaling of the signal.

Potential methods to exclude models with dark mediators
So far, there is only a limited amount of data available. However, with more data it is likely that one can statistically discriminate between real photon events and decays of dark mediators. While the properties of the photons are of course fully specified by QED and atomic physics, the main input parameters for the dark mediator decays will be its energy, mass and the decay length (such as E A , m A and L A as in the dark photon example). Below we outline important differences between the conversions of real photons and decays of dark mediators.
1. Affinity of conversions to the material inside the detector. Photons convert to pairs in the field of the nucleus, and therefore the distribution of conversion points roughly follows the number density of atoms weighted with the square of the atomic number, Z 2 n A . The dark mediators, on the other hand, can decay anywhere in the detector, including hollow parts. The distribution of vertices for the converted photon events should provide a useful discrimination.
2. Events beyond the first layer of the ECAL. The decays of dark mediators can occur in the ECAL beyond the first layer of the calorimeter, which would correspond to an unusual penetration depth for a regular photon. In fairness, the probability of decay within the second or third layer of the calorimeter is not very large for the models considered, and more data is needed for this criterion to become useful. But even with current statistics, the searches of "late converting" photons in association with regular photons is of interest and should be pursued.
3. Distribution of converted vs unconverted events. Exponential dependence on the distance travelled, for a short decay length L d , will always enhance the fraction of fake unconverted events. That is most dark mediators at short L d will decay before reaching the ECAL. Therefore this can be a useful criterion for part of the parameter space.

4.
Energy distribution of electron-positron pairs. It is well known that the electronpositron pairs created by Bethe-Heitler process (regular conversion) have an appreciable fraction of events with asymmetric energy distribution (E e + E e − or E e − E e + ) whereas a vast fraction of dark mediator decays has E e + ∼ E e − . This fact is well appreciated in the direct dark photon searches. An abnormally low fraction of asymmetric pairs could be a signature of dark mediators. 5. Shape and point of origin for the shower. Unconverted photons may have a small but non-zero penetration depth inside the first layer of the ECAL, while dark photons decaying in the gap between the tracker and the ECAL enter the calorimeter as pairs, and thus shower immediately. This will affect the shape of the shower, its starting point, and possibly the energy reconstructed from the standard procedures.

JHEP07(2016)063
6. Abnormal separation of electron-positron pair. In this paper we have avoided the discussion of the drop in efficiency for converted and/or unconverted photons when the mass of the dark mediator become large. When a dark mediator such as A decays to the electron-positron pair, each electron receives a perpendicular momentum p ⊥ ∼ m A /E A . After some distance travelled, this may lead to an abnormally large separation of electrons and positrons, compared to a similar behavior of a regular conversion pair, when they cross a layer of the pixel detector and/or reach the ECAL.
Detailed implementation of this criterion should determine the maximum mass for a dark mediator capable of faking a photon.
We believe that the possibility of dark mediators mimicking real photons deserves a closer look by the experimental collaborations. A few items outlined above may serve as a basis for developing a statistical procedure that would emphasize or suppress fake photons vs real photons and vice versa. It is also worth mentioning that due to the difference in the linear sizes of the ATLAS and CMS detectors (hence a different sensitivity to L d ), there can be an additional discriminating power in a combined treatment. Also, it may be that dark mediator decays create a large number of events that are neglected for one or many of the above reasons. Therefore a closer look in a sample with loosened criteria for photon identification may also contribute to constraining or validating dark mediator models.

Conclusion
In this paper we have considered the exotic possibility that metastable BSM particles of low mass could be produced as a result of a heavy resonance decay. Being weakly unstable, these particles decay to electron-positron pairs that may in fact resemble the conversion pairs originating from a regular photon. The prime candidates for such metastable particles are dark photons, as well as light scalars and pseudoscalars, which all have small branchings to neutrinos and therefore do not generate a large missing transverse momentum signal. We have examined both possibilities, without imposing very restrictive assumptions on the properties of the 750 GeV resonance. We have found that the parameter space for light particles (e.g. the dark photon models prefer a somewhat wide range of parameters along the m A /(100 MeV)×( /10 −4 ) ∼ O(1) line) that emerges from this analysis is not excluded by the current limits. However, a number of new proposals at different stages of maturity exists [99,100,102,115,116] which will eventually probe deep inside the region of interest.
In the models we consider, the mono-photon searches provide an important constraint. Also, should the light particles be able to decay to muons, a search of two collimated muons plus a "fake photon" reconstructing to the same invariant mass is a promising search channel.
Irrespective of the future status of the 750 GeV resonance, it seems important for the experimental collaborators to build statistical discriminators that would allow (given enough data) to distinguish between regular SM photon events and would-be-photon dark mediator decays. We have provided a discussion of some avenues along which this problem might be addressed.

JHEP07(2016)063
Note added. As this paper was being finalized, we became aware of recent preprints [119,120] that also have detailed discussions on the possibility that the diphoton resonance may also be explained by dark mediators mimicking real photon signals, and thus have overlap with some of the discussions in our paper.

Acknowledgments
We would like to thank A. Soffer and Y. Zhang for helpful discussions. Y.Z. would like to thank R. Essig for the discussion of Z models. The work of C.-Y.C, M.L. and M.P. is supported by NSERC, Canada. Research at the Perimeter Institute is supported in part by the Government of Canada through NSERC and by the Province of Ontario through MEDT. Y.Z. is supported through DoE grant DESC0008061.

A Kinetic mixing
The relevant Lagrangian for the kinetic mixing includes three parts: the abelian gauge part, L gauge , the vector boson mass part, L mass , and the interactions between gauge bosons and SM fermions, L int . The abelian gauge part is given by Here we use sin Y as the kinetic mixing coefficient to simplify later expressions. Note that lim Y →0 sin Y = Y . The mixing term F B is removed by field redefinition where the fields with tilde are the redefined fields. We follow the standard symmetry breaking conventions for the field B. The SM SU(2) × U(1) covariant derivative can be written as and rotated into where θ W is the weak mixing angle, g and g are SM SU(2) L and U(1) Y gauge couplings.

JHEP07(2016)063
Above we begin to use a short-handed notation, i.e., c, s, t stands for sin, cos, tan respectively. The subscript stands for the function variables and W stands for θ W . The relevant mass part of the Lagrangian is In the last step of the above equation, the Lagrangian is expressed in the redefined fields. However, the mass matrix is still non-diagonal. We introduce one more rotation to eliminate the mixing, We use the bar to represent fields in their physical (mass) basis. The eigenstates of the mass matrix in (A.8) yield m 2 Z and m 2 Z as and m Z 0 is the SM Z-boson mass before kinetic mixing. The appearance of the sign function is because we intend to assign m Z = m Z 0 when the mixing coefficient vanishes. Comparing the mass matrix before and after the diagonalization, we found

JHEP07(2016)063
We have adopted multiple matrix transformations so far. The transfer matrix from the original gauge basis to the final physical basis is (A. 16) The interaction part between the gauge bosons and the SM fermions can be written as where e is the SM U(1) EM gauge coupling, and J EM and J Z 0 are the SM EM and neutral current, respectively. A and Z 0 can be expressed in the physical basis as Therefore the interaction in the physical basis is From the above expression, we read off the relevant part for the Z f f interaction as Similarly, the Zf f interaction is modified to In the limit m A m Z 0 and Y 1, we have Above we introduce the "usual" kinetic mixing parameter ≡ Y cos θ W . In the other limit m A m Z 0 and Y 1, we obtain Note that the couplings between Z and fermions are independent of the Z mass in both limits.

JHEP07(2016)063 B Decay probability with boost effect
In this section, we derive the decay probability of dark mediators in the laboratory frame. We first start with the decay probability in the rest frame of the parent particle S which can be written as P r = dW 1 L r,1 e −r 1 /L r,1 1 L r,2 e −r 2 /L r,2 dr 1 dr 2 , (B.1) with dW = 1 4πp 2 0 d 3 p r,1 δ(| p r,1 | − p 0 ). (B. 2) The normalization factor in eq. (B.2) is chosen such that W = 1 after carrying out the integration. p 0 m S /2 is the magnitude of the momentum of the daughter particles and the subscript "r" indicates that the observable is in the rest frame of the parent particle. r i is the radial coordinate of the dark mediator i. We neglect the mass of the daughter particles since they are much lighter than the parent particle. The delta function is used to impose the on-shell condition. Note that the momenta of the two daughter particles are related p r,1 = − p r,2 so the delta function requires both daughter particles to be on-shell. r i and L r,i are radial coordinate and the decay length of the daughter particle i (i = 1, 2) in the rest frame of the parent particle. We also assume that the boost is only along the beam-pipe, i.e. the z direction. Using a Lorentz transformation one can express the z component of the momentum of the daughter particle in the rest frame in terms of the observables in the laboratory frame where p z L and E L are the z component momentum and the energy of the daughter particle in the laboratory frame, respectively. The subscript L indicates that the observable is in the laboratory frame. θ represents the polar angle of the daughter particles in the laboratory frame. We have used p z L = p L cos θ and E L p L in the last step. β is the relative velocity between the rest frame of the parent particle and the laboratory frame. The boost factor γ = 1/ 1 − β 2 . Similarly we obtain the following equations dp z r = γ(1 − β p z L E L )dp z L γ(1 − β cos θ)dp z L , (B.4) | p r | = p 2 ⊥ + γ 2 (p z L − βE L ) 2 p L sin 2 θ + γ 2 (cos θ − β) 2 , (B.5) where p ⊥ = p L sin θ is the momentum of the daughter particle in the transverse plane.

JHEP07(2016)063
Likewise, dW in the laboratory frame is as follows dW = d cos θ 2 dp L δ p L − p 0 sin 2 θ + γ 2 (cos θ − β) 2 ×γ(1 − β cos θ)(sin 2 θ + γ 2 (cos θ − β) 2 ) −3/2 . (B.7) Furthermore, based on the momentum conservation we know that there are relations between daughter particles 1 and 2. where We have used the delta function in the last step of eq. (B.14). Note that p 0 τ A /m A is the decay length of the daughter particles in the rest frame of the parent particle. r i,min and r i,max are lower and upper boundaries of r i of the fiducial volume. θ 1,min and θ 1,max are lower and upper boundaries of θ 1 of the fiducial region. f (β) is the normalized velocity distribution of the parent particle, i.e.

JHEP07(2016)063
Open Access. This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.