A mass reconstruction technique for a heavy resonance decaying to $\tau^+\tau^-$

For a resonance decaying to $\tau^+\tau^-$, it is difficult to reconstruct its mass accurately because of the presence of neutrinos in the decay products of the $\tau$ leptons. If the resonance is heavy enough, we show that its mass can be well determined by the momentum component of the $\tau$ decay products perpendicular to the velocity of the $\tau$ lepton, $p_{\perp}$, and the mass of the visible/invisible decay products, $m_{vis/inv}$, for $\tau$ decaying to hadrons/leptons. By sampling all kinematically allowed values of $p_{\perp}$ and $m_{vis/inv}$ according to their joint probability distributions determined by the MC simulations, the mass of the mother resonance is assumed to lie at the position with the maximal probability. Since $p_{\perp}$ and $m_{vis/inv}$ are invariant under the boost in the $\tau$ lepton direction, the joint probability distributions are independent upon $\tau$'s origin. Thus this technique is able to determine the mass of an unknown resonance with no efficiency loss. It is tested using the MC simulations of the physics processes $ pp \to Z/h(125)/h(750)+X\to \tau\tau+X$ at 13~TeV. The ratio of the full width at half maximum and the peak value of the reconstructed mass distribution is found to be 20\%-40\% using the information of missing transverse energy.


Introduction
Following the discovery of the Standard Model (SM) Higgs boson [1,2], it is of more interest to search for new high-mass particles, such as additional Higgs bosons [3][4][5][6]. Recently, in the γγ mass spectrum, a resonance around the mass 750 GeV/c 2 is seen with a significance of about 3σ [7,8]. In the SM and many beyond-SM theories, the coupling of the Higgs boson and the fermion is proportional to the mass of the fermion. For a neutral Higgs boson, h 0 , the branching fraction of h 0 → τ + τ − would be dominant compared to other leptons e and µ. Experimentally, the presence of neutrinos makes it challenging to reconstruct the invariant mass M (ττ) accurately [9,10]. Accordingly, the performance of the mass reconstruction technique affects the sensitivity of probing new physics.
The τ lepton has two types of decay modes. One is decaying to a charged lepton plus two neutrinos, τ → l+ν l +ν τ , l = e, µ (denoted by τ l ), the other is decaying to hadron(s) plus one neutrino, τ → hadrons+ν τ (denoted by τ h ). Experimentally, the visible decay product is the charged lepton (e, µ) or the hadron(s) while the invisible decay product is the neutrino(s). For the charged leptons, the momentum can be measured well by the tracking system. For the hadronic decay products, they are reconstructed from the energy clusters in the electromagnetic and hadronic calorimeters using the anti-k t jet finding algorithm [11,12]. One of the characteristics is the presence of one or three charged tracks accompanied by possibly neutral hadrons, which lead to a collimated shower in the calorimeters with a few nearby tracks. The energy deposits in the calorimeters are used to reconstruct the four-momenta of the hadronic τ candidates. Both the tracking and calorimeter information are combined to identify the hadronic τ decays and suppress the misidentification rate of jets.
For the M (ττ) reconstruction, an important observable is the missing transverse energy / E obs x/y . They are defined as / E obs x/y ≡ −1 × observed objects where the colliding axis is defined as the z axis and the sum is performed over all observed objects, including electrons, muons, hadronic taus, jets, etc.. If only the neutrinos from the τ decays contribute to the missing energy, / E obs x/y can then be used to improve the M (ττ) reconstruction, as will be shown below. Here we only use the transverse part of the missing energy. This is because in the hadron colliders, the interaction happens among the colliding partons. The partons in the high-boosting hadrons, such as proton, would have momenta collinear with the original momentum as a function of the mass hypothesis M . X 1 and X 2 denote the free paramters to specify the τ decay kinematics. X is chosen to be (x vis , φ τ , m inv ). Here x vis is the fraction of the τ lepton energy carried by the visible decay products in the laboratory frame; φ τ is the azimuthal angle of the τ lepton direction in the laboratory frame; m inv is the invariant mass of the neutrino(s) in the τ decays. The likelihood P( X 1 , X 2 , / E x , / E y , P vis 1 , P vis 2 ) is the product of three likelihood functions: two functions model the probability distributions of the decay parameters X i of the two τ leptons and one function quantifies the compatibility of a τ-pair decay hypothesis with the reconstructed missing transverse energy / E x/y . The relative M (ττ) resolution is (10-20)%.
Other interesting methods could be found in Ref. [20][21][22][23]. In this paper, we propose an alternative method, which utilizes the momentum component of the τ decay products perpendicular to the velocity of the τ lepton, p ⊥ . We will compare this method with the CAT, MMC and SVFIT, as the TMM only reconstructs M (ττ) partially. In Sec. 2, we present the principle of this method. In Sec. 3, the method is tested on the MC simulations. In Sec. 4, we apply the selection criteria used by the ATLAS Collaboration in searching for the SM Higgs decaying to the τ pair [10] to a MC sample of pp → h(125)+X → ττ+X via the gluon-gluon fusion process. The performance of this method would be more realistic in this case. We give the conclusions in Sec. 5.

Principle of the method
To illustrate our method of reconstructing the mass of a resonance R decaying to τ + τ−, we write down the following equations.
Here M R is the mass of the resonance R; p τ i is the magnitude of the momentum of the τ leptons; E τ i ≡ p 2 τ i + m 2 τ ; p vis i and p inv i are the magnitude of the momentum of the τ decay products, the visible parts (charged leptons or hadronic τ jet) and the neutrinos respectively; θ τ 1 τ 2 is the angle between two τ leptons; and θ vis i ,inv i is the angle between the charged lepton/hadronic jet and the corresponding neutrino(s). To determine the mass of the resonance M R , there are five unknown quantities, p inv 1 , p inv 2 , θ vis 1 ,inv 1 , θ vis 2 ,inv 2 and θ τ 1 τ 2 , as shown in Eq. 6-7. The other quantities p vis 1 and p vis 2 can be measured by the detectors.
First of all, we show that θ τ 1 τ 2 can be well estimated by the angle between the visible decay products of two τ leptons, θ vis 1 ,vis 2 , in the case of the heavy resonance R. We write down the τ mass constraints.
Here the subscripts 1, 2 are omitted. m vis is the mass of the charged lepton in the τ l mode or the mass of the hadronic jet in the τ h mode; m inv is the mass of the neutrinos in the τ l mode or equal to 0 in the τ h mode; and E vis/inv ≡ p 2 vis/inv + m 2 vis/inv . If the resonance is heavy enough M R >> m τ , we have and From the τ mass constraints shown in Eq. 8, the angle between visible decay product(s) and the invisible neutrino(s) turns out to be cos θ vis,inv 1 + m 2 Using the relation cos θ 1 − θ 2 2 for small θ, we find that |θ vis,inv | is of the order of m vis p vis or m inv p inv , denoted by O( m vis p vis , m inv p inv ). Therefore, in the case of M R >> m τ , the difference between θ τ 1 τ 2 and θ vis 1 ,vis 2 is of the same order, as shown in Eq. 12.
Resorting to Eq. 6 and Eq. 12, we can estimate the uncertainty of the reconstructed M R due to the approximation θ τ 1 ,τ 2 θ vis 1 ,vis 2 .
where we have used ∆θ τ 1 τ 2 |θ τ 1 τ 2 −θ vis 1 ,vis 2 |. It could be seen that this uncertainty is reduced in the region where θ vis 1 ,vis 2 is close to π (thus | sin θ vis 1 ,vis 2 | is close to 0). If the approximation θ τ 1 τ 2 θ vis 1 ,vis 2 is assumed, we then have four unknown quantities in total, namely, (p inv , m inv ) in the τ l mode, or (p inv , m vis ) in the τ h mode. The angles θ vis 1 ,inv 1 and θ vis 2 ,inv 2 can be expressed as functions of the unknowns through the τ mass constraints, Eq. 8. M (ττ) can be determined if p inv and m vis/inv are given. However, p inv is related with the momentum of the τ lepton. Thus the probability distribution of p inv depends upon the mass of the mother resonance and is not available for an unknown resonance. Instead, we choose to use the momentum component of the τ decay products perpendicular to the velocity of the τ lepton, p ⊥ (it is same for the visible and invisible decay products). p ⊥ is invariant under the boost in the direction of the τ lepton. The joint probability distributions (JPD), P(p ⊥ , m vis/inv ), are then independent upon the mass of the mother resonance.
To derive the expression of p τ as a function of p ⊥ and m vis/inv , let us consider the Lorentz transformation relating the laboratory frame and the center-of-mass (c.m.) frame of the τ lepton. The quantities in the in the c.m. frame will carry a " " and can be easily calculated for given m inv , m vis and p ⊥ . Letting v be the velocity of the τ lepton, In the equations above, p //vis = ± p 2 − p 2 ⊥ is the momentum component of the visible τ decay products parallel to the velocity of the τ lepton and E 2 vis = p 2 + m 2 vis is the energy of the visible τ decay products, where p is the momentum of the τ decay products in the c.m. frame. Using Eqs. 14 and p τ = γβm τ , we can obtain where p // ≡ p 2 − p 2 ⊥ and the two-fold solution is due to p //vis = ± p 2 − p 2 ⊥ ( physically, it is because the momentum component p //vis in the laboratory frame can be parallel or anti-parallel with that in the c.m. frame of the τ lepton).
For convenience, we write down explicitly the equations related with our method.
where the subscript i = 1, 2 and θ τ 1 τ 2 is replaced by θ vis 1 ,vis 2 in the first equation. The next step is to sample all possible values of (p ⊥ , m inv ) for the τ l mode or (p ⊥ , m vis ) for the τ h mode according to their JPDs which can be determined by the MC simulations. For each event accumulated by the detectors, we sample the unknowns many times (10000 is fairly enough) and obtain a distribution of M (ττ) according to Eqs. 16. We assume that the mass of the resonance lies at the position with the maximal probability. For each sampling entry, the two-fold solutions, shown in the second line of Eqs. 16, are used as long as they are physically allowed, namely, To use the missing transverse energy, each sampling entry is given a weight, w( / E), which is defined as Here / E obs x/y are the x/y components of the observed missing energy; σ / E x and σ / E y are the corresponding resolutions; and / E x/y are the x/y components of the sum of the momenta of the neutrinos, shown in Eq. 18.
Here the collinear approximation is used; θ vis i and φ vis i are the polar angle and the azimuthal angle of the visible decay products respectively. To comply to the experimental performance [14,24], the missing energy resolution is parameterized in the form, where E T is the scalar sum of all the observed objects (electrons, muons, hadronic taus, jets, etc.) and defined as

Performance of the technique
To test the performance of this technique, the MC samples for the physics processes pp → Z/h(125)/h(750)+X → τ + τ − +X at 13 TeV are produced with MadGraph5 [26]. The parton showers are simulated by Pythia 8 [27] and the detector response is simulated by Delphes 3 [28]. The detector simulation is adjusted to meet the run 1 performance of the ATLAS detector. Most relevantly, the reconstruction efficiency of the hadronic τ jets is about 60% with a jet faking rate about 1%. The effect of the pileup interactions is not considered until the end of this section. Here h(125) denotes the SM Higgs boson with the mass about 125 GeV/c 2 [25] and the width 4.07 MeV. h(750) denotes a possible high-mass Higgs boson. Its mass and width are 750 GeV/c 2 and 40 GeV respectively in the simulations.
In the leptonic decay mode τ l , we do not distinguish the charged leptons e and µ as their masses are negligible compared to the τ mass. In the hadronic decay mode τ h , the events with one charged track and three charged tracks are considered separately. Events with two τ candidates, charged leptons or hadronic τ jets, with opposite charge sign are selected. The transverse momentum, p T , is required to be larger than 20 GeV/c and the rapidity |η| is required to be less than 2.5 for both τ candidates. They are further required to be isolated. The charged leptons satisfy p T (∆R = 0.5)/p T (l) < 10%. Here p T (l) is the transverse momentum of the lepton and p T (∆R = 0.5) is the transverse momentum of the additional observed objects in the cone around the lepton candidate, where the cone size is defined by the angular distance between the additional objects and the lepton candidate ∆R = 0.5. For a hadronic τ candidate, the angular distance with respect to an electron or a muon is required to be larger than 0.2. Experimentally, additional isolation conditions are imposed on the hadronic τ candidates to reduce the jet faking rate. For example, the ATLAS collaboration uses the discriminating variables based on the tracks with p T > 1 GeV/c and the energy deposited in the calorimeter in the cone ∆R < 0.2 and those in the region 0.2 < ∆R < 0.4 around the hadronic τ candidate's direction [10]. Here in the delphes simulation, no further isolation condition is used for the hadronic τs.
In the first place, the collinear approximation that the angle between the τ leptons is well estimated by the angle between the two visible τ candidates is verified in the MC simulations. Figure 1 (a) shows the distribution of θ τ 1 τ 2 versus θ vis 1 ,vis 2 while Fig. 1 (b) shows the distribution of (θ τ 1 ,τ 2 − θ vis 1 ,vis 2 )/θ vis 1 ,vis 2 . We find that the correlation coefficient between the two angles is 0.998 and the relative difference |θ τ 1 ,τ 2 − θ vis 1 ,vis 2 |/θ vis 1 ,vis 2 is well below 10%. To be exact, the mean value of the distribution in Fig. 1 (b) is 0.1% and the root mean square (RMS) is 2.6%. Figure 2 shows the distributions of the momentum component p ⊥ , the invisible mass distribution in the τ l mode and the visible mass distribution in the τ h mode for the three mother resonances. In Fig. 2 (b), the hadronic modes with one charged track, τ → π/K/ρ + ν τ , can be well recognized. We see that the probability distributions have little dependence upon the mass of the mother resonance. p ⊥ and m vis/inv are correlated. Figure 3 shows the JPDs P(p ⊥ , m inv ) in the τ l model and P(p ⊥ , m vis ) in the τ h mode.
For each event, we sample (p ⊥ , m vis/inv ) according to their JPDs for 10000 times. A distribution of M (ττ) is obtained using Eqs. 16. The best estimate of M (ττ) for this event is assumed to be the peak position of this distribution. The final reconstructed distributions of M (ττ) for Z/h(125)/h(750) → τ + τ − are shown in Fig. 4, Fig. 5 and Fig. 6 respectively. For each resonance, we also present the results based on the JPDs P(p ⊥ , m vis/inv ) from           the simulation of the other resonances. In fact, the JPDs describe the decay kinematics of the τ lepton and have nothing to do with the resonance decaying to τ lepton pairs. Therefore, we can reconstruct the mass of an unknown resonances based on the JPDs from an known resonance. As Fig. 4, Fig. 5 and Fig. 6 shows, the performance are nearly the same no matter what JPDs are used. We adopt two quantities to measure the performance, namely, the reconstruction efficiency due to the technique itself and the relative mass resolution. They are crucial elements to search for new resonances and distinguish the resonance signal from the background. They are explained below.
The reconstruction efficiency describes the rate of successful mass reconstruction and is defined as where N sel is the number of events passing the selection criteria, and N sel+success is the number of events which are successfully reconstructed from the selected events. In the CAT, the mass can not be reasonably reconstructed if The efficiency is about 40%-70%. In the MMC, the efficiency loss is only 1%. It is due to large fluctuations of the / E T measurement or other scan variables and the limited number of scans. The definition of the relative mass resolution, used in Ref. [10], is the ratio of the full width at half maximum (FWHM) and the peak value (m peak ) of the mass distribution, denoted by FWHM/m peak .
The comparison of the performances of the CAT and this method is summarized in Table 1. If the resonance is heavier, the collinear approximation is better and thus the CAT works better. Our method has a stable performance with no efficiency lose and give a relative mass resolution of (30-40)%.
For the present, we have not yet mentioned the backgrounds under this method. In searching for the SM higgs decay h(125) → τ + τ − for which only evidences are reported [9,10], the decay Z → τ + τ − is the dominant background since the masses of the two bosons are close. A M (ττ) reconstruction technique with a higher reconstruction efficiency and a better mass resolution will surely improve the signal-background separation and increase the signal significance. Another important background is the multi-jet background which dominates the faking hadronic τs. It is usually estimated by a data-driven method, namely, it is represented by the data events with two τ candidates having the same charge sign.
In the end of this section, we take h(125) → ττ as example to investigate the effect of the pileup interactions. Here the average pileup is assumed to be 50. By repeating the simulations with the pileup interactions considered, it is found the mass reconstruction performance become worse, especially for the τ h τ h , as shown in Fig. 7. In this case, however, our method still give better mass resolutions than the CAT. The results are summarized in the last two lines of Table 1. It should be noted that very simple selection requirements are used in this section. In next section, we will test this method in a more realistic case.

Technique
Resonance   8 TeV. In this case, the performance of this technique would be more realistic. Table 2 lists the thresholds on the transverse momentum while Table 3 lists the remaining cuts. The definitions of these variables in the tables could be found in Ref. [10]. Figure 8 (a) shows the distribution of (θ τ 1 τ 2 − θ vis 1 ,vis 2 )/θ vis 1 ,vis 2 and the relation θ τ 1 τ 2 θ vis 1 ,vis 2 still holds well. The distributions of |φ vis 1 − φ vis 2 | before and after the event selection are shown in Fig. 8 (b). Most of the back-to-back events are abandoned by the selections. It is then expected that the CAT would work well as |φ vis 1 − φ vis 2 | is far from π after the event selection. Table 2. (color online) Summary of the transverse momentum thresholds and rapidity cuts applied in the analysis.
The indices 1 and 2 denote the leading (highest pT ) and sub-leading final state objects. Table 3.

Channel Analysis level thresholds
Summary of the selection criteria applied in the analysis. The definitions of the variables can be found be in Ref. [10].   mass is assumed to correspond to the value with the maximal probability. This method utilizes the fact that the quantities p ⊥ and m vis/inv are invariant under the boost in the direction of the τ lepton. Based on the MC simulations of pp → Z/h(125)/h(750) + X → τ + τ − + X, this method gives a relative mass resolution, FHWM/m peak 20%-40% using the information of missing energy with no efficiency loss. In the end, we would like to comment that this method would work better in the Circular Electron Positron Collider (CEPC), since the well-determined initial four-momenta of the colliding beams provide more kinematic constraints.