Measurement of the B 0 s → J / ψη lifetime

Using a data set corresponding to an integrated luminosity of 3 fb−1, collected by the LHCb experiment in pp collisions at centre-of-mass energies of 7 and 8 TeV, the effective lifetime in the B0 s → J/ψη decay mode, τeff, is measured to be τeff = 1.479± 0.034 (stat)± 0.011 (syst) ps. Assuming CP conservation, τeff corresponds to the lifetime of the light B 0 s mass eigenstate. This is the first measurement of the effective lifetime in this decay mode. Accepted by Phys. Lett. B. c © CERN on behalf of the LHCb collaboration, licence CC-BY-4.0. †Authors are listed at the end of this paper. ar X iv :1 60 7. 06 31 4v 2 [ he pex ] 2 8 Ja n 20 17


Introduction
Studies of B 0 s − B 0 s mixing provide important tests of the Standard Model (SM) of particle physics. In the SM, mixing occurs via box diagrams. Extensions to the SM may introduce additional CP -violating phases that alter the value of the B 0 s − B 0 s mixing weak phase, φ s , from that of the SM [1]. The B 0 s system exhibits a sizeable difference in the decay widths Γ L and Γ H , where L and H refer to the light and heavy B 0 s mass eigenstates, respectively. The effective lifetime, τ eff , of a B 0 s meson decay mode is measured by approximating the decay time distribution, determined in the B 0 s rest system, by a single exponential function. For final states that can be accessed by both B 0 s and B 0 s mesons the effective lifetime depends on their CP components and is also sensitive to φ s [2,3].
In this analysis τ eff is determined for the CP -even B 0 s → J/ψ η decay mode. As φ s is measured to be small [4,5] the mass eigenstates are also CP eigenstates to better than per mille level and τ eff measured in B 0 s → J/ψ η decays is equal, to good approximation, to the lifetime of the light B 0 s mass eigenstate, τ L ∝ Γ −1 L . In the SM τ L is predicted to be 1.43 ± 0.03 ps [6]. Measurements of τ L have previously been reported by LHCb in the B 0 s → D + s D − s and B 0 s → K + K − decay modes [7,8]. The latter is dominated by penguin diagrams, which could arise within and beyond the SM and gives rise to direct CP violation. This then leads to a different τ eff , when compared to measurements in the B 0 s → D + s D − s and B 0 s → J/ψ η decays which are mediated by tree diagrams. Improved precision on the effective lifetimes τ L and τ H will enable more stringent tests of the consistency between direct measurements of the decay width difference ∆Γ s = Γ L −Γ H measured in B 0 s → J/ψ φ decays and those inferred using effective lifetimes.
The measurement of the effective B 0 s → J/ψ η lifetime presented in this Letter uses 3 fb −1 of data collected in pp collisions at centre-of-mass energies of 7 TeV and 8 TeV during 2011 and 2012 using the LHCb detector. The J/ψ meson is reconstructed via the dimuon decay mode and the η meson via the diphoton decay mode. The presence of only two charged particles in the final state minimizes systematic uncertainties related to the tracking system.

Detector and simulation
The LHCb detector [9,10] is a single-arm forward spectrometer covering the pseudorapidity range 2 < η < 5, designed for the study of particles containing b or c quarks. The detector includes a high-precision tracking system consisting of a silicon-strip vertex detector surrounding the pp interaction region, a large-area silicon-strip detector (TT) located upstream of a dipole magnet with a bending power of about 4 Tm, and three stations of silicon-strip detectors and straw drift tubes placed downstream of the magnet. The tracking system provides a measurement of momentum, p, of charged particles with a relative uncertainty that varies from 0.5 % at low momentum to 1.0 % at 200 GeV/c. Large samples of J/ψ → µ + µ − and B + → J/ψ K + decays, collected concurrently with the data set used here, were used to calibrate the momentum scale of the spectrometer to a precision of 0.03 % [11]. The minimum distance of a track to a primary vertex (PV), the impact parameter (IP), is measured with a resolution of (15 + 29/p T ) µm, where p T is the component of the momentum transverse to the beam, in GeV/c. Different types of charged hadrons are distinguished using information from two ring-imaging Cherenkov detectors. Photons, electrons and hadrons are identified by a calorimeter system consisting of scintillating-pad and preshower detectors, an electromagnetic calorimeter and a hadronic calorimeter. The calorimeter response is calibrated using samples of π 0 → γγ decays. For this analysis a further calibration was made using the decay η → γγ, which results in a precision of 0.07 % on the neutral energy scale. Muons are identified by a system composed of alternating layers of iron and multiwire proportional chambers. The online event selection is performed by a trigger [12], which consists of a hardware stage, based on information from the calorimeter and muon systems, followed by a software stage, where a full event reconstruction is made. Candidate events are required to pass the hardware trigger, which selects muon and dimuon candidates with high p T based upon muon system information. The subsequent software trigger is composed of two stages. The first performs a partial event reconstruction and requires events to have two well-identified oppositely charged muons with an invariant mass larger than 2.7 GeV/c 2 . The second stage performs a full event reconstruction. Events are retained for further processing if they contain a displaced J/ψ → µ + µ − candidate. The decay vertex is required to be well separated from each reconstructed PV of the proton-proton interaction by requiring the distance between the PV and the J/ψ decay vertex divided by its uncertainty to be greater than three. This introduces a non-uniform efficiency for b-hadron candidates that have a decay time less than 0.1 ps.
Simulated pp collisions are generated using Pythia [13] with a specific LHCb configuration [14]. Decays of hadronic particles are described by EvtGen [15], in which final-state radiation is generated using Photos [16]. The interaction of the generated particles with the detector, and its response, are implemented using the Geant4 toolkit [17] as described in Ref. [18].

Selection
A two-step procedure, is used to optimize the selection of B 0 s → J/ψ η decay candidates. These studies use simulated data samples together with the high mass sideband of the data (5650 < m(J/ψ η) < 5850 MeV/c 2 ), which is not used in the subsequent determination of τ eff . In a first step, loose selection criteria are applied that reduce background significantly whilst retaining high signal efficiency. Subsequently, a multivariate selection (MVA) is used to reduce further the combinatorial background. This is optimized using pseudoexperiments to obtain the best precision on the measured B 0 s lifetime. The selection starts from a pair of oppositely charged particles, identified as muons, that form a common decay vertex. Combinatorial background is suppressed by requiring that χ 2 IP of the muon candidates 1 to all reconstructed PVs to be larger than four. To ensure a high reconstruction efficiency the muon candidates are required to have a pseudorapidity between 2.0 and 4.5. The invariant mass of the dimuon candidate must be within 50 MeV/c 2 of the known J/ψ mass [19]. In addition, the trigger requirement that the J/ψ decay length divided by its uncertainty is greater than three is reapplied.
Photons are selected from neutral clusters reconstructed in the electromagnetic calorimeter [10] that have a transverse energy in excess of 300 MeV and a confidence level to be 1 The quantity χ 2 IP is defined as the difference between the χ 2 of the PV reconstructed with and without the considered particle. a photon, P γ , greater than 0.009. The latter requirement has an efficiency of 98 % for the simulated signal sample whilst removing 23 % of the background in the high mass sideband. To suppress combinatorial background, if either of the photons when combined with any other photon candidate in the event has an invariant mass within 25 MeV/c 2 of the known π 0 meson mass [19] the candidate is rejected.
Candidate η → γγ decays are selected from diphoton combinations with an invariant mass within 70 MeV/c 2 of the known η mass [19] and with a transverse momentum larger than 2 GeV/c. The decay angle between the photon momentum in the η rest frame and the direction of Lorentz boost from the laboratory frame to the η rest frame, θ * η , is required to satisfy cos θ * η < 0.8. The J/ψ and η candidates are combined to form candidate B 0 (s) mesons. The average number of PVs in each event is 1.8 (2.0) for the 2011 (2012) dataset. When multiple PVs are reconstructed, the one with the minimum χ 2 IP to the B 0 (s) candidate is chosen. A kinematic fit is performed to improve the invariant mass resolution [20]. In this fit the momentum vector of the B 0 (s) candidate is constrained to point to the PV and the intermediate resonance masses are constrained to their known values. The reduced χ 2 of this fit, χ 2 /ndf, is required to be less than five. The measured B 0 (s) decay time must be larger than 0.3 ps and less than 10 ps. If more than one PV is reconstructed in an event the properties of the unassociated vertices are studied. Any candidate for which there is a second PV with χ 2 IP < 50 is rejected. This requirement has an efficiency of 98% that is almost flat as a function of the decay time and reduces background due to incorrect association of the B 0 (s) candidate to a PV. Finally, as in Ref. [21], the position of the PV along the beam-line is required to be within 10 cm of the nominal interaction point, where the standard deviation of this variable is approximately 5 cm. This criterion leads to a 10 % reduction in signal yield but defines a fiducial region where the reconstruction efficiency is uniform.
The second step of the selection process is based on a neural network [22], which is trained using the simulated signal sample and the high-mass sideband of the data for background. Seven variables that show good agreement between data and simulation and that do not significantly bias the B 0 (s) decay time distribution are used to train the neural net: the χ 2 /ndf of the kinematic fit; the p T of the B 0 (s) and η mesons; the minimum p T of the two photons; cos θ * η ; the minimum P γ of the two photons and the total hit multiplicity in the TT sub-detector.
The requirement on the MVA output was chosen to minimize the statistical uncertainty on the fitted τ eff using a sample of 100 pseudoexperiments. The chosen value removes 94 % of background candidates whilst retaining 69 % of the signal candidates. After applying these requirements 2 % of events contain multiple candidates from which only one, chosen at random, is kept.

Fit model
The effective lifetime is determined by performing a two-dimensional maximum likelihood fit to the unbinned distributions of the B 0 (s) candidate invariant mass and decay time where l is the candidate decay length, p the candidate momentum and m the reconstructed invariant mass of the candidate. The fit is performed for candidates with 5050 < m(J/ψ η) < 5650 MeV/c 2 and 0.3 < t < 10 ps. The fit model has four components: the B 0 s → J/ψ η signal, background from the B 0 → J/ψ η decay, background from partially reconstructed B 0 s → J/ψ ηX decays, and combinatorial background. In the fit, the decay-time distribution of each component is convolved with a Gaussian resolution function whose width is fixed to the standard deviation of the decay-time resolution in simulated data. A decay-time acceptance function accounts for the dependence of the signal efficiency on several effects. The procedure used to model the decay-time acceptance is described in detail in Ref. [21]. The overall acceptance, A tot , is factorised into the product of the selection (A sel ), trigger (A trig ) and vertex (A β ) acceptance functions, determined as described below. The effect of the selection requirements, dominated by the cut on the displacement of the muons from the PV, is studied using simulation and parameterised with the form where t is the decay time, and c 0 , c 1 and c 2 are parameters determined from the simulation and summarized in Table 1. In the second level of the software trigger a cut is applied on Table 1: Acceptance parameters due to the selection requirements (A sel ). The correlation coefficients are ρ c 0 c 1 = 0.51, ρ c 0 c 2 = 0.62 and ρ c 1 c 2 = 0.95.

Parameter
Value the decay length significance of the J/ψ candidate, which biases the decay time distribution. The trigger efficiency, A trig , is measured separately for the 2011 and 2012 datasets using events that are selected by a dedicated prescaled trigger in which this requirement is removed. It increases approximately linearly from 98% at t = 0.3 ps to 100% 4 ps. The resulting acceptance shape is parameterised in bins of decay time with linear interpolation between the bins. Finally, the reconstruction efficiency of the vertex detector decreases as the distance of closest approach of the decay products to the pp beam-line increases. This effect is studied using B + → J/ψ K + decays where the kaon is reconstructed without using vertex detector information [21] and parameterised with the form where the parameters β and γ are determined separately for the 2011 and 2012 data. The obtained values are summarized in Table 2. Figure 1 shows the overall acceptance curves obtained for the 2011 and 2012 datasets. The shape of A tot is mainly determined by A sel , whose uncertainty is dominated by the size of the simulation sample. The overall acceptance correction is relatively small. Fitting the simulated data with and without the correction τ eff changes by 13 fs. The invariant mass distribution for the B 0 s → J/ψ η signal is parameterised by a Student's t-distribution. The Bukin [23] and JohnsonSU [24] functions are considered for systematic variations. In the fit to the data, the shape parameters of this distribution are fixed to the simulation values. The decay time distribution for this component is modelled with an exponential function convolved with the detector resolution and multiplied by the detector acceptance, as discussed above.
The and the sum of two exponentials in decay time. In the fit to the data the lifetime of the shorter lived component is fixed to the value found in the fit to the sideband. As a systematic variation of the mass model, an exponential function is considered. Background from partially reconstructed decays of b hadrons is studied using a simulated bb sample. Using this sample an additional background component, due to partially reconstructed B 0 s → J/ψ ηX decays, is identified. Background from this source lies at invariant masses below 5100 MeV/c 2 and has a lifetime of 1.33 ± 0.10 ps. This component is modelled by a Novosibirsk function [32] in mass and an exponential in time. All parameters for this component apart from the yield are fixed to the simulation values in the fit to the data.
The fit has eight free parameters: the yield of the B 0 s → J/ψ η component (N B 0 s ), the combinatorial background yield (N comb ), the partially reconstructed background yield (N partial ), the B 0 s mass, the lifetime of the signal component (τ eff ), the coefficient of the combinatorial background component in mass (a comb ), the longer lived background lifetime (τ comb ) and the fraction of the short-lived background (f comb ). Independent fits are performed for the 2011 and 2012 data and a weighted average of the two lifetime values is made. The correctness of the fit procedure is validated using the full simulation and pseudoexperiments. No significant bias is found and the uncertainties estimated by the fit are found to be accurate.  Table 3. The fitted signal yields of the two years scale according to the known integrated luminosity and b-hadron production cross-section. There is some tension in the relative yield of the partially reconstructed background between the two years. However, this parameter is almost uncorrelated with τ eff and this tension has no impact on the result. The average of the fitted values of τ eff is τ eff = 1.479 ± 0.034 ps, where the uncertainty is statistical.

Results
The main source of systematic uncertainty is due to the modelling of the decay time acceptance function (Section 4). Varying the parameters of the acceptance function within their correlated uncertainties, a variation of the fitted lifetime of 10 fs is found, which is assigned as a systematic uncertainty. Uncertainties on A sel due to the parameterisation of this effect are evaluated to be negligible by replacing the functional form with a histogram. The statistical and systematic uncertainties on A β are evaluated by repeating the fit and varying the parameterisation within its uncertainties. The statistical uncertainty on A trig is propagated by generating an ensemble of histograms with each bin varied within its statistical uncertainty. Systematic uncertainties on A trig are estimated to be small by varying the binning of the histogram and considering an alternative analytic form. In simulation studies the efficiency of the MVA is found to be independent of the decay time within uncertainties. Conservatively, allowing for a linear dependence, an uncertainty of 1.7 fs is assigned.
The influence of the decay time resolution is estimated by increasing its value from 51 to 70 fs. This variation covers the variation of the resolution with decay time and any possible discrepancy in the resolution between data and simulation. The change in τ eff from this variation is negligible. The impact of the uncertainties in f r , the B 0 s − B 0 mass splitting, and the B 0 lifetime are evaluated by repeating the fit procedure varying these parameters within their quoted uncertainties.
Further uncertainties arise from the modelling of the time distributions of the background components. In the default fit the lifetime of the short-lived component is fixed to the value found in a fit to the mass sideband. Removing this constraint changes the result by 4 fs, which is assigned as a systematic uncertainty. The uncertainty due to the fixed lifetime of the partially reconstructed component is found to be negligible.
Uncertainties arising from the modelling of the signal and background mass distributions are evaluated using the discrete profiling method described in Ref. [33] and found to be negligible. Further small uncertainties arise due to the limited knowledge of the length scale of the detector along the beam axis (z−scale), the charged particle momentum scale and the neutral particle energy scale.
The stability of the result has been tested against a number of possible variations, such as the fitted invariant mass range, the requirement on the IP of the muons, the MVA requirement and analysing the sample according to the number of reconstructed PVs. No significant change in the final result is found and hence no further systematic uncertainty   is assigned. All the uncertainties are summarized in Table 4. Adding them in quadrature leads to a total systematic uncertainty of 11.1 fs which is dominated by the size of the simulation sample used to determine the acceptance and to validate the analysis procedure.

Summary
Using data collected by LHCb, the effective lifetime in the B 0 s → J/ψ η decay mode is measured to be τ eff = 1.479 ± 0.034 (stat) ± 0.011 (syst) ps.
In the limit of CP conservation, τ eff is equal to the lifetime of the light B 0 s mass eigenstate τ L . The present measurement is consistent with, and has similar precision to, the effective lifetime determined using the B 0 s → D + s D − s decay mode [7], τ eff (D + s D − s ) = 1.379 ± 0.026 (stat)±0.017 (syst) ps and also with the value measured in the B 0 s → K + K − mode [8], τ eff (K + K − ) = 1.407 ± 0.016 (stat) ± 0.007 (syst) ps, where penguin diagrams are expected to be more important. Averaging the tree level measurements gives τ eff = 1.42 ± 0.02 ps in good agreement with the expectations of the Standard Model [6], τ L = 1.43 ± 0.03 ps and the value quoted by HFAG [34] from measurements made in the B 0 s → J/ψ φ mode, τ L = 1.420 ± 0.006 ps. The values from these different measurements are compared in Fig. 3.