1 Introduction

All previously published measurements of the top quark mass have been obtained using samples of top quark-antiquark pairs. A combination of measurements from the CDF and D0 experiments at the Tevatron and ATLAS and CMS experiments at the LHC yields a value of \(173.34 \pm 0.27\,\text {(stat)} \pm 0.71\,\text {(syst)} \,\text {GeV} \) for the top quark mass \(m_{\mathrm{t}}\) [1]. Measuring \(m_{\mathrm{t}}\) in single top quark production enriches the range of available measurements, exploiting a sample which is almost statistically independent from those used by previous ones, and with systematic uncertainties partially uncorrelated from those considered in \(\mathrm{t}\overline{\mathrm{t}}\) production. Because of the different production mechanism, the mass extraction is affected differently by the modelling of both perturbative effects, such as initial- and final-state radiation, and nonperturbative effects, such as colour reconnection, in quantum chromodynamics (QCD). Some discussion on these topics, though mainly restricted to the case of pair production, can be found in Refs. [2, 3]. In perspective, the lower level of gluon radiation and final state combinatorial arrangements with respect to \(\mathrm{t}\overline{\mathrm{t}}\) production will make this channel a good candidate for precision measurements of \(m_{\mathrm{t}}\) when larger samples of events are available.

At the CERN LHC, top quarks are mainly produced as \(\mathrm{t}\overline{\mathrm{t}}\) pairs, through gluon-gluon fusion or quark-antiquark annihilation, mediated by the strong interaction. The standard model (SM) predicts single top quark production through electroweak processes, with a rate about one third that of the \(\mathrm{t}\overline{\mathrm{t}}\) production cross section. This has been confirmed by observations at the Tevatron  [4] and LHC  [5, 6].

In this paper, top quark candidates are reconstructed via their decay to a \(\mathrm {W}\) boson and a \(\mathrm{b}\) quark, with the \(\mathrm {W}\) boson decaying to a muon and a neutrino. The event selection is tailored, before looking at data in the signal region, to enhance the single top quark content in the final sample and so have a result as independent as possible from those obtained using \(\mathrm{t}\overline{\mathrm{t}}\) events.

The paper is organised as follows. Section 2 describes the CMS detector, followed by information about the data sample and simulation used in the analysis in Sect. 3. The selection of events and the reconstruction of the top quark candidates is given in Sect. 4, and the description of the maximum-likelihood fit to derive the top quark mass is in Sect. 5. Section 6 describes the systematic uncertainties affecting the measurement and Sect. 7 summarises the results.

2 The CMS detector

The central feature of the CMS apparatus is a superconducting solenoid of 6 \(\text {\,m}\) internal diameter, providing a magnetic field of 3.8 \(\text {\,T}\). Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter, and a brass and scintillator hadron calorimeter, each composed of a barrel and two endcap sections. Forward calorimeters extend the pseudorapidity coverage provided by the barrel and endcap detectors. Muons are measured in gas-ionisation detectors embedded in the steel flux-return yoke outside the solenoid.

A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [7].

3 Data and simulated samples

The measurement reported here is performed using the \(\sqrt{s}=\)8 TeV proton-proton collision data sample collected in 2012 with the CMS detector, corresponding to an integrated luminosity of 19.7 fb\(^{-1}\).

At the lowest order in perturbation theory, single top quark production proceeds through the t-channel, s-channel, and associated \(\mathrm{t} \mathrm {W}\) production modes. The t channel provides the largest contribution to the single top quark cross section. The corresponding amplitude can be calculated using one of two different schemes [8,9,10]: in the 5-flavour scheme, b quarks are considered as coming from the interacting proton, and the leading-order (LO) diagram is a \(2 \rightarrow 2\) process (Fig. 1, upper); in the 4-flavour scheme, b quarks are not present in the initial state, and the LO diagram is a \(2 \rightarrow 3\) process (Fig. 1, lower). The predicted t-channel single top quark cross section for pp collisions at a centre-of-mass energy of 8 TeV is \(\sigma _{\mathrm{t}}= 54.9^{+2.3}_{-1.9}\) \(\text {\,pb}\) for the top quark and \(\sigma _{\overline{\mathrm{t}}}=29.7^{+1.7}_{-1.5}\) \(\text {\,pb}\) for the top antiquark. These values are obtained by a next-to-leading-order (NLO) calculation in quantum QCD with hathor v.2.1 [11, 12], assuming a top quark mass of 172.5\(\,\text {GeV}\). The parton distribution functions (PDFs) and \(\alpha _S\) uncertainties are calculated using the PDF4LHC prescription [13, 14] with the MSTW2008 68% confidence level (CL) NLO [15, 16], CT10 NLO [17], and NNPDF2.3 [18] PDF sets.

Fig. 1
figure 1

Feynman diagrams representing the dominant single top quark production mechanisms in the t channel

At 8 TeV, the predicted \(\mathrm{t}\overline{\mathrm{t}}\) production cross section is \(\sigma (\mathrm{t}\overline{\mathrm{t}}) = 252.9^{+6.4}_{-8.6}\,\text { (scale)} \pm 11.7\,(\mathrm {PDF} + \alpha _S)\,\text {\,pb} \) as calculated with the Top++2.0 program to next-to-next-to-leading order in perturbative QCD, including soft-gluon resummation to next-to-next-to-leading-log order (see Ref. [19] and references therein), and assuming a top quark mass of 172.5\(\,\text {GeV}\). In this calculation, the total scale uncertainty is obtained from the independent variation of the factorisation and renormalisation scales, \(\mu _\mathrm {F}\) and \(\mu _\mathrm {R}\), by a factor 2 and 1/2; the total PDF and \(\alpha _S \) uncertainties are estimated following the PDF4LHC prescription [14] with the MSTW2008 68% CL NNLO [16], CT10 NNLO [18], and NNPDF2.3 [20] FFN PDF sets.

Simulated events are used to optimise the event selection and to study the background processes and the expected performance of the analysis. The signal t-channel events are generated with the powheg generator, version 1.0 [21], in the 5-flavour scheme, interfaced with pythia  [22], version 6.426, for parton showering and hadronisation. Single top quark s-channel and tW associated production are considered as backgrounds for this measurement and simulated with the same generator. Top quark pair production, single vector boson production associated with jets (referred to as \(\mathrm {W}\)/\({\mathrm{Z}}\) +jets in the following), and double vector boson (diboson) production are amongst the background processes taken into consideration and have been simulated with the MadGraph generator, version 5.148 [23], interfaced with pythia for parton showering. The pythia generator is used to simulate QCD multijet event samples enriched with isolated muons. The value of the top quark mass used in all simulated samples is 172.5\(\,\text {GeV}\). All samples are generated using the CTEQ6.6M [24] PDF set and use the Z2* underlying event tune [25]. The factorisation and renormalisation scales are both set to \(m_{\mathrm {t}}\) for the single top quark samples, while a dynamic scale is used for the other samples, defined as the sum in quadrature of the transverse momentum (\(p_{\mathrm {T}}\)) and the mass of the particles produced in the central process. The passage of particles through the detector is simulated using the Geant4 toolkit [26]. The simulation includes additional overlapping pp collisions (pileup) with a multiplicity that is tuned to match the one observed in data.

4 Event selection and reconstruction

Signal events are characterised by a single isolated muon, momentum imbalance due to the presence of a neutrino, and one central b jet from the top quark decay. In addition, events often feature the presence of a light quark jet in the forward direction, from the hard-scattering process.

The online selection requires the presence of one isolated muon candidate with \(p_{\mathrm {T}}\) greater than 24\(\,\text {GeV}\) and absolute value of the pseudorapidity (\(\eta \)) below 2.1. Events are required to have at least one primary vertex reconstructed from at least four tracks, with a distance from the nominal beam-interaction point of less than 24 cm along the z axis and less than 2 cm in the transverse plane. In cases where more than one primary vertex is found, the one featuring the largest value of \(\Sigma p_{\mathrm {T}} ^2\) is retained (“leading vertex”), where the sum runs over all the tracks assigned to that vertex.

All particles are reconstructed and identified with the CMS particle-flow algorithm [27, 28]. Muon candidates are further required to have \(p_{\mathrm {T}} > 26\,\text {GeV} \), thus ensuring they are selected in the region of maximal trigger efficiency. Muon candidates are also required to be isolated. This is ensured by requiring \(I_{\text {rel}} < 0.12\), where \(I_{\text {rel}}\) is defined as the sum of the transverse energies deposited by long-lived charged hadrons, photons, and neutral hadrons in a cone of size \(\Delta R =\sqrt{\smash [b]{( \Delta \eta )^2 + ( \Delta \phi )^2}} = 0.4\) around the muon direction (\(\phi \) being the azimuthal angle, in radians), divided by the muon \(p_{\mathrm {T}}\) itself. An offset correction is applied to remove the additional energy included in the jets that come from pileup [29]. Events are rejected if an additional muon (electron) candidate is present, passing the selection criteria \(p_{\mathrm {T}} > 10\, (20)\) \(\,\text {GeV}\), \(|\eta |< 2.5\), and \(I_{\text {rel}} < 0.2\ (0.15)\).

To define jets, the reconstructed particles are clustered using the anti-\(k_{\mathrm {T}}\) algorithm [30] with a distance parameter of 0.5. Charged particles are excluded if they originate from a primary vertex that is not the leading vertex. The energy deposition in the jet due to neutral pileup particles is inferred and subtracted by considering charged pileup particles inside the jet cone. Additional corrections to the jet energies are derived from the study of dijet events and photon+jets events [31]. Jets are required to have \(|\eta | < 4.7\) and to have a corrected transverse energy greater than 40\(\,\text {GeV}\). Jets associated with the hadronisation of b quarks (“b jets”) are identified using a \(\mathrm{b}\) tagging algorithm based on the 3D impact parameter of the tracks in the jet to define a \(\mathrm{b}\) tagging discriminator [32]. The threshold for this variable is chosen such that the probability to misidentify jets coming from the hadronisation of light quarks (\(\mathrm{u}\), \(\mathrm{d}\), \(\mathrm{s}\)) or gluons is small (0.1%), while ensuring an efficiency of 46% for selecting jets coming from b quarks, as determined from the simulation of events with top quark topologies. Event weights are applied to adjust the b jet yields in the simulation to account for differences in the \(\mathrm{b}\) tagging efficiency between data and simulation.

The missing transverse momentum (\({\vec p}_{\mathrm {T}}^{\text {miss}}\)) is calculated as the negative vector sum of the transverse momenta of all reconstructed particles. Corrections to the jet energies, as well as an offset correction accounting for pileup interactions, are propagated to \({\vec p}_{\mathrm {T}}^{\text {miss}}\). The missing transverse momentum magnitude (\(p_{\mathrm {T}} ^\text {miss}\)) is required to exceed 50\(\,\text {GeV}\), to suppress the QCD multijet background.

To reject jets from pileup, non \(\mathrm{b}\)-tagged jets are rejected if the root-mean-square \(\eta \)-\(\phi \) radius of the particles constituting the jet with respect to the jet axis is larger than 0.025. To suppress background from QCD multijet events, the transverse mass of the \(\mathrm {W}\) boson \(m_\mathrm {T}(\mathrm {W})\) must be larger than 50\(\,\text {GeV}\), where \(m_\mathrm {T}(\mathrm {W})\) is constructed from the missing transverse momentum and muon transverse momentum vectors as

$$\begin{aligned} m_{\mathrm {T}}(\mathrm {W}) = \sqrt{\left( p_{\mathrm {T}} ^{\mu } + p_{\mathrm {T}} ^\text {miss} \right) ^2 - \left( p_x^{\mu } + p^{\text {miss}}_{\mathrm {T},x} \right) ^2 - \left( p_y^{\mu } + p^{\text {miss}}_{\mathrm {T},y} \right) ^2}. \end{aligned}$$
(1)

The same event reconstruction and selection of top quark candidates adopted by the CMS single top quark t-channel cross section measurement at 8 TeV in Ref. [5] is used. Due to the detector acceptance and jet selection requirements, most signal events are characterised by the presence of two reconstructed jets, one of which comes from the hadronisation of a \(\mathrm{b}\) quark. Therefore, events with two reconstructed jets, exactly one of which is b tagged, constitute the “signal region” (referred to as ‘2J1T’ in the following). Other event topologies are used to study background properties: the sample with two reconstructed jets, neither of which is b tagged (‘2J0T’) is dominated by \(\mathrm {W}\)+jets events; the sample with three reconstructed jets, where two jets are b tagged (‘3J2T’) is dominated by \(\mathrm{t}\overline{\mathrm{t}}\) events. For all topologies considered, the jet with the highest value of the \(\mathrm{b}\) tagging discriminator is used for top quark reconstruction, while that with the lowest value is taken to be the light-quark jet associated with top quark production (Fig. 1).

To enrich the sample in single top quark events, further requirements are applied to variables that exhibit good discriminating power with respect to \(\mathrm{t}\overline{\mathrm{t}}\) events, as described below. The selection criteria have been chosen after studying their effect on the purity of the sample, while verifying that the statistical uncertainty achievable on the top quark mass would not be excessively degraded.

A feature of single top quark production in the t channel is that the top quark is accompanied by a light-quark jet (the quark labelled \(\mathrm{q} '\) in Fig. 1), which is produced in a more forward direction than jets coming from \(\mathrm{t}\overline{\mathrm{t}}\) production or other background processes. This is reflected in the distribution of the absolute value of the pseudorapidity of the light-quark jet \(|\eta _{\mathrm {j}'} |\), shown in Fig. 2 (upper) for all reconstructed top quark candidates. A requirement of \(|\eta _{\mathrm {j}'} | > 2.5\) is applied to the sample. The stability of the selection has been checked by verifying that, if the events with \(|\eta _{\mathrm {j}'} | > 4\) were excluded, the final result would not be affected.

In t-channel single top quark production, top quarks are produced more frequently than top antiquarks due to the charge asymmetry of the proton-proton initial state [33], as seen in the muon charge distribution (Fig. 2, lower). To obtain as pure a sample as possible, only events with positively charged muons are retained.

Fig. 2
figure 2

Distribution of the light-quark jet pseudorapidity (upper) and of the muon charge (lower) for all top quark candidates in the muonic decay channel. Points with error bars represent data, stacked histograms show expected contributions from Monte Carlo simulation. The hatched area represents the uncertainty on the Monte Carlo predictions associated to the finite size of the samples and their normalization, and the integrated luminosity

5 Determination of the top quark mass

For each selected event, the top quark mass is reconstructed from the invariant mass \(m_{\mu \nu \mathrm{b}}\) calculated from the muon, the neutrino, and the \(\mathrm{b}\) jet. The 4-momenta of the muon and the jet are measured, while, for the neutrino, the 4-momentum is determined by using the missing transverse momentum in the event and a kinematical constraint on the \(\mathrm {\mu }\) \(\nu \) invariant mass, required to be consistent with the mass \(m_{\mathrm {W}}\) of the \(\mathrm {W}\) boson [34]:

$$\begin{aligned} m_{\mathrm {W}}^2= & {} \left( E^{\mu } + \sqrt{(p_{\mathrm {T}} ^\text {miss})^2+(p^{\nu }_{z})^2} \right) ^2 - \left( p_x^{\mu } + p^{\text {miss}}_{\mathrm {T},x} \right) ^2 \nonumber \\&- \left( p_y^{\mu } + p^{\text {miss}}_{\mathrm {T},y} \right) ^2-\left( p_{z}^{\mu } + p_{z}^{\nu } \right) ^2 , \end{aligned}$$
(2)

where \(E^{\mu }\) is the muon energy, \(p_{x}^{\mu }\), \(p_{y}^{\mu }\) and \(p_{z}^{\mu }\) are the components of the muon momentum, \(p_{z}^{\nu }\) is the longitudinal component of the neutrino momentum, and \(p_{\mathrm {T}} ^\text {miss}\) is used for the transverse components of the neutrino momentum. Equation 2 is quadratic in \(p_{z}^{\nu }\): when two real solutions are found, the one with the smallest value of \(|p_{z}^{\nu }|\) is taken; in the case of complex solutions, the imaginary component is eliminated by modifying \(p^{\text {miss}}_{\mathrm {T},x}\) and \(p^{\text {miss}}_{\mathrm {T},y}\) independently, so as to give \(m_\mathrm {T}(\mathrm {W}) = m_{\mathrm {W}}\)  [35].

Figure 3 shows the \(m_{\mu \nu \mathrm{b}}\) distributions before and after the final event selection. According to Monte Carlo simulation, after the final selection, 73% of the reconstructed top quarks come from single top quark production, and of these about 97% come from t-channel production.

Fig. 3
figure 3

Reconstructed \(\mu \nu \mathrm{b} \) invariant mass distribution for data (points with error bars) and Monte Carlo events (stacked histograms). (Left) initial selection; (right) final selection after the charge and light-quark jet pseudorapidity requirements. The ratio of the observed number of events in data to the number predicted by simulation is shown in the lower plots. The hatched area represents the uncertainty on the Monte Carlo predictions associated to the finite size of the samples and their normalization, and the integrated luminosity

The top quark mass is measured with an extended unbinned maximum-likelihood fit to the \(m_{\mu \nu \mathrm{b}}\) distribution. The numbers of events for the various contributions, except for the single top quark t-channel one, are fixed to the values extracted from simulation, taking into account the different theoretical cross sections [5]. The description of the parameterisation of the signal and background components used in the fit is presented below. The free parameters of the fit are the number of single top quark signal events and the parameters of the signal shape.

5.1 Parameterisation of top quark components

The shapes of the \(m_{\mu \nu \mathrm{b}}\) distributions for samples where a top quark is present are studied using simulated events.

The \(\mathrm{t}\overline{\mathrm{t}}\) component exhibits a wider peak, with a larger high-mass tail, compared to the single top quark t-channel component. The simulation shows that the number of muon and \(\mathrm{b}\) jet pairs correctly assigned to the parent top quark is around 55% for \(\mathrm{t}\overline{\mathrm{t}}\) events, while this fraction exceeds 90% for signal events. Both contributions can be fitted by Crystal Ball functions [36], with independent parameters \(\mu \) and \(\sigma \) representing the Gaussian core, and \(\alpha \) and n describing where the power-law tail begins and the exponent of the tail, respectively. The distributions obtained from the simulated samples before the final selection are shown in Fig. 4. The difference between the values of the \(\mu \) parameter of the Crystal Ball function obtained from the fits is \(m_{\mathrm{t}}(\text {t channel}) - m_{\mathrm{t}}(\mathrm{t}\overline{\mathrm{t}}) = 0.30 \pm 0.17\,\text {GeV} \), where the uncertainty is the statistical uncertainty from the fit.

The remaining single top quark components (s-channel and \(\mathrm{t} \mathrm {W}\) production) account for only about 3.5% of the final sample and their contribution is absorbed in the \(\mathrm{t}\overline{\mathrm{t}}\) component, since their distributions exhibit broader peaks with respect to the t channel.

Fig. 4
figure 4

Reconstructed \(\mu \nu \mathrm{b} \) invariant mass from Monte Carlo simulated events for single top quark t channel (upper) and \(\mathrm{t}\overline{\mathrm{t}}\) (lower). The continuous lines show the results of fits to Crystal Ball shapes

The parameter \(\mu \) of the Crystal Ball function describing the single top quark t-channel component is used to estimate the top quark mass. The mass is obtained by shifting the value of \(\mu \) resulting from the fit by an amount \(\Delta m\) depending on \(\mu \) itself. In order to calibrate the magnitude of the shift, the fit has been repeated on a set of simulated samples including all signal and background processes, where the t-channel single top quark and \(\mathrm{t}\overline{\mathrm{t}}\) events were generated with different values of the top quark mass, all other events remaining unchanged. Figure 5 shows the resulting values of \(\mu \) as a function of the generated top quark mass (upper) and the mass calibration curve from a fit to these values (lower). The \(\Delta m\) shift to be applied to the fitted value of \(\mu \) is expressed as a linear function of \(\mu \) itself. The shaded grey area represents the uncertainty associated with \(\Delta m\), obtained from the statistical uncertainties of the fits.

Fig. 5
figure 5

Mass calibration from fits to samples with different generated top quark mass. (Top) fit results as a function of the generated top quark mass. The straight line shows the result of a linear fit to the chosen top quark mass values. (Lower) mass shift, as a function of the fitted top quark mass (straight line). The shaded grey area represents the associated systematic uncertainty

5.2 Parameterisation of the non-top-quark background

The \(\mathrm {W}\)+jets events are expected to provide the largest contribution to the residual background. The ‘2J0T’ sample is mostly populated by such events and contains a large number of events, making it in principle a suitable control region to study the expected contribution of \(\mathrm {W}\)+jets events to the background in the signal region. However, the simulation shows that the reconstructed invariant mass distribution for \(\mathrm {W}\)+jets events in the ‘2J0T’ sample differs from that of the ‘2J1T’ sample. Thus, simulation has been used for the characterisation of the \(\mathrm {W}\)+jets component, as well as for all other non-top-quark background contributions. The shape of the invariant mass distribution for the sum of all non-top-quark background sources is well reproduced by a Novosibirsk function [37], with parameters \(\mu \) and \(\sigma \) representing the Gaussian core, and \(\tau \) describing the skewness of the distribution. The option to use the full simulated sample before the final selection, as is done for events containing top quarks, has not been chosen, as the parameters of the fitted function vary significantly with the requirement on \(|\eta _{\mathrm {j}'} |\), as shown in Fig. 6. Therefore, the sample obtained after applying the final selection is used to determine the shape parameters in the final fit.

Fig. 6
figure 6

Reconstructed \(\mu \nu \mathrm{b} \) invariant mass for non-top-quark background events, from Monte Carlo simulation. (Top) before final selection; (lower) after final selection. The continuous lines show the results of fits to Novosibirsk functions

5.3 Determination of the top quark mass from the fit

The invariant mass distribution of the selected top quark candidates is fitted with three components corresponding to signal, \(\mathrm{t}\overline{\mathrm{t}}\) and non-top-quark processes, using the probability density functions described above. The mass is obtained from the resulting value of the mean of the Gaussian core of the Crystal Ball function fitting the single top quark contribution, applying the calibration procedure described above. All parameters of the single top quark component are left free in the fit. The difference between the peak position of the t-channel and \(\mathrm{t}\overline{\mathrm{t}}\) components is kept fixed to the value measured in simulation, to reduce the statistical fluctuations due to the small number of residual \(\mathrm{t}\overline{\mathrm{t}}\) events. All remaining parameters (including normalisations) are fixed to the values extracted from simulation, after applying the final event selection.

The results of the fits to the simulated sample and to the collision data sample are shown in Fig. 7. The number of t-channel events returned by the fit is \(N_{\text {t-ch}}^{\text {fit}} = 2188 \pm 72\), in agreement with the number expected from simulation, \(N_{\text {t-ch}}^{\mathrm {MC}} = 2216^{+94}_{-78}\). A value of \(m_{\mathrm{t}} = 172.95 \pm 0.77\,\text {(stat)} \,\text {GeV} \) is obtained after applying the mass calibration (Fig. 5). A systematic uncertainty of \(0.39\,\text {GeV} \) is associated to the mass calibration procedure.

Fig. 7
figure 7

Result of the fit to the reconstructed \(\mu \nu \mathrm{b} \) invariant mass. Top Monte Carlo simulation; lower data. In each plot, the solid line represents the result of the full fit; the dotted line shows the non-top-quark component, while the dashed line shows the sum of all background components

5.4 Cross-checks

The consistency and stability of the fit are assessed using pseudo-experiments. Ensembles of experiments are simulated using the signal and background templates, with their normalisations distributed according to Poisson statistics. In each pseudo-experiment, the same fit described above is repeated and the top quark mass and the signal yield are derived. The resulting distributions of the top quark mass and its root-mean-square show that the fit does not have any significant bias, with the difference between fitted and generated top quark masses, normalised to the fitted mass uncertainty (“pull”), distributed as expected.

Additionally, a test has been made where both the single top quark contribution and the \(\mathrm{t}\overline{\mathrm{t}}\) components are fitted with a single Crystal Ball function. The results do not change appreciably within the present uncertainties with respect to the nominal fit.

The mass measurement for the single top quark contribution is derived after having removed the single top antiquark events. As a check, the analysis has been repeated and the top quark mass has been measured using single top antiquark events. The difference between the two measurements is \(0.8 \pm 1.2\,\text {GeV} \), with a difference of \(-0.6 \pm 1.5\,\text {GeV} \) expected from simulation. Furthermore, the fit has been performed by simultaneously fitting single top quark and single top antiquark candidates: the fitted mass does not statistically differ with respect to the result obtained with the nominal fit. These studies confirm that the selection of only the top quark candidates does not introduce any bias in the measured top quark mass.

6 Systematic uncertainties

Many of the uncertainties described below use modifications of the simulation to assess the impact on the final result. These modifications affect the shapes and normalisations of the templates used by the fit. Their contributions have been evaluated following the strategy adopted in Ref. [38]: the uncertainties are categorised consistently to allow effective combinations with other top quark mass measurements.

In the following, the sources of uncertainties identified as relevant for the measurement are described, as well as the procedure adopted to evaluate their impact. All the uncertainties are then combined in quadrature to derive the total systematic uncertainty.

  • Jet energy scale (JES): JES factors are applied to the jet energy response in simulation to match that observed in data. The JES uncertainties are \(p_{\mathrm {T}}\)- and \(\eta \)-dependent, and are taken into account by scaling the energies of all jets up and down according to their individual uncertainties, as determined in dedicated studies [31]. The scaling is then propagated to the calculation of \(p_{\mathrm {T}} ^\text {miss}\), and all other quantities dependent on the jet energies. The mass fit is repeated on the ‘scaled’ simulated sample and the shift with respect to the nominal fit is taken as a measure of the uncertainty. The uncertainties in the JES are subdivided into independent sources and grouped into different categories following the prescription defined in Ref. [39], aimed at simplifying the combination of measurements reported by the different LHC experiments. A total of five categories are identified referring to the effect of uncertainties related to the absolute scale determination using Drell–Yan events (“in-situ correlation group”), relative (\(\eta \)-dependent) calibration, and high- and low-\(p_{\mathrm {T}}\) extrapolation (“inter-calibration group”), flavour-specific corrections (“flavour-correlation group”), pileup corrections using an offset dependence on the jet \(p_{\mathrm {T}}\) (“pileup \(p_{\mathrm {T}}\) uncertainty”), and remaining sources, uncorrelated between ATLAS and CMS (“uncorrelated group”).

  • b quark hadronisation model: This is the term that accounts for the flavour-dependent uncertainties arising from the simulation of the parton fragmentation. The total uncertainty can be decomposed into two separate contributions: the \(\mathrm{b}\) quark fragmentation uncertainty and the uncertainty from \(\mathrm{b}\) hadron decays. The b quark fragmentation uncertainty has been derived in the same way as in the top quark mass measurement using semileptonic \(\mathrm{t}\overline{\mathrm{t}}\) events [38]. The Bowler–Lund fragmentation function for \(\mathrm{b}\) hadrons is retuned to agree with the \(x_{\mathrm {B}}\) data measured by the ALEPH [40] and DELPHI [41] Collaborations, where \(x_{\mathrm {B}}\) represents the fraction of the \(\mathrm{b}\) quark energy retained by a \(\mathrm{b}\) hadron. A weight is attributed to each event, according to the \(x_{\mathrm {B}}\) value, and the difference with respect to the nominal setup is taken as the systematic uncertainty. The systematic uncertainty from the semileptonic branching ratio of \(\mathrm{b}\) hadrons is taken from Ref. [38], in which the branching fractions were varied by \(-0.45\) and \(+0.77\%\) to give the possible range of values and the associated uncertainty.

  • Jet energy resolution (JER): After correcting for the mismatch between the data and simulation for the energy resolution, the uncertainty is determined by varying the corrected JER within its \(\eta \)-dependent ±1 standard deviation uncertainties. These changes are then propagated to the calculation of \(p_{\mathrm {T}} ^\text {miss}\).

  • Muon momentum scale: This contribution is determined by varying the reconstructed muon momenta by their uncertainties. These are determined as a function of the muon \(\eta \) and \(p_{\mathrm {T}}\) with a “tag-and-probe” method based on Drell–Yan data, as described in Ref. [42].

  • Unclustered missing transverse momentum: The uncertainty arising from the component of the missing transverse momentum that is not due to particles reconstructed as leptons and photons or clustered in jets (“unclustered \(p_{\mathrm {T}} ^\text {miss}\) ”) is determined by varying it by ±10%.

  • Pileup: This is the uncertainty coming from the modelling of the pileup in data. It is taken as the sum of the effect of the uncertainty in the pileup rate (evaluated with pseudo-experiments in which the effective inelastic pp cross section has been varied by ±5%) and the pileup term extracted from the JES ‘uncorrelated’ group (see above).

  • b tagging efficiency: To calculate the uncertainties from the \(\mathrm{b}\) tagging efficiency and the misidentification rate, the \(p_{\mathrm {T}}\)- and \(\eta \)-dependent \(\mathrm{b}\) tagging and misidentification scale factors are varied within their uncertainties for heavy- and light-flavour jets, as estimated from control samples [32]. The resulting changes are propagated to the event weights applied to the simulated events to obtain the uncertainties.

  • Fit calibration: The mass is derived from the value of \(\mu \) returned by the fit according to the mass calibration procedure described before: the same procedure provides an associated systematic uncertainty (Fig. 5, lower).

  • Background estimate: This is the uncertainty resulting from the use of simulated backgrounds in the mass determination. One contribution to the systematic uncertainty is determined by varying the background normalisations by ±1 times their standard deviation uncertainties. In addition, in the fit, the shape parameters of both the \(\mathrm{t}\overline{\mathrm{t}}\) and the W+jets components are fixed: these parameters are varied by ±1 times their standard deviation uncertainties. As there are theoretical uncertainties on the inputs to the simulation which may alter the background shapes used in the mass fit, an additional ‘radiation and matrix-element to parton-shower matching’ uncertainty is included, as described below.

  • Generator model: Depending on whether the b quarks are considered part of the proton or not, the production of single top quarks can be studied in the 5- or 4-flavour schemes [10], respectively. The signal sample used in this work is produced with the 5-flavour scheme, where the b quarks are considered as constituents of the proton. To estimate the systematic uncertainty due to treating the b quarks like the lighter quarks, a comparison between a 5- and a 4-flavour-scheme (\(2 \rightarrow 2\) and \(2 \rightarrow 3\), respectively) samples has been performed: in the latter, the b quarks are generated in the hard scattering from gluon splitting. The samples used for the comparison are produced using the CompHEP generator [43], version 4.5.1, with the same configuration as the nominal signal sample.

  • Hadronisation model: This uncertainty is already covered by the JES uncertainty and \(\mathrm{b}\) quark hadronisation uncertainties considered above. As a cross-check, the nominal simulation is compared with results obtained using the herwig generator [44], version 6.520, tune AUET2 [45], for parton showering and hadronisation. The resulting difference is in agreement with what is obtained for the JES uncertainty.

  • Radiation and matrix-element to parton-shower matching: This is the category which covers the QCD factorisation and renormalisation scale (with the nominal values of \(\mu _{\mathrm {R}}=\mu _{\mathrm {F}}=Q^2\)) and initial- and final-state radiation uncertainties. For the renormalisation and factorisation scale uncertainty determination, dedicated samples with \(\mu _{\mathrm {R}}\) and \(\mu _{\mathrm {F}}\) scales shifted up or down by a factor of 2 are used. The uncertainty is determined by comparing the central result with the shifted ones. For the matrix-element to parton-shower matching thresholds, \(\mathrm{t}\overline{\mathrm{t}}\) and \(\mathrm {W}\)+jets samples in which the thresholds have been shifted up or down by a factor of 2 are used, with the systematic uncertainty evaluated in the same way as for the scale uncertainty. This is not relevant for the signal data set, which does not have a matrix-element to parton-shower matching. This procedure covers initial- and final-state radiation uncertainties. All variations are applied independently of each other and the corresponding uncertainties are treated as uncorrelated.

  • Underlying event: This term represents the uncertainty coming from the modelling of the underlying event (UE), the particles from the interaction that do not enter into the hard parton-parton interaction. It is evaluated by comparing the results from two different tunes of pythia, the default Z2* tune and the Perugia tune [46]. The differences in the value of the fitted mass are within the statistical uncertainty determined by the size of the simulated samples. In fact, the two opposite variations result in mass shifts with the same sign. For this reason, the uncertainty from this source is estimated as the maximum statistical uncertainty of the changes.

  • Colour reconnection: This uncertainty is evaluated by comparing two different UE tunes in which one has the nominal colour-reconnection effects and the other has these turned off.

  • Parton distribution functions: The PDF4LHC [14] prescriptions are followed to calculate the uncertainty due to the choice of the PDFs. The variation of the fitted top quark mass is estimated by using alternative sets of PDFs with respect to the nominal one, namely the MSTW2008CP [17], CT10 [24], and NNPDF2.3 [15] sets.

The systematic uncertainties that affect the top quark mass measurement are summarised in Table 1. Sources of systematic uncertainties that are totally or partially uncorrelated with the top quark mass measurements using \(\mathrm{t}\overline{\mathrm{t}}\) events are the fit calibration, the background estimate, the generator model and theoretical parameters for the simulation of signal events, and the colour-reconnection effects.

Table 1 Systematic uncertainties in the top quark mass

7 Summary

The top quark mass is measured in a sample enriched in events with a single top quark for the first time. Top quarks are reconstructed from decays to a \(\mathrm {W}\) boson and a \(\mathrm{b}\) quark, with the \(\mathrm {W}\) boson decaying to a muon and a neutrino. In the final sample, events with a top quark from single production in the t-channel account for 73% of the total number of events with a top quark. The measurement is obtained from a fit to the distribution of the reconstructed mass of top quark candidates, where the t-channel single top quark component is modelled separately from the contribution of other top quark production channels. The measured value is \(m_{\mathrm{t}} = 172.95 \pm 0.77\,\text {(stat)} ^{+0.97}_{-0.93}\,\text {(syst)} \,\text {GeV} \). This is in agreement with the current combination of Tevatron and LHC results, \(173.34 \pm 0.27\,\text {(stat)} \pm 0.71\,\text {(syst)} \,\text {GeV} \), which is based on measurements using \(\mathrm{t}\overline{\mathrm{t}}\) events. Because many of the systematic uncertainties affecting the measurement of \(m_{\mathrm{t}}\) using single top quark t-channel events are totally or partially uncorrelated with the measurements using \(\mathrm{t}\overline{\mathrm{t}}\) events, and in addition the data sample analysed is largely statistically independent of the samples previously used, the result presented in this paper will be useful in the determination of world averages of the top quark mass.