1 Introduction

The three main production modes of single top quarks and antiquarks in proton–proton (pp) collisions occur via electroweak interactions and are commonly categorised through the virtuality of the exchanged \(\text {W}\)  boson four-momentum. They are called t channel (t ch) when the four-momentum is space-like, s channel when it is time-like, and \(\text {W}\)-associated (\({\text{tW}} \)) when the four-momentum is on shell. At the CERN LHC, the production via the t channel has the largest cross section of the three modes whose most-relevant Born-level Feynman diagrams are shown in Fig. 1. In the rest of this paper, “quark” is used to generically denote a quark or an antiquark, unless otherwise specified.

Fig. 1
figure 1

Born-level Feynman diagrams for single top quark production in the t channel. Corresponding diagrams also exist for single top antiquark production

The \(t\text {-channel}\) production process was first observed by the D0 and CDF experiments at the Tevatron [1, 2]. Its inclusive cross section has been measured with high precision at the CERN LHC by the ATLAS and CMS Collaborations at \(\sqrt{s}=7\), 8, and 13\(\text {TeV}\)  [3,4,5,6,7,8]. Differential cross sections have been determined as well at 7 and 8\(\text {TeV}\)  [3, 5, 9].

Differential cross section measurements can contribute to constraining the effective field theory operators [10], the top quark mass, the renormalisation and factorisation scales, and the parton distribution functions (PDFs) of the proton [11]. In particular, the ratio of the \(t\text {-channel}\) top quark to antiquark production is sensitive to the ratio of the up to down quark content of the proton [12, 13]. Furthermore, differential angular distributions can be used to assess the electroweak coupling structure at the \(\text {W}\)tb vertex. A “vector−axial-vector” (V−A) coupling is predicted in the standard model (SM), leading to the production of highly polarised top quarks [14,15,16]. A powerful observable to investigate the coupling structure in \(t\text {-channel}\) production is given by the top quark polarisation angle \(\theta _{\mathrm{pol}}^\star \), defined via

$$\begin{aligned} \cos \theta _{\mathrm{pol}}^\star = \frac{\vec {p}_{{{\text {q}}}^{\prime }}^{\star }\cdot \vec {p}_{{\ell }{}{}}^{\star }}{|\vec {p}_{{{\text {q}}}^{\prime }}^{\star } | |\vec {p}_{{\ell }{}{}}^{\star } |}, \end{aligned}$$
(1)

where the superscript signifies that the momenta of the charged lepton, \({\ell }{}{} \) (muon or electron), from the top quark decay, and the spectator quark, \({{\text {q}}}^{\prime } \), are calculated in the top quark rest frame. The normalised differential cross section as a function of \(\cos \theta _{\mathrm{pol}}^\star \) at the parton level is related to the top quark polarisation, P, as

$$\begin{aligned} \frac{1}{\sigma }\frac{\text {d}\sigma }{\text {d}\cos \theta _{\mathrm{pol}}^\star }=\frac{1}{2}\left( 1+2 A_{{\ell }{}{}}\cos \theta _{\mathrm{pol}}^\star \right) , A_{{\ell }{}{}}=\frac{1}{2}P\alpha _{{\ell }{}{}}, \end{aligned}$$
(2)

where \(A_{{\ell }{}{}}\) denotes the spin asymmetry and \(\alpha _{{\ell }{}{}}\) is the so-called spin-analysing power of the charged lepton [16]. The spin asymmetry and/or polarisation have been measured in \({\text {p}{}{}} {\text {p}{}{}} \) collision data by the ATLAS and CMS Collaborations at \(\sqrt{s}=8\) \(\,\text {Te}\text {V}\) using various analysis techniques [9, 17, 18].

In this paper, the differential cross section of combined single top quark and antiquark production in the t channel is measured by the CMS experiment at \(\sqrt{s}=13\) \(\,\text {Te}\text {V}\) as a function of the top quark transverse momentum (\(p_{\mathrm{T}} \)), rapidity, and polarisation angle, the \(p_{\mathrm{T}} \) and rapidity of the charged lepton that originates from the top quark decay, and the \(p_{\mathrm{T}} \) of the \(\text {W}{}{}\)  boson from the top quark decay. The spin asymmetry is further determined from the measured differential cross section with respect to the polarisation angle. Additionally, a measurement of the differential charge ratio is performed as a function of the \(p_{\mathrm{T}} \) and rapidities of the top quark and charged lepton, and the \(p_{\mathrm{T}} \) of the \(\text {W}{}{}\)  boson. Differential cross sections are measured at both the parton and particle levels using an unfolding procedure.

The analysis strategy and the structure of the paper are outlined in the following. A brief description of the CMS detector is given in Sect. 2, followed by a summary of the analysed data and simulated event samples in Sect. 3. The reconstruction of physics objects and the event selection are detailed in Sect. 4. To determine the contributions from signal and backgrounds a maximum-likelihood fit (ML) is performed separately in each bin of the measurement. In the fit, shape distributions, referred to in the following as templates, are fitted to the data. For the signal and all background processes, samples of simulated events are used to determine the shape distributions, except for the templates of events containing only jets produced through the strong interaction, which are referred to as “multijet” events in this paper. The procedure to estimate the templates of multijet events based on data in a sideband region is provided in Sect. 5. Section 6 describes the measurement of the number of \(t\text {-channel}\) single top quark events from data through an ML fit. In the fit, statistical and experimental systematic uncertainties are profiled, where the latter encompasses uncertainties related to the reconstruction, identification, and calibration of the selected events and physics objects. The resulting distributions of the observables are validated in control and signal regions in Sect. 7. The fit results are input to an unfolding procedure to determine the differential cross sections and charge ratios at the parton and particle levels, as detailed in Sect. 8. The sources of experimental and theoretical systematic uncertainties are described in Sect. 9. The results are presented in Sect. 10 and the paper is summarised in Sect. 11.

2 The CMS detector and event reconstruction

The central feature of the CMS apparatus is a superconducting solenoid of 6\(\,\text {m}\) internal diameter, providing a magnetic field of 3.8\(\,\text {T}\). Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Forward calorimeters (HF) extend the pseudorapidity (\(\eta \)) coverage provided by the barrel and endcap detectors. Muons are detected in gas-ionisation chambers embedded in the steel flux-return yoke outside the solenoid. A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [19].

The particle-flow (PF) algorithm [20] aims to reconstruct and identify each particle in an event with an optimised combination of information from various elements of the CMS detector. The energy of electrons is estimated from a combination of the electron momentum at the primary interaction vertex, as determined by the tracker, the energy of the corresponding ECAL cluster, and the energy sum of all bremsstrahlung photons spatially compatible with originating from the electron track. The energy of muons is obtained from the curvature of a global track estimated from reconstructed hits in the inner tracker and muon systems. The energy of charged hadrons is determined from a combination of their momentum measured in the tracker and the matching ECAL and HCAL energy deposits. Finally, the energy of neutral hadrons is obtained from the corresponding ECAL and HCAL energy deposits. In the regions \(|\eta |>3\), electromagnetic and hadronic shower components are identified in the HF.

Events of interest are selected using a two-tiered trigger system [21]. The first level, composed of custom hardware processors, uses information from the calorimeters and muon detectors whereas a version of the full event reconstruction software optimised for fast processing is performed at the second level, which runs on a farm of processors.

The missing transverse momentum vector, \({\vec p}_{\mathrm{T}}^{\text {miss}}\), is defined as the projection onto the plane perpendicular to the beams of the negative vector momentum sum of all PF candidates in an event. Its magnitude is referred to as \(p_{\mathrm{T}} ^\text {miss}\).

3 Data set and simulated samples

The analysed \({\text {p}{}{}} {\text {p}{}{}} \) collision data set was recorded in 2016 by the CMS detector and corresponds to an integrated luminosity of 35.9\(\,\text {fb}^{-1}\)  [22]. Events were triggered by requiring at least one isolated muon candidate with \(p_{\mathrm{T}} >24\) \(\,\text {Ge}\text {V}\) and \(|\eta | < 2.4\) or one electron candidate with \(p_{\mathrm{T}} >32\) \(\,\text {Ge}\text {V}\) and \(|\eta | < 2.1\), with additional requirements [23] that select genuine electrons with an efficiency of about 80%.

Various samples of simulated events are used in this measurement to evaluate the detector resolution, efficiency, and acceptance, estimate the contributions from background processes, and determine the differential cross sections at the parton and particle levels.

Single top quark events in the t channel are simulated at next-to-leading order (NLO) in the four-flavour scheme (4FS) with powheg  v2 [24, 25] interfaced with pythia v8.212 [26] for the parton shower simulation, using the CUETP8M1 [27] tune interfaced with madspin  [28] for simulating the top quark decay. For comparison, alternative NLO \(t\text {-channel}\) samples have been generated in the 4FS and five-flavour scheme (5FS), using MadGraph 5_amc@nlo v2.2.2 [29] interfaced with pythia.

The powheg  v2 generator is also used to simulate events from top quark pair production (\(\mathrm {t}\bar{\mathrm {t}}\)) at NLO. Parton showering is simulated with pythia using the CUETP8M2T4 tune [30]. The production of single top quark events via the tW channel is simulated at NLO using powheg  v1 [31] in the 5FS interfaced with pythia using the CUETP8M1 tune for the parton shower simulation. The overlap with top quark pair production is removed by applying the diagram removal scheme [32]. Samples of \(\text {W}{}{} \text {+jets}\) events are generated with MadGraph 5_amc@nlo v2.3.3 at NLO, and interfaced with pythia using the CUETP8M1 tune. The production of leptonically decaying \(\text {W}{}{}\)  bosons in association with jets is simulated with up to two additional partons at the matrix element level, and the FxFx scheme [33] is used for jet merging. Lastly, \(\text {Z}{}{}/{{\upgamma }{}{}} ^{*}\text {+jets}\) events are generated with MadGraph 5_amc@nlo v2.2.2 at leading order (LO), interfaced with pythia using the MLM jet matching scheme [34].

In these simulated samples, the NNPDF3.0 [35] NLO set is used as the default PDF, and a nominal top quark mass of 172.5\(\,\text {Ge}\text {V}\) is chosen where applicable. The simulated events are overlaid with additional collision interactions (“pileup”) according to the distribution inferred from the data. All generated events undergo a full Geant4  [36] simulation of the detector response.

The \(t\text {-channel}\) cross section in \({\text {p}{}{}} {\text {p}{}{}} \) collisions at \(\sqrt{s}=13\) \(\,\text {Te}\text {V}\) is predicted to be \(\sigma _{{\text {t}}} =136.0{\,}^{+5.4}_{-4.6}\) \(\,\text {pb}\) for the top quark and \(\sigma _{{ {{\text {t}}}}} =81.0{\,}^{+4.1}_{-3.6}\) \(\,\text {pb}\) for the top antiquark, calculated for a top quark mass of 172.5\(\,\text {Ge}\text {V}\) at NLO in quantum chromodynamics (QCD) using the hathor v2.1 [11, 37] program. The PDF and the strong coupling constant (\(\alpha _S \)) uncertainties are calculated using the PDF4LHC prescription [38, 39] with the MSTW2008 NLO 68% confidence level [40, 41], CT10 [42] NLO, and NNPDF2.3 [43] NLO PDF sets, and are added in quadrature with the renormalisation and factorisation scale uncertainty. The simulated samples of single top quark and antiquark events employed in this measurement—generated with similar settings—were normalised using the predicted cross sections above. Predictions at next-to-next-to-leading order are available as well [12] and are 3% smaller than the corresponding cross sections at NLO. However, these are not utilised since they have been calculated using a different PDF set and top quark mass value.

4 Event selection

Proton–proton collision events containing one isolated muon or electron and two or three jets are analysed. This signature selects events where the \(\text {W}{}{}\)  boson from a single top quark decays into a charged lepton and a neutrino. One of the selected jets is expected to stem from the hadronisation of a bottom quark that originates from the top quark decay. Another jet (\(j^{\prime }\)) from a light-flavoured quark (up, down, or strange) is expected from the spectator quark (labelled \({{\text {q}}}^{\prime }\) in Fig. 1) that is produced in association with the top quark. The jet from the spectator quark is characteristically found at relatively low angles with respect to the beam axis.

The reconstructed vertex with the largest value of summed physics-object \(p_{\mathrm{T}} ^2\) is taken to be the primary \({\text {p}{}{}} {\text {p}{}{}} \) interaction vertex. The physics objects are the jets, clustered using the jet finding algorithm described in Refs. [44, 45] with the tracks assigned to the vertex as inputs, and the negative vector \(\vec {p}_\mathrm {T}\) sum of those jets.

Muon candidates are accepted if they have \(p_{\mathrm{T}} >26\) \(\,\text {Ge}\text {V}\), \(|\eta |<2.4\), and pass the following identification requirements optimised for the selection of genuine muons. A global muon track must have a track fit with a \(\chi ^2\) per degree of freedom <10, have hits in the silicon tracker and muon systems, including at least six in the tracker, of which at least one must be in the pixel detector. Additionally, track segments are required in at least two muon stations to suppress signals from hadronic showers spilling into the muon system. Muon candidates are required to be isolated with a relative isolation parameter \(I_{\mathrm{rel}}^{{{\upmu }{}{}}} <6\)%, which is defined as the scalar sum of the transverse energies \(E_{\mathrm{T}}\) deposited in the ECAL and HCAL within a cone of radius \(\varDelta R = \sqrt{\smash [b]{(\varDelta \eta )^2+(\varDelta \phi )^2}} < 0.4\), divided by the muon \(p_{\mathrm{T}} \). The transverse energy is defined as \(E_{\mathrm{T}} =E\,\sin (\theta )\) with E and \(\theta \) being the energy and polar angle, respectively, of photons and charged and neutral hadrons. Here, \(\varDelta \eta \) and \(\varDelta \phi \) are the pseudorapidity and azimuthal angle, respectively, measured relative to the muon direction. The isolation parameter is corrected by subtracting the energy deposited by pileup, which is estimated from the energy deposited by charged hadrons within the isolation cone that are associated with pileup vertices [46].

Electron candidates are required to have \(p_{\mathrm{T}} >35\) \(\,\text {Ge}\text {V}\), \(|\eta |<1.48\), and fulfil a set of additional quality requirements as follows: the distance between the matched ECAL cluster position and the extrapolated electron track has to be within \(|\varDelta \eta |<3.08\times 10^{-3}\) and \(|\varDelta \phi |<8.16\times 10^{-2}\); the absolute difference between the inverse of the energy estimated from the ECAL cluster and the inverse of the electron track momentum must be less than \(12.9\,\text {Me}\text {V} ^{-1}\); the ratio of the HCAL to the ECAL energy associated with the electron is required to be less than 4.14%; the energy-weighted lateral width of the electron shower in the ECAL along the \(\eta \) direction is restricted to \({<}9.98\times 10^{-3}\). Electrons from photon conversions are suppressed by requiring that the corresponding track has no missing hits in the inner layers of the tracker and that they do not stem from a photon conversion vertex. Electron candidates have to be isolated using the so-called effective-area-corrected relative isolation parameter [47] by requiring \(I_\text {rel}^{{\text {e}}} <5.88\)%. This parameter is defined similarly to the muon isolation parameter as the sum of the charged and neutral particle energies within a cone of \(\varDelta R<0.3\) around the electron candidate, divided by the electron \(p_{\mathrm{T}} \). The relative contribution from pileup is estimated as \(A_\text {eff}\,\rho \) and subtracted from the isolation parameter, where \(A_\text {eff}\) denotes an \(\eta \)-dependent effective area, and \(\rho \) is the median of the \(E_{\mathrm{T}}\) density in a \(\delta \eta {\times }\delta \phi \) region calculated using the charged particle tracks associated with the pileup vertices.

The selected muon (electron) candidate has to be within 2.0 (0.5)\(\,\text {mm}\) in the transverse plane and 5.0 (1.0)\(\,\text {mm}\) along the beam direction of the primary vertex.

Electron candidates with showers in the ECAL endcap (\(1.48<|\eta |<2.5\)) are not used in the measurement because of the higher background consisting of hadrons misidentified as electrons and of electrons originating from decays of heavy-flavour hadrons, which is found to be about four times larger compared to the ECAL barrel region.

Events are rejected if additional muon or electron candidates passing looser selection criteria are present. The selection requirements for these additional muons/electrons are as follows: looser identification and isolation criteria, \(p_{\mathrm{T}} >10\) (15)\(\,\text {Ge}\text {V}\) for muons (electrons), and \(|\eta |<2.5\).

The transverse \(\text {W}{}{}\)  boson mass is calculated from the formula

$$\begin{aligned} m_{\mathrm{T}} (\text {W}{}{}) =\sqrt{\smash [b]{2p_{\mathrm{T}} ^{\ell }p_{\mathrm{T}} ^\text {miss} \left[ 1-\cos (\phi ^\ell -\phi ^\text {miss})\right] }} \end{aligned}$$
(3)

using the \(p_{\mathrm{T}} \) and the \(\phi \) of the charged lepton and \({\vec p}_{\mathrm{T}}^{\text {miss}}\).

Jets are reconstructed from PF candidates and clustered by applying the anti-\(k_{\mathrm{T}}\) algorithm [44] with a distance parameter of 0.4 using the FastJet package [45]. The influence of pileup is mitigated using the charged hadron subtraction technique [48]. The jet momentum is determined as the vectorial sum of all particle momenta in the jet. An offset correction is applied to the jet \(p_{\mathrm{T}} \) to account for contributions from pileup. Further corrections are applied to account for the nonuniform detector response in \(\eta \) and \(p_{\mathrm{T}} \) of the jets. The corrected jet momentum is found from simulation to be within 2 to 10% of the true momentum over the whole \(p_{\mathrm{T}}\) spectrum and detector acceptance. The corrections are propagated to the measured \({\vec p}_{\mathrm{T}}^{\text {miss}}\). A potential overlap of a jet with the selected lepton is removed by ignoring jets that are found within a cone of \(\varDelta R<0.4\) around a selected lepton candidate. The analysis considers jets within \(|\eta |<4.7\) whose calibrated \(p_{\mathrm{T}} \) is greater than 40\(\,\text {Ge}\text {V}\), with the exception of the HCAL–HF transition region (\(2.7<|\eta |<3\)) in which jets must have a \(p_{\mathrm{T}} \) of at least 50\(\,\text {Ge}\text {V}\) to reduce the contribution from detector noise. The event is accepted for further analysis if two or three jets are present.

To reduce the large background from \(\text {W}{}{} \text {+jets}\) events, a b  tagging algorithm based on a multivariate analysis (MVA) called “combined MVA” [49], which combines the results from various other b  tagging algorithms, is used for identifying jets produced from the hadronisation of b  quarks within the acceptance of the silicon tracker (\(|\eta |<2.4\)). A tight selection is applied on the discriminant of the algorithm, which gives an efficiency of \({\approx }\)50% for jets originating from true b  quarks and misidentification rates of \({\approx }\)0.1% for light jets from u, d, or s quarks or gluons and \({\approx }\)3% for jets from c quarks, as determined from simulation.

Corrections are applied to the simulated events to account for known differences with respect to data. Lepton trigger, reconstruction, and identification efficiencies are estimated with a “tag-and-probe” method [50] from \(\text {Z}{}{}/{{\upgamma }{}{}} ^{*}\text {+jets}\) events for data and simulation from which corrections are derived in bins of lepton \(\eta \) and \(p_{\mathrm{T}}\). The b  tagging performance in simulation is corrected to match the tagging efficiency observed in data, using scale factors that depend on the \(p_{\mathrm{T}} \) and \(\eta \) of the selected jets. The scale factors are estimated by dedicated analyses performed with independent data samples [49]. In particular, the mistagging rate of non-b jets in data is determined using the “negative-tag” method [51]. A smearing of the jet momenta is applied to account for the known difference in jet energy resolution in simulation compared to data. The profile of pileup interactions is reweighted in simulation to match the one in data derived from the measured instantaneous luminosity.

To classify signal and control samples of events, different event categories are defined, denoted “NjMb ”, where N is the total number of selected jets (2 or 3) and M is the number of those jets passing the b  tagging requirement (0, 1, or 2). The 2j1b category has the highest sensitivity to the signal yield, whereas the 2j0b and 3j2b categories, enriched in background processes with different compositions, are used to assess the background modelling.

One top quark candidate is reconstructed per event in the 2j1b signal category assuming \(t\text {-channel}\) single top quark production. The procedure commences by first reconstructing the \(\text {W}{}{}\)  boson. The component of the neutrino candidate momentum along the beam direction \(p_{z}\) is found by imposing a \(\text {W}{}{}\)  boson mass constraint (80.4\(\,\text {Ge}\text {V}\)) on the system formed by the charged lepton and \({\vec p}_{\mathrm{T}}^{\text {miss}} \), the latter being interpreted as the projection in the transverse plane of the four-momentum of the unknown neutrino, as in Ref. [52]. The four-momentum of the top quark candidate (from which its mass, \(p_{\mathrm{T}} \), and rapidity are derived) is then calculated as the vector sum of the four-momenta of the charged lepton, the b-tagged jet, and the neutrino candidate. The other (nontagged) jet is interpreted as originating from the spectator quark, which recoils against the \(\text {W}{}{}\)  boson.

5 Multijet background estimation

Since the probability for a simulated multijet event to mimic the final state of the signal process is very small, it becomes impractical to simulate a sufficiently large number of events for this background. Therefore, the background from multijet events in the analysis phase space region is estimated in a two-step procedure based on data in a sideband region. First, templates of the \(m_{\mathrm{T}} (\text {W}{}{}) \) distribution from multijet events are obtained from data in a sideband region. Their normalisations are then estimated in a second step through a template-based ML fit to the events in the 2j1b and 3j2b categories, simultaneously with the number of signal events, as described in Sect. 6. In this section, a dedicated ML fit is discussed that is performed on events in the 2j0b category only for validating the procedure. The outcome of this ML fit is not used further in the measurement.

In the muon channel, the sideband region is defined by inverting the muon isolation requirement (\(I_{\mathrm{rel}}^{{{\upmu }{}{}}} >20\%\)), which results in a region dominated by multijet events. In the electron channel, the electron candidate is required to fail loose identification criteria, yielding a sideband region consisting not only of nonisolated electrons but also of electrons that fail the photon conversion criteria or are accompanied by large amounts of bremsstrahlung, thus reflecting a combination of various effects. The templates used in the ML fit are determined for this category by subtracting the contamination from other processes, estimated using simulation and which amounts to about 10 (5)% in the muon (electron) channel, from the data.

The template shapes have been validated for various observables in the 2j0b \(\text {W}{}{} \text {+jets}\) control category where the fraction of selected multijet events amounts to approximately 10 (20)% for muon (electron) events, which is comparable to those in the signal category. The \(m_{\mathrm{T}} (\text {W}{}{}) \) distributions are shown in Fig. 2 for the muon (left) and electron (right) channel after the multijet templates (extracted from data) and the templates of the processes with prompt leptons (extracted from the simulated events) have been normalised to the result of a dedicated ML fit using only events in the 2j0b category. This dedicated fit encompasses only two components, which are the multijet template whose yield is unconstrained in the fit, and all other processes grouped together, with a constraint of \({\pm }30\%\) on their combined yield using a log-normal prior. The fit is performed while simultaneously profiling the impact of experimental systematic uncertainties (as discussed in Sect. 9) affecting the yield and shape of the templates. After the fit, the derived multijet templates and the simulated samples in both channels are found to describe the distributions of data well, thus validating the procedure for estimating the contribution of multijet events from data. For the measurement, the normalisations of the multijet templates in the 2j1b and 3j2b categories are estimated using a different procedure, as described in Sect. 6.

Fig. 2
figure 2

Distributions of the transverse \(\text {W}{}{}\)  boson mass in the 2 jets, 0 b  tag control category for the (left) muon and (right) electron channels after scaling the simulated and multijet templates to the result of a dedicated ML fit performed on this category of events. The hatched band displays the fit uncertainty. The lower plots give the ratio of the data to the fit results. The right-most bins include the event overflows

6 Signal yield estimation

The number of \(t\text {-channel}\) single top quark events in data is determined from an ML fit using the distributions of \(m_{\mathrm{T}} (\text {W}{}{}) \) and of two boosted decision tree (BDT) discriminants in the 2j1b category, and the \(m_{\mathrm{T}} (\text {W}{}{}) \) distribution in the 3j2b category. Simultaneously, the background yields and the impact of the experimental systematic uncertainties, modelled using nuisance parameters that influence yield and shape, are profiled.

The first BDT, labelled \(\mathrm {BDT}_{t\text {-ch}}\), has been trained separately on muon and electron events to discriminate \(t\text {-channel}\) single top quark events from \(\mathrm {t}\bar{\mathrm {t}}\), \(\text {W}{}{} \text {+jets}\), and multijet events using corresponding samples of simulated events. The following five observables have been chosen as input:

  • the absolute value of the pseudorapidity of the untagged jet, \(|\eta (j^{\prime }) |\);

  • the reconstructed top quark mass, \(m_{{\ell }{}{} {{\upnu }{}{}} {{\text {b}}}}\);

  • the transverse \(\text {W}{}{}\)  boson mass, \(m_{\mathrm{T}} (\text {W}{}{})\);

  • the distance in \(\eta \text {--}\phi \) space (\(\varDelta R\)) between the b-tagged and the untagged jet, \(\varDelta R({\text {b}},j^{\prime })\);

  • the absolute difference in pseudorapidity between the b-tagged jet used to reconstruct the top quark and the selected lepton, \(|\varDelta \eta ({\text {b}},{\ell }{}{})|\).

These have been selected based on their sensitivity for separating signal from background events, while exhibiting low correlations with the observables used to measure the differential cross sections. The resulting distribution of the \(\mathrm {BDT}_{t\text {-ch}}\) discriminant is presented in Fig. 3 (left).

The \(\mathrm {BDT}_{t\text {-ch}}\) discriminant shapes of the \(\text {W}{}{} \text {+jets}\) and \(\mathrm {t}\bar{\mathrm {t}}\) backgrounds are found to be very similar. To obtain sensitivity in the fit to both backgrounds individually, a second BDT, labelled \(\mathrm {BDT}_{\mathrm {t}\bar{\mathrm {t}}/\text {W}{}{}}\), has been trained separately on muon and electron events to classify events only for these two processes using the following six input observables: \(m_{{\ell }{}{} {{\upnu }{}{}} {{\text {b}}}}\); \(p_{\mathrm{T}} ^\text {miss}\); \(\varDelta R({\text {b}},j^{\prime })\); \(|\varDelta \eta ({\text {b}},{\ell }{}{})|\); the \(\text {W}{}{}\)  boson helicity angle, \(\cos \theta _{\text {W}{}{}}^\star \), defined as the angle between the lepton momentum and the negative of the top quark momentum in the \(\text {W}{}{}\)  boson rest frame [16]; and the event shape C, defined using the momentum tensor

$$\begin{aligned} S^{ab}=\frac{\sum _{i}^{\text {jets},{\ell }{}{},{\vec p}_{\mathrm{T}}^{\text {miss}}}p_{i}^{a} p_{i}^{b}}{\sum _{i}^{\text {jets},{\ell }{}{},{\vec p}_{\mathrm{T}}^{\text {miss}}}|\vec {p}_{i} |^2}, \end{aligned}$$
(4)

as \(C=3(\lambda _1\lambda _2+\lambda _1\lambda _3+\lambda _2\lambda _3)\), where \(\lambda _{1}\), \(\lambda _{2}\), and \(\lambda _{3}\) denote the eigenvalues of the momentum tensor \(S^{ab}\) with \(\lambda _1+\lambda _2+\lambda _3=1\). In the two most extreme cases, the event shape C vanishes for perfectly back-to-back dijet events (\(C=0\)) and reaches its maximum (\(C=1\)) if the final-state momenta are distributed isotropically. For the measurement, the \(\mathrm {BDT}_{\mathrm {t}\bar{\mathrm {t}}/\text {W}{}{}}\) discriminant is evaluated only in the phase space region defined by \(m_{\mathrm{T}} (\text {W}{}{}) >50\) \(\,\text {Ge}\text {V}\) and \(\mathrm {BDT}_{t\text {-ch}} <0\), which is found to be largely dominated by background events. Thus, the \(\mathrm {BDT}_{\mathrm {t}\bar{\mathrm {t}}/\text {W}{}{}}\) input observables do not have to be selected explicitly such that they possess low correlation with the observables used to measure the differential cross sections. The resulting \(\mathrm {BDT}_{\mathrm {t}\bar{\mathrm {t}}/\text {W}{}{}}\) discriminant distribution is displayed in Fig. 3 (right).

Fig. 3
figure 3

Distributions of the BDT discriminants in the 2 jets, 1 b  tag category: (left) \(\mathrm {BDT}_{t\text {-ch}}\) trained to separate signal from background events; (right) \(\mathrm {BDT}_{\mathrm {t}\bar{\mathrm {t}}/\text {W}{}{}}\) trained to separate \(\mathrm {t}\bar{\mathrm {t}}\) from \(\text {W}{}{} \text {+jets}\) events in a background-dominated category. Events in the muon and electron channels have been summed. The predictions have been scaled to the result of the inclusive ML fit and the hatched band displays the fit uncertainty. The regions of the distributions used in the fits are indicated in the lower panels, which show the ratio of the data to the fit result

The ML fit is performed using the following four distributions from events in various categories:

  • the \(m_{\mathrm{T}} (\text {W}{}{})\) distribution for events with \(m_{\mathrm{T}} (\text {W}{}{}) <50\) \(\,\text {Ge}\text {V}\) in the 2j1b category, which is particularly sensitive to the number of multijet events;

  • the \(\mathrm {BDT}_{\mathrm {t}\bar{\mathrm {t}}/\text {W}{}{}}\) discriminant distribution for events with \(m_{\mathrm{T}} (\text {W}{}{}) >50\) \(\,\text {Ge}\text {V}\) and \(\mathrm {BDT}_{t\text {-ch}} <0\) in the 2j1b category, which defines a region enriched in \(\mathrm {t}\bar{\mathrm {t}}\) and \(\text {W}{}{} \text {+jets}\) but depleted of signal and multijet events;

  • the \(\mathrm {BDT}_{t\text {-ch}}\) discriminant distribution for events with \(m_{\mathrm{T}} (\text {W}{}{}) >50\) \(\,\text {Ge}\text {V}\) and \(\mathrm {BDT}_{t\text {-ch}} >0\) in the 2j1b category, which is enriched in signal events;

  • the \(m_{\mathrm{T}} (\text {W}{}{})\) distribution in the 3j2b category, which provides additional sensitivity to the \(\mathrm {t}\bar{\mathrm {t}}\) yield, and thus further reduces the correlation between the estimated yields.

The \(m_{\mathrm{T}} (\text {W}{}{})\) distributions in the 2j1b and 3j2b categories are shown in Fig. 4 on the left and right, respectively. In the fit, each distribution is split in two by separating events depending on the charge of the selected muon or electron in the event. This results in eight distributions per lepton channel and thus 16 distributions in the \({{\upmu }{}{}}/{\text {e}} \) combined fit. A coarser equidistant binning of the distributions, as opposed to the one shown in Figs. 3 and 4, is used in the ML fits to prevent cases where single bins are depleted of background estimates as follows: four bins are used for each of the \(m_{\mathrm{T}} (\text {W}{}{})\) and \(\mathrm {BDT}_{t\text {-ch}}\) distributions in the 2j1b category; eight bins are used for the \(\mathrm {BDT}_{\mathrm {t}\bar{\mathrm {t}}/\text {W}{}{}}\) distribution; and ten bins are used for the \(m_{\mathrm{T}} (\text {W}{}{})\) distribution in the 3j2b category.

Fig. 4
figure 4

Distributions of the transverse \(\text {W}{}{}\)  boson mass for events in the (left) 2 jets, 1 b  tag and (right) 3 jets, 2 b  tags categories. Events in the muon and electron channels have been summed. The predictions have been scaled to the result of the inclusive ML fit and the hatched band displays the fit uncertainty. The regions of the distributions used in the fits are indicated in the lower panels, which show the ratio of the data to the fit result. The right-most bins include the event overflows

The yields of \(t\text {-channel}\) single top quark and antiquark events are measured independently. Background events containing top quarks (\(\mathrm {t}\bar{\mathrm {t}}\), \({\text{tW}}\)) are grouped together, and only their total yield is estimated. The top quark background yield is constrained using a log-normal prior with a width of \({\pm }10\)% to account for the uncertainty in the theoretical \(\mathrm {t}\bar{\mathrm {t}}\) and \({\text{tW}}\) production cross sections, and the uncertainty when two out of the four jets expected from semileptonic \(\mathrm {t}\bar{\mathrm {t}}\) production are not within the acceptance, as is the case in the 2j1b category. The electroweak background processes, \(\text {W}{}{} \text {+jets}\) and \(\text {Z}{}{}/{{\upgamma }{}{}} ^{*}\text {+jets}\), are grouped together as well, and an uncertainty of \({\pm }30\)% in their combined yield is applied using a log-normal prior constraint. This is motivated by the theoretical uncertainty in the modelling of the \(\text {W}{}{}\) and \(\text {Z}{}{}/{{\upgamma }{}{}} ^{*}\) production rates in association with two or more (heavy-flavour) jets [53, 54]. The yields of multijet events are assumed to be independent per lepton type and event category. Their yields are constrained by a log-normal prior with a width of \({\pm }100\)% with respect to the template normalisations obtained from data in the sideband regions. In addition, an uncertainty in the predicted lepton charge ratio per background process, accounting for charge misreconstruction and uncertainties in the charge ratio [55], is taken into account using a Gaussian prior with a width of \({\pm }1\)% in the fit, for a total of 14 fit parameters. The impact of the finite number of simulated events on the templates is accounted for by employing the “Barlow–Beeston-lite” method [56].

Experimental systematic uncertainties, as detailed in Sect. 9, are profiled in the fit simultaneously with the yields and charge ratios. Each source is assigned a nuisance parameter according to which the shape and yield of the fit templates are modified.

The resulting event yields from a simultaneous fit to the data in the muon and electron channels are listed in Table 1. Overall, the distributions used in the fit, shown in Figs. 3 and 4, are found to be well modelled by the samples of simulated events and the multijet templates from data after normalising them to the fit result.

Table 1 Measured and observed event yields in the 2j1b category for each lepton channel and charge. The uncertainties in the yields are the combination of statistical and experimental systematic uncertainties

For each differential cross section measurement, the observable of interest is divided into intervals, discussed in Sect.  8, and a fit is performed in which the signal and background yields can vary independently in each of the intervals. The likelihoodL to be maximised in such fits can be expressed as

$$\begin{aligned}&\ln \left( L(\vec {\beta },\vec {\nu },\vec {R})\right) \nonumber \\&\quad = -\sum _{k}^\text {dist}\sum _{j}^\text {int}\sum _{i}^\text {bins}\left( d_{kji}\ln {p_{kji}(\vec {\beta _{j}},\vec {\nu },\vec {R})-p_{kji}(\vec {\beta _{j}},\vec {\nu },\vec {R})}\right) \nonumber \\&\qquad +\text {constraints}, \end{aligned}$$
(5)

where d denotes the number of observed events and p is the estimated yield. The summation overk denotes the 16 distributions (“dist”), j denotes the interval (“int”) in the observable (e.g. for the top quark \(p_{\mathrm{T}}\): 0–50\(\,\text {Ge}\text {V}\), 50–80 \(\,\text {Ge}\text {V}\), 80–120\(\,\text {Ge}\text {V}\), 120–180 \(\,\text {Ge}\text {V}\), and 180–300\(\,\text {Ge}\text {V}\)), and i denotes a bin in one of the 16 distributions per interval. The prediction \(\vec {p}_{kj}\), which includes all bins i for distribution k and interval j, is given by

$$\begin{aligned} \vec {p}_{kj}(\vec {\beta }_{j},\vec {\nu },\vec {R})&=\beta _{{\text {t}},j}\vec {T}_{{\text {t}},kj}^{t\text {-ch}}(\vec {\nu })+\beta _{\overline{{\mathrm{t}}},j}\vec {T}_{\overline{{\mathrm{t}}},kj}^{t\text {-ch}}(\vec {\nu })\nonumber \\&\quad +\beta _{\mathrm {t}\bar{\mathrm {t}}/{\text {tW}} ,j}\vec {T}_{kj}^{\mathrm {t}\bar{\mathrm {t}}/{\text {tW}} }(R_{j},\vec {\nu })\nonumber \\&\quad +\beta _{\text {W}{}{}/\text {Z}{}{}/{{\upgamma }{}{}} ^{*}\text {+jets},j}\vec {T}_{kj}^{\text {W}{}{}/\text {Z}{}{}/{{\upgamma }{}{}} ^{*}\text {+jets}}(R_{j},\vec {\nu })\nonumber \\&\quad +\beta _{\text {multijet},j}({\ell }{}{},r)\vec {T}_{kj}^{\text {multijet}}(R_{j}({\ell }{}{},r),\vec {\nu }), \end{aligned}$$
(6)

where \(\vec {\nu }\) are the nuisance parameters, R the charge ratios of each background process, and \(\beta \) the normalisations of the templates \(\vec {T}\), which are independent per lepton flavour \({\ell }{}{} \) and category \(r\in \){2j1b, 3j2b} for the multijet templates. The profiling of systematic uncertainties leads to a correlation between the \(t\text {-channel}\) top quark and antiquark yields in the same interval of about 20–30%. These correlations are propagated to the differential cross sections for each top quark charge, and are accounted for when calculating their sum and ratio.

Since the kinematic selection of electron events is restricted to \(p_{\mathrm{T}} >35\) \(\,\text {Ge}\text {V}\) and \(|\eta |<1.48\), which is tighter than for muon events (\(p_{\mathrm{T}} >26\) \(\,\text {Ge}\text {V}\), \(|\eta |<2.4\)), the signal yields in the lowest interval of the lepton \(p_{\mathrm{T}} \) and in the highest two intervals of the lepton rapidity spectra are estimated from the muon channel alone in the combined \({{\upmu }{}{}}/{\text {e}} \) fit.

7 Validation of signal and background modelling

The distributions of the observables that are unfolded are validated by comparing the predictions to the data in a background-dominated as well as in a signal-enriched region before unfolding. Both regions are defined for events in the 2j1b category that also satisfy \(m_{\mathrm{T}} (\text {W}{}{}) >50\) \(\,\text {Ge}\text {V}\) to suppress the contribution from multijet production. The modelling of the \(\mathrm {t}\bar{\mathrm {t}}{}{}/{\text{tW}}\) and \(\text {W}{}{}/\text {Z}{}{}/{{\upgamma }{}{}} ^{*}\text {+jets}\) backgrounds is validated in a background-dominated region obtained from events having \(\mathrm {BDT}_{t\text {-ch}} <0\). To validate the modelling of the \(t\text {-channel}\) process, events are instead required to pass \(\mathrm {BDT}_{t\text {-ch}} >0.7\), resulting in a sample enriched in signal events. These two regions and their selections are only defined and applied for validation purposes, and not used for measuring the differential cross sections for which the individual fit results are used in the unfolding instead.

The resulting distributions in both regions for all six observables that are unfolded are shown in Figs. 5 and 6 after the predictions have been scaled to the inclusive fit result. Overall good agreement between the data and the fit result is observed in the background-dominated region, thus validating the modelling of the \(\mathrm {t}\bar{\mathrm{t}}{}{}/{\text{tW}}\) and \(\text {W}{}{}/\text {Z}{}{}/{{\upgamma }{}{}} ^{*}\text {+jets}\) backgrounds. In the signal region, reasonable agreement is also observed.

Fig. 5
figure 5

Distributions of the observables in a (left column) background-dominated and a (right column) signal-enriched region for events passing the 2 jets, 1 b  tag selection: (upper row) top quark \(p_{\mathrm{T}}\); (middle row) charged lepton \(p_{\mathrm{T}}\); (lower row) \(\text {W}{}{}\)  boson \(p_{\mathrm{T}}\). Events in the muon and electron channels have been summed. The predictions have been scaled to the result of the inclusive ML fit and the hatched band displays the fit uncertainty. The plots on the left give the number of events per bin, while those on the right show the number of events per bin divided by the bin width. The lower panel in each plot gives the ratio of the data to the fit results. The right-most bins include the event overflows

Fig. 6
figure 6

Distributions of the observables in a (left column) background-dominated and a (right column) signal-enriched region for events passing the 2 jets, 1 b  tag selection: (upper row) top quark rapidity; (middle row) charged lepton rapidity; (lower row) cosine of the top quark polarisation angle. Events in the muon and electron channels have been summed. The predictions have been scaled to the result of the inclusive ML fit and the hatched band displays the fit uncertainty. The plots on the left give the number of events per bin, while those on the right show the number of events per bin divided by the bin width. The lower panel in each plot gives the ratio of the data to the fit results

8 Unfolding

The distributions from reconstructed events are affected by the detector resolution, selection efficiencies, and kinematic reconstruction, which lead to distortions with respect to the corresponding distributions at the parton or particle levels. The size of these effects varies with the event kinematics. In order to correct for these effects and determine the parton- and particle-level distributions, an unfolding method is applied to the reconstructed distributions. In this analysis, the tunfold algorithm [57] is chosen, which treats unfolding as a minimisation problem of the function

$$\begin{aligned} \chi ^2&=\left( \vec {y}-\varvec{R}\varvec{\epsilon }\vec {x}\right) ^\mathrm {T}\varvec{V}_{y}^{-1}\left( \vec {y}-\varvec{R}\varvec{\epsilon }\vec {x}\right) \nonumber \\&\quad +\underbrace{\tau ^2\left\Vert \varvec{L}(\vec {x}-\vec {x}_{0})\right\Vert ^2}_\text {regularisation}+\lambda \sum _{i}\left( \vec {y}-\varvec{R}\varvec{\epsilon }\vec {x}\right) _{i}, \end{aligned}$$
(7)

where \(\vec {y}\) denotes the measured yields in data, \(\varvec{V}_{y}\) is the covariance matrix of the measured yields, and \(\vec {x}\) is the corresponding differential cross section at parton or particle level. The matrices \(\varvec{R}\) and \(\varvec{\epsilon }\) denote the transition probability and selection efficiencies, respectively, both estimated from simulation. The signal yields and covariances are estimated through ML fits using the \(m_{\mathrm{T}} (\text {W}{}{})\), \(\mathrm {BDT}_{\mathrm {t}\bar{\mathrm {t}}/\text {W}{}{}}\), and \(\mathrm {BDT}_{t\text {-ch}}\) distributions, as detailed in Sect. 6.

A penalty term, based on the curvature of the unfolded spectrum [58, 59] encoded in the matrix \(\varvec{L}\), is added in the minimisation to suppress oscillating solutions originating from amplified statistical fluctuations. This “regularisation” procedure has a strength \(\tau \) that is chosen to minimise the global correlation between the unfolded bins. The “bias vector” \(\vec {x}_{0}\) is set to the expected spectrum from simulation. Pseudo-experiments using simulated data are performed to verify that the unfolding method estimates the uncertainties correctly, while keeping the regularisation bias at a minimum. No regularisation is applied when unfolding the lepton \(p_{\mathrm{T}}\) and rapidity spectra since the migrations between bins are found to be negligible. The overall normalisation of the unfolded spectrum is determined by performing a simultaneous minimisation with respect to the Lagrange multiplier \(\lambda \).

The parton-level top quark in simulation is defined as the generated on-shell top quark after quantum electrodynamic (QED) and QCD radiation, taking into account the intrinsic transverse momentum of initial-state partons. Events are required to contain either a muon or an electron from the top quark decay chain. This also includes muons or electrons from intermediately produced \({{\uptau }{}{}} \) leptons. In such events, the \(\text {W}{}{}\)  boson is chosen to be the direct daughter of the top quark. The spectator quark is selected from among the light quarks after QED and QCD radiation that are not products of the top quark decay. In case of ambiguities arising from initial-state radiation, the spectator quark that minimises the \(p_{\mathrm{T}} \) of the combined spectator quark and top quark system is chosen.

The top quark at the particle level (called “pseudo top quark”) is defined in simulated events by performing an event reconstruction based on the set of stable simulated particles after hadronisation [60]. In the context of this study, all particles with a lifetime of more than 30\(\,\text {ps}\) are considered stable. So-called “dressed” muons and electrons are constructed by accounting for the additional momenta carried by photons within a cone of \(\varDelta R<0.1\) around the corresponding prompt lepton that do not originate from hadronisation products. The \({\vec p}_{\mathrm{T}}^{\text {miss}} \) is defined as the summed momentum of all prompt neutrinos in the event. Jets at the particle level are clustered from all stable particles excluding prompt muons, prompt electrons, prompt photons, and all neutrinos using the anti-\(k_{\mathrm{T}}\) algorithm with a distance parameter of \(R=0.4\). From these objects, a pseudo top quark is reconstructed by first solving for the unknown neutrino \(p_{z}\) momentum, which is identical to the top quark reconstruction procedure applied to data, as described in Sect. 4. Events containing a single dressed muon or electron with \(p_{\mathrm{T}} >26\) \(\,\text {Ge}\text {V}\) and \(|\eta |<2.4\), together with two jets with \(p_{\mathrm{T}} >40\) \(\,\text {Ge}\text {V}\) and \(|\eta |<4.7\), are considered at the particle level. Jets that are closer than \(\varDelta R=0.4\) to the selected dressed muon or electron are ignored. The jet that yields a top quark mass closest to 172.5\(\,\text {Ge}\text {V}\) is assumed to come from the top quark decay, while the other jet is taken as the spectator jet.

The size of the binning intervals are chosen to minimise the migrations between the reconstructed bins while retaining sensitivity to the shapes of the distributions. The stability (purity) is defined as the probability that the parton- or particle-level (reconstructed) values of an observable within a certain range also have their reconstructed (parton-/particle-level) counterparts in the same range. Both quantities are found to be greater than or equal to 50% in most bins of all distributions, with the exception of a few bins at the parton level where purity and stability drop to 40%, and the first two bins of the polarisation angle distribution at the parton level where both quantities drop to about 25%. The stability and purity values are about 10% larger for the particle-level distributions than for the parton-level ones. The acceptance times efficiency for selecting \(t\text {-channel}\) single top quark events at the detector level is found to be 2–8 (20–30)% for muon events and 1–5 (10–20)% for electron events with respect to the parton (particle) level across the unfolding bins.

9 Systematic uncertainties

The measurements are affected by various sources of systematic uncertainty. For each systematic variation, new templates and response matrices are derived. Systematic variations can create correlations between the \(t\text {-channel}\) top quark and antiquark yields since both yields are estimated simultaneously from data through an ML fit, as described in Sect. 6.

The following experimental systematic uncertainties are profiled in the ML fit.

  • Background composition: As described in Sect. 6, the \(\text {Z}{}{}/{{\upgamma }{}{}} ^{*}\text {+jets}\) and \(\text {W}{}{} \text {+jets}\) processes and the \(\mathrm {t}\bar{\mathrm {t}}\) and \({\text{tW}} \) processes are separately grouped together in the ML fit. The ratios of the \(\text {Z}{}{}/{{\upgamma }{}{}} ^{*}\text {+jets}\) to the \(\text {W}{}{} \text {+jets}\) yields and the \(\mathrm {t}\bar{\mathrm {t}}\) to the \({\text{tW}} \) yields are assigned a \({\pm }20\)% uncertainty. This covers the uncertainty in the small \(\text {Z}{}{}/{{\upgamma }{}{}} ^{*}\text {+jets}\) and \({\text{tW}} \) yields, for which the analysis has little sensitivity.

  • Multijet shape estimation: The multijet event distributions are estimated from data by inversion of the muon isolation criterion or the electron identification criteria. The uncertainty in the shape of these distributions is estimated by varying the criteria. The requirement on the muon isolation parameter in the sideband region is modified from \(I_{\mathrm{rel}}^{{{\upmu }{}{}}} >20\)% to either \(20< I_{\mathrm{rel}}^{{{\upmu }{}{}}} < 40\)% or \(I_{\mathrm{rel}}^{{{\upmu }{}{}}} > 40\)%, and the electron isolation parameter to either \(I_\text {rel}^{{\text {e}}} < 30\)% or \(I_\text {rel}^{{\text {e}}} > 5.88\)%, while inverting the identification criteria. Another variation is done by requiring electrons in the sideband region to explicitly pass or fail the photon conversion criterion, which is also part of the electron identification requirement.

  • Efficiency of b  tagging and misidentification: The scale factors used to reweight the b  tagging and misidentification efficiencies in simulation to the ones estimated from data are varied within their uncertainties based on the true flavour of the selected jets [49].

  • Jet energy scale and resolution: The jet energy scale and resolution corrections are varied within their uncertainties [61]. The shifts induced in the jet momenta are propagated to \({\vec p}_{\mathrm{T}}^{\text {miss}} \) as well.

  • Unclustered energy: The contributions to \(p_{\mathrm{T}} ^\text {miss}\) of PF candidates that have not been clustered into jets are varied within their respective energy resolutions [62].

  • Pileup: The simulated distribution of pileup interactions is modified by shifting the total inelastic \({\text {p}{}{}} {\text {p}{}{}} \) cross section by \({\pm }5\%\) [63].

  • Lepton efficiencies: The scale factors that account for differences in the lepton selection and reconstruction efficiencies between data and simulation are varied within their uncertainties [23, 46].

The systematic uncertainties in the theoretical modelling of the simulated samples are estimated by using new templates and response matrices in the ML fit and unfolding for each variation. For each uncertainty source, the maximum difference of the up/down variations with the result using the nominal templates and response matrix is taken as the estimated uncertainty per bin. These are added in quadrature to the experimental uncertainty per bin.

The following sources of theoretical uncertainty have been evaluated.

  • Modelling of top quark \(p_{\mathrm{T}}\) in \(\mathrm {t}\bar{\mathrm {t}}\) events: Differential cross section measurements of \(\mathrm {t}\bar{\mathrm {t}}\) production by CMS [64, 65] have shown that the \(p_{\mathrm{T}}\) spectrum of top quarks in \(\mathrm {t}\bar{\mathrm {t}}\) events is significantly softer than predicted by NLO simulations. To correct for this effect, simulated \(\mathrm {t}\bar{\mathrm {t}}\) events are reweighted according to the scale factors derived from measurements at 13\(\,\text {Te}\text {V}\)  [65]. The difference in the predictions when using the default \(\mathrm {t}\bar{\mathrm {t}}\) simulation sample is taken as an additional uncertainty.

  • Top quark mass: The nominal top quark mass of 172.5\(\,\text {Ge}\text {V}\) is modified by \({\pm }0.5\) \(\,\text {Ge}\text {V}\) in the simulation [66]. The difference with respect to the nominal simulation results is taken as the corresponding uncertainty.

  • Parton distribution functions: The effect of the uncertainty in the PDFs is estimated by reweighting the simulated events using the recommended variations in the NNPDF3.0 NLO set, including a variation of \(\alpha _S \) [35]. The reweighting is performed using precomputed weights stored in the event record by the matrix element generator [67].

  • Renormalisation/factorisation scales: A reweighting procedure similar to that used for the PDFs is carried out on simulated \(t\text {-channel}\), \(\text {W}{}{} \text {+jets}\), and \(\mathrm {t}\bar{\mathrm {t}}\) simulated events to estimate the effect of the uncertainties in the renormalisation and factorisation scales. The weights correspond to independent variations by factors of 0.5 and 2 in the scales with respect to their nominal values. The envelope of all possible combinations of up-varied/down-varied scales with the exception of the extreme up/down combinations is considered as an uncertainty. This uncertainty is evaluated independently for the \(t\text {-channel}\), \(\text {W}{}{} \text {+jets}\), and \(\mathrm {t}\bar{\mathrm {t}}\) simulated event samples.

  • Parton shower: The uncertainties in the parton shower simulation are evaluated by comparing the nominal samples to dedicated samples with varied shower parameters. For \(t\text {-channel}\) single top quark production, the differences with respect to samples with a varied factorisation scale by a factor of 0.5 or 2 or with a varied powheg \(h_\text {damp}\) parameter are taken as two independent uncertainties. For the simulated \(\mathrm {t}\bar{\mathrm {t}}\) samples, the variation of the factorisation scale in both initial- and final-state radiation, and the \(h_\text {damp}\) parameter are evaluated as three independent uncertainties.

  • Underlying event tune: The impact of uncertainties arising from the CUETP8M2T4 underlying event tune [30] used in the simulation of \(\mathrm {t}\bar{\mathrm {t}}\) events is evaluated using dedicated samples with the tune varied within its uncertainties.

  • Colour reconnection: The default model of colour reconnection in pythia is based on multiple-particle interactions (MPI) with early resonance decays switched off. An uncertainty in the choice of this model is taken into account by repeating the measurement using three alternative models of colour reconnection in the simulation of \(t\text {-channel}\) single top quark and \(\mathrm {t}\bar{\mathrm {t}}\) production: the MPI-based scheme with early resonance decays switched on, a gluon-move scheme [68], and a QCD-inspired scheme [69].

  • Fragmentation model: The fragmentation of b quarks, modelled by the Bowler-Lund function [70], is varied within its uncertainties for \(t\text {-channel}\) single top quark and \(\mathrm {t}\bar{\mathrm {t}}\) production. Additionally, the impact when using the Peterson model [71] for b quark fragmentation instead is assessed.

In addition, an uncertainty of \({\pm }2.5\%\) in the measurement of the integrated luminosity of the data set [22] is taken into account by scaling the evaluated covariance matrix per observable accordingly.

10 Results

Differential cross sections of \(t\text {-channel}\) single top quark production as a function of the top quark \(p_{\mathrm{T}} \), rapidity, and polarisation angle, the \(p_{\mathrm{T}} \) and rapidity of the charged lepton (muon or electron) that originates from the top quark decay, and the \(p_{\mathrm{T}} \) of the \(\text {W}{}{}\)  boson from the top quark decay are presented in Figs. 7 and 8 at the parton and particle levels, respectively. The normalised differential cross sections of the same observables at the parton and particle levels are provided in Figs. 9 and 10. The total uncertainty is indicated by the vertical lines, while horizontal bars indicate the statistical and experimental uncertainties, which have been profiled in the ML fit, and thus exclude the uncertainties in the theoretical modelling and the luminosity. The differential cross sections refer to \(t\text {-channel}\) single top quark production where the top quark decays semileptonically (into either muon or electron) including events where the charged lepton stems from an intermediate \({{\uptau }{}{}} \) lepton decay. The results are compared to the predictions by the powheg generator interfaced with pythia in the 4FS and the MadGraph 5_amc@nlo generator interfaced with pythia in the 4FS and 5FS.

Fig. 7
figure 7

Differential cross sections for the sum of \(t\text {-channel}\) single top quark and antiquark production at the parton level: (upper row) top quark \(p_{\mathrm{T}}\) and rapidity; (middle row) charged lepton \(p_{\mathrm{T}}\) and rapidity; (lower left) \(\text {W}{}{}\)  boson \(p_{\mathrm{T}}\); (lower right) cosine of the top quark polarisation angle. The total uncertainty is indicated by the vertical lines, while horizontal bars indicate the statistical and experimental uncertainties, which have been profiled in the ML fit, and thus exclude the uncertainties in the theoretical modelling and the luminosity. Three different predictions from event generators are shown by the solid, dashed, and dotted lines. The lower panels show the ratios of the predictions to the data

Fig. 8
figure 8

Differential cross sections for the sum of \(t\text {-channel}\) single top quark and antiquark production at the particle level: (upper row) top quark \(p_{\mathrm{T}}\) and rapidity; (middle row) charged lepton \(p_{\mathrm{T}}\) and rapidity; (lower left) \(\text {W}{}{}\)  boson \(p_{\mathrm{T}}\); (lower right) cosine of the top quark polarisation angle. The total uncertainty is indicated by the vertical lines, while horizontal bars indicate the statistical and experimental uncertainties, which have been profiled in the ML fit, and thus exclude the uncertainties in the theoretical modelling and the luminosity. Three different predictions from event generators are shown by the solid, dashed, and dotted lines. The lower panels show the ratios of the predictions to the data

Fig. 9
figure 9

Normalised differential cross sections for the sum of \(t\text {-channel}\) single top quark and antiquark production at the parton level: (upper row) top quark \(p_{\mathrm{T}}\) and rapidity; (middle row) charged lepton \(p_{\mathrm{T}}\) and rapidity; (lower left) \(\text {W}{}{}\)  boson \(p_{\mathrm{T}}\); (lower right) cosine of the top quark polarisation angle. The total uncertainty is indicated by the vertical lines, while horizontal bars indicate the statistical and experimental uncertainties, which have been profiled in the ML fit, and thus exclude the uncertainties in the theoretical modelling. Three different predictions from event generators are shown by the solid, dashed, and dotted lines. The lower panels show the ratios of the predictions to the data

Fig. 10
figure 10

Normalised differential cross sections for the sum of \(t\text {-channel}\) single top quark and antiquark production at the particle level: (upper row) top quark \(p_{\mathrm{T}}\) and rapidity; (middle row) charged lepton \(p_{\mathrm{T}}\) and rapidity; (lower left) \(\text {W}{}{}\)  boson \(p_{\mathrm{T}}\); (lower right) cosine of the top quark polarisation angle. The total uncertainty is indicated by the vertical lines, while horizontal bars indicate the statistical and experimental uncertainties, which have been profiled in the ML fit, and thus exclude the uncertainties in the theoretical modelling. Three different predictions from event generators are shown by the solid, dashed, and dotted lines. The lower panels show the ratios of the predictions to the data

An overall good agreement of the results with the predictions from the 4FS is observed, except for a slight deviation at low top quark \(p_{\mathrm{T}} \). The predictions from the 5FS for the top quark and \(\text {W}{}{}\)  boson \(p_{\mathrm{T}}\) distributions do not agree as well with the data.

Differential ratios of the top quark production rates to the sum of the top quark and antiquark rates as a function of the top quark \(p_{\mathrm{T}} \) and rapidity, the \(p_{\mathrm{T}} \) and rapidity of the charged lepton, and the \(\text {W}{}{}\)  boson \(p_{\mathrm{T}} \) are presented in Figs. 11 and 12 at the parton and particle levels, respectively. It is found that the standard definition of the charge ratio in the literature, i.e. \(\sigma _{{\text {t}}}/\sigma _{{ {{\text {t}}}}} \), can yield large variances when the precision in certain intervals of the differential cross section for the top antiquark is low. Therefore, the charge ratio is defined as \(\sigma _{{\text {t}}}/\sigma _{{\text {t+t}}} \) in this paper. The ratios have been calculated from the measured cross sections at the parton and particle levels, while accounting for correlations between the top quark and antiquark spectra, as detailed in Sects. 6 and 9. The resulting charge ratios are compared to the predictions by the NNPDF3.0 NLO, MMHT14 NLO [72], and CT10 NLO PDF sets, which have been calculated using the powheg signal sample—generated in the 4FS and interfaced with pythia. The uncertainty bands shown in Figs. 11 and 12 represent the total uncertainty from varying the corresponding PDF eigenvectors and \(\alpha _S \). Within the uncertainties, the measured charge ratios are in good agreement with the predictions from all three PDF sets.

Fig. 11
figure 11

Ratio of the top quark to the sum of the top quark and antiquark \(t\text {-channel}\) differential cross section at the parton level: (upper row) top quark \(p_{\mathrm{T}}\) and rapidity; (middle row) charged lepton \(p_{\mathrm{T}}\) and rapidity; (lower row) \(\text {W}{}{}\)  boson \(p_{\mathrm{T}}\). The total uncertainty is indicated by the vertical lines, while horizontal bars indicate the statistical and experimental uncertainties, which have been profiled in the ML fit, and thus exclude the uncertainties in the theoretical modelling. Predictions from three different PDF sets are shown by the solid, dashed, and dotted lines. The lower panels show the ratios of the predictions to the data

Fig. 12
figure 12

Ratio of the top quark to the sum of the top quark and antiquark \(t\text {-channel}\) differential cross section at the particle level: (upper row) top quark \(p_{\mathrm{T}}\) and rapidity; (middle row) charged lepton \(p_{\mathrm{T}}\) and rapidity; (lower row) \(\text {W}{}{}\)  boson \(p_{\mathrm{T}}\). The total uncertainty is indicated by the vertical lines, while horizontal bars indicate the statistical and experimental uncertainties, which have been profiled in the ML fit, and thus exclude the uncertainties in the theoretical modelling. Predictions from three different PDF sets are shown by the solid, dashed, and dotted lines. The lower panels show the ratios of the predictions to the data

The spin asymmetry, sensitive to the top quark polarisation, is determined from the differential cross section as a function of the polarisation angle at the parton level (Fig. 7, lower right). A linear \(\chi ^{2}\)-based fit, assuming the expected functional dependence given in Eq. (2), is used to take the correlations between the unfolded bins into account. The measured spin asymmetry in the muon and electron channel and their combination is given in Table 2.

Table 2 The measured spin asymmetry in the muon and electron channel and their combination. A breakdown of the systematic uncertainties is also provided. Minor systematic uncertainties (lepton efficiencies, pileup, and unclustered energy) have been grouped into the “Others” category

The measured asymmetries are in good agreement with the predicted SM value of 0.436, found using powheg at NLO, with a negligible uncertainty. Good agreement is also found with a corresponding measurement by the ATLAS Collaboration at \(\sqrt{s}=8\) \(\,\text {Te}\text {V}\)  [17]. This measurement is found to be more precise than a previous analysis of the spin asymmetry at \(\sqrt{s}=8\) \(\,\text {Te}\text {V}\) by the CMS Collaboration [9]. In particular, the deviation found therein, corresponding to 2.0 standard deviations, is not seen.

11 Summary

Differential cross sections for \(t\text {-channel}\) single top quark and antiquark production in proton–proton collisions at \(\sqrt{s}=13\) \(\,\text {Te}\text {V}\) have been measured by the CMS experiment at the LHC using a sample of proton–proton collision data, corresponding to an integrated luminosity of 35.9\(\,\text {fb}^{-1}\). The cross sections are determined as a function of the top quark transverse momentum (\(p_{\mathrm{T}} \)), rapidity, and polarisation angle, the charged lepton \(p_{\mathrm{T}} \) and rapidity, and the \(p_{\mathrm{T}} \) of the \(\text {W}{}{}\)  boson from the top quark decay. In addition, the charge ratio has been measured as a function of the top quark, charged lepton, and \(\text {W}{}{}\)  boson kinematic observables. Events containing one muon or electron and two or three jets are used. The single top quark and antiquark yields are determined through maximum-likelihood fits to the data distributions. The differential cross sections are then obtained at the parton and particle levels by unfolding the measured signal yields.

The results are compared to various next-to-leading-order predictions, and found to be in good agreement. Furthermore, the top quark spin asymmetry, which is sensitive to the top quark polarisation, has been measured using the differential cross section as a function of the top quark polarisation angle at the parton level. The resulting value of \(0.440 \pm 0.070\) is in good agreement with the standard model prediction.

These results demonstrate a good understanding of the underlying electroweak production mechanism of single top quarks at \(\sqrt{s}=13\) \(\,\text {Te}\text {V}\) and in particular of the electroweak vector−axial-vector coupling predicting highly polarized top quarks. Lastly, the differential charge ratios, sensitive to the ratio of the up to down quark content of the proton, are found to be consistent with the predictions by various sets of parton distribution functions.