1 Introduction

The standard model (SM) of particle physics is extremely successful in describing the phenomena of elementary particles and their interactions. Nevertheless, it is believed to be only a low-energy realisation of a more general theory. In its current form, it fails to explain several observations, such as the nature of dark matter, the baryon asymmetry of the universe and the stabilisation of the Higgs boson mass against radiative corrections from the Planck scale. These shortcomings could be remedied by the existence of new particles at the TeV scale, which motivates extensive searches at the Large Hadron Collider (LHC).

One of the most compelling theories beyond the SM is Supersymmetry (SUSY) [1,2,3,4,5,6]. SUSY is a space-time symmetry that for each SM particle postulates the existence of a partner particle whose spin (S) differs by one-half unit. The introduction of gauge-invariant and renormalisable interactions into SUSY models can violate the conservation of baryon number (B) and lepton number (L), resulting in a proton lifetime shorter than current experimental limits [7]. This is usually solved by assuming that the multiplicative quantum number R-parity [8], defined as \(R~=~(-1)^{3(B-L)+2S}\), is conserved.

In the framework of a generic R-parity-conserving model, SUSY particles are produced in pairs, and the lightest supersymmetric particle (LSP) is stable and a candidate for dark matter [9, 10]. The scalar partners of right-handed and left-handed quarks (squarks), \(\tilde{q}_{\mathrm {R}}\) and \(\tilde{q}_{\mathrm {L}}\), can mix to form two mass eigenstates, \(\tilde{q}_1\) and \(\tilde{q}_2\), with \(\tilde{q}_1\) defined to be the lighter one. In the case of the supersymmetric partner of the top quark, \(\tilde{t}\), large mixing effects can lead to one top squark mass eigenstate, \(\tilde{t}_1\), that is significantly lighter than the other squarks. The charginos and neutralinos are mixtures of the bino, winos and Higgsinos that are superpartners of the U(1) and SU(2) gauge bosons and the Higgs bosons, respectively. Their mass eigenstates are referred to as \(\tilde{\chi }_{i}^{\pm }\) \((i=1,2)\) and \(\tilde{\chi }_{j}^{0}\) \((j=1,2,3,4)\) in order of increasing masses. In a large variety of models, the LSP is the lightest neutralino \(\tilde{\chi }^0_1\).

Fig. 1
figure 1

Diagrams representing the four main signals targeted by the analyses: a the decay of the top squark via the lightest chargino (\(\tilde{t} \rightarrow b {\tilde{\chi }^\pm _1} \)), b the two-body decay into an on-shell top quark and the lightest neutralino (\(\tilde{t} \rightarrow t {\tilde{\chi }^0_1} \)), c the three-body decay mode into an on-shell W boson, a b-quark and the lightest neutralino (\(\tilde{t} \rightarrow b W {\tilde{\chi }^0_1} \)) and d the four-body decay mode (\(\tilde{t} \rightarrow b f f' {\tilde{\chi }^0_1} \)) where the two fermions f and \(f'\) are a lepton with its neutrino in this article

In this paper a search for direct pair production of the top squark is reported, in final states with two isolated leptons (electrons or muons) and missing transverse momentum. The search utilises \(36.1~\hbox {fb}^{-1}\) of proton–proton collision data collected by the ATLAS experiment in 2015 and 2016 at a centre-of-mass energy \(\sqrt{s}=13\) TeV.

The top squark is assumed to decay into either the lightest chargino or the lightest neutralino. Depending on the mass difference between the top squark and the lighter SUSY particles, different decay modes are relevant. The decays \(\tilde{t} \rightarrow t {\tilde{\chi }^0_1} \) and \(\tilde{t} \rightarrow b {\tilde{\chi }^\pm _1} \) (where t and b represent either the quark or the anti-quark, depending on the charge conjugation) with \({\tilde{\chi }^\pm _1} \rightarrow W {\tilde{\chi }^0_1} \) dominate when they are kinematically accessible. For intermediate mass differences, \(m_{{\tilde{\chi }^0_1}}+m_{W}+m_{b}< m_{\tilde{t}} < m_{{\tilde{\chi }^0_1}}+m_{t}\), the three-body decay \(\tilde{t} \rightarrow b W {\tilde{\chi }^0_1} \) is considered. For smaller mass differences, the four-body decay channel \(\tilde{t} \rightarrow b f f' {\tilde{\chi }^0_1} \), where f and \(f'\) are two fermions from the \(W^{*}\) decay, is assumed to occur. In this search, f and \(f'\) are a lepton and its associated neutrino. For each of these decay modes, shown by the diagrams in Fig. 1, a dedicated event selection is performed to optimise the search significance, as detailed in Table 1.

The results of the searches are interpreted in simplified models [11,12,13] as a function of the top squark and lightest neutralino masses. Additionally, results are also interpreted in one phenomenological minimal supersymmetric standard model (pMSSM) [14,15,16,17] model including the following decay modes: \(\tilde{t} \rightarrow t {\tilde{\chi }^0_1} \), \(\tilde{t} \rightarrow b {\tilde{\chi }^\pm _1} \) with \({\tilde{\chi }^\pm _1} \rightarrow W {\tilde{\chi }^0_1} \) and \(\tilde{t} \rightarrow t {\tilde{\chi }^0_2} \), with \({\tilde{\chi }^0_2} \rightarrow h/Z {\tilde{\chi }^0_1} \). Previous ATLAS [18, 19] and CMS [20,21,22,23,24,25,2632] analyses have set exclusion limits at 95% confidence level (CL) on the signal scenarios considered here. When considering simplified models including the \(\tilde{t} \rightarrow t {\tilde{\chi }^0_1} \) decay, top squark masses up to about 700 GeV have been excluded for a nearly massless lightest neutralino. For the same assumptions about the lightest neutralino mass, if the \(\tilde{t} \rightarrow b {\tilde{\chi }^\pm _1} \) decay is dominant, top squark masses up to about 500 GeV have been excluded.

Table 1 Summary of the sections dedicated to the two-body, three-body and four-body selections and signal types targeted by each selection

2 ATLAS detector

The ATLAS detector [33] at the LHC is a multi-purpose particle detector with a cylindrical forward–backward symmetric geometryFootnote 1 and an approximate \(4\pi \) coverage in solid angle. It consists of an inner tracking detector (ID) surrounded by a thin superconducting solenoid providing a 2 T axial magnetic field, electromagnetic and hadron calorimeters, and a muon spectrometer. The inner tracking detector covers the pseudorapidity range \(|\eta | < 2.5\). It consists of silicon pixel, silicon microstrip, and transition radiation tracking detectors. The newly installed innermost layer of pixel sensors [34] was operational for the first time during the 2015 data-taking. Lead/liquid-argon (LAr) sampling calorimeters provide electromagnetic (EM) energy measurements with high granularity. A hadron (steel/scintillator-tile) calorimeter covers the central pseudorapidity range (\(|\eta | < 1.7\)). The end-cap and forward regions are instrumented with LAr calorimeters for both the EM and hadronic energy measurements up to \(|\eta | = 4.9\). The muon spectrometer surrounds the calorimeters and features three large air-core toroid superconducting magnets with eight coils each. It includes a system of precision tracking chambers and fast detectors for triggering. The field integral of the toroids ranges between 2.0 and 6.0 Tm across most of the detector.

3 Data samples and event reconstruction

The data were collected by the ATLAS detector in 2015 and 2016 during pp collisions at a centre-of-mass energy of \(\sqrt{s} = 13\) TeV, with a peak instantaneous luminosity of \(\mathcal{L} =1.4 \times 10^{34}~\hbox {cm}^{-2}\hbox {s}^{-1}\), a bunch spacing of 25 ns, and an average number of pp interactions per bunch crossing (pile-up) of \(\langle \mu \rangle = 14\) in 2015 and \(\langle \mu \rangle = 24\) in 2016. Only events taken in stable beam conditions, and for which all relevant detector systems were operational, are considered in this analysis. The integrated luminosity of the resulting data set is \(36.1~\hbox {fb}^{-1}\), with an uncertainty of \(\pm \,3.2\)%. This uncertainty is derived, following a methodology similar to that detailed in Ref. [35], from a preliminary calibration of the luminosity scale using xy beam-separation scans performed in August 2015 and May 2016.

Candidate events are required to have a reconstructed vertex with at least two associated tracks with transverse momentum \(p_\mathrm{T} > 400~\hbox {MeV}\). The vertex with the highest scalar sum of the squared transverse momenta of the associated tracks is considered the primary vertex of the event.

Electron (baseline) candidates are reconstructed from three-dimensional electromagnetic calorimeter energy depositions matched to ID tracks, and are required to have pseudorapidity \(|\eta |<2.47\), \(p_\mathrm{T}>7\hbox { GeV}\), and to pass a loose likelihood-based identification requirement [36]. The likelihood input variables include measurements of calorimeter shower shapes and of track properties from the ID.

Muon (baseline) candidates are reconstructed in the pseudorapidity region \(|\eta |<2.4\) from muon spectrometer tracks matching ID tracks. They must have \(p_\mathrm{T}> 7\hbox { GeV}\) and must pass the medium identification requirements defined in Ref. [37], which are based on requirements on the number of hits in the different ID and muon spectrometer subsystems, and on the significance of the charge-to-momentum ratio (q / p) measurement  [37].

Jets are reconstructed from three-dimensional energy clusters in the calorimeter [38] with the anti-\(k_t\) jet clustering algorithm [39, 40] with a radius parameter \(R=0.4\). Only jet candidates with \(p_\mathrm{T}>20\hbox { GeV}\) and \(|\eta |<2.8\) are considered. Jets are calibrated as described in Refs. [41, 42], and the expected average energy contribution from pile-up clusters is subtracted according to the jet area [43]. Additional selections are applied to jets with \(p_\mathrm{T}< 60 \hbox { GeV}\) and \(|\eta |<2.4\) in order to reject jets produced in pile-up collisions [44]. The “medium” working point is used for the pile-up rejection, which has an efficiency of about 92% for jets produced by the hard scatter. Jets resulting from the hadronisation of b-quarks are identified using a multivariate b-tagging algorithm (MV2c10), which is based on quantities such as impact parameters of associated tracks and reconstructed secondary vertices [45, 46]. This algorithm is used at a working point that provides 77% b-tagging efficiency in simulated \(t\bar{t}\) events, and a rejection factor of 134 for light-quark flavours and gluons and 6 for charm jets. The jets satisfying the b-tagging requirements are referred to as b-jets.

Events are discarded if they contain any jet with \(p_\mathrm{T}>20 \hbox { GeV}\) failing to satisfy basic quality selection criteria that reject detector noise and non-collision backgrounds [47].

To resolve reconstruction ambiguities, an overlap removal algorithm is applied to candidate leptons and jets. Non-b-tagged jets which lie within \(\Delta R =\sqrt{(\Delta y)^2+(\Delta \phi )^2} < 0.2\) (here y stands for the rapidity) from an electron candidate are removed, and the same is done for jets which lie close to a muon candidate and are consistent with the characteristics of jets produced by muon bremsstrahlung. Finally, any lepton candidate which lies within \(\Delta R < 0.4\) from the direction of a surviving jet candidate is removed, in order to reject leptons from the decay of a b- or c-hadron. Electrons which share an ID track with a muon candidate are also removed.

Additional selections are then applied to the remaining lepton and jet candidates. Tighter requirements on the lepton candidates are imposed, which are then referred to as “signal” electrons or muons. Signal electrons must satisfy the medium likelihood-based identification requirement as defined in Ref. [36]. Signal electrons must have a transverse impact parameter with respect to the reconstructed primary vertex, \(d_0\), with a significance of \(\vert d_0\vert /\sigma (d_0) < 5\). For signal muons, the corresponding requirement is \(\vert d_0\vert /\sigma (d_0) < 3\). The tracks associated with the signal leptons must have a longitudinal impact parameter with respect to the reconstructed primary vertex, \(z_0\), satisfying \(\vert z_0 \sin \theta \vert < 0.5\) mm. Isolation criteria are applied to both electrons and muons by placing an upper limit on the sum of the transverse energy of the calorimeter energy clusters in a cone of \(\Delta R_\eta = \sqrt{(\Delta \eta )^2+(\Delta \phi )^2} = 0.2\) around the electron (excluding the deposit from the electron itself), and the scalar sum of the \(p_\mathrm{T}\) of tracks within a variable-size cone around the lepton (excluding its own track). The track isolation cone radius for electrons (muons) is given by the smaller of \(\Delta R = 10~\hbox {GeV}/p_\mathrm{T}\) and \(\Delta R_\eta = 0.2\,(0.3)\). The isolation criteria are optimised such that the isolation selection efficiency is uniform across \(\eta \), and it increases from 95% for \(p_\mathrm{T} = 25~\hbox {GeV}\) to 99% for \(p_\mathrm{T} = 60~\hbox {GeV}\) in \(Z\rightarrow \ell \ell \) events.

Jets are required to have \(| \eta | < 2.5\).

The missing transverse momentum (\(\mathbf { p}^{\mathrm {miss}}_{\mathrm {T}}\)), whose magnitude is denoted by \(E_{\text {T}}^{\text {miss}}\), is defined as the negative vector sum of the transverse momenta of all identified baseline objects (electrons, muons, jets) and an additional soft term. The soft term is constructed from all tracks that are not associated with any reconstructed electron, muon or jet, but which are associated with the primary vertex. In this way, the \(E_{\text {T}}^{\text {miss}}\) value is adjusted for the best calibration of the jets and the other identified objects above, while maintaining pile-up independence in the soft term [48, 49].

4 Event selection

For the two-body and three-body selections, events are accepted if they pass an online selection (trigger) requiring a minimum of two electrons, two muons or an electron and a muon matched to the trigger objects. The offline selection requires that the leading lepton has a \(p_\mathrm{T}\) larger than 25 GeV and the subleading lepton a \(p_\mathrm{T}\) larger than 20 GeV, ensuring that trigger efficiencies are constant in the relevant phase space. The four-body selection accepts events passing an \(E_{\text {T}}^{\text {miss}}\)-based trigger and having offline \(E_{\text {T}}^{\text {miss}}\) > 200 GeV. This ensures that the trigger efficiency is constant in the relevant phase space. Using this trigger permits the use of a reduced lepton \(p_\mathrm{T}\) threshold of 7 GeV, increasing acceptance for the low lepton \(p_\mathrm{T}\) produced in the four-body \(\tilde{t} \rightarrow b\ell \nu {\tilde{\chi }^0_1} \) decay.

Events are required to have exactly two signal leptons which must be of opposite charge (electrons, muons, or one of each) with an invariant mass (regardless of the flavour of the leptons in the pair) \(m_{\ell \ell }\) greater than \(20~\hbox {GeV}\) (\(10~\hbox {GeV}\) for the four-body selection) in order to remove leptons from low-mass resonances. Except for the four-body selection, events with same-flavour (SF) lepton pairs with \(m_{\ell \ell }\) between 71.2 and 111.2 GeV are rejected, in order to reduce the backgrounds with leptons produced by Z bosons. No additional selection is applied to the \(m_{\ell \ell }\) value of different-flavour (DF) lepton pairs. In the following, the requirements described in the preceding part of this section are referred to as “common selection”.

4.1 Discriminators and kinematic variables

For the different decay modes considered, dedicated sets of discriminating variables are used to separate the signal from the SM backgrounds.

The missing transverse momentum and the \(p_\mathrm{T}\) of the leading leptons and jets are used to define three useful ratio variables :

$$\begin{aligned} R_{2\ell 2j}= & {} E_{\mathrm{T}}^{\mathrm{miss}} / (E_{\mathrm{T}}^{\mathrm{miss}} + p_\mathrm{T}(\ell _1) + p_\mathrm{T}(\ell _2) + p_\mathrm{T}(j_1) + p_\mathrm{T}(j_2)),\\ R_{2\ell }= & {} E_{\mathrm{T}}^{\mathrm{miss}} / (p_\mathrm{T}(\ell _1) + p_\mathrm{T}(\ell _2)), \end{aligned}$$

and

$$\begin{aligned} R_{2\ell 4j} = {E_{\mathrm{T}}^{\mathrm{miss}}}/(E_{\mathrm{T}}^{\mathrm{miss}} + p_\mathrm{T}(\ell _1) + p_\mathrm{T}(\ell _2) + \sum _{i=1,\ldots ,N\le 4}p_\mathrm{T}(j_i)), \end{aligned}$$

where \(p_\mathrm{T}(\ell _1)\) and \(p_\mathrm{T}(\ell _2)\) are the leading and subleading lepton transverse momenta and \(p_\mathrm{T}(j_{i =1,\ldots ,N\le 4})\) are the transverse momenta in decreasing order of up to the four leading jets. The variables \(R_{2\ell 2j} \) and \(R_{2\ell } \) are used to reject backgrounds, e.g. \(Z/\gamma ^*+\mathrm {jets} \), which peak at lower values than the signal. Similarly, \(R_{2\ell 4j}\) is a powerful discriminant against multi-jet events.

Other variables employed are :

  • \(\mathbf p^{\ell \ell }_{\mathrm {T,boost}}\): defined as the vector

    $$\begin{aligned} \mathbf p^{\ell \ell }_{\mathrm {T,boost}} = \mathbf { p}^{\mathrm {miss}}_{\mathrm {T}} + \mathbf {p}_\mathrm{T}(\ell _1) + \mathbf {p}_\mathrm{T}(\ell _2). \end{aligned}$$

    The \(\mathbf p^{\ell \ell }_{\mathrm {T,boost}}\) variable, with magnitude \(p^{\ell \ell }_{\mathrm {T,boost}}\), can be interpreted as the opposite of the vector sum of all the transverse hadronic activity in the event.

  • \(\Delta \phi _\mathrm {boost}\): the azimuthal angle between the \( \mathbf { p}^{\mathrm {miss}}_{\mathrm {T}} \) vector and the \(\mathbf p^{\ell \ell }_{\mathrm {T,boost}}\) vector.

  • \(\Delta x\): defined as

    $$\begin{aligned} \Delta x = \frac{2\cdot \left( p_{\text {z}} (\ell _1)+p_{\text {z}} (\ell _2)\right) }{E_{\text {C}M}} \end{aligned}$$

    where \(E_{\text {C}M}=13\) TeV is used and \(p_{\text {z}} (\ell _1)\),\(p_{\text {z}} (\ell _2)\) are respectively the leading and subleading lepton longitudinal momenta. This variable helps to discriminate between gluon- and quark-initiated processes. The former tend to peak towards zero, while the latter tend to peak at higher values.

  • \(\cos \theta _b\): the cosine of the angle between the direction of motion of either of the two leptons and the beam axis in the centre-of-mass frame of the two leptons [50]. This variable is sensitive to the spin of the pair-produced particle, providing additional rejection against diboson backgrounds.

  • \(m_{\tiny {\text{ T2 }}}^{\ell \ell }\): lepton-based “stransverse” mass. The stransverse mass defined in Refs. [51, 52] is a kinematic variable used to bound the masses of a pair of identical particles which have each decayed into a visible and an invisible particle. This quantity is defined as

    $$\begin{aligned} m_{\tiny {\text{ T2 }}} ( \mathbf {p}_{\mathrm {T,1}}, \mathbf {p}_{\mathrm {T,2}}, \mathbf {q}_{\mathrm {T}})\\ = \min _{\mathbf {q}_{\mathrm {T,1}} + \mathbf {q}_{\mathrm {T,2}} = \mathbf {q}_{\mathrm {T}} } \left\{ \max [\; m_{\mathrm {T}}( \mathbf {p}_{\mathrm {T,1}}, \mathbf {q}_{\mathrm {T,1}} ), m_{\mathrm {T}}( \mathbf {p}_{\mathrm {T,2}}, \mathbf {q}_{\mathrm {T,2}} ) \;] \right\} , \end{aligned}$$

    where \(m_{\mathrm T}\) indicates the transverse mass,Footnote 2 \(\mathbf {p}_{\mathrm {T,1}}\) and \(\mathbf {p}_{\mathrm {T,2}}\) are the transverse momentum vectors of two particles, and \(\mathbf {q}_{\mathrm {T,1}}\) and \(\mathbf {q}_{\mathrm {T,2}}\) are transverse momentum vectors with \( \mathbf {q}_{\mathrm {T}} = \mathbf {q}_{\mathrm {T,1}} + \mathbf {q}_{\mathrm {T,2}}\). The minimisation is performed over all the possible decompositions of \(\mathbf {q}_{\mathrm {T}}\). For \(t\bar{t}\) or WW decays with \(t\rightarrow b \ell \nu \) and \(W \rightarrow \ell \nu \), when the transverse momenta of the two leptons in each event are taken as \(\mathbf {p}_{\mathrm {T,1}}\) and \(\mathbf {p}_{\mathrm {T,2}}\), and \(\mathbf { p}^{\mathrm {miss}}_{\mathrm {T}}\) as \(\mathbf {q}_\mathrm {T}\), \(m_T2 (\mathbf {p}_\mathrm{T}(\ell _1), \mathbf {p}_\mathrm{T}(\ell _2), \mathbf { p}^{\mathrm {miss}}_{\mathrm {T}})\) is bounded sharply from above by the mass of the W boson [53, 54]. In the \(\tilde{t} \rightarrow b {\tilde{\chi }^\pm _1} \) decay mode the upper bound is strongly correlated with the mass difference between the chargino and the lightest neutralino. In this paper, \(m_T2 (\mathbf {p}_\mathrm{T}(\ell _1), \mathbf {p}_\mathrm{T}(\ell _2), \mathbf { p}^{\mathrm {miss}}_{\mathrm {T}})\) is referred to simply as \(m_{\tiny {\text{ T2 }}}^{\ell \ell }\).

The three-body selection uses a number of “super-razor” variables that are defined in Ref. [55]. They are designed to identify events with two massive parent particles (i.e. top squarks) each decaying into a set of visible (only leptons are considered in this case, all other particles including jets are ignored) and invisible particles (i.e. neutrinos and neutralinos). These variables are:

  • \(R_{p_{\text {T}}}\): defined as

    $$\begin{aligned} R_{p_{\text {T}}} = \frac{|\vec {J}_{\text {T}}|}{|\vec {J}_{\text {T}}| + \sqrt{\hat{s}}_{\text {R}}/4}, \end{aligned}$$

    where \(\vec {J}_{\text {T}}\) is the vector sum of the transverse momenta of the visible particles and the missing transverse momentum, and \(\sqrt{\hat{s}_{\text {R}}}\) is a measure of the system’s energy in the razor frame R as defined in Ref. [55] as the frame in which the two visible leptons have equal and opposite \(p_{\text {z}}\). In the case where all possible visible particles are considered, the razor frame R becomes an approximation of the pair production centre-of-mass frame with the centre-of-mass energy \(\sqrt{\hat{s}}_{\text {R}}\). In this analysis, only leptons are considered in the visible system. Therefore, \(R_{p_{\text {T}}}\) tends towards zero in events that do not contain additional activity (i.e. dibosons) due to vanishing \(|\vec {J}_{\text {T}}|\), whereas in events that contain additional activity (i.e. \(t\bar{t}\)) this variable tends towards unity, thus providing separation power between the two cases.

  • \(\gamma _\text {R+1}\): The Lorentz factor associated with the boosts from the razor frame R to the approximations of the two decay frames of the parent particles. It is a measure of how the two visible systems are distributed, tending towards unity when the visible particles are back-to-back or have different momenta, while preferring lower values when they are equal in momenta and collinear.

  • \(M_{\Delta }^\text {R}\): defined as

    $$\begin{aligned} M_{\Delta }^\text {R}\ = \frac{\sqrt{\hat{s}_{\text {R}}}}{\gamma _{{\text {R}}+1}}. \end{aligned}$$

    This variable has a kinematic end-point that is proportional to the mass-splitting between the parent particle and the invisible particle. Therefore, it provides rejection against both the top quark and diboson production processes when it is required to be greater than the mass of the W boson, and in this case it also helps to reject the residual \(Z/\gamma ^*+\mathrm {jets} \) background.

  • \(\Delta \phi _{\beta }^\text {R}\): The quantity \(\Delta \phi _{\beta }^\text {R}\) is the azimuthal angle between the razor boost from the laboratory to the R frame and the sum of the visible momenta as evaluated in the R frame. For systems where the invisible particle has a mass that is comparable to the pair-produced massive particle, this variable has a pronounced peak near \(\pi \), making it, in general, a good discriminator in searches for models with small mass differences.

4.2 Two-body event selection

This selection targets the top squark two-body decays (Fig. 1a, b) into either a bottom quark and a chargino, with the chargino decaying into the lightest neutralino and a W boson, or a near-mass-shell top quark and a neutralino.

In these decays, the kinematic properties of signal events are similar to those of \(t\bar{t}\) events. In particular, when the top squarks are produced at rest the momenta carried by the neutralinos in the final state are small and the discrimination difficult. Better separation between signal events and the \(t\bar{t}\) background can be obtained for top squark pairs which recoil from initial-state radiation (ISR).

Three signal regions (SRs), summarised in Table 2 and denoted by \(\mathrm {SR(A,B,C)}^{\mathrm {2-body}}_{x}\), where x stands for the lower bound of the \(m_{\tiny {\text{ T2 }}}^{\ell \ell }\) interval, were optimised to target different scenarios:

  • \(\text {SRA}^{\text {2-body}}_{180}\) targets the decays into \(b {\tilde{\chi }^\pm _1} \) in scenarios where \(m_{\tilde{t}_1}-m_{{\tilde{\chi }^\pm _1}}\) is below 10 GeV and the b-jets from the decay of the \(\tilde{t}_1\) are too low in energy to be reconstructed. For this reason, b-jets with \(p_\mathrm{T}>25\) GeV are vetoed to reduce the contamination from SM processes including top quarks. No further requirement is imposed on the hadronic activity of the event. Events with SF leptons are required to have \(m_{\ell \ell } >111.2\) GeV and \(R_{2\ell 2j} >0.3\) to reduce the contamination from \(Z/\gamma ^*+\mathrm {jets} \) events. The contribution from diboson production is expected to be the dominant background in the SR and it is reduced by requiring the events to have \(\Delta x <0.07\). Furthermore, events are required to have \(m_{\tiny {\text{ T2 }}}^{\ell \ell } >180\) GeV.

  • \(\text {SRB}^{\text {2-body}}_{140}\) targets the decays into \(b {\tilde{\chi }^\pm _1} \) in scenarios with a mass-splitting between the top squark and the chargino larger than 10 GeV, such that the jets from the hadronisation of b-quarks are expected to be detectable. At least two jets with \(p_\mathrm{T}>25\) GeV are required, with at least one of them being identified as a b-jet. Events from \(t\bar{t}\) and \(Z/\gamma ^*+\mathrm {jets} \) production are suppressed by requiring \(\Delta \phi _\mathrm {boost} <1.5\). The main expected SM processes satisfying this selection are \(t\bar{t}\) and \(t\bar{t}+Z\) with the \(Z\) boson decaying into neutrinos. A final selection of \(m_{\tiny {\text{ T2 }}}^{\ell \ell } >140\) GeV is applied. Because of the similar final state, this selection is the most sensitive to signal scenarios in which the \(\tilde{t}_1\) decays into \(t+{\tilde{\chi }^0_1} \), with large \(m_{\tilde{t}_1} - m_{{\tilde{\chi }^0_1}}\).

  • \(\text {SRC}^{\text {2-body}}_{110}\) targets the decays into \(t+{\tilde{\chi }^0_1} \), in scenarios where \(m_{\tilde{t}_1} \sim m_{{\tilde{\chi }^0_1}}+m_{t}\). Candidate events are required to have \(E_{\text {T}}^{\text {miss}} > 200\) GeV and at least three jets with \(p_\mathrm{T}>25\) GeV, where one of the jets is interpreted as ISR. The other two jets are expected to arise from the decay of the top quarks in the final state. One of the jets in the event is required to be b-tagged, effectively separating the signal events from SM diboson production. The \(Z/\gamma ^*+\mathrm {jets} \) background is suppressed by requiring \(R_{2\ell } \) to be larger than 1.2. Events are finally required to have \(m_{\tiny {\text{ T2 }}}^{\ell \ell } >110\) GeV.

For the model-dependent exclusion limits, a shape fit of the \(m_{\tiny {\text{ T2 }}}^{\ell \ell }\) distribution is performed for the \(\text {SRA}^{\text {2-body}}_{180}\) and \(\text {SRB}^{\text {2-body}}_{140}\) selections: the distribution is divided into bins of width 20 GeV, starting from \(m_{\tiny {\text{ T2 }}}^{\ell \ell } =120\) GeV; the last bin’s low boundary corresponds to the requirement on the same variable in the definitions of \(\text {SRA}^{\text {2-body}}_{180}\) and \(\text {SRB}^{\text {2-body}}_{140}\); each bin is referred to as \(\mathrm {SR(A,B)}^{\mathrm {2-body}}_{x,y}\), where x and y denote the low and high edges of the bin.

Table 2 Two-body selection signal region definitions

4.3 Three-body event selection

This selection targets the top squark three-body decay mode (Fig. 1c), which is expected to be the dominant decay mode when the two-body decay mode into the lightest chargino or neutralino is kinematically forbidden, i.e. for \(m_{{\tilde{\chi }^0_1}}+m_{W}+m_{b}< m_{\tilde{t}_1} < m_{{\tilde{\chi }^0_1}} + m_{t}\) and \(m_{\tilde{t}_1} < m_{{\tilde{\chi }^\pm _1}} + m_{b}\).

Two orthogonal signal regions, SR\(^{\text {3-body}}_{W}\) and SR\(^{\text {3-body}}_{t}\), are summarised in Table 3. The SR\(^{\text {3-body}}_{W}\) targets the region where \(\Delta m(\tilde{t},{\tilde{\chi }^0_1}) \sim m_{W}\) in which the produced b-jets have low transverse momentum, and hence are often not reconstructed. The second signal region SR\(^{\text {3-body}}_{t}\) targets the region in which \(\Delta m(\tilde{t},{\tilde{\chi }^0_1}) \sim m_{t}\).

The two regions make use of a common set of requirements on \(R_{p_{\text {T}}}\), \(\gamma _\text {R+1}\), and in the two-dimensional (\(\cos \theta _b\), \(\Delta \phi _{\beta }^\text {R}\)) plane. In addition, SR\(^{\text {3-body}}_{W}\) requires that no b-jet is identified in the event and that \(M_{\Delta }^\text {R}\)  \(>95\) GeV. The large \(M_{\Delta }^\text {R}\) requirement suppresses the top quark and diboson backgrounds. In the case of SR\(^{\text {3-body}}_{t}\), the requirements are: at least one b-jet and \(M_{\Delta }^\text {R}\) \(>110\) GeV. The b-jet requirement makes the selection orthogonal to SR\(^{\text {3-body}}_{W}\), so that the two SRs can be statistically combined. Furthermore, a slightly tighter \(M_{\Delta }^\text {R}\) requirement is necessary to eliminate the background that originates from top quark production processes.

Table 3 Three-body selection signal region definitions

4.4 Four-body event selection

The selection described here targets the four-body decay mode of the top squark (Fig. 1d) for scenarios where \(m_{\tilde{t}_1} < m_{{\tilde{\chi }^0_1}} + m_{b} + m_{W}\) and \(m_{\tilde{t}_1} < m_{{\tilde{\chi }^\pm _1}} + m_{b}\). In this region the top squark decay into \(c{\tilde{\chi }^0_1} \) might be dominant, depending on various SUSY model parameters. The branching ratio into this final state is here assumed to be negligible. For these small mass splittings, the leptons in the final state, originating from the virtual W boson decays, are expected to have low \(p_\mathrm{T}\).

Signal events can be distinguished from SM processes if a high-\(p_\mathrm{T}\) jet from ISR leads to a large transverse boost of the sparticle pair system and enhances the \(E_{\text {T}}^{\text {miss}}\) value. At least two jets with \(p_\mathrm{T}>25~\hbox {GeV}\) are required in the event. The leading jet is considered to be the ISR jet and required to have \(p_\mathrm{T}>150\) GeV. Since the jets resulting from \(\tilde{t}\) decays tend to have low \(p_\mathrm{T}\) in this scenario, at most one more energetic jet with \(p_\mathrm{T}>25~GeV\) is permitted in the event and the transverse momentum of the third jet (if present) must satisfy \(p_\mathrm{T}(j_3)/E_{\text {T}}^{\text {miss}} < 0.14\).

In order to remove events originating from low-mass resonances, the invariant mass of the two leptons, \(m_{\ell \ell }\), is required to be greater than 10 GeV. Furthermore, upper limits on \(p_{\text {T}} (\ell _1)\) and \(p_{\text {T}} (\ell _2)\), respectively of 80 GeV and 35 GeV, are applied.

The signal region \(\text {SR}^{\mathrm {4-body}}\) is defined as summarised in Table 4. The two variables \(R_{2\ell 4j} \) and \(R_{2\ell } \) must be larger than 0.35 and 12 to reject multi-jet and \(t\bar{t}\) backgrounds, respectively. Finally, the two most energetic jets in the event must not be tagged as b-jets.

Table 4 Four-body selection signal region definition

5 Samples of simulated events

Monte Carlo (MC) simulated event samples are used to aid in the estimation of the background from SM processes and to model the SUSY signal. The event generator, parton shower and hadronisation generator, cross-section normalisation, parton distribution function (PDF) set and underlying-event parameter set (tune) of these samples are given in Table 5, and more details of the event generator configurations can be found in Refs. [56,57,58,59]. Cross-sections calculated at next-to-next-to-leading order (NNLO) in QCD including resummation of next-to-next-to-leading logarithmic (NNLL) soft gluon terms were used for top quark production processes. For production of top quark pairs in association with vector or Higgs bosons, cross-sections calculated at next-to-leading order (NLO) were used, and the event generator cross-sections calculated by Sherpa (at NLO for most of the processes) are used when normalising the multi-boson backgrounds. In all MC samples, except those produced by Sherpa, the EvtGen v1.2.0 program [60] was used to model the properties of the bottom and charm hadron decays. Additional MC samples are used when estimating systematic uncertainties, as detailed in Sect. 7.

Table 5 Simulated signal and background event samples: the corresponding event generator, parton shower generator, cross-section normalisation, PDF set and underlying-event tune are shown

SUSY signal samples were generated from leading-order (LO) matrix elements with up to two extra partons, using the MadGraph5_aMC@NLO  [61] event generator. The two-body signals used Pythia 8.186 [62] for the modelling of the SUSY decay chain, parton showering, hadronisation and the description of the underlying event. The three-body and four-body signals were decayed with Pythia8 + MadSpin [86] instead. Parton luminosities were provided by the NNPDF23LO PDF set. Jet–parton matching was realised following the CKKW-L prescription [87], with a matching scale set to one quarter of the pair-produced superpartner mass. In all cases, the mass of the top quark was fixed at 172.5 GeV. Signal cross-sections were calculated to next-to-leading order in the strong coupling constant, adding the resummation of soft gluon emission at next-to-leading-logarithmic accuracy (NLO + NLL) [67, 88, 89]. The nominal cross-sections and their uncertainties were taken from an envelope of cross-section predictions using different PDF sets and factorisation and renormalisation scales, as described in Ref. [68]. All two-, three- and four-body samples were generated assuming a 100% branching ratio into the respective final states.

For the pMSSM inspired models, the mass spectrum of sparticles was calculated using Softsusy 3.7.3 [90] and cross-checked with SPheno 3.3.8 [91, 92] and Suspect 2.5 [93]. Hdecay and Sdecay, included in Susy-Hit [94] were used to generate decay tables of the SUSY particles.

To simulate the effects of additional pp collisions in the same and nearby bunch crossings, additional interactions were generated using the soft QCD processes of Pythia 8.186 with the A2 tune [95] and the MSTW2008LO PDF set [96], and they were overlaid onto each simulated hard-scatter event. The MC samples were reweighted to the pile-up distribution observed in the data. The MC samples were processed through an ATLAS detector simulation [97] based on Geant4 [98] or, in the case of \(t\bar{t} t\) and the SUSY signal samples, a fast simulation using a parameterisation of the calorimeter response and Geant4 for the other parts of the detector [99]. All MC samples are reconstructed in the same manner as the data. Corrections derived from data control samples are applied to simulated events to account for differences between data and simulation in reconstruction efficiencies, momentum scale and resolution of leptons and in the efficiency and false positive rate for identifying jets resulting from the hadronisation of b-quarks.

6 Background estimation

The dominant SM background processes satisfying the SR requirements are estimated by simulation, which is normalised to data and verified in separate regions of the phase space. Dedicated control regions (CRs), described in Sects. 6.16.3, enhanced in a particular background component are used for the normalisation. Subdominant background yields are taken directly from MC simulation or from additional independent studies in data. For each signal region, a simultaneous “background fit” is performed to the number of events found in the CRs, using a statistical minimisation based on a likelihood implemented in the HistFitter package [100]. In each fit, the normalisations of the background contributions having dedicated CRs are allowed to float, while the MC simulation is used to describe the shape of distributions of kinematical variables. The level of agreement between the background prediction and data is compared in dedicated validation regions (VRs), which are not used to constrain the background normalisation or nuisance parameters in the fit.

In order to keep the background control region kinematically as close as possible to the SR, the two-body, three-body and four-body selections use different sets of CRs. The definitions of the regions used in each analysis and the results of the fits are described in the following subsections.

The background due to jets misidentified as leptons (hereafter referred to as “fake” leptons) and non-prompt leptons is collectively referred to as “FNP”: it consists of semileptonic \(t\bar{t}\), s-channel and t-channel single-top-quark, W + jets and light- and heavy-flavour multi-jet events. It is estimated from data with a method similar to that described in Refs. [101, 102]. Two types of lepton identification criteria are defined for this evaluation: “tight” and “loose”, corresponding to signal and baseline leptons described in Sect. 3. The method makes use of the number of observed events containing loose–loose, loose–tight, tight–loose and tight–tight lepton pairs in a given SR. The probability for prompt leptons satisfying the loose selection criteria to also pass the tight selection is measured using a \(Z\rightarrow \ell \ell \) (\(\ell = e, \mu \)) sample. The equivalent probability for fake or non-prompt leptons is measured in data from multi-jet- and \(t\bar{t}\)-enriched control samples. The number of events containing a contribution from one or two fake or non-prompt leptons is calculated from these probabilities.

Systematic uncertainties in the samples of simulated events affect the expected yields in the different regions and are taken into account to determine the uncertainties in the background predictions. The systematic uncertainties are described by nuisance parameters, which are not constrained by the fit, since the number of floating background normalisation parameters is equal to the number of CRs. Each uncertainty source is described by a single nuisance parameter, and all correlations between background processes and selections are taken into account. A list of systematic uncertainties considered in the fits is provided in Sect. 7.

6.1 Two-body selection background determination

The main background sources for the two-body selection are respectively diboson production in \(\text {SRA}^{\text {2-body}}_{180}\)and \(t\bar{t}\) and \(t\bar{t}\) + Z in \(\text {SRB}^{\text {2-body}}_{140}\) and \(\text {SRC}^{\text {2-body}}_{110}\). These processes are normalised to data in dedicated CRs, summarised in Table 6 together with the corresponding VRs: \(\text {CR}^{\text {2-body}}_{t\bar{t}}\) (included in the background fits of \(\text {SRA}^{\text {2-body}}_{180}\) and \(\text {SRB}^{\text {2-body}}_{140}\)), \(\text {CR}^{\text {2-body}}_{t\bar{t},3j}\) (included in the background fit of \(\text {SRC}^{\text {2-body}}_{110}\)), \(\text {CR}^{\text {2-body}}_{VV\text {-SF}}\) (included in the background fits of \(\text {SRA}^{\text {2-body}}_{180}\) and \(\text {SRB}^{\text {2-body}}_{140}\)), \(\text {CR}^{\text {2-body}}_{t\bar{t} Z}\) (included in the background fits of \(\text {SRA}^{\text {2-body}}_{180}\), \(\text {SRB}^{\text {2-body}}_{140}\) and \(\text {SRC}^{\text {2-body}}_{110}\)) and \(\text {CR}^{\text {2-body}}_{VZ}\) (included in the background fits of \(\text {SRA}^{\text {2-body}}_{180}\) and \(\text {SRB}^{\text {2-body}}_{140}\)). The control and validation regions are labelled using the targeted background process as subscript, which can also include additional selection details, and the associated selection as superscript. For example, the “3j” subscript of \(\text {CR}^{\text {2-body}}_{t\bar{t},3j}\)refers to the minimum jet multiplicity which is required in this control region. In \(\text {CR}^{\text {2-body}}_{t\bar{t} Z}\) and \(\text {CR}^{\text {2-body}}_{VZ}\), events with three charged leptons including one same-flavour opposite-charge pair with \(|m_{\ell \ell }-m_{Z}| < 20\) GeV are selected. In order to mimic the kinematics of the \(t\bar{t}\) + Z events with invisible Z decays, a corrected \(E_{\text {T}}^{\text {miss}}\) variable, \(E_{\mathrm {T,corr}}^{\mathrm {miss}}\), is defined by vectorially adding the momentum of the same-flavour opposite-charge lepton pair to the \(\mathbf { p}^{\mathrm {miss}}_{\mathrm {T}}\) vector.

In order to test the reliability of the background prediction, the results of the simultaneous fit are cross-checked in VRs which are disjoint from both the corresponding control and signal regions. Overlapping regions, e.g. \(\text {CR}^{\text {2-body}}_{t\bar{t}}\) and \(\text {CR}^{\text {2-body}}_{t\bar{t},3j}\), are only included in independent background fits, so that no correlation is introduced. The expected signal contamination in the CRs is generally below 5%. The highest signal contamination in the VRs, of about \(18\%\), is expected in \(\text {VR}^{\text {2-body}}_{t\bar{t},3j}\) for a top squark mass of 400 GeV and a lightest neutralino mass of 175 GeV.

Table 6 Two-body selection control and validation regions definition. The common selection defined in Sect. 4 also applies to all regions except \(\text {CR}^{\text {2-body}}_{t\bar{t} Z}\) and \(\text {CR}^{\text {2-body}}_{VZ}\), which require three leptons including one same-flavour opposite-charge pair with \(|m_{\ell \ell }-m_{Z}| < 20\) GeV

Figure 2 shows the distributions of some of the kinematic variables used to define the four control regions after the \(\text {SRA}^{\text {2-body}}_{180}\) background fit, so that the plots illustrate the modelling of the shape of each variable. In general, good agreement is found between the data and the background model within uncertainties. The other selection variables are equally well described by the background prediction.

Fig. 2
figure 2

Two-body selection distributions of a \(n_{b\text {-jets}}\) in \(\text {CR}^{\text {2-body}}_{t\bar{t}}\), b \(R_{2\ell 2j} \) in \(\text {CR}^{\text {2-body}}_{VV\text {-SF}}\) and c, d \(E_{\mathrm {T,corr}}^{\mathrm {miss}}\) in \(\text {CR}^{\text {2-body}}_{t\bar{t} Z}\) and \(\text {CR}^{\text {2-body}}_{VZ}\) after the \(\text {SRA}^{\text {2-body}}_{180}\) background fit. The contributions from all SM backgrounds are shown as a histogram stack; the hatched bands represent the total uncertainty in the background predictions after the fit to the data has been performed. The counting uncertainty on data is also shown by the black error bars. The rightmost bin of each plot includes overflow events

The results of the background fits, as well as the MC expected background composition before the fit, are reported in Table 7 for the CRs used in the \(\text {SRA}^{\text {2-body}}_{180}\) and \(\text {SRB}^{\text {2-body}}_{140}\) background fits, and in Table 8 for the CRs used in the \(\text {SRC}^{\text {2-body}}_{110}\) background fit. The normalisations for fitted backgrounds are found to be consistent with the theoretical predictions, when uncertainties are considered. By construction, in the CRs the yields observed and predicted by the fits are the same. Good agreement, within one standard deviation from the SM background prediction, is observed in the VRs and summarised in Fig. 5.

Table 7 Two-body selection background fit results for the CRs of the \(\text {SRA}^{\text {2-body}}_{180}\) and \(\text {SRB}^{\text {2-body}}_{140}\) background fits. The nominal predictions from MC simulation, are given for comparison for those backgrounds (\(t\bar{t}\), VV-SF, \(t\bar{t} Z\) and VZ) that are normalised to data in dedicated CRs. The “Others” category contains the contributions from \(t\bar{t} W\), \(t\bar{t} h\), \(t\bar{t} WW\), \(t\bar{t} t\), \(t\bar{t} t\bar{t}\), Wh, ggh and Zh production. Combined statistical and systematic uncertainties are given. Entries marked “–” indicate a negligible background contribution
Table 8 Two-body selection background fit results for the CRs of the \(\text {SRC}^{\text {2-body}}_{110}\) background fit. The nominal predictions from MC simulation, are given for comparison for those backgrounds (\(t\bar{t}\) and \(t\bar{t} Z\)) that are normalised to data in dedicated CRs. The “Others” category contains the contributions from \(t\bar{t} W\), \(t\bar{t} h\), \(t\bar{t} WW\), \(t\bar{t} t\), \(t\bar{t} t\bar{t}\), Wh, ggh and Zh production. Combined statistical and systematic uncertainties are given. Entries marked “–” indicate a negligible background contribution

6.2 Three-body selection background determination

In the three-body signal regions defined in Sect. 4.3, the SM background is dominated by diboson and \(t\bar{t}\) production. A single control region is used for \(t\bar{t}\) production, while two CRs are defined to target diboson events with either same-flavour or different-flavour lepton pairs. The background predictions are tested in VRs that are defined to be kinematically adjacent to, yet disjoint from, the signal regions. The definitions of the control and validation regions are shown in Table 9. The overlap between \(\text {VR}^{\text {3-body}}_{t\bar{t}}\) and \(\text {VR}^{\text {3-body}}_{VV\text {-DF}}\) does not affect the final results as these regions are not used to constrain the background normalisations. The signal contamination in the CRs and VRs is generally small, with the maximum found to be about 12% in \(\text {VR}^{\text {3-body}}_{VV\text {-DF}}\) for a top squark mass of 220 GeV and a lightest neutralino mass of 110 GeV.

Table 9 Three-body selection control and validation regions definitions. The common selection defined in Sect. 4 also applies to all regions

Table 10 shows the expected and observed numbers of events in each of the control regions after the background fit. The total number of fitted background events in the validation regions is in agreement with the observed number of data events. Figure 3 shows three distributions in the control regions after the background fit, so that the plots illustrate the MC modelling of the shape of each variable. In general, good agreement between the data and the background model is found within uncertainties. The other selection variables are equally well described by the background prediction. Good agreement, within one standard deviation from the SM background prediction, is observed in the VRs and summarised in Fig. 5.

Table 10 Three-body selection background fit results for the CRs of the SR\(^{\text {3-body}}_{W}\) and SR\(^{\text {3-body}}_{t}\) background fit. The nominal predictions from MC simulation, are given for comparison for those backgrounds (\(t\bar{t}\), VV-DF and VV-SF) that are normalised to data in dedicated CRs. Combined statistical and systematic uncertainties are given. Entries marked “–” indicate a negligible background contribution
Fig. 3
figure 3

Three-body selection distributions of a \(R_{p_{\text {T}}}\) in \(\text {CR}^{\text {3-body}}_{t\bar{t}}\), b \(\cos \theta _b\) in \(\text {CR}^{\text {3-body}}_{VV\text {-DF}}\), and c \(M_{\Delta }^\text {R}\) in \(\text {CR}^{\text {3-body}}_{VV\text {-SF}}\) after the background fit. The contributions from all SM backgrounds are shown as a histogram stack; the hatched bands represent the total uncertainty in the background predictions after the fit to the data has been performed. The counting uncertainty on data is also shown by the black error bars. The rightmost bin of each plot includes overflow events

6.3 Four-body selection background determination

In the four-body SR, the largest SM background contributions stem from \(t\bar{t}\) and diboson production, as well as \(Z/\gamma ^*+\mathrm {jets} \) production with the Z boson decaying into \(\tau \tau \) with both \(\tau \) leptons decaying leptonically. Three dedicated control regions are defined: \(\text {CR}^{\text {4-body}}_{t\bar{t}}\), \(\text {CR}^{\text {4-body}}_{VV}\) and \(\text {CR}^{\text {4-body}}_{Z\tau \tau }\). The background predictions are tested in three validation regions that are defined to be kinematically similar to, but disjoint from, both the control and signal regions. The definitions of the control and validation regions are shown in Table 11. In the \(t\bar{t}\) control region the signal contamination is less than \(\sim 6\%\), while in \(\text {CR}^{\text {4-body}}_{VV}\) and \(\text {CR}^{\text {4-body}}_{Z\tau \tau }\) the highest signal contamination, for a top squark mass of 260 GeV and a lightest neutralino mass of 180 GeV, is respectively \(\sim 30\%\) and \(\sim 9\%\).

Table 11 Four-body selection control and validation regions definition. The common selection reported in Table 4 also applies to all regions

Table 12 shows the expected and observed numbers of events in each of the control regions after the background fit. Good agreement between data and the SM predictions is observed in the validation regions and shown in Fig. 5. Figure 4 shows three distributions in the control regions for this analysis after applying the normalisation factors provided by the background fit. Good agreement between data and the SM predictions is observed. The other selection variables are equally well described by the background prediction. The largest observed deviation (\(1.4 \sigma \)) from the SM background prediction is found in \(\text {VR}^{\text {4-body}}_{Z\tau \tau }\). The yields in the other VRs are found to be compatible with the SM predictions within one standard deviation.

Table 12 Four-body selection background fit results for the CRs of the \(\text {SR}^{\mathrm {4-body}}\) background fit. The nominal predictions from MC simulation, are given for comparison for those backgrounds (\(t\bar{t}\), VV and \(Z_{\tau \tau }\)) that are normalised to data in dedicated CRs. Combined statistical and systematic uncertainties are given
Fig. 4
figure 4

Four-body selection distributions of the a \(p_\mathrm{T}(j_1)\) in \(\text {CR}^{\text {4-body}}_{t\bar{t}}\), b \(R_{2\ell } \) in \(\text {CR}^{\text {4-body}}_{VV}\) and c \(E_{\text {T}}^{\text {miss}}\) in \(\text {CR}^{\text {4-body}}_{Z\tau \tau }\)after the background fit. The contributions from all SM backgrounds are shown as a histogram stack; the hatched bands represent the total uncertainty in the background predictions after the fit to the data has been performed. The counting uncertainty on data is also shown by the black error bars. The rightmost bin of each plot includes overflow events

7 Systematic uncertainties

The primary sources of systematic uncertainty are related to: the jet energy scale (JES), jet energy resolution (JER), and the theoretical and MC modelling uncertainties in the backgrounds. The statistical uncertainties of the simulated event samples are also taken into account. The effect of the systematic uncertainties is evaluated for all signal samples and background processes. Since the normalisation of the dominant background processes is extracted in dedicated control regions, the systematic uncertainties only affect the extrapolation to the signal regions in these cases. Statistical uncertainties due to the limited number of data events in the CRs are also included in the fit for each region.

The JES and JER uncertainties are derived as a function of the \(p_\mathrm{T}\) and \(\eta \) of the jet, as well as of the pile-up conditions and the jet flavour composition of the selected jet sample [43]. Uncertainties associated to the modelling of the b-tagging efficiencies for b-jets, c-jets and light-flavour jets [103, 104] are also considered.

The systematic uncertainties related to the modelling of \(E_{\text {T}}^{\text {miss}}\) in the simulation are estimated by propagating the uncertainties in the energy and momentum scale of electrons, muons and jets, as well as the uncertainties in the resolution and scale of the soft term [49].

Other detector-related systematic uncertainties, such as those in lepton reconstruction efficiency, energy scale, energy resolution and in the modelling of the trigger efficiency [36, 37], are found to have a small impact on the results and are generally negligible compared to the other detector-related uncertainties.

The uncertainties in the modelling of the \(t\bar{t}\) and single-top backgrounds in simulation are estimated by varying the renormalisation and factorisation scales by a factor of two, as well as the amount of initial- and final-state radiation used to generate the samples [56]. Uncertainties in the parton shower modelling are assessed as the difference between the predictions from Powheg showered with Pythia and Herwig, and those due to the event generator choice by comparing Powheg and MadGraph5_aMC@NLO  [56]. An uncertainty in the acceptance due to the interference between \(t\bar{t}\) and single top quark Wt production is assigned by comparing the predictions of dedicated LO MadGraph  2.5 samples. These samples are used to compare the predictions for \(t\bar{t}\) and Wtb with the inclusive WWbb process, where the same production diagrams are included, but top quarks are not required to be on-shell.

The diboson background MC modelling uncertainties are estimated by varying up and down by a factor of two the renormalisation, factorisation and resummation scales used to generate the sample [58]. For \(t\bar{t} Z\) production, the predictions from the MadGraph5_aMC@NLO and Sherpa event generators are compared and the full difference between the respective predictions is assigned as an uncertainty. Uncertainties related to the choice of renormalisation and factorisation scales are assessed by varying the corresponding event generator parameters up and down by a factor of two around their nominal values [105].

The uncertainties related to the choice of QCD renormalisation and factorisation scales in \(Z/\gamma ^*+\mathrm {jets} \) events are assessed by varying the corresponding event generator parameters up and down by a factor of two around their nominal values. Uncertainties due to our choice of the resummation scale and the matching scale between the matrix element and the parton shower are estimated by varying up and down by a factor of two the corresponding parameters in Sherpa.

The cross-sections used to normalise the MC samples are varied according to the uncertainty in the cross-section calculation, i.e., 5.3% uncertainty for single top quark Wt-channel [106], 6% for diboson, 13% for \(t\bar{t} W\) and 12% for \(t\bar{t} Z\) production [61]. For \(t\bar{t} WW\), tZ, tWZ, \(t\bar{t} h\), \(t\bar{t} t\), \(t\bar{t} t\bar{t}\), and triboson production processes, which constitute a small background, a 50% uncertainty in the event yields is assumed.

Systematic uncertainties are assigned to the FNP background estimate to account for potentially different compositions (heavy flavour, light flavour or photon conversions) between the signal and control regions, as well for the contamination from prompt leptons in the regions used to measure the probabilities for loose fake or non-prompt leptons to satisfy the tight signal criteria. Parameterisations of these probabilities are independently derived from \(t\bar{t}\)- and multi-jet-enriched same-charge dilepton samples. The \(t\bar{t}\)-enriched sample is used to derive the parameterisation from which the central prediction for the FNP background is obtained. The full difference between the predictions derived from the \(t\bar{t}\) and the multi-jet parameterisation is assigned as the systematic uncertainty in the central FNP prediction and symmetrised.

A 3.2% uncertainty in the luminosity measurement is also taken into consideration for all signal and background estimates that are directly derived from MC simulations.

Table 13 summarises the contributions of the different sources of systematic uncertainty in the total SM background predictions in the signal regions. The total systematic uncertainty ranges between 15% and 46%, with the dominant sources being the size of the MC event samples, the JES and \(E_{\text {T}}^{\text {miss}}\) modelling, the numbers of events in the CRs and the \(t\bar{t}\) theoretical uncertainties.

Theory uncertainties in the signal acceptance are taken into account. These are computed by varying the strong coupling constant \(\alpha _{s}\), the renormalization and factorization scales, the CKKW scale used to match the parton shower and matrix element descriptions and the parton shower tunes. These uncertainties are mostly relevant for the four-body selection and range between 10% and 30% depending on the mass difference \(m_{\tilde{t}_1}-m_{{\tilde{\chi }^0_1}}\).

Table 13 Sources of systematic uncertainty in the SM background estimates, estimated after the background fits. The values are given as relative uncertainties in the total expected background event yields in the SRs. Entries marked “–” indicate either a negligible contribution or an uncertainty that does not apply (for example the normalisation uncertainty for a background whose normalisation is not fitted for that specific signal region). MC statistics refer to the statistical uncertainty from the simulated event samples. The individual components can be correlated and therefore do not necessarily add up in quadrature to the total systematic uncertainty

8 Results

The data are compared to background predictions in the signal regions of the different selections. The number of observed events and the predicted number of SM background events from the background-only fits in all SRs and VRs are shown in Fig. 5. In all SRs, good agreement is observed between data and the SM background predictions. A detailed discussion of the results is given in the following sections.

Fig. 5
figure 5

Comparison of the observed data (\(n_{\mathrm{obs}}\)) with the predicted SM background (\(n_{\mathrm{exp}}\)) in the SRs and associated VRs. The background predictions are obtained using the background-only fit configuration, and the hatched bands represent the total uncertainty in the background predictions after the fit to the data has been performed. The counting uncertainty on data is also shown by the black error bars. The bottom panel shows the difference between data and the predicted SM background divided by the total uncertainty (\(\sigma _{\mathrm{tot}}\))

8.1 Two-body results

Figure 6 shows the \(m_{\tiny {\text{ T2 }}}^{\ell \ell }\) distribution in each of the two-body signal regions, split between the same- and different-flavour lepton channels, omitting the selection on \(m_{\tiny {\text{ T2 }}}^{\ell \ell }\) itself. The estimated SM yields in \(\text {SRA}^{\text {2-body}}_{180}\) and \(\text {SRB}^{\text {2-body}}_{140}\) are determined with a background fit simultaneously determining the normalisations of the background contributions from \(t\bar{t}\), diboson with a SF lepton pair, \(t\bar{t} + Z\) and diboson with more than two charged leptons by including \(\text {CR}^{\text {2-body}}_{t\bar{t}}\), \(\text {CR}^{\text {2-body}}_{VV\text {-SF}}\), \(\text {CR}^{\text {2-body}}_{t\bar{t} Z}\) and \(\text {CR}^{\text {2-body}}_{VZ}\) in the likelihood minimisation. The estimated SM yields in \(\text {SRC}^{\text {2-body}}_{110}\) are determined with a background fit simultaneously determining the normalisations of the background contributions from \(t\bar{t}\) and \(t\bar{t} + Z\) by including \(\text {CR}^{\text {2-body}}_{t\bar{t},3j}\) and \(\text {CR}^{\text {2-body}}_{t\bar{t} Z}\) in the likelihood minimisation. No significant excess over the SM prediction is observed, as can be seen from the background-only fit results which are shown in Table 14 for \(\text {SRA}^{\text {2-body}}_{180}\) and \(\text {SRB}^{\text {2-body}}_{140}\), and Table 15 for the \(\text {SRC}^{\text {2-body}}_{110}\). Table 16 reports the observed and expected yields for the SRs used for the computation of the exclusion limits.

Fig. 6
figure 6

Two-body selection distributions of \(m_{\tiny {\text{ T2 }}}^{\ell \ell }\) for events satisfying the selection criteria of the six SRs, except for the one on \(m_{\tiny {\text{ T2 }}}^{\ell \ell }\), after the background fit. The contributions from all SM backgrounds are shown as a histogram stack; the hatched bands represent the total uncertainty in the background predictions after the fit to the data has been performed. The counting uncertainty on data is also shown by the black error bars. The rightmost bin of each plot includes overflow events. Reference top squark pair production signal models are overlayed for comparison. Red arrows indicate the signal region selection criteria

Table 14 Two-body selection background fit results for \(\text {SRA}^{\text {2-body}}_{180}\) and \(\text {SRB}^{\text {2-body}}_{140}\). The nominal predictions from MC simulation, are given for comparison for those backgrounds (\(t\bar{t}\), VV-SF, \(t\bar{t} Z\) and VZ) that are normalised to data in dedicated CRs. The “Others” category contains the contributions from \(t\bar{t} W\), \(t\bar{t} h\), \(t\bar{t} WW\), \(t\bar{t} t\), \(t\bar{t} t\bar{t}\), Wh, ggh and Zh production. Combined statistical and systematic uncertainties are given. Entries marked “–” indicate a negligible background contribution. The “Others” contribution to \(\text {SRB}^{\text {2-body}}_{140}\) is dominated by \(t\bar{t} W\)
Table 15 Two-body selection background fit results for \(\text {SRC}^{\text {2-body}}_{110}\). The nominal predictions from MC simulation, are given for comparison for those backgrounds (\(t\bar{t}\) and \(t\bar{t} Z\)) that are normalised to data in dedicated CRs. The “Others” category contains the contributions from \(t\bar{t} W\), \(t\bar{t} h\), \(t\bar{t} WW\), \(t\bar{t} t\), \(t\bar{t} t\bar{t}\), Wh, ggh and Zh production. Combined statistical and systematic uncertainties are given. Entries marked “–” indicate a negligible background contribution
Table 16 Two-body selection background fit results for \(\mathrm {SR(A,B)}^{\mathrm {2-body}}_{x,y}\) regions, where x and y denote the low and high edges of the bin. Combined statistical and systematic uncertainties are given. Uncertainties in the predicted background event yields are quoted as being symmetric

8.2 Three-body results

Figure 7 shows the distributions of \(R_{p_{\text {T}}}\) and \(M_{\Delta }^\text {R}\) in each of the signal regions, split between the same- and different-flavour channels, omitting the requirement on \(R_{p_{\text {T}}}\) and on \(M_{\Delta }^\text {R}\). The estimated SM yields in SR\(^{\text {3-body}}_{W}\) and SR\(^{\text {3-body}}_{t}\) are determined with a background fit simultaneously determining the normalisations of \(t\bar{t}\), SF diboson production and DF diboson production by including \(\text {CR}^{\text {3-body}}_{t\bar{t}}\), \(\text {CR}^{\text {3-body}}_{VV\text {-SF}}\) and \(\text {CR}^{\text {3-body}}_{VV\text {-DF}}\) in the likelihood minimisation. No excess over the SM prediction is observed. Table 17 shows the background fit results.

Fig. 7
figure 7

Three-body selection distributions of \(R_{p_{\text {T}}}\) in a same-flavour and b different-flavour events that satisfy all the SR\(^{\text {3-body}}_{W}\) selection criteria except for the one on \(R_{p_{\text {T}}}\), and of \(M_{\Delta }^\text {R}\) in the c same-flavour and d different-flavour events that satisfy all the SR\(^{\text {3-body}}_{t}\) selection criteria except for the one on \(M_{\Delta }^\text {R}\) after the background fit. The contributions from all SM backgrounds are shown as a histogram stack; the hatched bands represent the total uncertainty in the background predictions after the fit to the data has been performed. The counting uncertainty on data is also shown by the black error bars. The rightmost bin of each plot includes overflow events. Reference top squark pair production signal models are overlayed for comparison. Red arrows indicate the signal region selection criteria

Table 17 Three-body selection background fit results for SR\(^{\text {3-body}}_{W}\) and SR\(^{\text {3-body}}_{t}\). The nominal predictions from MC simulation, are given for comparison for those backgrounds (\(t\bar{t}\), VV-DF and VV-SF) that are normalised to data in dedicated CRs. Combined statistical and systematic uncertainties are given. Entries marked “–” indicate a negligible background contribution

8.3 Four-body results

Figure 8 shows the distributions of \(R_{2\ell 4j}\) and \(R_{2\ell } \) for events satisfying all the \(\text {SR}^{\mathrm {4-body}}\) selections. No significant excess over the SM prediction is visible. The estimated SM yields in \(\text {SR}^{\mathrm {4-body}}\) are determined with a background fit simultaneously determining the normalisations of \(t\bar{t}\), diboson production, and \(Z/\gamma ^*+\mathrm {jets} \) where \(Z \rightarrow \tau \tau \), by including \(\text {CR}^{\text {4-body}}_{t\bar{t}}\), \(\text {CR}^{\text {4-body}}_{VV}\) and \(\text {CR}^{\text {4-body}}_{Z\tau \tau }\) in the likelihood minimisation. The background fit results are shown in Table 18. The observed yield is less than one standard deviation from the background prediction in the SR.

Fig. 8
figure 8

Four-body selection distributions of a \(R_{2\ell 4j}\) and b \(R_{2\ell } \) for events satisfying all the \(\text {SR}^{\mathrm {4-body}}\) selections except for the one on the variable shown in the figure, after the background fit. The contributions from all SM backgrounds are shown as a histogram stack; the hatched bands represent the total uncertainty in the background predictions after the fit to the data has been performed. The counting uncertainty on data is also shown by the black error bars. The rightmost bin of each plot includes overflow events. Reference top squark pair production signal models are overlayed for comparison. Red arrows indicate the signal region selection criteria

Table 18 Four-body selection background fit results for \(\text {SR}^{\mathrm {4-body}}\). The nominal predictions from MC simulation, are given for comparison for those backgrounds (\(t\bar{t}\), VV and \(Z_{\tau \tau }\)) that are normalised to data in dedicated CRs. The “Others” category contains the contributions from \(t\bar{t} W\), \(t\bar{t}h\), \(t\bar{t}WW\), \(t\bar{t}t\), \(t\bar{t}t\bar{t}\), Wh, ggh and Zh production. Combined statistical and systematic uncertainties are given

8.4 Interpretation

Two different sets of exclusion limits are derived for models of new physics beyond the SM. A model-independent upper limit on the visible cross-section \(\sigma _{\mathrm {vis}}\) of new physics, defined as the ratio between the upper limit at 95% CL on the number of signal events \(S^{95}\) and the integrated luminosity, is derived in each SR by performing a fit which includes the observed yield in the SR as a constraint, and a free signal yield in the SR as an additional process. The CL\(_\mathrm {s}\) method [107] is used to derive all the exclusion confidence levels. These limits assume negligible signal contamination in the CRs. This assumption leads to conservative results when comparing with model-dependent limits for models that predict a sizeable contamination in the CRs. Model-independent upper limits are presented in Table 19.

Table 19 Model-independent 95% CL upper limits on the visible cross-section (\(\sigma _{\mathrm {vis}}\)) of new physics, the visible number of signal events (\(S^{95}_{\text {o}bs}\)), the visible number of signal events (\(S^{95}_{\text {e}xp}\)) given the expected number of background events (and \(\pm 1\sigma \) excursions of the expected number), and the discovery p-value (\(p(s = 0)\)), all calculated with pseudo-experiments, are shown for each SR

Model-dependent limits are computed for various \(\tilde{t}_1\) pair production scenarios. Profile likelihood fits are performed including the expected signal yield and its associated uncertainties in the CRs and SRs. All limits are quoted at 95% CL. When setting limits, the regions included in the \(m_{\tiny {\text{ T2 }}}^{\ell \ell }\) shape fits (\(\mathrm {SRA}^{\mathrm {2-body}}_{x,y}\) and \(\mathrm {SRB}^{\mathrm {2-body}}_{x,y}\)) are statistically combined. Similarly, the SR\(^{\text {3-body}}_{W}\) and SR\(^{\text {3-body}}_{t}\) signal regions are statistically combined as well. For each signal model, the SR with the best expected limit is used for setting the final limit.

Limits for simplified models in which pair-produced \(\tilde{t}_1\) decay with 100% branching ratio into a top quark and \(\tilde{\chi }^0_1\) are shown in the \(\tilde{t}_1\)\(\tilde{\chi }^0_1\) mass plane in Fig. 9. The various SRs cover the different \(\tilde{t}_1\) mass ranges, as described in Table 1. Top squark masses up to 720 GeV are excluded for a massless lightest neutralino. Neutralino masses up to 300 GeV are excluded for \(m_{\tilde{t}_1}=645\) GeV. In the three-body decay hypothesis, top squark masses are excluded up to 430 GeV for \(m_{\tilde{t}_1}-m_{{\tilde{\chi }^0_1}}\) close to the W boson mass. In the four-body decay hypothesis, top squark masses are excluded up to 400 GeV for \(m_{\tilde{t}_1}-m_{{\tilde{\chi }^0_1}}=40\) GeV.

Fig. 9
figure 9

Exclusion contour for a simplified model assuming \(\tilde{t}_1\) pair production, decaying via \(\tilde{t}_1 \rightarrow t^{(*)}{\tilde{\chi }^0_1} \) with 100% branching ratio. The dashed grey line and the shaded yellow band are the expected limit and its \(\pm 1\sigma \) uncertainty. The thick solid red line is the observed limit for the central value of the signal cross-section. The expected and observed limits do not include the effect of the theoretical uncertainties in the signal cross-section. The dotted lines show the effect on the observed limit when varying the signal cross-section by \(\pm 1\sigma \) of the theoretical uncertainty. The shaded blue areas show the observed exclusion from the ATLAS \(\sqrt{s}=8\) TeV analyses [18]

Limits are shown for a class of simplified models in which only pair-produced \(\tilde{t}_1\) decaying with 100% branching ratio into the lightest chargino and a b-quark are considered. Figure 10 shows the interpretation in the \(\tilde{t}_1\)\(\tilde{\chi }^0_1\) mass plane assuming that \(m_{\tilde{t}_1} - m_{{\tilde{\chi }^\pm _1}} = 10\) GeV. Top squark masses up to 700 GeV are excluded for an LSP mass up to 200 GeV.

Fig. 10
figure 10

Exclusion contour for a simplified model assuming \(\tilde{t}_1\) pair production, decaying via \(\tilde{t}_1 \rightarrow b{\tilde{\chi }^\pm _1} \) with 100% branching ratio. The lightest chargino mass is assumed to be close to the stop mass, \(m_{{\tilde{\chi }^\pm _1}} = m_{\tilde{t}_1}-10\) GeV. The dashed grey line and the shaded yellow band are the expected limit and its \(\pm 1\sigma \) uncertainty. The thick solid red line is the observed limit for the central value of the signal cross-section. The expected and observed limits do not include the effect of the theoretical uncertainties in the signal cross-section. The dotted lines show the effect on the observed limit when varying the signal cross-section by \(\pm 1\sigma \) of the theoretical uncertainty. The shaded blue area shows the observed exclusion from the ATLAS \(\sqrt{s}=8\) TeV analysis [18]

Finally, limits are set on a pMSSM model where the wino and bino mass parameters, \(M_1\) and \(M_2\), are set to \(M_2 = 2M_1\) and \(m_{\tilde{t}_1} > m_{{\tilde{\chi }^\pm _1}}\). The remaining pMSSM parameters [16, 17] have the following values: \(M_3 = 2.2\) TeV (gluino mass parameter), \(M_{S} = \sqrt{m_{\tilde{t}_1} m_{\tilde{t}_2}} = 1.2\) TeV (product of top squark masses), \(X_{t}/M_{S} = \sqrt{6}\) (mixing parameter between the left- and right-handed states), and \(\tan {\beta } = 20\) (ratio of vacuum expectation values of the two Higgs doublets). The values of \(M_3\) and \(M_{S}\) have been chosen in order to avoid the current gluino and top squark mass limits, while the value of \(X_{t}/M_{S}\) is assumed to obtain a low-mass lightest top squark while maintaining the models consistent with the observed Higgs boson mass of 125 GeV. Limits are set for both the positive and negative values of \(\mu \) (the Higgs mass parameter) as a function of \(m_{\tilde{t}_1}\) and \(m_{{\tilde{\chi }^0_1}}\), and are shown in Fig. 11. Top squark masses up to about 700 GeV are excluded for a lightest neutralino of about 280 GeV. The sensitivity for low values of \(m_{{\tilde{\chi }^0_1}}\) is limited by the \(m_{\tiny {\text{ T2 }}}^{\ell \ell }\) selection acceptance, since \(m_{{\tilde{\chi }^\pm _1}} - m_{{\tilde{\chi }^0_1}}\) is reduced by assuming \(M_2 = 2M_1\).

Fig. 11
figure 11

Exclusion contour as a function of \(m_{\tilde{t}_1}\) and \(m_{{\tilde{\chi }^0_1}}\) in the pMSSM model described in the text. Pair production of \(\tilde{t}_1\) and \(\tilde{b}_1\) are considered. Limits are set for both the positive (red in the figure) and negative (blue in the figure) values of \(\mu \). The dashed and dotted grey lines indicate constant values of the \(\tilde{b}_1\) mass. The signal models included within the shown contours are excluded at 95% CL. The dashed lines and the shaded band are the expected limit and its \(\pm 1\sigma \) uncertainty. The thick solid line is the observed limit for the central value of the signal cross-section. The expected and observed limits do not include the effect of the theoretical uncertainties in the signal cross-section

9 Conclusion

This article reports a search for direct top squark pair production in final states containing two opposite-charge leptons and large missing transverse momentum, based on a \(36.1~\hbox {fb}^{-1}\) dataset of \(\sqrt{s} = 13\) TeV proton–proton collisions recorded by the ATLAS experiment at the LHC in 2015 and 2016. Good agreement was found between the observed events in the data and the expected Standard Model yields.

Model-independent 95% CL upper limits on the visible cross-section for new phenomena were computed. The results are also interpreted in terms of simplified models assuming a range of top squark and lightest neutralino masses, with the former decaying into the latter via either a direct two-, three- or four-body decay or via an intermediate chargino state. In the case of top squark decays into \(t^{(*)}{\tilde{\chi }^0_1} \), top squark masses below 720 GeV are excluded for a massless lightest neutralino. In the three-body decay hypothesis, top squark masses are excluded up to 430 GeV for \(m_{\tilde{t}_1}-m_{{\tilde{\chi }^0_1}}\) close to the W boson mass. In the four-body decay hypothesis, top squark masses are excluded up to 400 GeV for \(m_{\tilde{t}_1}-m_{{\tilde{\chi }^0_1}}=40\) GeV. Both these results extend the coverage of previous searches by about 100 GeV. The chargino decay mode, \(\tilde{t}_1 \rightarrow b {\tilde{\chi }^\pm _1} \), is excluded for top squark masses up to 700 GeV, assuming that \(m_{\tilde{t}_1} - m_{{\tilde{\chi }^\pm _1}} = 10\) GeV, extending the previous results by almost 200 GeV. When considering a pMSSM-inspired model including multiple decay chains, top squark masses up to about 700 GeV are excluded for a lightest neutralino of about 280 GeV. These results extend the region of supersymmetric parameter space excluded by previous LHC searches.