1 Introduction

The measurement of the production of \(W\)-boson pairs through interactions of quarks and gluons probes the electroweak (EW) gauge structure of the Standard Model (SM) and allows further tests of the strong interaction between quarks and gluons. The \(WW\) production process is also important as it constitutes large irreducible backgrounds in searches for physics beyond the SM and to \(H\rightarrow WW^*\) production. Its large production cross-section combined with the large sample of proton–proton (pp) collision data delivered by the Large Hadron Collider (LHC), enables this process to be studied differentially with a better statistical precision than was possible in previous measurements.

The first measurements of \(WW\) production were carried out at the LEP electron–positron collider [1]. At the Tevatron this process was measured in proton–antiproton collisions by the CDF [2, 3] and DØ [4] Collaborations. In pp collisions at the LHC, \(WW\) production cross-sections were determined for centre-of-mass energies of \(\sqrt{s}=7\) \(\text {Te} \text {V}\) and \(\sqrt{s}=8\) \(\text {Te} \text {V}\) by the ATLAS [5, 6] and CMS [7, 8] Collaborations. In addition, a dedicated measurement of the \(WW +1\)-jet final state was carried out by the ATLAS Collaboration [9] at \(\sqrt{s}=8\) \(\text {Te} \text {V}\). At \(\sqrt{s}=13\) \(\text {Te} \text {V}\), the total cross-section for \(WW\) production was measured by the ATLAS Collaboration [10], albeit only for the small 2015 data sample, which did not allow any differential studies.

The cross-section measurements at \(\sqrt{s}=7\) and \(\sqrt{s}=8\) \(\text {Te} \text {V}\) revealed discrepancies between data and theory that have since been addressed through the inclusion of higher-order corrections in perturbative quantum chromodynamics (QCD) [11,12,13,14,15,16]. This has remedied the mismatch between the total measured and predicted cross-sections, but some discrepancies in the differential distributions persist. The high-energy behaviour of the \(WW\) cross-section and the angular distributions of the \(WW\) decay products could be affected by new physical phenomena at higher partonic centre-of-mass energies, such as EW doublet or triplet scalars [17, 18] or degenerate and non-degenerate top-quark superpartners (stops) in supersymmetry (SUSY) scenarios [19, 20]. These specific models can be constrained by their contribution to dimension-six operators in an effective Lagrangian at tree level [17]. At lower partonic centre-of-mass energies, \(WW\) production can also be used to provide complementary constraints on compressed EW SUSY scenarios with low stop masses [21].

The \(WW\) signal is composed of two leading sub-processes: \(q\bar{q} \rightarrow W W \) productionFootnote 1 (in the t- and s-channels) and gluon–gluon fusion production (both non-resonant \(gg \rightarrow W W \) and resonant \(gg \rightarrow H \rightarrow W W \)). Figure 1 shows representative sub-processes. To allow for a proper treatment and inclusion of the interference, which is especially relevant in the tails of kinematic distributions, the resonant production is kept as part of the signal. The fiducial phase space is defined to be orthogonal to the \(H\rightarrow WW\) measurements by the ATLAS Collaboration [22, 23] using a requirement on the dilepton invariant mass. Therefore the Higgs boson contribution included in the signal definition is dominated by off-shell production and interference effects. The production of two W bosons from the decay of top–antitop quark pairs is not considered part of the signal.

Fig. 1
figure 1

Feynman diagrams for SM WW production at tree level (from left to right): \(q\bar{q}\) initial-state t-channel, \(q\bar{q}\) initial-state s-channel, gg initial-state non-resonant and gg initial-state resonant production. The s-channel production contains the WWZ and \(WW\gamma \) triple-gauge-coupling vertices. The gluon–gluon fusion processes are mediated either by a quark loop (\(gg \rightarrow W W \)) or the resonant production of a Higgs boson with subsequent decay into WW (\(gg \rightarrow H \rightarrow W W \))

The different sub-processes for \(WW\) production are known theoretically at different orders in the strong coupling constant \(\alpha _{\mathrm {s}}\). The \(q\bar{q} \rightarrow W W \) production cross-section is known to \({\mathcal {O}}(\alpha _{\mathrm{s}}^2)\), next-to-next-to-leading order (NNLO) [11, 15]. Recently, also a NNLO prediction matched to a parton shower has become available [15, 24, 25]. The non-resonant \(gg \rightarrow W W \) production cross-section is known to \({\mathcal {O}}(\alpha _{\mathrm{s}}^3)\), next-to-leading order (NLO) [26], and its interference with the resonant \(gg \rightarrow W W \) production cross-section is known to \({\mathcal {O}}(\alpha _{\mathrm{s}}^2)\).

This paper presents a measurement of the fiducial cross-section for \(WW\) production at \(\sqrt{s}=13~\text {Te} \text {V}\) using pp collision data recorded in 2015 and 2016 by the ATLAS experiment, corresponding to an integrated luminosity of \(36.1\,\mathrm {fb}^{-1}\). The \(WW\rightarrow e^\pm \nu \mu ^\mp \nu \) decay channel is studied (denoted in the following by \(WW \rightarrow e \mu \)). The measurement is performed in a phase space close to the geometric and kinematic acceptance of the experimental analysis. This includes a veto on the presence of jets with transverse momenta (\(p_{\text {T}}\)) above a series of thresholds, with a \(p_{\text {T}} =35~\text {Ge} \text {V}\) threshold used as a baseline. Measuring the fiducial cross-section as a function of the jet veto \(p_{\text {T}}\) threshold provides an indirect measure of the jet \(p_{\text {T}}\) spectrum in WW events, without removing the jet veto that is necessary for background suppression.

Six differential distributions involving kinematic variables of the final-state charged leptons are measured in the baseline phase space. Three of them characterize the energy of the process: the transverse momentum of the leading lepton \(p_{\text {T}}^{\text {lead}~\ell }\), the invariant mass of the dilepton system \(m_{e\mu }\) and the transverse momentum of the dilepton system \(p_{\mathrm {T}}^{e\mu }\). Three further distributions probe angular correlations and the spin state of the \(WW\) system. These are the rapidity of the dilepton system \(|y_{e\mu }|\), the difference in azimuthal angle between the decay leptons \(\Delta \phi _{e\mu }\), and \(|\cos {\theta ^*}|\) defined as:

$$\begin{aligned} |\cos \theta ^{*}|\ = \left| \tanh \left( \frac{\Delta \eta _{e\mu }}{2} \right) \right| ~, \end{aligned}$$

where \(\Delta \eta _{e\mu }\) is the difference between the pseudorapidities of the leptons.Footnote 2 This variable is longitudinally boost-invariant and sensitive to the spin structure of the produced diparticle pairs as discussed in Ref. [27]. The unfolded \(p_{\text {T}}^{\text {lead}~\ell }\) distribution is used to set limits on anomalous triple-gauge-boson couplings, since this distribution was identified as the most sensitive to the effect of these couplings.

2 ATLAS detector

The ATLAS detector [28] at the LHC is a multipurpose particle detector with a forward–backward symmetric cylindrical geometry and nearly \(4\pi \) coverage in solid angle. It consists of inner tracking devices surrounded by a superconducting solenoid, electromagnetic (EM) and hadronic calorimeters, and a muon spectrometer. The inner detector (ID) provides charged-particle tracking in the pseudorapidity region \(|\eta |<2.5\) and vertex reconstruction. It comprises a silicon pixel detector, a silicon microstrip tracker, and a straw-tube transition radiation tracker. The ID is placed inside a solenoid that produces a 2 T axial magnetic field. Lead/liquid-argon (LAr) sampling calorimeters provide EM energy measurements with high granularity. A steel/scintillator-tile hadronic calorimeter covers the central pseudorapidity range \(|\eta |<1.7\). The endcap and forward regions are instrumented with LAr calorimeters for both the EM and hadronic energy measurements up to \(|\eta |=4.9\). The muon spectrometer (MS) is operated in a magnetic field provided by air-core superconducting toroids and includes tracking chambers for precise muon momentum measurements up to \(|\eta |=2.7\) and trigger chambers covering the range \(|\eta |<2.4\).

A two-level trigger system [29] selects the events used in the analysis. The first level is implemented in custom electronics, while the second trigger level is a flexible software-based system.

3 Data and simulated event samples

The data were collected at a centre-of-mass energy of 13 \(\text {Te} \text {V}\) during 2015 and 2016, and correspond to an integrated luminosity of \(36.1\,\mathrm {fb}^{-1}\). Only high-quality data with all detectors in normal operating conditions are analysed. The average number of interactions per bunch crossing was estimated to be \(\left<\mu \right>=24\).

Simulated event samples are used for most of the background estimates, for the correction of the signal yield due to detector effects, and for comparison with the measured cross-sections.

The \(WW\) signal was modelled using the NLO perturbative QCD Powheg-Box v2 event generator [30,31,32,33,34] for \(q{\bar{q}}\) initial states. The \(gg\rightarrow WW\) contribution was generated using the Sherpa 2.1.1+OpenLoops framework [35, 36] at leading order (LO) with up to one additional parton and includes non-resonant and resonant Higgs boson production and interference terms. The Sherpa 2.1.1+OpenLoops framework also allows these contributions to be generated and studied separately. In both cases, the CT10[37] parton distribution functions (PDF) were used. Powheg-Box was interfaced to Pythia 8.210 [38] for the modelling of parton showers and hadronization as well as underlying-event simulation, using the AZNLO[39] set of tuned parameters (‘tune’) and the CTEQ6L1[40] PDF set. Sherpa used its own parton shower, fragmentation and underlying-event model. Alternative signal samples for the quark-induced production were generated using Powheg-Box interfaced to Herwig++ 2.7.1 [41] with the UEEE5 tune [42], and using the Sherpa 2.2.2 generator with its own model for parton showering, hadronization and the underlying event. The Sherpa 2.2.2 prediction was obtained at NLO with up to one additional parton emission and up to three at LO and employs the NNPDF3.0 [43] PDF set. The \(WW\) signal predictions were normalized to the NNLO cross-section [11]; the \(gg\rightarrow WW\) process was normalized to its inclusive NLO cross-section [26].

The background processes considered are: top-quark pair production (\(t\bar{t}\)), associated production of a top quark with a W boson (Wt), single vector-boson production (W or Z, in association with jets), multijet production, other diboson production (WZ, ZZ, \(W\gamma \) and \(Z\gamma \)) and triboson production (WWW, WWZ, WZZ and ZZZ), where Z stands for \(Z/\gamma ^{*}\).

For the generation of \(t\bar{t}\) and Wt processes at NLO, Powheg-Box v2 [44] and Powheg-Box v1 [30] respectively were used with the CT10 PDF set. For the parton shower, hadronization and underlying event, simulated events were interfaced to Pythia 8.186 for \(t\bar{t}\) and Pythia 6.425 [45] for single-top production, using the A14 tune [46] and the Perugia 2012 [47] tune, respectively. The top-quark mass was set to 172.5 \(\text {Ge} \text {V}\). In the \(t\bar{t}\) sample, the \(h_{\mathrm {damp}}\) parameter that regulates the high-\(p_{\text {T}}\) emission, against which the \(t\bar{t}\) system recoils, was set to 1.5 times the top-quark mass following studies reported in Ref. [48]. Alternative samples were generated with different settings to assess the uncertainty in modelling top-quark events. To estimate uncertainties in additional QCD radiation in top-quark processes, a pair of samples was produced with the alternative sets of A14 (\(t\bar{t}\)) or Perugia 2012 (Wt) parameters for higher and lower radiation, as well as with different renormalization and factorization scales which were both varied either by a factor of 2 or 0.5. For the higher-radiation samples, the value of the \(h_{\mathrm {damp}}\) parameter was doubled. Two alternative Monte Carlo (MC) programs were used to estimate the impact of the choice of hard-scatter generator and hadronization algorithm in top-quark events; for each of these samples one of the two components was replaced by an alternative choice. The alternative choices are MadGraph5_aMC@NLO 2.3 [49] for the hard-scatter generator and Herwig 7 [50] (Herwig++ 2.7.1) for the hadronization algorithm in \(t\bar{t}\) (Wt) events. In addition, the modelling of the overlap at NLO between Wt and \(t{\bar{t}}\) diagrams [51] was studied. The effect was assessed by generating Wt events with different schemes for overlap removal using the Powheg-Box event generator interfaced to Pythia 6.425 for the simulation of parton showering and non-perturbative effects. The top-quark events were normalized using the NNLO+next-to-next-to-leading-logarithm (NNLL) QCD cross-section [52] for \(t\bar{t}\), and the NLO+NNLL cross-section [53] for Wt production.

The Z+jets process (with \(Z\rightarrow ee/\mu \mu /\tau \tau \)) was modelled using Sherpa 2.2.1 [54] with the NNPDF3.0 PDF set. This process was calculated with up to two additional partons at NLO and up to four additional partons at LO. The W+jets and alternative Z+jets events were produced with the Powheg-Box generator at NLO accuracy using the CT10 PDF set, interfaced to Pythia 8.186 for parton showering, hadronization and the underlying event. As in the \(WW\) samples, the AZNLO tune was used for the underlying event together with the CTEQ6L1 PDF set. The Z+jets and W+jets events were normalized using their respective NNLO cross-section calculations [55].

The background from diboson production processes (WZ, ZZ, \(W\gamma \) and \(Z\gamma \)) was simulated using the Sherpa 2.2.2 generator with the NNPDF 3.0 PDF set. The samples include up to one additional parton emission at NLO and up to three at LO. Alternative samples for WZ and ZZ processes were produced using the same Powheg-Box +Pythia 8 set-up as the \(q{\bar{q}}\)-initiated \(WW\) signal samples discussed above. The background from triboson production was modelled using the Sherpa+OpenLoops generator with the CT10 PDF set, calculated at NLO for inclusive production and including up to two hard parton emissions at LO. The WZ, ZZ and triboson samples produced with Sherpa were normalized to the cross-section calculated by Sherpa, with hard parton emissions at NLO or LO as discussed, and thus already capturing some of the NNLO effects. The WZ and ZZ backgrounds simulated with Powheg-Box were normalized to their NNLO cross-sections [56,57,58,59,60].

EvtGen 1.2.0 [61] was used for the properties of the bottom and charm hadron decays after hadronization in all samples generated with Powheg-Box and MadGraph5_aMC@NLO.

Additional interactions in the same or nearby bunch crossings (pile-up) were simulated using Pythia 8.186 using the A2 tune [62] and the MSTW2008LO PDF [63] set and were overlaid on the simulated signal and background events.

All simulated event samples were produced using the ATLAS simulation infrastructure [64], using the full Geant 4 [65] simulation of the ATLAS detector. Simulated events were then reconstructed with the same software as used for the data and were corrected with data-driven correction factors to account for differences in lepton and jet reconstruction and identification between data and simulation. These corrections are of the order of 1–3%.

4 Event reconstruction and selection

The \(WW\) event candidates are selected by requiring each event to contain exactly one electron and exactly one muon of opposite charge, each passing the selections described below. Events with a same-flavour lepton pair are not used because they have a larger background from the Drell–Yan process.

Candidate events are required to have at least one vertex with at least two associated tracks with \(p_{\text {T}} > 400\) \(\text {Me} \text {V}\). The vertex with the highest \(\sum p_{\text {T}} ^2\) of the associated tracks is considered to be the primary vertex.

4.1 Trigger

Candidate events were recorded by either a single-muon or a single-electron trigger that imposed a minimum lepton transverse momentum threshold that varied during data-taking. The \(p_{\text {T}}\) threshold of the leptons required by triggers in 2015 was 24 \(\text {Ge} \text {V}\) for electrons and 20 \(\text {Ge} \text {V}\) for muons, both satisfying loose isolation requirements. Due to the higher instantaneous luminosity in 2016 the trigger threshold was increased to 26 \(\text {Ge} \text {V}\) for both the electrons and the muons, and more restrictive isolation for both the leptons as well as more restrictive identification requirements for electrons were applied. Additionally, single-lepton triggers with higher \(p_{\text {T}}\) thresholds but with no isolation or with loosened identification criteria were used to increase the efficiency. The trigger efficiency for events satisfying the full selection criteria described below is about 99% and is determined using a simulated signal sample that is corrected to reflect the data efficiencies with corrections measured using \(Z\rightarrow ee\) [66] and \(Z\rightarrow \mu \mu \) [67] events. These data-driven corrections are of the order of 2% with permille level uncertainties.

4.2 Leptons

Electron candidates are reconstructed from the combination of a cluster of energy deposits in the EM calorimeter and a track in the ID [66]. Candidate electrons must satisfy the TightLH quality definition described in Ref. [66]. Signal electrons are required to have \(E_{\text {T}} > 27\) \(\text {Ge} \text {V}\) and the pseudorapidity of electrons is required to be \(|\eta | < 2.47\), excluding the transition region between the barrel and endcaps in the LAr calorimeter (\(1.37< |\eta | < 1.52\)). In addition, a requirement is added to reject electrons that potentially stem from photon conversions to reduce the \(W\gamma \) background [66]. This uses a simple classification based on the candidate electron’s E / p and \(p_{\text {T}} \), the presence of a hit in the pixel detector, and the secondary-vertex information, to determine whether the electron could also be considered as a photon candidate and rejected.

Muon candidates are reconstructed by combining a track in the ID with a track in the MS [67]. The Medium quality criterion, as defined in Ref. [67], is applied to the combined tracks. Signal muons are required to have \(p_{\text {T}} >27\) \(\text {Ge} \text {V}\) and \(|\eta | < 2.5\).

Leptons are required to originate from the primary vertex. The longitudinal impact parameter of each lepton track, calculated relative to the primary vertex and multiplied by \(\sin \theta \) of the track, is required to be smaller than 0.5 mm. Furthermore, the significance of the transverse impact parameter, defined by the transverse impact parameter (\(d_0\)) of a lepton track relative to the beam line, divided by its estimated uncertainty (\(\sigma _{d_0}\)), is required to satisfy \(|d_0/\sigma _{d_0}| <3.0~(5.0)\) for muons (electrons). Leptons are also required to be isolated using information from ID tracks and energy clusters in the calorimeters in a cone around the lepton. The expected isolation efficiency is at least 90% (99%) at a \(p_{\text {T}}\) of 25 (60) \(\text {Ge} \text {V}\) using the Gradient working point defined in Refs. [66, 67].

4.3 Jets

Jet candidates are reconstructed within the calorimeter acceptance using the anti-\(k_{t}\)  jet clustering algorithm [68] using the FastJet code [69] with a radius parameter of \(R = 0.4\), which combines clusters of topologically connected calorimeter cells [70, 71]. The jet energy is calibrated by applying a \(p_{\text {T}}\)- and \(\eta \)-dependent correction derived from MC simulation with additional corrections based on data [72]. As part of the jet energy calibration a pile-up correction based on the concept of jet area is applied to the jet candidates [73]. Jets are required to have a pseudorapidity \(|\eta |<4.5\).

The jet-vertex-tagger (JVT) technique [74] is used to separate hard-scatter jets from pile-up jets within the acceptance of the tracking detector by requiring a significant fraction of the jets’ summed track \(p_{\text {T}}\) to come from tracks associated with the primary vertex. For jets with \(2.5<|\eta |<4.5\), a forward-JVT selection is applied to suppress pile-up jets [75].

Candidate jets are discarded if they are within a cone of size \(\Delta R = 0.2\) around an electron candidate, or if they have fewer than three associated tracks and are within a cone of size \(\Delta R = 0.2\) around a muon candidate. However, if a jet with three or more associated tracks is within a cone of size \(\Delta R = 0.4\) around a muon candidate, or any jet is within a region \(0.2< \Delta R < 0.4\) around an electron candidate, the corresponding electron or muon candidate is discarded.

Within the ID acceptance, jets originating from the fragmentation of b-hadrons (b-jets) are identified using a multivariate algorithm (MV2c10 BDT) [76, 77]. The chosen operating point has an efficiency of 85% for selecting jets containing b-hadrons, as estimated from a sample of simulated \(t\bar{t}\) events and validated with data [77].

4.4 Missing transverse momentum

The missing transverse momentum is computed as the negative of the vectorial sum of the transverse momenta of tracks associated with jets and muons, as well as tracks in the ID that are not associated with any other component. The \(p_{\text {T}}\) of the electron track is replaced by the calibrated transverse momentum of the reconstructed electron [78]. This definition has been updated for Run 2 data-taking conditions [79], and denoted by \(\vec {E}_{\mathrm {T}}^{\mathrm {miss, track}}\) with its absolute value denoted by \(E_{\mathrm {T}}^{\mathrm {miss, track}}\). The tracks are required to be associated with the primary vertex and to satisfy the selection criteria described in Ref. [79].

The \(E_{\mathrm {T}}^{\mathrm {miss, track}}\) takes advantage of the excellent vertex resolution of the ATLAS detector and gives a missing transverse momentum estimate that is robust in the presence of pile-up, but it neglects the contribution of neutral particles, which do not form tracks in the ID. The pseudorapidity coverage of \(E_{\mathrm {T}}^{\mathrm {miss, track}}\) is also limited to the tracking volume of \(|\eta | < 2.5\), which is smaller than the calorimeter coverage of \(|\eta | < 4.9\). For events without any reconstructed jets, the \(E_{\mathrm {T}}^{\mathrm {miss, track}}\) provides a small improvement of the \(E_{\text {T}}^{\text {miss}}\) resolution compared with the standard reconstruction algorithms [79].

4.5 Signal region definition

The signal region (SR), in which the measurement is performed, is defined as follows. To reduce the background from other diboson processes, events are required to have no additional electrons or muons with \(p_{\text {T}}\) \(> 10\) \(\text {Ge} \text {V}\) fulfilling loosened selection criteria. For this looser selection, the GradientLoose isolation requirement [66, 67] is used for both the electrons and the muons, which has an expected isolation efficiency of at least 95% (99%) at a \(p_{\text {T}}\) of 25 (60) \(\text {Ge} \text {V}\). Moreover, a less stringent MediumLH requirement [66] is applied for electron identification.

To suppress the background contribution from top-quark production, events are required to have no jets with \(p_{\text {T}}\) \(> 35\) \(\text {Ge} \text {V}\) and \(|\eta |< 4.5\), and no b-jets with \(p_{\text {T}}\) \(> 20\) \(\text {Ge} \text {V}\) and \(|\eta |< 2.5\). The jet \(p_{\text {T}}\) requirement is optimized to minimize the total systematic uncertainty in the measurement. The additional b-jet veto requirement allows the background from top-quark production to be suppressed by a factor of three, while keeping 97% of the \(WW\) signal events. For the remaining top-quark background events that pass all selection criteria, the b-jets are mainly produced outside the acceptance of the detector (\(p_{\text {T}}\) \(< 20\) \(\text {Ge} \text {V}\) or \(|\eta |> 2.5\)), according to MC simulation.

In addition, the requirements of \(E_{\mathrm {T}}^{\mathrm {miss, track}}\) \(> 20\) \(\text {Ge} \text {V}\) and \(p_{\text {T}} ^{e\mu }>30\) \(\text {Ge} \text {V}\) suppress the Drell–Yan background contributions. A further requirement on the invariant mass of the lepton pair (\(m_{e\mu } > 55\) \(\text {Ge} \text {V}\)) reduces the \(H\rightarrow WW^*\) contribution to a level below 1% of the expected signal. This last requirement is inverted compared with the one used in the recent measurement of \(H\rightarrow WW^*\) production at 13 \(\text {Te} \text {V}\) by ATLAS [23], making the two measurements statistically independent. Otherwise, both measurements use similar selections for events in the 0-jet category, although with lower lepton \(p_{\text {T}}\) requirements in the \(H\rightarrow WW^*\) analysis.

The lepton, jet, and event selection criteria are summarized in Table 1.

Table 1 Summary of lepton, jet, and event selection criteria for WW candidate events. In the table \(\ell \) stands for e or \(\mu \). The definitions of lepton identification and isolation are detailed in Refs. [66] and [67]

5 Background estimation

After applying all selection requirements described in Section 4, the dominant background is from top-quark production. This includes \(t{\bar{t}}\) and W-associated single-top production, which both yield two real leptons in the final state.

The non-prompt lepton background originates from leptonic decays of heavy quarks, hadrons misidentified as leptons, and electrons from photon conversions. Such lepton-like objects are collectively referred to as fake leptons. Events with fake leptons are mainly due to the production of \(W+\)jets, s- and t-channel single-top production, both with leptonic W-boson decay and a jet misidentified as a lepton, or from multijet production with two jets misidentified as leptons. Other processes can contribute as well, but are negligible in the signal region. Since most of these events – more than \(98\%\) – correspond to W+jets production, this background is referred to as W+jets background in the following.

Drell–Yan production of \(\tau \)-leptons (\(Z\rightarrow \tau \tau \)) can also give rise to the \(e\mu \) final state. Other diboson (WZ, ZZ, \(W\gamma \) and \(Z\gamma \)) and triboson (VVV, where \(V=W,\ Z\)) production processes constitute a smaller background contribution. A summary table comparing the number of observed candidate events in data to the respective numbers of predicted signal and background events in the signal region can be found in Section 5.5.

5.1 Background from top-quark production

Background from top-quark production is estimated using a partly data-driven method [6, 80], in which the top-quark contribution is extrapolated from a control region (top CR) to the signal region. The top CR is selected by applying the WW signal selection except for the b-jet and jet-veto requirements. To reduce the \(WW\) signal contamination in this control region, an additional requirement on the scalar sum of the transverse momenta of leptons and jets, \(H_{\text {T}} > 200\) \(\text {Ge} \text {V}\), is applied. The remaining non-top-quark contribution estimated by MC simulation is subtracted and the resulting number of top-quark events, \(N^{\text {top}}_{\text {CR}}\), is corrected for the \(H_{\text {T}}\) cut efficiency, \(\epsilon _{H_{\text {T}}}\), using top-quark MC samples. With the efficiency for top-quark events to satisfy the jet-veto requirement, \(\epsilon _{\text {jet-veto}}\), the top-quark background contribution in the signal region can be calculated as:

$$\begin{aligned} N^{\text {top}}_{\text {SR}} = \frac{N^{\text {top}}_{\text {CR}}}{\epsilon _{H_{\text {T}}}} \times \epsilon _{\text {jet-veto}}~. \end{aligned}$$

The jet-veto efficiency, which mainly quantifies the fraction of top events with jets below the jet-veto and b-jet-veto \(p_{\text {T}}\) thresholds, is calculated from simulation, with an extra correction factor [6, 80]:

$$\begin{aligned} \epsilon _{\text {jet-veto}} = \epsilon _{\text {jet-veto}}^\text {MC}\times \left( \frac{\epsilon _\text {single-jet-veto}^\text {Data}}{\epsilon _\text {single-jet-veto}^\text {MC}}\right) ^{\langle n_\text {jets} \rangle } \end{aligned}$$
(1)

where \(\epsilon _\text {single-jet-veto}\) is defined as the fraction of top-quark events that contain no jets other than the b-tagged jet, and \(\epsilon _{\text {jet-veto}}^\text {MC}\) extrapolates the top-quark MC prediction from the top CR (without \(H_{\text {T}}\) requirement) to the signal region. The \(\epsilon _\text {single-jet-veto}\) is determined both in data and simulation using events with two leptons, the same requirements on \(E_{\mathrm {T}}^{\mathrm {miss, track}}\), \(p_{\text {T}} ^{e\mu }\) and \(m_{e\mu }\) as for the signal selection, and at least one b-tagged jet. The small contributions to this region of the signal and other background contributions, mainly W+jets production, are subtracted before the calculation of \(\epsilon _\text {single-jet-veto}^\text {Data}\). The ratio \(\epsilon _\text {single-jet-veto}^\text {Data}/\epsilon _\text {single-jet-veto}^\text {MC}\) then corrects for differences in the veto efficiency for a single jet between data and simulation. It is found to be consistent with one. The exponent \(\langle n_\text {jets} \rangle \) represents the average number of jets in the top CR and is measured to be approximately 2.5 in both data and top-quark background simulation. It is varied by \(\pm 1.0\) as part of the uncertainty in the method to conservatively cover \(\langle n_\text {jets} \rangle \) variations in different control regions as well as variations due to detector uncertainties and modelling, with a small impact (1.8%) on the total uncertainty in the top-quark background estimate.

Fig. 2
figure 2

Inputs to the partly data-driven method for the top background estimate as a function of the \(p_{\text {T}}\) of the leading lepton: (upper left) events selected in data and in simulation in the top CR, with a requirement of \(H_{\text {T}}>200\,~\text {Ge} \text {V}\) applied, (upper right) the \(H_{\text {T}}\) cut efficiency \(\epsilon _{H_{\text {T}}}\), (lower left) the MC-based jet-veto efficiency \(\epsilon _\text {jet-veto}^\text {MC}\), and (lower right) the efficiency ratio \(\epsilon _\text {single-jet-veto}^\text {Data} / \epsilon _\text {single-jet-veto}^\text {MC}\). The latter is constant within uncertainties, and therefore replaced by the inclusive efficiency ratio (dashed line). In all figures, statistical and systematic uncertainties are displayed as hatched bands

The top-quark background estimate includes detector uncertainties in addition to the uncertainties in the method. Modelling uncertainties are determined using alternative MC samples and include the modelling of the parton shower, extra QCD radiation and the effect of the choice of generators. Interference effects between Wt and \(t\bar{t}\) are also considered. These modelling uncertainties are estimated by comparing the results from different MC samples described in Section 3. The cross-section uncertainty is taken to be 6% for \(t\bar{t}\) [52, 81,82,83,84,85,86] and 10% for Wt production [53, 87]. The total uncertainty in the top-quark background estimate in the signal region is about 12% using this partly data-driven approach, making use of cancellations of systematic uncertainties in the ratio \(\epsilon _{\text {jet-veto}}^\text {MC}/(\epsilon _\text {single-jet-veto}^\text {MC})^{\langle n_\text {jets} \rangle }\) in Eq. (1). It is dominated by the b-tagging and modelling uncertainties. The contribution of the \(t{\bar{t}}\) and Wt background to the total expected yield in the signal region is about 25% (17% \(t{\bar{t}}\) and 8% Wt).

The differential top-quark background contribution and its uncertainties are evaluated by applying the same procedure in each bin of the measured observables. As an example, Figure 2 shows the relevant quantities used in this partly data-driven method, as a function of the transverse momentum of the leading (highest \(p_{\text {T}} \)) lepton. The systematic uncertainties in \(N^{\text {top}}_{\text {SR}}\) are significantly reduced due to the systematics cancellations compared with the uncertainty bands from Figure 2. The decrease of \(\epsilon _\text {jet-veto}^\text {MC}\) at high leading lepton \(p_{\text {T}}\) is due to an increase in the typical \(p_{\text {T}}\) of extra jets which recoil against the leptons, nearing the jet-veto \(p_{\text {T}}\) threshold, and hence reducing the probability to still pass the jet veto. Since the efficiency ratio, \(\epsilon _\text {single-jet-veto}^\text {Data} / \epsilon _\text {single-jet-veto}^\text {MC}\), is found to be independent of any kinematic variable, the single value of \(0.98 \pm 0.05\) is used for all differential distributions. This is shown as a dashed line in the lower right panel of Figure 2.

5.2 Background from Drell–Yan production

The estimate of the Drell–Yan background process is based on MC simulation, with a \(5\%\) theoretical cross-section uncertainty [88]. A validation region dominated by Drell–Yan events is defined with the same selections as for the signal region, but with the \(e\mu \) invariant mass required to be \(45~\text {Ge} \text {V}<m_{e\mu }<80\) \(\text {Ge} \text {V}\)  and with the events failing either the \(p_{\text {T}} ^{e\mu }\)- or the \(E_{\mathrm {T}}^{\mathrm {miss, track}}\)-requirement to make the sample orthogonal to that in the signal region. Good agreement between the data and the simulation is observed in this region. The shape uncertainty is evaluated by using an alternative MC event generator, as detailed in Section 3, and includes uncertainties due to the modelling of the acceptance. The total uncertainty in the Drell–Yan background is 11% and the contribution of this background in the signal region is found to be 4%.

5.3 Background from W+jets production

The yield of W+jets is estimated by comparing in data the number of events with leptons satisfying either of two alternative sets of selection requirements, together with the \(WW\) signal selection criteria, following the same procedure as that described in Ref. [6]. The loose lepton selection criteria are defined such that the signal sample is a subset of the loose lepton sample. For electrons, the loose selection corresponds to the MediumLH quality definition [66] and no isolation requirements are imposed. For muons, the loose selection is the same as for signal muons, except that the isolation requirement is omitted. The tight selection criteria are the same as those used for the signal selection. With the introduction of real-lepton and fake-lepton efficiencies, a system of four equations can be solved to estimate the number of W+jets events. Here, the number of events that have exactly one loose muon (electron) and one tight electron (muon), two loose leptons or two tight leptons, are used. The real-lepton (fake-lepton) efficiency used in these equations is defined as the probability for prompt (fake) leptons selected with the loose criteria to satisfy the tight selection criteria.

The efficiencies for real electrons (muons) are determined using MC simulation, with data-to-MC correction factors [66, 67] applied. The efficiencies for fake electrons (muons) are measured using a multijet data sample, in a control region with exactly one loose electron (muon) and between one and three jets.

Events in this control region are also required to have low \(E_{\mathrm {T}}^{\mathrm {miss, track}}\) and low transverse massFootnote 3 \(m_{\mathrm {T}}\), to fulfil angular requirements between \(E_{\mathrm {T}}^{\mathrm {miss, track}}\) and the jets in the event, and to have no b-tagged jets. Real-lepton contributions to the control region are estimated using MC simulation and are subtracted.

Both the real- and fake-lepton efficiencies are derived as functions of \(p_{\text {T}}\) and \(\eta \) of the lepton. This is sufficient to describe the most important correlations with the differential distributions studied. Moreover, as the loose lepton selection in the W+jets background estimate at low lepton-\(p_{\text {T}}\) (\(p_{\text {T}} <50\) \(\text {Ge} \text {V}\) for muons or \(p_{\text {T}} <60\) \(\text {Ge} \text {V}\) for electrons) is typically looser than in the trigger selection, the efficiencies are provided separately for low-\(p_{\text {T}}\) electrons or muons that satisfy or fail to satisfy the trigger selection requirements. The fake-lepton efficiency for the non-triggered leptons is estimated using events recorded with triggers that have lower muon-\(p_{\text {T}}\), only MediumLH electron quality and no lepton isolation requirements, but only record a fraction of the events satisfying these criteria.

The uncertainty in the \(W\)+jets background is directly related to the uncertainties in the real- and fake-lepton efficiencies. For real-lepton efficiencies, these take into account uncertainties in electron and muon reconstruction and isolation correction factors. Uncertainties in the fake lepton efficiencies include variations in the control region definition, as well as normalization and shape uncertainties in the subtracted contributions from other processes in the control region. The control region variations are designed to cover the uncertainty in the flavour composition of the jets faking leptons, and include variations of the \(m_{\mathrm {T}}\) requirement and the number of b-tagged jets.

The total uncertainty in the W+jets yield is 90% and is dominated by the uncertainty in the fake and real electron efficiencies, because of the greater contribution of electron fakes to the W+jets background. The W+jets background amounts to 3% of the expected yield in the signal region.

The differential W+jets distributions necessary for the differential cross-section measurements are also determined in a fully data-driven way, by evaluating the same system of linear equations [6] in each bin of the differential distributions.

The predicted contributions to the backgrounds from W+jets are validated using a data control sample in which the two selected leptons are required to have the same electric charge (same-sign) and satisfy all the other selection requirements. Figure 3 shows the pseudorapidity difference between the leptons and the transverse momentum of the sub-leading lepton for this same-sign control sample. The predictions and the data agree well.

Fig. 3
figure 3

Distributions of the pseudorapidity difference between the leptons (left) and the transverse momentum of the sub-leading lepton (right) for the same-sign validation region. The uncertainties shown include statistical and systematic uncertainties

5.4 Background from multi-boson production

The estimate of the diboson background from WZ, ZZ, \(W\gamma \) and \(Z\gamma \) processes is based on MC simulation. These processes contribute about 3% to the total number of events. The uncertainty in the cross-section for these diboson processes is taken as 10% [89, 90] and variations in the shape and the acceptance are considered for WZ and ZZ production by using alternative MC generators, as detailed in Section 3.

The \(V\gamma \) background simulation is validated in data using the events passing the same selection as for the signal region, except inverting the electron identification criteria and requiring the reconstructed electron track to have no hit in the innermost layer of the pixel detector. The WZ background simulation is validated in data using events that allow for the presence of a third loosely isolated lepton with \(p_{\text {T}} >10\) \(\text {Ge} \text {V}\) and require the same-flavour lepton pair to be of opposite sign and with invariant mass of \(80~\text {Ge} \text {V}<m_{ee/\mu \mu }<100~\text {Ge} \text {V}\), while otherwise passing the signal region selection. Good agreement between the data and the simulation is found in both regions.

The background from triboson production (WWW, WWZ, WZZ and ZZZ) is less than 0.1% and is evaluated using MC simulation. The cross-section uncertainty is taken as \(30\%\) [89].

5.5 WW candidate events and estimated background yields

After applying all the selection requirements, 12 659 events are observed in data, with a contribution of 65% from \(WW\) production, which is estimated using simulation (see Section 3). A summary of the data, signal, and background yields is shown in Table 2. Kinematic distributions comparing the selected data with the signal and backgrounds in the signal region are shown in Figure 4. Fair agreement between data and expectations is observed for the overall normalization and the shapes of various kinematic distributions. Small underpredictions in the peak region of the leading lepton \(p_{\text {T}}\) distribution, the low \(m_{e\mu }\) region and a small downward trend in the ratio of the data to expectations in the \(\Delta \phi _{e\mu }\) distribution have also been observed in the previous ATLAS measurement at \(\sqrt{s}=8\) \(\text {Te} \text {V}\)[6]. The trend in the \(\Delta \phi _{e\mu }\) distribution was also observed at \(\sqrt{s}=8\) \(\text {Te} \text {V}\) by CMS Collaboration [8]. For the \(\Delta \phi _{e\mu }\) distribution, the largest discrepancy between data and expectations (of about three standard deviations) is observed in the range \(1.3< \Delta \phi _{e\mu } < 1.6\).

Table 2 Number of events observed in data, compared with the numbers of predicted signal and background events in the signal region. The systematic uncertainties, described in Section 7, do not include the uncertainty in the integrated luminosity. The uncertainties in the total background and in the sum of signal and background are the sums in quadrature of the uncertainties in the various background and signal sources
Fig. 4
figure 4

Kinematic distributions of the selected data events after the full event selection (from left to right and top to bottom): \(p_{\text {T}}^{\text {lead}~\ell }\), \(m_{e\mu }\), \(p_{\mathrm {T}}^{e\mu }\), \(|y_{e\mu }|\), \(\Delta \phi _{e\mu }\) and \(|\cos {\theta ^*}|\). Data are shown together with the predictions of the signal and background production processes. Statistical and systematic uncertainties in the predictions are shown as hatched bands. The lower panels show the ratio of the data to the total prediction. An arrow indicates that the point is off-scale. The last bin includes the overflow

6 Fiducial cross-section determination

The \(WW\) cross-section is evaluated in the fiducial phase space of the \(e\mu \) decay channel, as defined in Table 3. In simulated events, electrons and muons are required to originate from one of the W bosons produced in the hard scatter, and the momenta of photons emitted in a cone \(\Delta R = 0.1\) around the lepton direction are added to the lepton momentum after QED final-state radiation to form ‘dressed’ leptons [91]. Final-state particles with lifetimes greater than 30 ps are clustered into jets (referred to as particle-level jets) using the same algorithm as for detector-level jets, i.e. the anti-\(k_t\) algorithm with radius parameter \(R = 0.4\). The selected charged leptons and any neutrino in the event are not included in the jet clustering. The fiducial phase space at particle level does not make any requirement on jets containing b-quarks. The missing transverse momentum is defined at particle level as the transverse component of the vectorial sum of the neutrino momenta. Its magnitude is denoted in Table 3 by \(E_{\text {T}}^{\text {miss}} \).

The fiducial cross-section is obtained as follows:

$$\begin{aligned} \sigma ^{\mathrm {fid}} _{WW \rightarrow e \mu } = \frac{N_{\mathrm{obs}} - N_{\mathrm{bkg}}}{C\, \times {{{\mathcal {L}}}} }, \end{aligned}$$

where \({{{\mathcal {L}}}}\) is the integrated luminosity, \(N_{\mathrm{obs}}\) is the observed number of events, \(N_{\mathrm{bkg}}\) is the estimated number of background events and C is a factor that accounts for detector inefficiencies, resolution effects and contributions from \(\tau \)-lepton decays. The C factor is defined as the ratio of the number of reconstructed WW events after the final selection with electrons or muons in the final state (including electrons or muons from \(\tau \)-lepton decays) to the number of WW events generated in the fiducial region where only direct decays of W bosons to electrons and muons are allowed. The C factor takes into account the contribution to the \(WW\) signal originating outside of the fiducial phase space. This contribution is estimated from MC simulation to be about 21% of the expected reconstructed signal, about 40% of which originates from \(\tau \)-lepton decays. The C factor has a value of 0.613 with an uncertainty of 3%, including experimental and unfolding method sources, as detailed in Section 7.

The fiducial cross-section as a function of the jet-veto \(p_{\text {T}}\) threshold is determined using the same method, but modifying the selection requirements to exclude events with jets above a transverse momentum of 30 \(\text {Ge} \text {V}\), 35 \(\text {Ge} \text {V}\), 40 \(\text {Ge} \text {V}\), 45 \(\text {Ge} \text {V}\), 50 \(\text {Ge} \text {V}\), 55 \(\text {Ge} \text {V}\), and 60 \(\text {Ge} \text {V}\), respectively. The values for C are determined for each threshold and increase from 0.598 to 0.625.

The differential cross-sections are determined using an iterative Bayesian unfolding method [92, 93] with one iteration for \(m_{e\mu }\), \(p_{\text {T}}^{\text {lead}~\ell }\), \(|y_{e\mu }|\), \(\Delta \phi _{e\mu }\) and \(|\cos {\theta ^*}|\), and two iterations for \(p_{\mathrm {T}}^{e\mu }\). The number of iterations is optimized to find a balance between too many iterations, causing high statistical uncertainties in the unfolded distributions, and too few iterations, which can bias the measurement towards the MC prediction. The unfolding procedure corrects for migrations between bins in the distributions during the reconstruction of the events, and applies fiducial as well as reconstruction efficiency corrections. The fiducial corrections take into account events that are reconstructed in the signal region, but originate from outside the fiducial region; the reconstruction efficiency corrects for events inside the fiducial region that are not reconstructed in the signal region due to detector inefficiencies. Tests with MC simulation demonstrate that the method is successful in retrieving the true distribution in the fiducial region from the reconstructed distribution in the signal region.

Table 3 Definition of the \(WW\rightarrow e\mu \) fiducial phase space

7 Systematic uncertainties

Systematic uncertainties in the WW cross-section measurements arise from the reconstruction of leptons and jets, the background determination, pile-up and integrated luminosity uncertainties, as well as the procedures used to correct for detector effects, and theoretical uncertainties in the signal modelling.

For leptons and jets, uncertainties in the momentum or energy scale and resolution are considered [67, 72, 94]. Uncertainties in the lepton reconstruction and identification efficiencies [66, 67] as well as the efficiency of the jet vertex tagging requirements [74, 75] in the simulation are taken into account. Uncertainties in the b-tagging, which mainly stem from the top-quark background contributions, are also taken into account based on the studies in Refs. [95, 96]. The impact of uncertainties in the scale and resolution of \(E_{\mathrm {T}}^{\mathrm {miss, track}}\) are estimated as discussed in Ref. [79]. The pile-up modelling uncertainty is evaluated by varying the number of simulated pile-up interactions by its uncertainty of 10% of the nominal value. The variations are designed to cover the uncertainty in the ratio of the predicted to the measured cross-section of non-diffractive inelastic events producing a hadronic system of mass \(m_X > 13~\text {Ge} \text {V}\) [97], where the nominal value of \(\sigma _{\text {inel}}=74\) mb is used in the simulation.

Uncertainties in MC-based background processes include variations of the shapes of predicted distributions, the normalization, and the statistical uncertainties in the simulation, in addition to the full set of detector-related uncertainties. The first two are estimated as discussed in Sections 3 and 5. The uncertainties in the background from top-quark and W+jets production are mitigated by the use of the data-driven methods described in Section 5.

Uncertainties due to the unfolding procedure and the modelling of the signal process are considered by repeating the cross-section extraction with modified inputs. The uncertainty due to the choice of generator for the hard interaction, the parton shower model and the underlying-event modelling for the MC-based unfolding inputs, is estimated by using Sherpa 2.2.2 instead of Powheg-Box +Pythia 8 for \(q\bar{q}\)-initiated WW production, with the samples detailed in Section 3. The impact of mismodelling of the data by Powheg-Box +Pythia 8 for each observable is estimated by reweighting the distribution at generator level to improve the agreement between data and simulation after event reconstruction. The obtained prediction at detector level, which is then very similar to data, is unfolded with the normal inputs and the difference from the reweighted prediction at generator level is considered as an uncertainty. The impact of statistical uncertainties in the nominal signal simulation is estimated using pseudo-data. The theory uncertainties cover PDF and scale variations of the unfolding inputs. The PDF uncertainty is estimated as the 68% confidence level (CL) envelope of the CT10[37] prediction. The uncertainty associated with higher-order QCD corrections is evaluated by varying the renormalization (\(\mu _\textsc {r}\)) and factorization (\(\mu _\textsc {f}\)) scales independently by factors of 2 and 0.5 with the constraint \(0.5\le \mu _\textsc {f}/\mu _\textsc {r} \le 2\).

The uncertainty in the combined 2015+2016 integrated luminosity is 2.1%. It is derived from the calibration of the luminosity scale using x-y beam-separation scans, following a methodology similar to that detailed in Ref. [98], and using the LUCID-2 detector for the baseline luminosity measurements [99]. The LHC beam energy uncertainty is estimated to be 0.1% [100]. It affects the signal cross-section by less than 0.2% and is not considered in the total uncertainty.

A summary of the systematic uncertainties in the fiducial cross-section measurement is shown in Table 4. The total uncertainty is dominated by the b-tagging uncertainty (3.4%), the jet energy scale uncertainty (3%), and the modelling of the W+jets (3.1%) and top-quark (2.6%) backgrounds.

Table 4 Relative uncertainties in the \(WW\) fiducial cross-section measurement

8 Theoretical predictions

Theoretical predictions are calculated for the fiducial and the differential cross-sections and include the \(q\bar{q} \rightarrow W W \) and \(gg\rightarrow WW\) sub-processes. The \(q{\bar{q}}\)-initiated production makes up 95% of the total cross-section, while the non-resonant and resonant gg-initiated sub-processes account for 5%.

NNLO predictions for the \(q\bar{q} \rightarrow W W \) production cross-sections are determined using the MATRIX program [101,102,103], including off-shell effects and the non-resonant and resonant gluon-initiated contributions at LO. For improved precision, the MATRIX prediction for \(q\bar{q}\)-initiated production is also complemented with NLO corrections to gluon-induced WW production [104] and with extra NLO EW corrections that also include the photon-induced (\(\gamma \gamma \rightarrow WW \)) contribution [105]. For all these predictions, the NNPDF 3.1 LUXqed PDF set is used [106, 107], the renormalization and factorization scales are set to \(m_{WW}/2\), and the scale uncertainties are evaluated according to Ref. [108]. The PDF uncertainty corresponds to the 68% CL variations of the NNPDF set. The MATRIX prediction itself does not include EW radiative effects from leptons in contrast to the MC simulation used to define leptons in the fiducial region, where photons from the parton shower outside a cone of \(\Delta R=0.1\) can be present. The application of NLO EW corrections compensates, at least partially, for this difference. It is observed that the NLO corrections to the \(gg\rightarrow WW\) sub-process increase the fiducial cross-section by 3%, whereas the NLO EW corrections, applied to the sum of \(q\bar{q}\)- and gg-initiated production, decrease it by 6%.

NLO predictions for \(q\bar{q} \rightarrow W W \) production, which are matched to a parton shower (\(q\bar{q}\) NLO+PS), are determined using either Powheg-Box interfaced to Pythia 8 or Herwig++, or Sherpa 2.2.2. They are combined with the Sherpa +OpenLoops calculation for the gluon-induced non-resonant and resonant \(WW\) production (gg LO+PS). These predictions are described in detail in Section 3. The NLO+PS predictions also include photon final-state radiation and thus already part of the EW effects. Therefore no additional EW correction is applied.

A summary of fiducial cross-section predictions for \(WW\) production is given in Table 5. Predictions from the different generators matched to parton showers agree well among each other and with the fixed-order predictions. For the Sherpa 2.2.2 prediction, scale uncertainties are larger than for the Powheg-Box predictions because the Sherpa calculation includes matrix elements with higher jet multiplicities, which results in a larger uncertainty estimate when varying the renormalization and factorization scales in the matrix element calculation. For fixed-order predictions, scale uncertainties are large because they are evaluated according to Ref. [108].

Table 5 Predictions of the \(WW\) fiducial cross-section. Predictions matched to parton showers are normalized to inclusive fixed-order calculations

9 Results

9.1 Cross-section measurements and comparisons with theoretical predictions

The measured fiducial cross-section for \(WW \rightarrow e \mu \) production at \(\sqrt{s}=13\,\text {Te} \text {V}\) is:

$$\begin{aligned} \sigma _\text {fid}=\left( 379.1\pm 5.0\text { (stat)}\pm 25.4\text { (syst)} \pm 8.0\text { (lumi)} \right) \text {fb}~. \end{aligned}$$

The combined statistical and systematic uncertainty of the measurement, including the uncertainty in the luminosity, is 7.1%.

A comparison between the fiducial cross-section measurement and fixed-order theoretical calculations is shown in Figure 5. The measurement is compared with the NNLO QCD MATRIX predictions including the full set of QCD and EW corrections, discussed in detail in Section 8. The predictions agree well with the measurement.

Fig. 5
figure 5

Comparison of the measured fiducial cross-section with various theoretical predictions. Theoretical predictions are indicated as points with inner (outer) error bars denoting PDF (PDF+scale) uncertainties. The central value of the measured cross-section is indicated by a vertical line with the narrow band showing the statistical uncertainty and the wider band the total uncertainty including statistical and systematic uncertainties

The measured fiducial cross-sections as a function of the jet-veto \(p_{\text {T}}\) thresholds are shown in Figure 6. The fiducial cross-section rises by about 30% when accepting events containing jets with a transverse momentum of up to 60 \(\text {Ge} \text {V}\), as compared with 30 \(\text {Ge} \text {V}\). The measurement is compared to NNLO predictions from MATRIX (Figure 6, left), and to NLO+PS predictions from Powheg-Box +Pythia 8, Powheg-Box +Herwig++ and Sherpa 2.2.2 for \(q\bar{q}\)-initiated states, combined with Sherpa +OpenLoops (LO+PS) for the gg initial states (Figure 6, right). All three \(q\bar{q}\) NLO+PS predictions are normalized to the NNLO theoretical prediction for the total cross-section, with the gg LO+PS contribution normalized to NLO. With increasing jet-veto \(p_{\text {T}}\) threshold, the fiducial cross-section rises as it becomes more inclusive. All predictions agree within uncertainties with the data, but are consistently at the lower bound of these.

Fig. 6
figure 6

Comparison of the measured fiducial cross-section as a function of the jet-veto \(p_{\text {T}}\) threshold with various theoretical predictions. The measurement is compared with NNLO predictions from MATRIX on the left. This calculation does not include the NLO EW correction and is Born-level, whilst the measurement is conducted using dressed leptons, which might account for some of the differences seen. On the right a comparison with NLO+PS predictions from Powheg-Box +Pythia 8, Powheg-Box +Herwig++ and Sherpa 2.2.2 for \(q\bar{q}\) initial states, combined with Sherpa +OpenLoops (LO+PS) for the gg initial states is shown. All three \(q\bar{q}\) NLO+PS predictions are normalized to the NNLO theoretical prediction for the total cross-section, with the gg LO+PS contribution normalized to NLO. The measured cross-section values are shown as points with error bars giving the statistical uncertainty and solid bands indicating the size of the total uncertainty. Theoretical predictions are indicated as markers with hatched bands denoting PDF+scale uncertainties

The measured fiducial cross-sections as a function of \(p_{\text {T}}^{\text {lead}~\ell }\), \(m_{e\mu }\), \(p_{\mathrm {T}}^{e\mu }\), \(|y_{e\mu }|\), \(\Delta \phi _{e\mu }\) and \(|\cos {\theta ^*}|\) are shown in Figures 7 and 8. They are compared with the NNLO QCD predictions from MATRIX, including NLO corrections for \(gg\rightarrow WW\) production and extra NLO EW corrections, as well as with the same \(q\bar{q}\) NLO+PS predictions as stated above (combined with gg LO+PS) normalized to the NNLO (NLO) theoretical prediction for the total cross-section. All of these predictions provide a fair description of the data, except for low values of the \(p_{\text {T}}\) of the leading lepton as well as low values of invariant mass \(m_{e\mu }\) and \(\Delta \phi _{e\mu } <1.8\). For the \(p_{\text {T}}^{\text {lead}~\ell }\) distribution, Powheg-Box +Pythia 8 and Sherpa 2.2.2 underestimate the cross-section by up to 15%–20%. For the other two distributions, all predictions display similar underestimates of the measured differential cross-section but to slightly varying degrees, depending on the size of their uncertainties. The most consistent difference is observed at around \(\Delta \phi _{e\mu } \approx 1.5\). A similar underprediction of the data, shifted slightly to lower \(\Delta \phi _{e\mu }\) values (around \(\approx 0.5\)–1.0) was seen in both the ATLAS and CMS measurements at 8 \(\text {Te} \text {V}\)[6, 8] when compared with the predictions from a variety of MC generators. Global \(\chi ^2\) comparisons are carried out for all the predictions. They do not display any significant differences between predictions and data with the largest \(\chi ^2\) per degree of freedom being 18.5/14 when comparing the Sherpa 2.2.2 +Sherpa +OpenLoops prediction with the measured \(p_{\text {T}}^{\text {lead}~\ell }\) distribution.

Fig. 7
figure 7

Measured fiducial cross-sections of \(WW \rightarrow e \mu \) production for four of the six observables (from left to right and top to bottom): \(p_{\text {T}}^{\text {lead}~\ell }\), \(m_{e\mu }\), \(p_{\mathrm {T}}^{e\mu }\), and \(|y_{e\mu }|\). The measured cross-section values are shown as points with error bars giving the statistical uncertainty and solid bands indicating the size of the total uncertainty. The results are compared with the NNLO prediction with extra NLO EW corrections and NLO corrections for \(gg\rightarrow WW \) production, and with NLO+PS predictions from Powheg-Box +Pythia 8, Powheg-Box +Herwig++ and Sherpa 2.2.2 for \(q\bar{q}\) initial states, combined with Sherpa +OpenLoops (LO+PS) for the gg initial states. All three \(q\bar{q}\) NLO+PS predictions are normalized to the NNLO theoretical prediction for the total cross-section, with the gg LO+PS contribution normalized to NLO. Theoretical predictions are indicated as markers with hatched bands denoting PDF+scale uncertainties

Fig. 8
figure 8

Measured fiducial cross-sections of \(WW \rightarrow e \mu \) production for two of the six observables: \(\Delta \phi _{e\mu }\) and \(|\cos {\theta ^*}|\). The measured cross-section values are shown as points with error bars giving the statistical uncertainty and solid bands indicating the size of the total uncertainty. The results are compared with the NNLO prediction with extra NLO EW corrections and NLO corrections for \(gg\rightarrow WW \) production, and with NLO+PS predictions from Powheg-Box +Pythia 8, Powheg-Box +Herwig++ and Sherpa 2.2.2 for \(q\bar{q}\) initial states, combined with Sherpa +OpenLoops (LO+PS) for the gg initial states. All three \(q\bar{q}\) NLO+PS predictions are normalized to the NNLO theoretical prediction for the total cross-section, with the gg LO+PS contribution normalized to NLO. Theoretical predictions are indicated as markers with hatched bands denoting PDF+scale uncertainties

9.2 Limits on anomalous gauge couplings

The self-couplings of the electroweak gauge bosons can be probed via the WWZ and \(WW\gamma \) vertices, present when the W bosons are produced via s-channel \(Z/\gamma ^*\) exchange, as shown in Figure 1. New physics processes at a high energy scale (\(\Lambda \)) that alter \(WW\) production can be described by operators with mass dimensions larger than four in an effective field theory (EFT) framework [109]. The higher-dimensional operators of the lowest order from purely EW processes have dimension six, and can generate anomalous triple-gauge-boson couplings (aTGC). A deviation from the SM in measured \(WW\) production rates or in certain kinematic distributions, as predicted by these theories, could provide evidence for physics beyond the SM. In the EFT framework employed, there are five dimension-six operators (\({\mathcal {O}}_i\)) and the relevant EFT coefficients (coupling constants) are: \(c_{WWW}\), \(c_{W}\), \(c_{B}\), \(c_{{\tilde{W}}WW}\) and \(c_{{\tilde{W}}}\) [109]. The dimensionless coefficients (\(c_i\)) parameterize the strength of the coupling between new physics and SM particles

$$\begin{aligned} {\mathcal {L}} = {\mathcal {L}}_{\text {SM}}+ \sum _i^{} \frac{c_i}{\Lambda ^2}{\mathcal {O}}_i~. \end{aligned}$$

Constraints on the EFT coefficients are determined by considering only one operator at a time using the unfolded leading lepton \(p_{\text {T}}\) (\(p_{\text {T}}^{\text {lead}~\ell }\)) fiducial cross-section, which was identified as the unfolded distribution most sensitive to the effect of the five operators.

Templates of the \(p_{\text {T}}^{\text {lead}~\ell }\) distribution representing the pure SM contribution, the aTGC contribution, and the interference between the SM and aTGC contributions at LO are prepared at generator level using MadGraph5_aMC@NLO version 2.6.3.2 [110], interfaced to Pythia 8.212 with the A14 tune for parton showering and hadronization. The relative size of the SM cross-section modification increases with \(p_{\text {T}}^{\text {lead}~\ell }\) so that the last measured bin is most sensitive to the aTGC effects. To ensure a good agreement of the MadGraph5_aMC@NLO prediction with the baseline SM prediction, a bin-wise correction, determined as the ratio of the pure SM contributions from Powheg-Box +Pythia 8 (normalized to the NNLO cross-section) and MadGraph5_aMC@NLO, is applied.

It is verified that the pure SM assumption used in the unfolding procedure introduces no bias to the extraction of limits from the unfolded cross-section. A reweighting procedure implemented in the MadGraph5_aMC@NLO [111] generator is used to obtain multiple signal predictions that include aTGCs of a magnitude corresponding to the upper limits set by the Run 1 analysis [6]. The simulation is interfaced to Herwig 6.5 [112] and passed through the ATLAS detector simulation. Neither the reconstruction efficiency and the fiducial corrections nor the bin-to-bin migrations are significantly different.

The measured \(p_{\text {T}}^{\text {lead}~\ell }\) cross-section and the MadGraph5_aMC@NLO prediction, interfaced to Pythia 8, as described above, are used to construct a likelihood function, in which statistical and systematic measurement uncertainties are modelled by a multivariate Gaussian distribution. Systematic uncertainties in the theory prediction are considered as nuisance parameters, each constrained with a Gaussian distribution. Since electroweak radiative effects are already partially taken into account in the parton shower of the MadGraph5_aMC@NLO prediction, the effect of applying NLO EW corrections to the \(p_{\text {T}}^{\text {lead}~\ell }\) distribution in addition is considered as a further systematic uncertainty.

Frequentist confidence intervals for the EFT coefficients are computed from values of a profile likelihood ratio test statistic [113]. Observed and expected 95% CL intervals for the EFT coefficients are summarized in Table 6. Due to the higher centre-of-mass energy, the limits reported here are more restrictive than those previously published by the ATLAS and CMS Collaborations in the \(WW\) final state [6, 8]. Compared to results from inclusive WZ production [114] and electroweak W and Z boson production in association with two jets [115], both at \(\sqrt{s}=13~\text {Te} \text {V}\), the limits on \(c_{B}/\Lambda ^2\) from this analysis are the most stringent (by about a factor 2), while those on \(c_{WWW}/\Lambda ^2\) and \(c_{W}/\Lambda ^2\) are weaker by factors of about 1.6 – 4. Limits on the CP-odd operators \({\mathcal {O}}_{{\tilde{W}}WW}\) and \({\mathcal {O}}_{{\tilde{W}}}\) are not provided by the other two measurements.

The sensitivity to dimension-six operators mostly stems from their direct effect on the \(WW\) cross-section as a function of \(p_{\text {T}}^{\text {lead}~\ell }\), except for the \(c_{W}\) coefficient where both the direct contribution and the interference between the SM and terms containing EFT operators contribute equally.

Table 6 The expected and observed 95% CL intervals for the anomalous coupling parameters of the EFT model [109]. There is a change in convention relative to Ref. [6] that changes the sign on some of these parameters

10 Conclusion

The cross-section for the production of \(W^{+}W^{-}\) pairs in pp collisions at \(\sqrt{s}=13\) \(\text {Te} \text {V}\) (with subsequent decays into \(WW\rightarrow e\nu _e\mu \nu _\mu \)) is measured in a fiducial phase space that excludes the presence of jets with transverse momentum above 35 \(\text {Ge} \text {V}\). The measurement is performed with data recorded by the ATLAS experiment at the LHC in 2015 and 2016, which correspond to an integrated luminosity of \(36.1\ \text {fb}^{-1}\). The measured fiducial cross-section is \(\sigma _\text {fid}=\left( 379.1\pm 5.0\text { (stat)}\pm 25.4\text { (syst)}\pm 8.0\text { (lumi)} \right) \text {fb}\), and is found to be consistent with theoretical predictions, including NNLO QCD and NLO EW corrections. The fiducial cross-section is also measured as a function of the transverse momentum threshold for the jet veto. Differential cross-sections are measured as a function of kinematic and angular variables of the final-state charged leptons and are compared with several predictions from perturbative QCD calculations. Data and theory show fair agreement for all differential distributions. The distribution of the transverse momentum of the leading lepton is used to investigate anomalous triple-gauge-boson coupling parameters. No evidence for anomalous WWZ and \(WW\gamma \) couplings is found, hence limits on their magnitudes are set. These limits are more restrictive than those derived at \(\sqrt{s}=8~\text {Te} \text {V}\).