Measurement of W boson angular distributions in events with high transverse momentum jets at √ s = 8 TeV using the ATLAS detector

The W boson angular distribution in events with high transverse momentum jets is measured using data collected by the ATLAS experiment from proton–proton collisions at a centre-of-mass energy √ s = 8 TeV at the Large Hadron Collider, corresponding to an integrated luminosity of 20 . 3 fb − 1 . The focus is on the contributions to W + jets processes from real W emission, which is achieved by studying events where a muon is observed close to a high transverse momentum jet. At small angular separations, these contributions are expected to be large. Various theoretical models of this process are compared to the data in terms of the absolute cross-section and the angular distributions of the muon from the leptonic W decay.


Introduction
Precision measurements of Standard Model processes at the Large Hadron Collider (LHC) are crucial for probing the fundamental structure of the strong and electroweak interactions. The data sample corresponding to an integrated luminosity of 20.3 fb −1 collected by the ATLAS experiment from proton-proton (pp) collisions at a centre-of-mass energy √ s = 8 TeV at the LHC allows detailed study of perturbative quantum chromodynamics (perturbative QCD, pQCD) and real and virtual electroweak (EW) corrections that impact measurements of W + jets production.
At high energies, real emission of weak bosons in dijet events can contribute significantly to inclusive W + jets measurements [1][2][3][4][5]. In leading-order (LO) calculations of W + 1-jet production, the W boson is balanced by the recoil hadronic jet, often referred to as back-to-back production. At next-to-leading order (NLO), QCD and EW corrections to W + 1-jet processes appear, both as real and virtual contributions. In the case of real W boson emission from an initial-or final-state quark, these contributions scale as O α ln 2 p T, j /m W , where α is the gauge coupling of the unified EW theory, p T, j is the transverse momentum of the jet and m W is the W boson mass, and have a collinear enhancement in the distribution of the angular distance between the W boson and the closest jet. The collinear enhancement arises from collinear and infrared divergences which would be present in the limit of E-mail address: atlas.publications@cern.ch.
vanishing W boson mass, but which are regulated by its finite mass. The procedures to correctly account for collinear parton radiation, such as massless gluon emission, are well known and led to the introduction of (Sudakov) parton showering of the initial-and final-state partons in Monte Carlo generators for QCD as well as quantum electrodynamics (QED) contributions. An analogous procedure is available for the emission of real W bosons [6]. The effect of real W boson emission can be probed by isolating events for which the cancellation between real and virtual corrections is incomplete, for example by studying the region of small angular separation between a jet and the W boson. This region also contains LO contributions from W + 2-jets, as well as corrections to that process, which must be included for accurate predictions.
Due to this complex mixture of W + 1-jet and W + 2-jet processes, and the relevant QCD and EW corrections to both, comparisons of measurements to predictions using multiple approaches for estimating those corrections are crucial. Comparisons of the measured angular spectra of the muon from the W boson with fixed-order predictions at NLO and next-to-next-to-leading-order (NNLO) and with programs with electroweak parton showers help in understanding the accuracy of these predictions. The measurements presented here focus on events that contain a muon and a jet with transverse momentum p T > 500 GeV. In this kinematic regime, contributions to W + jets processes from real W boson emission are enhanced in the region of small angular separation between the W boson decay products and the closest jet. The angular separation is defined as the distance between the muon and the closest jet, R(μ, jet) = ( φ) 2 + ( η) 2 , 1 hereafter referred to as R. Measurements of this angular separation thus provide precision tests of pQCD and electroweak predictions for the rate and pattern of real W boson emission. Real W boson emission, also termed collinear W production, is the dominant process for events with R < 2.4, and thus R < 2.4 is referred to as the collinear region. The significance of this higher-order contribution at small R is shown in Ref. [5]. For events with R > 2.4, the W boson is balanced by a hadronic recoil that may consist of one or more jets. These measurements of the R distribution probe a new region of phase space that has not been explicitly studied in detail.
Measurements of W + jets production by both the ATLAS and CMS experiments often remove portions of the collinear region by requiring that the lepton (e or μ) is separated from any jet by an angular distance of R > 0.5 [7,8]. By relaxing this requirement to R > 0.2 and focusing on the distribution of angular separation between the muon and the closest jet in events with at least one very high p T jet (p T > 500 GeV), it is possible to explicitly target real W emission with this measurement. Collinear W production may constitute an important background in searches for beyond the Standard Model physics that involve Lorentz-boosted top quarks [9], either in rare topologies or at high energies. If the W decay products are collinear with one of the jets, the structure of that jet can begin to resemble that of the three-pronged structure of a boosted top quark. While the rate for collinear W production is suppressed relative to dijet production with no W emission, hadronic W decays can cause a large increase in the measured jet mass. The result is that W emission from quarks at very high p T can yield single jets with definite substructure that resemble the boosted top-quark signals being searched for.

The ATLAS detector
The ATLAS detector [10,11] provides nearly full solid angle coverage around the pp collision point at the LHC.
The inner detector (ID) comprises a silicon pixel tracker closest to the beamline, a microstrip silicon tracker, and a straw-tube transition-radiation tracker at radii up to 108 cm. A thin solenoid surrounding the tracker provides a 2 T axial magnetic field enabling the measurement of charged-particle momenta. The overall ID acceptance spans the full azimuthal range in φ, and the range |η| < 2.5 for particles originating near the nominal LHC interaction region [12].
The electromagnetic (EM) and hadronic calorimeters are composed of multiple subdetectors spanning |η| < 4.9. The EM barrel calorimeter uses a liquid-argon (LAr) active medium, together with lead absorbers, and covers |η| < 1.45. In the region |η| < 1.7, the hadronic calorimeter is constructed from steel absorber and scintillator tiles and is separated into barrel (|η| < 1.0) and extendedbarrel (0.8 < |η| < 1.7) sections. The endcap (1.375 < |η| < 3.2) and forward (3.1 < |η| < 4.9) regions are instrumented with LAr calorimeters for EM as well as hadronic energy measurements. A muon spectrometer with three large air-core toroid magnet systems surrounds the calorimeters. The muon spectrometer measures the momentum of muons from their tracks, which are reconstructed with three layers of high-precision tracking chambers. 1 ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the detector and the z-axis along the beam pipe. The x-axis points from the IP to the centre of the LHC ring, and the y-axis points upward. Cylindrical coordinates (r, φ) are used in the transverse plane, φ being the azimuthal angle around the z-axis. The pseudorapidity is defined in terms of the polar angle θ as η = − ln tan(θ/2). These chambers provide coverage in the range |η| < 2.7, while dedicated fast chambers allow triggering in the region |η| < 2.4.
A three-level trigger system is used to record events for analysis. The different parts of the trigger system are referred to as the Level-1 trigger, the Level-2 trigger, and the Event Filter [13]. The Level-1 trigger is implemented in hardware and uses a subset of detector information to reduce the event rate to a design value of at most 75 kHz. The Level-1 trigger is followed by two software-based triggers, the Level-2 trigger and the Event Filter, which together reduce the event rate to a few hundred Hz.

Data and simulated samples
The measurement presented here is based on the entire 2012 pp dataset at a centre-of-mass energy of √ s = 8 TeV. Events are required to meet baseline quality criteria during stable LHC running periods. These data quality criteria primarily reject data with significant contamination from detector noise or issues in the readout [14] based upon individual assessments for each subdetector. The resulting dataset corresponds to an integrated luminosity of 20.3 fb −1 . The absolute luminosity scale is derived from beamseparation scans performed in November 2012. The uncertainty in the integrated luminosity is ±1.9% [15].
Simulated events from Monte Carlo (MC) generators are used for calculating the signal efficiency and estimating background in the signal region. The events are simulated using a GEANT4-based [16] full detector simulation [17]. In addition to the hard scatter, each event is overlaid with a number of additional pp collisions (pile-up) extracted from the distribution of the average number of pp interactions per bunch crossing μ observed in data. These additional pp collisions are generated with PYTHIA v8.160 [18] using the ATLAS A2 set of tuned parameters (A2 tune) [19] and the MSTW2008LO parton distribution function (PDF) set [20].
Events containing W + jets are generated with ALPGEN 2.14 [21], which implements MLM matching [22] of the matrix element calculation with parton showering. The W boson is produced as part of the matrix element calculations, allowing simulation of both collinear and back-to-back W + jets production. In the latter, the W boson is balanced by the hadronic recoil system. The matrix elements provided by ALPGEN are configured to allow up to five partons in the final state in addition to the W boson, including heavy-flavour production as well. The generator is interfaced with PYTHIA v6.427 [23] for parton showering and fragmentation. The CTEQ6L1 PDF set [24] is used. A K -factor is applied to these samples to correct the normalisation to a NNLO pQCD inclusive cross-section calculated with FEWZ [25] and the MSTW2008NNLO PDF set. A sample of events is also generated with PYTHIA v8.210 and using the CT10 NLO PDF set [26] in which W boson radiation can be produced via a weak parton shower.
Dijet events are generated with PYTHIA v8.165. Top-quark pair production is simulated with POWHEG-r2129 [27][28][29][30] interfaced with PYTHIA v6.426 with the P2011C [31] tune for parton showering and fragmentation. Diboson production is simulated with MC@NLO v4.07 [32]. Additional samples of diboson production are generated using SHERPA v1.43 [33] and these are used to estimate theoretical uncertainties in the diboson background estimation. The above samples are all generated using the CT10 NLO PDF set. Events containing Z + jets are generated with ALPGEN using the same configuration as the W + jets simulation above. Single top-quark production is a negligible background for this analysis and is not included.
All samples are normalised to their calculated inclusive crosssections. However, for the W + jets, dijets, tt and Z + jets samples, there is an additional correction applied to the normalisation, derived from the comparison of data and Monte Carlo simulations in the signal region and control regions. The process of deriving this correction is explained in detail in Section 4.

Baseline event selection
The topology of collinear W production involves two back-toback high-p T jets, one of which emits a nearby W boson. Events are required to contain at least one jet with p T > 500 GeV, as this is found to be sufficient to probe the kinematic region of interest. The probability of a collinear W emission from such a jet is estimated by PYTHIA v8.210 to be 0.15%. Over half of the production of W + jets in the phase space probed in this measurement is in the collinear region. A requirement for a second high-p T jet is not applied. Although both jets initially recoil from each other and have similar p T , the jet that emits the collinear W boson can lose a significant amount of energy to the muon and neutrino, neither of which are reconstructed as part of the jet energy. Requiring a second high-p T jet would impose an implicit maximum on the energy carried by the W boson and its decay products.
The analysis focuses on the leptonic decays of W bosons to muons in order to ensure a high reconstruction purity, and thus events are required to have exactly one muon. Events that contain an electron are rejected, which reduces the background by removing mixed-flavour dileptonic (electron plus muon) tt decays. Control regions are used to establish the normalisation of MC simulations of several background processes. These regions are defined by inverting various selection criteria used in the final measurement.
To reject non-collision background [34], events are required to contain at least one primary vertex consistent with the beaminteraction region, reconstructed from at least two tracks each with p track T > 400 MeV. The primary hard-scatter vertex is defined as the vertex with the highest (p track T ) 2 . To reject rare events contaminated by spurious signals in the detector, all anti-k t [35,36] jets with radius parameter R = 0.4 and p jet T > 20 GeV (see below) are required to satisfy the loosest jet-quality requirements discussed in Ref. [34]. These criteria are designed to reject non-collision background and significant transient noise in the calorimeters while maintaining an efficiency for good-quality events greater than 99.8% with as high a rejection of contaminated events as possible. In particular, this selection is very efficient in rejecting events that contain fake jets due to calorimeter noise.

Trigger selection
Events used in this analysis are selected by requiring that they pass at least one of two single-muon triggers [37]. The first trigger requires an isolated muon with p T > 24 GeV and the second trigger requires a muon with p T > 36 GeV with no isolation criteria applied. The track-based isolation used in the trigger requires that the scalar sum of the p T of all tracks within a cone of radius R = 0.2 around the muon is less than 12% of the muon p T .

Object reconstruction
Muons are reconstructed by combining tracks in the ID with tracks in the muon spectrometer [38]. They are required to have p T > 25 GeV and |η| < 2.4. To reduce contamination from semileptonic b-decays, in-flight pion and kaon decays and cosmic muons, their longitudinal impact parameter with respect to the primary vertex z 0 must satisfy |z 0 | sin θ < 0.5 mm and their transverse impact parameter with respect to the primary vertex d 0 must satisfy |d 0 |/σ (d 0 ) < 3. The selected offline reconstructed muon must also match the online muon that passed the trigger.
Jets are built using the anti-k t algorithm with a radius parameter of R = 0.4 from locally calibrated three-dimensional topological energy clusters [39]. The resulting jets are required to have p T > 100 GeV and |η| < 2.1.
The number of b-tagged jets for a given event is calculated using the MV1 tagger [40] on jets built using the anti-k t algorithm with R = 0.4. The jets considered for b-tagging have p T > 25 GeV and are reconstructed within |η| < 2.1. The MV1 tagger is configured to have a b-tagging efficiency of 70% in semileptonic tt events.
Electrons are reconstructed from a combination of a calorimeter energy cluster and a matched ID track [41,42]. They must meet a set of identification criteria (the so-called medium criteria of Ref. [41]). They are also required to have p T > 20 GeV and |η| < 2.47, excluding the transition region between the barrel and the endcap calorimeters (1.37 < |η| < 1.52). To reduce the contamination from semileptonic b-decays and misidentification, the same impact parameter requirements used for muons are applied along with an isolation requirement. This isolation is track-based and requires that the scalar sum of the p T of all tracks in a cone of radius R = 0.2 around the electron be less than 15% of the electron p T .

Measurement selection
To select the W + jets signal, events are required to contain at least one jet with p T > 500 GeV, exactly one muon, no b-tagged jets, a primary vertex and no electrons. Any additional jets with p T > 100 GeV are included in the analysis. The leading jet, defined as the jet with the highest p T , is not necessarily the one closest to the muon. The R distance is always measured with respect to the closest jet. The muon is required to be isolated using both track-based and calorimeter-based isolation criteria. The track isolation requires that the scalar sum of the p T of all tracks in a cone of radius R = 0.2 around the muon be less than 10% of the muon p T . The calorimeter isolation requires that the scalar sum of the p T in all calorimeter cells in a cone of radius R = 0.2 around the muon be less than 40% of the muon p T . Applying these isolation criteria significantly reduces the background from dijet events, where muons mostly originate from heavy-flavour or inflight decays and are non-isolated. The b-tag veto also reduces the background from tt, which generates two b-quarks in their decay, by over 80%, while only 10% of the W + jets signal is rejected. Requirements on missing transverse momentum were not found to improve the signal selection or background rejection. The efficiency of the isolation requirement was studied both in simulated samples and in situ using data events containing high-p T top quarks, and the results from the two studies were in agreement. However, in the extremely collinear region, where the distance between the muon and the closest jet is R < 0.2, the limited size of the event sample did not allow the same conclusion. As a result, events where R < 0.2 are also excluded. This causes approximately 2% of the W + jets signal to be rejected.

Control region definitions and background estimation
For the final state with at least one high-p T jet and a single muon, the dominant background processes that contribute to the signal region are dijets, tt and Z + jets. In addition, there is a small background contribution from diboson production. These are all modelled using the simulated samples described in Section 3.
For each of the three main background processes, a control region utilising an event selection different from the signal region is defined such that most of the events in this control region are from the chosen background. Control Region 1 is enriched in dijets, with a 93% purity of dijet events, by applying the inverse of the signal region isolation selection. It uses events that pass the muon trigger without an isolation requirement and requires the muon to have p T > 38 GeV, as events with a non-isolated muon of lower p T are mostly rejected by the trigger, together with a distance R > 0.2 between the muon and the closest jet. Control Region 2 is enriched in tt, with 91% of events originating from tt production, by requiring at least two b-tagged jets. Control Region 3 is enriched in Z + jets, which constitute 94% of events in this region, by using events with exactly two muons, with both muons passing the signal region isolation. It is further required that the dimuon invariant mass in Control Region 3 satisfies 60 GeV < m μμ < 120 GeV. In this case, the muon with the higher p T is chosen to define R.
Using data from these control regions and the signal region, a scale factor is derived for each main background process and the W + jets signal to correct the normalisation of the MC sample to that observed in data. To ensure the scale factor is not affected by contamination from other backgrounds and the W + jets signal, it is necessary to subtract the MC prediction for the con-tamination from the control region data. As there is a circular dependency in using scaled MC predictions to derive new scalings, an iterative approach is applied. First, the scale factors are derived with the contamination subtracted using the uncorrected normalisations. Then the normalisations are updated with the scale factor corrections and the procedure to derive them is repeated. Since the contamination in each of the regions is quite small, the scale factors converge very rapidly. The dijet sample is scaled by 1.134 ± 0.054, the tt sample is scaled by 0.861 ± 0.061, the Z + jets sample is scaled by 0.705 ± 0.052 and the W + jets sample is scaled by 0.711 ± 0.016. These uncertainties in the scale factors are due to the statistical uncertainty of the data and MC samples and are part of the overall uncertainties in the measurement detailed in Section 6. However, the uncertainty in the W + jets scale factor has no effect on the results of the measurement. After the scale factors are applied, the MC predictions and observed distributions of the distance between the muon and the closest jet for each control region are shown in Fig. 1. The systematic uncertainties shown in Fig. 1 correspond to those described in Section 6.

Definition of observable and correction for detector effects
The estimated background is subtracted from the data in the signal region and the resultant distribution of the distance R between the muon and the closest jet is unfolded using an iterative Bayesian technique [43] to correct for detector effects including both the efficiency of the selection criteria and the resolution of the angular separation between the muon and the nearest jet, where the former effect is dominant. This technique is implemented within the RooUnfold framework [44]. A response matrix derived from MC simulation is used to correct the distribution from detector-level to particle-level. The particle-level prediction from MC simulation is used as an initial prior during the first iteration of the unfolding. Subsequent iterations use the previous iteration's unfolded distribution as a new prior. A single iteration step is used, as this was found to be the optimal choice that minimised the combination of statistical fluctuation and the bias introduced by the prior of unfolded results.
The detector response and the combined efficiency of the trigger, reconstruction and the analysis selection for the W + jets signal is obtained from MC simulation. The fiducial selection applied to MC simulation is similar to the kinematic selection of the analysis. Particle-level jets, built from stable final-state particles (defined as those with a proper lifetime τ corresponding to cτ ≥ 10 mm [45]) excluding muons and neutrinos, must satisfy p T > 100 GeV and |η| < 2.1. Events are required to have at least one particle-level jet with p T > 500 GeV and a particle-level muon with a dressed 2 p T > 25 GeV and |η| < 2.4. No requirements on promptness are applied to the muons or the dressing photons. Any additional muons that pass these requirements cause the event to be rejected. Events where the distance between the muon and the closest jet R < 0.2 are also rejected. Unlike the analysis selection, there are no requirements on b-jets or electrons for the fiducial selection. The unfolding to the fiducial region also corrects for events that do not pass the particle-level selection, but pass the detector-level selection. Events in the fiducial signal region that arise from W → τ ν are also removed so that the cross-section is quoted exclusively for the muon decay channel. 2 Photons that are contained in a cone of size R = 0.1 around the muon are summed and included as part of the muon energy.

Systematic uncertainties
The dominant systematic uncertainties in the cross-section measurement arise from the uncertainties in the jet energy scale and the b-tagging efficiency. For each systematic uncertainty, the selection criteria are re-applied, the control region normalisations are reassessed, and the unfolding procedure is repeated with the quantity under consideration varied by ±1 standard deviation. The average of the up and down variations of the final cross-section measurement are summed in quadrature, as the variations are independent and not correlated. This sum is then used as the full systematic uncertainty. The systematic uncertainties in the measurement, grouped by source, are summarised in Table 1 for the inclusive cross-section, the collinear region (0.2 < R < 2.4) and the back-to-back region ( R > 2.4).
Since the dijet, tt and Z + jets simulated samples are scaled to data in their respective control regions, there is a systematic uncertainty in the scaling that arises from the statistical uncertainty in the data and the MC simulations in these control regions. As the control region for dijets does not have the same kinematic selection as the signal region, there could be some bias due to mismodelling of the dijet kinematics in the simulated sample. An uncertainty accounting for this is derived by varying the kinematic selection of the control region. The uncertainty in the jet energy scale comprises 17 independent components [46]. Six of these are derived from various in situ analyses and two are related to the η intercalibration of the jets.
There are also four components that account for the mismodelling of the p T response with respect to pile-up and three topology components that account for the dependence of the p T -response uncertainty on the relative fractions of jets initiated by light quarks, gluons and b-quarks.
To correct the b-tagging efficiency in simulation to that observed in data, scale factors derived from in situ analyses are applied to the simulated samples [47,48]. These have associated uncertainties. The uncertainties for b-, c-and τ -jets are assessed independently from those for light jets and the uncertainties in the efficiency scale factors are fully anti-correlated with those in the inefficiency scale factors.
In each control region, any disagreement between the R distributions for data and MC simulations is taken as a systematic uncertainty for the R prediction from that specific background in the signal region. This introduces an additional data-driven sys- Table 2 The number of events in the signal region observed in data, along with the composition of these events as predicted by MC simulation, split by the distance between the muon and the closest jet. The dijet, tt and Z + jets backgrounds have been scaled according to their respective control regions. tematic uncertainty to the dijet, tt and Z + jets estimates for the R distribution. Since the diboson background prediction is not constrained by data from a control region, an alternative prediction is obtained from a different simulated sample generated using SHERPA. The difference between these two predictions is taken as an uncertainty in the diboson background estimate.
The systematic uncertainty due to the dependence of the unfolding on the prior signal distribution, as obtained from MC simulations, is evaluated through a data-driven closure test. The simulated signal sample is reweighted at particle-level such that the distribution of the fully simulated detector-level R more closely matches the observed data. This reweighted simulated detector-level distribution is then unfolded and compared with the reweighted particle-level distribution. Differences observed in this comparison are taken as a systematic uncertainty in the unfolding. The uncertainty due to the dependence on the number of unfolding iteration steps was negligible.
Other smaller uncertainty contributions arise from the uncertainty in the integrated luminosity, the uncertainties in the muon momentum scale and resolution, muon reconstruction efficiency and trigger efficiency and the uncertainties in the jet energy resolution [49]. Uncertainties in the electron energy scale and resolution were evaluated but found to be negligible.

Results
The number of events in the signal region observed in data is listed in Table 2, along with the composition of these events as predicted by MC simulation. Numbers are given for the collinear region (0.2 < R < 2.4), the back-to-back region ( R > 2.4), and the inclusive sample. The uncorrected distributions of the reconstructed distance between the muon and the closest jet observed in data and predicted by MC simulations are shown in Fig. 2 for the signal region. In general the distributions agree within the uncertainties, except around R = 2.8 where there is a deficit and around the most collinear region of R < 0.5 where there is a slight excess in the prediction from MC simulations.

Differential cross-section measurement
The differential cross-section of W → μν as a function of R(μ, closest jet), obtained from the unfolded data of the signal region, is shown in Fig. 3. The measured total cross-sections for the inclusive case, in the collinear region and the back-to-back region are also listed in Tables 3-5.
The measurements are compared to several theory predictions.
The ALPGEN+PYTHIA6 W + jets calculation and the normalisation K -factor used for this prediction are described in Section 3 and the quoted uncertainties are the statistical uncertainties. The W + j and jj + weak shower calculation provided by PYTHIA v8.210, described in Section 3, is shown as well. In this case, the W boson      An NNLO QCD calculation, which includes up to O (α 3 S ), for the angular separation between the lepton from the W boson decay and the nearest jet in W + jets events has recently become available [57,58]. This calculation, obtained from Ref. [5], is denoted 'W + ≥ 1 jet N jetti NNLO' here. It uses a new technique based on N-jettiness [59] to split the phase space for the real emission corrections. It relies on the theoretical formalism provided in soft-collinear effective theory. The calculation uses the CT14 NNLO Table 6 Fiducial W + jets cross-sections for the selection criteria of (1) at LO, NLO and NNLO in QCD from Ref. [5]. The uncertainties shown are the scale uncertainties.
are the same as the ones used for the measurement except for the muon pseudorapidity (|η| < 2.5 instead of |η| < 2.4). The effect of this difference in muon pseudorapidity is evaluated using the ALPGEN+PYTHIA6 W + jets sample and a correction factor accounting for this, which is less than 4% across the entire distribution, is applied. The calculated cross-sections obtained at LO, NLO and NNLO without the muon pseudorapidity correction are shown in Table 6. The scale uncertainty decreases from ∼±20% at NLO to +3%/−7% at NNLO.
The comparison of the data to ALPGEN+PYTHIA6 in Fig. 3 shows good shape agreement to within uncertainties, except at very low R, but ALPGEN+PYTHIA6 predicts a significantly higher integrated cross-section. The comparison to PYTHIA8 at high R, where it is dominated by back-to-back W + jets production in which the W boson is balanced by the hadronic recoil system, shows much better agreement. At smaller R, where the collinear process dominates, neither the shape nor the overall cross-section agree. The comparisons to SHERPA+OpenLoops and W + ≥ 1 jet N jetti NNLO show much better agreement across the entire distribution.

Enhancement of the collinear fraction with jet p T
The events in the signal region are further divided into two categories based on the transverse momentum of the leading jet: 500 GeV < p leading jet T < 600 GeV and p leading jet T > 650 GeV. For each of these two categories, the data distribution is unfolded. The 50 GeV gap between the two categories reduces the migration of events from one category to the other during unfolding. The resulting normalised differential W + jets cross-section is shown in Fig. 4. As the leading-jet p T increases, the fraction of events in the lower R (collinear) region increases and the fraction in the higher R (back-to-back W + jets) region decreases. This may be interpreted as an increase in the collinear W emission probability as the jets become more energetic. With higher p T the collinear peak is shifted to smaller R. This is also understood since the mass of the W boson becomes proportionally smaller compared to the energy of the jet. The full measurement results are shown in Fig. 5. The comparison to theory predictions shows results similar to the ones obtained for p leading jet T > 500 GeV in Section 7.1.

Conclusions
The cross-section for W → μν in association with at least one very high transverse momentum jet is measured as a function of the angular distance between the muon from the W boson decay and the closest jet. This measurement utilises data recorded by the ATLAS detector from pp collisions at Comparisons to a variety of MC generators and theoretical calculations show varying levels of agreement. ALPGEN+PYTHIA6 overestimates the total cross-section, whereas PYTHIA8, which is modified to explicitly include the process of W boson emission, disagrees with the measurement in the collinear region ( R < 2.4). On the other hand, agreement with the SHERPA+OpenLoops NLO QCD+EW calculation and the W + ≥ 1 jet N jetti NNLO calculation in Ref. [5] is well within the systematic and statistical uncertainties of the predictions and the measurement.
This measurement has implications for Monte Carlo programs that incorporate real W boson emission, a process which is only just now being probed directly at the energy of the LHC. The rate of this process increases with jet p T and thus also with centre-ofmass energy, and will therefore play a significant role in W + jets measurements at high p T , vector-boson scattering measurements, and even QCD multijet measurements at very large dijet invariant masses where the corrections due to real boson emission are significant.
Lastly, the potential is high for this process to mimic the signatures of a highly Lorentz-boosted top quark. The importance of such signatures in the search for new physics at the LHC necessitates a thorough understanding of processes such as the one measured in detail in this paper. As the physics programmes of the LHC experiments extend into new territories in terms of both the centre-of-mass energy and integrated luminosity, these once rare processes will become a ubiquitous consideration.

Acknowledgements
We thank CERN for the very successful operation of the LHC, as well as the support staff from our institutions without whom ATLAS could not be operated efficiently.
We The crucial computing support from all WLCG partners is acknowledged gratefully, in particular from CERN, the ATLAS Tier-