1 Introduction

Collimated sprays of energetic hadrons, known as jets, are the dominant final-state objects of high-energy proton–proton (\(pp\) ) interactions at the Large Hadron Collider (LHC) located at CERN. They are key ingredients for many physics measurements and for searches for new phenomena. This paper describes the reconstruction of jets in the ATLAS detector [1] using 2012 data. Jets are reconstructed using the anti-\(k_{t}\) [2] jet algorithm, where the inputs to the jet algorithm are typically energy depositions in the ATLAS calorimeters that have been grouped into “topological clusters” [3]. Jet radius parameter values of \(R=0.4\), \(R=0.6\), and \(R=1.0\) are considered. The first two values are typically used for jets initiated by gluons or quarks, except top quarks. The last choice of \(R=1.0\) is used for jets containing the hadronic decays of massive particles, such as \(W\)/\(Z\)/Higgs bosons and top quarks. The same jet algorithm can also be used to form jets from other inputs, such as inner-detector tracks associated with charged particles or simulated stable particles from the Monte Carlo event record.

Calorimeter jets, which are reconstructed from calorimeter energy depositions, are calibrated to the energy scale of jets created with the same jet clustering algorithm from stable interacting particles. This calibration accounts for the following effects:

  • Calorimeter non-compensation  Different energy scales for hadronic and electromagnetic showers.

  • Dead material  Energy lost in inactive areas of the detector.

  • Leakage  Showers reaching the outer edge of the calorimeters.

  • Out-of-calorimeter jet  Energy contributions which are included in the stable particle jet but which are not included in the reconstructed jet.

  • Energy depositions below noise thresholds  Energy from particles that do not form calorimeter clusters or have energy depositions not included in these clusters due to the noise suppression in the cluster formation algorithm.

  • Pile-up  Energy deposition in jets is affected by the presence of multiple \(pp\) collisions in the same \(pp\) bunch crossing as well as residual signals from other bunch crossings.

A first estimate of the jet energy scale (JES) uncertainty of \(5\%{-}9\%\) was based on information available prior to \(pp\) collision data and initial analysis of early data taken in 2010 [4]. An improved jet calibration with an uncertainty evaluated to be about \(2.5\%\) for jets with pseudorapidityFootnote 1\(|\eta |<0.8\) over a wide range of transverse momenta (\(p_{{\text {T}}}\) ) was achieved with the full 2010 dataset using test-beam measurements, single-hadron response measurements, and in situ techniques [5]. A much larger dataset, recorded during the 2011 data-taking period, improved the precision of JES measurements to \(1{-}3\%\) for jets with \(p_{{\text {T}}} >40~{\text {Ge}} {\text {V}}\) within \(|\eta |<2.5\) using a statistical combination of several in situ techniques [6].

This paper describes the derivation of the ATLAS jet calibration and jet energy resolution using the full 2012 \(pp\) collision dataset, which is more than four times larger than the 2011 dataset used for the previous calibration [6]. Due to the increased instantaneous luminosity, the beam conditions in 2012 were more challenging than those in 2011, and the ability to mitigate the effects of additional \(pp\) interactions is of major importance for robust performance, especially for jets with low \(p_{{\text {T}}}\) . The jet calibration is derived using a combination of methods based both on Monte Carlo (MC) simulation and on in situ techniques. The jet energy resolution (JER), which previously was studied using events with dijet topologies [7], is determined using a combination of several in situ JER measurements for the first time. A subset of these jet calibration techniques were subsequently used for \(R=0.4\) jets recorded during the 2015 data-taking period [8], and for \(R=1.0\) jets recorded during the 2015-2016 data-taking period [9].

The outline of the paper is as follows. Section 2 describes the ATLAS detector and the dataset used. The MC simulation framework is presented in Sect. 3, and the jet reconstruction and calibration strategy is summarized in Sect. 4. Section 5 describes the global sequential calibration method, which exploits information from the tracking system (including the muon chambers) and the topology of the energy depositions in the calorimeter to improve the JES uncertainties and the JER. The in situ techniques based on a \(p_{{\text {T}}}\) balance are described in Sects. 68. First, the intercalibration between the central and forward detector, using events with dijet-like topologies, is presented in Sect. 6. The methods based on the \(p_{{\text {T}}}\) balance between a jet and a well-calibrated photon or Z boson are discussed in Sect. 7, while the study of the balance between a high-\(p_{{\text {T}}}\) jet and a system of several low-\(p_{{\text {T}}}\) jets is presented in Sect. 8. The combination of the JES in situ results and the corresponding uncertainties are discussed in Sect. 9, while the in situ combination and the results for the JER are presented in Sect. 10.

2 The ATLAS detector and data-taking conditions

The ATLAS detector consists of an inner tracking detector, sampling electromagnetic and hadronic calorimeters, and muon chambers in a toroidal magnetic field. A detailed description of the ATLAS detector is in Ref. [1].

The inner detector (ID) has complete azimuthal coverage and spans the pseudorapidity range of \(|\eta |<2.5\). It consists of three subdetectors: a high-granularity silicon pixel detector, a silicon microstrip detector, and a transition radiation tracking detector. These are placed inside a solenoid that provides a uniform magnetic field of 2 T. The ID reconstructs tracks from charged particles and determines their transverse momenta from the curvature in the magnetic field.

Jets are reconstructed from energy deposited in the ATLAS calorimeter system. Electromagnetic calorimetry is provided by high-granularity liquid argon (LAr) sampling calorimeters, using lead as an absorber, which are split into barrel (\(|\eta |<1.475\)) and endcap (\(1.375<|\eta |<3.2\)) regions, where the endcap is further subdivided into outer and inner wheels. The hadronic calorimeter is divided into the barrel (\(|\eta |<0.8\)) and extended barrel (\(0.8<|\eta |<1.7\)) regions, which are instrumented with tile scintillator/steel modules, and the endcap region (\(1.5<|\eta |<3.2\)), which uses LAr/copper modules. The forward calorimeter region (\(3.1<|\eta |<4.9\)) is instrumented with LAr/copper and LAr/tungsten modules to provide electromagnetic and hadronic energy measurements, respectively. The electromagnetic and hadronic calorimeters are segmented into layers, allowing a determination of the longitudinal profiles of showers. The electromagnetic barrel, the electromagnetic endcap outer wheel, and tile calorimeters consist of three layers. The electromagnetic endcap inner wheel consists of two layers. The hadronic endcap calorimeter consists of four layers. The forward calorimeter has one electromagnetic and two hadronic layers. There is also an additional thin LAr presampler, covering \(|\eta | <1.8\), dedicated to correcting for energy loss in material upstream of the calorimeters.

The muon spectrometer surrounds the ATLAS calorimeter. A system of three large air-core toroids with eight coils each, a barrel and two endcaps, generates a magnetic field in the pseudorapidity range \(|\eta | < 2.7\). The muon spectrometer measures muon tracks with three layers of precision tracking chambers and is instrumented with separate trigger chambers.

Events are retained for analysis using a trigger system [10] consisting of a hardware-based level-1 trigger followed by a software-based high-level trigger with two levels: level-2 and subsequently the event filter. Jets are identified using a sliding-window algorithm at level-1 that takes coarse-granularity calorimeter towers as input. This is refined with an improved jet reconstruction based on trigger towers at level-2 and on calorimeter cells in the event filter [11].

The dataset consists of \(pp\) collisions recorded from April to December 2012 at a centre-of-mass energy (\(\sqrt{s}\)) of \(8~{\text {Te}} {\text {V}}\). All ATLAS subdetectors were required to be operational and events were rejected if any data quality issues were present, resulting in a usable dataset with a total integrated luminosity of \(20~{\hbox {fb}}^{-1}{}\). The LHC beams were operated with proton bunches organized in bunch trains, with bunch crossing intervals (bunch spacing) of 50 ns. The average number of \(pp\) interactions per bunch crossing, denoted \(\mu \), was typically between 10 and 30 [12].

The typical electron drift time within the ATLAS LAr calorimeters is 450 ns [13]. Thus, it is not possible to read out the full detector signal from one event before the next event occurs. To mitigate this issue, a bipolar shaper [14] is applied to the output, creating signals with a pulse sufficiently short to be read between bunch crossings. After bipolar shaping, the average energy induced by \({\text {pile-up}}\) interactions should be zero in the ideal situation of sufficiently long bunch trains with the same luminosity in each pair of colliding bunches. A bunch-crossing identification number dependent offset correction is applied to account for the finite train length such that the average energy induced by pileup is zero for every crossing. However, fluctuations in \({\text {pile-up}}\) activity, both from in-time and out-of-time collisions, contribute to the calorimeter energy read out of the collision of interest. Multiple methods to suppress the effects of \({\text {pile-up}}\) are discussed in subsequent sections.

3 Simulation of jets in the ATLAS detector

Monte Carlo event generators simulate the type, energy, and direction of particles produced in pp collisions. Table 1 presents a summary of the various event generators used to determine the ATLAS jet calibration. A detailed overview of the MC event generators used in ATLAS analyses can be found in Ref. [15].

Table 1 Summary of the simulated samples used to derive the jet calibration and to assess systematic uncertainties

The baseline simulation samples used to obtain the MC-based jet calibration were produced using Pythia version 8.160 [24]. Pythia uses a \(2\rightarrow 2\) matrix element interfaced with a parton distribution function (PDF) to model the hard process. Additional radiation was modelled in the leading-logarithm approximation using \(p_{{\text {T}}}\) -ordered parton showers. Multiple parton–parton interactions (MPI), also referred to as the underlying event (UE), were also simulated, and modelling of the hadronization process was based on the Lund string model [25].

Separate samples produced using other generators were used to derive the final jet calibration and resolution and associated uncertainties using in situ techniques. The Herwig [26] and Herwig++ [27] event generators use a \(2 \rightarrow 2\) matrix element convolved with a PDF for the hard process just as Pythia8 does, but use angle-ordered parton showers and a different modelling of the UE and hadronization. The Sherpa event generator [28] was used to produce multi-leg \(2\rightarrow N\) matrix elements matched to parton showers using the CKKW [29] prescription. Fragmentation was simulated using the cluster-hadronization model [30], and the UE was modelled using the Sherpa AMISIC model based on Ref. [21]. Samples were also produced using the Powheg Box  [31,32,33,34] software that is accurate to next-to-leading order (NLO) in perturbative QCD. Parton showering and modelling of the hadronization and the UE were provided by either Pythia8 or Herwig, resulting in separate samples referred to as Powheg + Pythia8 and Powheg + Herwig, respectively. Tuned values of the modelling parameters affecting the parton showering, hadronization, and the UE activity were determined for each generator set-up to match various distributions in data as summarized in Table 1 and references therein.

The generated stable particles, defined as those with a lifetime \(\tau \) such that \(c \tau > 10~\hbox {mm}\), were input to the detector simulation that models the particles’ interactions with the detector material. Such particles are used to build jets as explained in Sect. 4. Most MC samples were generated with a full detector simulation of the ATLAS detector [35] based on Geant4 [36], in which hadronic showers are simulated with the QGSP BERT model [37]. Alternative samples were produced using the Atlfast-II (AFII) fast detector simulation based on a simplified modelling of particle interactions with the calorimeter, yielding a factor of ten more events produced for the same CPU time [38]. The output of the detector simulation were detector signals with the same format as those from real data.

\({\text {Pile-up}}\) events, i.e. additional pp interactions that are not correlated with the hard-scatter event of interest, were simulated as minimum-bias events produced with Pythia8 using the AM2 tuned parameter set [23] and the MSTW2008LO PDF [22]. The simulated detector signals from these events were overlaid with the detector signals from the hard-scatter event based on the \({\text {pile-up}}\) conditions of the 2012 data-taking period. \({\text {Pile-up}}\) events were overlaid both in the hard-scatter bunch crossing (in-time \({\text {pile-up}}\) ) and in nearby bunch crossings (out-of-time \({\text {pile-up}}\) ) with the detector signals offset in time accordingly. These out-of-time \({\text {pile-up}}\) signals are overlaid in such a manner as to cover the full read-out window of each of the ATLAS calorimeter sub-detectors. The number of \({\text {pile-up}}\) events to overlay in each bunch crossing was sampled from a Poisson distribution with a mean \(\mu \) corresponding to the expected number of additional \(pp\) collisions per bunch crossing.

4 Overview of ATLAS jet reconstruction and calibration

4.1 Jet reconstruction and preselection

Jets are reconstructed with the anti-\(k_{t}\) algorithm [2] using the FastJet software package [39, 40] version 2.4.3. Jets are formed using different inputs: stable particles from the event generator record of simulated events resulting in truth-particle jets; reconstructed calorimeter clusters, producing calorimeter jets; or inner-detector tracks to form track jets.

The generated stable particles used to define truth-particle jets are required to originate (either directly or via a decay chain) from the hard-scatter vertex, and hence do not include particles from \({\text {pile-up}}\) interactions. Muons and neutrinos are excluded to ensure that the truth-particle jets are built from particles that leave significant energy deposits in the calorimeters.

Calorimeter jets are built from clusters of adjacent calorimeter read-out cells that contain a significant energy signal above noise levels, referred to as topological clusters or topo-clusters. Details of the formation of topo-clusters are provided in Ref. [3]. In its basic definition, a topo-cluster is assigned an energy equal to the sum of the associated calorimeter cell energies calibrated at the electromagnetic scale (EM-scale) [41,42,43,44], which is the basic signal scale accounting correctly for the energy deposited in the calorimeter by electromagnetic showers. The direction (\(\eta \) and \(\phi \)) of a topo-cluster is defined from the centre of the ATLAS detector to the energy-weighted barycentre of the associated calorimeter cells, and the mass is set to zero. Topo-clusters can further be calibrated using the local cell signal weighting (LCW) method [3] designed to give the correct scale for charged pions produced in the interaction point. The LCW method reduces fluctuations in energy due to the non-compensating nature of the ATLAS calorimeters, out-of-cluster energy depositions, and energy deposited in dead material, improving the energy resolution of the reconstructed jets in comparison with jets reconstructed using EM-scale clusters [5].

The calorimeter jet four-momentum directly after jet finding is referred to as the constituent scale four-momentum \(p^{\text {const}}\) and is defined as the sum of the constituent topo-cluster four-momenta \(p^{\text {topo}}_i\):

$$\begin{aligned} p^{\,\text {const}}= & {} \left( E^{\,\text {const}},{\mathbf {p}}^{~\text {const}} \right) = \sum _{i=1}^{N_{\text {const}}} p^{\text {topo}}_i \nonumber \\= & {} \left( \sum _{i=1}^{N_{\text {const}}} E^{\text {topo}}_i, \sum _{\text {i=1}}^{N_{\text {const}}} {\mathbf {p}}^{~\text {topo}}_i \right) . \end{aligned}$$
(1)

The constituent scales considered in this paper are EM or LCW depending on the calibration of the constituent topo-clusters. At this stage, all angular coordinates are defined from the centre of the ATLAS detector, and the detector pseudorapidity \(\eta _{\text {det}}{}\equiv \eta ^{\text {const}}\) and detector azimuth \(\phi _{\text {det}}{} \equiv \phi ^{\text {const}}\) are recorded for each jet. The most common choice in ATLAS analyses of the anti-\(k_{t}\) radius parameter is \(R=0.4\), but \(R=0.6\) is also used frequently. Analyses that search for hadronic decays of highly boosted (high \(p_{{\text {T}}}\) ) massive objects often use larger values of R than these since the decay products of the boosted objects can then be contained within the resulting large-R jets. Due to the larger radius parameter, this class of jets spans a larger solid angle and hence are more sensitive to \({\text {pile-up}}\) interactions than jets with \(R\le 0.6\). To mitigate the influence of \({\text {pile-up}}\) and hence improve the sensitivity of the analyses, several jet grooming algorithms have been designed and studied within ATLAS [45,46,47,48]. In this paper, the trimming algorithm [49] (one type of grooming method) is applied to anti-\(k_{t}\) jets built with \(R=1.0\). This grooming procedure starts from the constituent topo-clusters of a given \(R=1.0\) anti-\(k_{t}\) jet to create subjets using the \(k_t\) jet algorithm [50] with radius parameter \(R_{\text {sub}}=0.3\). The topo-clusters belonging to subjets with \(f_{\text {cut}} \equiv p_{{\text {T}}} {}^{\text {subjet}} / p_{{\text {T}}} {}^{\text {jet}} < 0.05\) are discarded, and the jet four-momentum is then recalculated from the remaining topo-clusters.

For each in situ analysis, jets within the full calorimeter acceptance \(|\eta _{\text {det}}{}|<{4.5}\) with calibrated \(p_{{\text {T}}} >8~{\text {Ge}} {\text {V}}\) (\(p_{{\text {T}}} >25~{\text {Ge}} {\text {V}}\) in case of the multijet analysis) are considered. These \(p_{{\text {T}}}\) thresholds do not bias the kinematic region of the derived calibration, which is \(p_{{\text {T}}} \ge 17~{\text {Ge}} {\text {V}}\) (\(p_{{\text {T}}} \ge 300~{\text {Ge}} {\text {V}}\) for the multijet analysis). The jets are also required to satisfy “Loose” quality criteria, designed to reject fake jets originating from calorimeter noise bursts, non-collision background, or cosmic rays [6], and to fulfil a requirement designed to reject jets originating from \({\text {pile-up}}\) vertices. The latter criterion is based on the jet vertex fraction (\({\mathrm {JVF}}\)), computed as the scalar sum \(\sum p_{{\text {T}}} ^{{\mathrm {track}}}\) of the tracks matched to the jet that are associated with the hard-scatter primary vertex divided by \(\sum p_{{\text {T}}} ^{{\mathrm {track}}}\) using all tracks matched to the jet (see Ref. [51] for further details). The default hard-scatter vertex is the primary vertex with the largest \(\sum _{\text {tracks}}{p_{\text {T}}^2}\), but other definitions are used for certain analyses [52]. Each jet with \(p_{{\text {T}}} <50~{\text {Ge}} {\text {V}}\) within the tracking acceptance \(|\eta _{\text {det}}{}|<{2.4}\) is required to have \({\mathrm {JVF}}> 0.25\), which effectively rejects \({\text {pile-up}}\) jets in ATLAS 2012 pp data [51].

Jets with a radius parameter of \(R=0.4\) or \(R=0.6\) have been built using both EM- and LCW-scale topo-clusters as inputs. These four jet reconstruction options have been studied in similar levels of detail, but for brevity the paper will focus on presenting the results for jets built using EM-scale topo-clusters with a radius parameter of \(R=0.4\), which better demonstrates the importance of the GS calibration as described in Sect. 5. Key summary plots will present the results for all four jet definitions thus showing the final performance of each of the different options. In contrast, jets with a radius parameter of \(R=1.0\) have only been studied in detail using LCW-scale topo-clusters as inputs. This choice is motivated by the common usage of such jets for tagging of hadronically-decaying particles, where the energy and angular distribution of constituents within the jet is important. For such a situation, LCW topo-clusters are advantageous because they flatten the detector response, and thus the tagging capabilities are less impacted by where a given energy deposit happens to be within the detector.

4.2 Matching between jets, jet isolation, and calorimeter response

To derive a calibration based on MC simulation, it is necessary to match a truth-particle jet to a reconstructed jet. Two methods are used for this: a simple, angular matching as well as a more sophisticated approach known as jet ghost association [53]. For the angular matching, a \(\Delta R < 0.3\) requirement is used, where \(\Delta R\) is the pseudorapidity and azimuthal angle separation between the two jets added in quadrature, i.e. \(\Delta R = \Delta \eta \,\oplus \, \Delta \phi \equiv \sqrt{(\Delta \eta )^2 \,+\, (\Delta \phi ) ^2}\). The angular criterion \(\Delta R < 0.3\) is chosen to be smaller than the jet radius parameter used for ATLAS analyses (\(R=0.4\) or larger) but much larger than the jet angular resolution (Sect. 4.3.2). Jet matching using ghost association treats each MC simulated particle as a ghost particle, which means that they are assigned an infinitesimal \(p_{{\text {T}}}\) , leaving the angular coordinates unchanged. The calorimeter jets can now be built using both the topo-clusters and ghost particles as input. Since the ghost particles have infinitesimal \(p_{{\text {T}}}\) , the four-momenta of the reconstructed jets will be identical to the original jets built only from topo-clusters, but the new jets will also have a list of associated truth particles for any given reconstructed jet. A truth-particle jet is matched to a reconstructed jet if the sum of the energies of the truth-particle jet constituents which are ghost-associated with the reconstructed jet is more than 50% of the truth-particle jet energy, i.e. the sum of the energies of all constituents. This ensures that only one reconstructed jet is matched to any given truth-particle jet. If several truth-particle jets fulfil the matching requirement, the truth-particle jet with the largest energy is chosen as the matched jet. Matching via ghost association results in a unique match for each truth-particle jet and hence performs better than the simple angular matching in cases where several jets have small angular separation from each other.

The simulated jet energy response is defined by

$$\begin{aligned} {\mathcal {R}}_{E} = \left\langle \frac{E_{\text {reco}}}{E_{\text {truth}}}\right\rangle , \end{aligned}$$

where \(E_{\text {reco}}\) is the reconstructed energy of the calorimeter jet, \(E_{\text {truth}}\) is the energy of the matching truth-particle jet, and the brackets denote that \({\mathcal {R}}_E\) is defined from the mean parameter of a Gaussian fit to the response distribution \(E_{\text {reco}}/E_{\text {truth}}\). The \(p_{{\text {T}}}\) and mass responses are defined analogously as the Gaussian means \(\langle p_{{\text {T}}} {}_{\text {,reco}} / p_{{\text {T}}} {}_{\text {,truth}}\rangle \) and \(\langle m_{\text {reco}}/m_{\text {truth}}\rangle \) of the reconstructed quantity divided by that of the matching truth-particle jet. When studying the jet response for a population of jets, both the reconstructed and the truth-particle jets are typically required to fulfil isolation requirements. For the analyses presented in this paper, reconstructed jets are required to have no other reconstructed jet with \(p_{{\text {T}}} {}>7~{\text {Ge}} {\text {V}}\) within \(\Delta R < 1.5R\), where R is the anti-\(k_{t}\) jet radius parameter used. Truth-particle jets are similarly required to have no jets with \(p_{{\text {T}}} {}>7~{\text {Ge}} {\text {V}}\) within \(\Delta R < 2.5 R\). After requiring the particle and reconstructed jets to be isolated, the jet energy response distributions for jets with fixed \(E_{\text {truth}}\) and \(\eta \) have nearly Gaussian shapes, and \({\mathcal {R}}_E\) and the jet resolution \(\sigma _{\mathcal {R}}\) are defined as the mean and width parameters of Gaussian fits to these distributions, respectively. For all results presented in this paper, the mean jet response is defined from the mean parameter of a fit to a jet response or momentum balance distribution as appropriate rather than the mean or median of the underlying distribution, as the fit mean is found to be significantly more robust against imperfect modelling of the tails of the underlying distribution.

Fig. 1
figure 1

Overview of the ATLAS jet calibration described in this paper. All steps are derived and applied separately for jets built from EM-scale and LCW calibrated calorimeter clusters, except for the global sequential calibration, which is only partially applied to LCW-jets (Sect. 5). The notations EM + JES and LCW + JES typically refer to the fully calibrated jet energy scale; however, in the sections of this paper that detail the derivations of the GS and the in situ corrections, these notations refer to jets calibrated by all steps up to the correction that is being described

4.3 Jet calibration

An overview of the ATLAS jet calibration applied to the \(8~{\text {Te}} {\text {V}}\) data is presented in Fig. 1. This is an extension of the procedure detailed in Ref. [6] that was applied to the \(7~{\text {Te}} {\text {V}}\) data collected in 2011. The calibration consists of five sequential steps. The derivation and application of the first three calibration steps are described in this section, while the global sequential calibration (GS) is detailed in Sect. 5, and the relative in situ correction and the associated uncertainties are described in Sects. 69.

4.3.1 Jet origin correction

The four-momentum of the initial jet is defined according to Eq. (1) as the sum of the four-momenta of its constituents. As described in Sect. 4.1, the topo-clusters have their angular directions \((\eta ,\phi )\) defined from the centre of the ATLAS detector to the energy-weighted barycentre of the cluster. This direction can be adjusted to originate from the hard-scatter vertex of the event. The jet origin correction first redefines the \((\eta ,\phi )\) directions of the topo-clusters to point to the selected hard-scatter vertex, which results in a updated set of topo-cluster four-momenta. The origin-corrected calorimeter jet four-momentum \(p^{\text {orig}}\) is the sum of the updated topo-cluster four-momenta,

$$\begin{aligned} p^{\text {orig}} = \sum _{i=1}^{N_{\text {const}}} p^{\text {topo,orig}}_i. \end{aligned}$$

Since the energies of the topo-clusters are not affected, the energy of the jet also remains unchanged. Figure 2 presents the impact of the jet origin correction on the jet angular resolution by comparing the axis of the calorimeter jet (\(\eta _{\mathrm {reco}},\phi _{\mathrm {reco}}\)) with the axis of the matched truth-particle jet (\(\eta _{\mathrm {truth}},\phi _{\mathrm {truth}}\)). A clear improvement can be seen for the pseudorapidity resolution, while no change is seen for the azimuthal resolution. This is expected as the spread of the beamspot is significantly larger along the beam axis (\(\sim \)50 mm) than in the transverse plane (\(\ll 1~\hbox {mm}\)).

4.3.2 \({\text {Pile-up}}\) correction

The reconstruction of the jet kinematics is affected by \({\text {pile-up}}\) interactions. To mitigate these effects, the contribution from \({\text {pile-up}}\) is estimated on an event-by-event and jet-by-jet basis as the product of the event \(p_{{\text {T}}}\) -density \(\rho \) [53] and the jet area A in \((y,\phi )\)-space, where y is the rapidity of the jet [54]. The jet area is determined with the FastJet 2.4.3 program [39, 40] using the active-area implementation, in which the jets are rebuilt after adding randomly distributed ghost particles with infinitesimal \(p_{{\text {T}}}\) and randomly selected y and \(\phi \) from uniform distributions. The active area is estimated for each jet from the relative number of associated ghost particles (Sect. 4.2). As can be seen in Fig. 3a, the active area for a given anti-\(k_{t}\) jet tends to be close to \(\pi R^2\). The event \(p_{{\text {T}}}\) -density \(\rho \) is estimated event-by-event by building jets using the \(k_t\) jet-finding algorithm [50] due to its tendency to naturally include uniform soft background into jets [53]. Resulting \(k_t\) jets are only considered within \(|\eta |<2\) to remain within the calorimeter regions with sufficient granularity [51]. No requirement is placed on the \(p_{{\text {T}}}\) of the jets, and the median of the \(p_{{\text {T}}} {}/A\) distribution is taken as the value of \(\rho \). The median is used to reduce the sensitivity of the method to the hard-scatter activity in the tails. The \(\rho \) distributions of events with average interactions per bunch crossing \(\mu \) in the narrow range of \(20<\mu <21\) and several fixed numbers of primary vertices \(N_{\text {PV}}\) are shown in Fig. 3b. It can be seen that \(\rho \) increases with \(N_{\text {PV}}\) as expected, but for a fixed \(N_{\text {PV}}\), \(\rho \) still has sizeable event-by-event fluctuations. A typical value of the event \(p_{{\text {T}}}\) -density in the 2012 ATLAS data is \(\rho = 10~{\text {Ge}} {\text {V}}\), which for a \(R=0.4\) jet corresponds to a subtraction in jet \(p_{{\text {T}}}\) of \(\rho \, A \approx 5~{\text {Ge}} {\text {V}}\).

Fig. 2
figure 2

Jet angular resolution as a function of transverse momentum for anti-\(k_{t}\) jets with \(R = 0.4\). The resolutions are defined by the spread of the difference between the reconstructed jet axis (\(\eta _{\mathrm {reco}},\phi _{\mathrm {reco}}\)) and the axis of the matched truth-particle jet (\(\eta _{\mathrm {truth}},\phi _{\mathrm {truth}}\)) (see Sect. 4.2 for matching details) in simulated events and are shown both with (circles) and without (triangles) the jet origin correction, which adjusts the direction of the reconstructed jet to point to the hard-scatter vertex instead of the geometrical centre of the detector

Fig. 3
figure 3

a Ratio of the jet active area to \(\pi R^2\), where R is the jet radius parameter and b the event \(p_{{\text {T}}}\) -density \(\rho \). The jet area ratio is shown separately for \(R=0.4\) and \(R=0.6\) jets reconstructed with the anti-\(k_{t}\) algorithm, and \(\rho \) is shown for different numbers of reconstructed primary vertices \(N_{\text {PV}}\) in events with average number of pp interactions in the range \(20\le \mu {}<21\)

Fig. 4
figure 4

Jet energy and mass responses as a function of \(\eta _{\text {det}}\) for different truth-particle jet energies. The energy responses \({\mathcal {R}}_E\) for anti-\(k_{t}\) jets with \(R=0.4\) at the a EM scale and the b LCW scale and c for trimmed anti-\(k_{t}\)  \(R=1.0\) jets are presented. Also, d the jet mass response \({\mathcal {R}}_m\) for the latter kind of jets is given

After subtracting the \({\text {pile-up}}\) contribution based on \(\rho \, A\), the pileup dependence of \(p_{{\text {T}}} ^{\text {jet}}\) is mostly removed, especially within the region where the value of \(\rho \) is derived. However, the value of \(p_{{\text {T}}} ^{\text {jet}}\) has a small residual dependence on \(N_{\text {PV}}\) and \(\mu \), particularly in the region beyond where \(\rho \) is derived and where the calorimeter granularity changes. To mitigate this, an additional correction is derived, parameterized in terms of \(N_{\text {PV}}\) and \(\mu \), which is the same approach and parameterization as was used for the full \({\text {pile-up}}\) correction of the ATLAS 2011 jet calibration [6]. A typical value for this correction is \(\pm 1~{\text {Ge}} {\text {V}}\) for jets in the central detector region. The full \({\text {pile-up}}\) correction to the jet \(p_{{\text {T}}}\) is given by

$$\begin{aligned} p_{{\text {T}}} {} \, \mapsto \, p_{{\text {T}}} {} - \rho \, A - \alpha \, ( N_{\text {PV}} - 1) - \beta \,\mu , \end{aligned}$$
(2)

where the \(\alpha \) and \(\beta \) parameters depend on jet pseudorapidity and the jet algorithm, and are derived from MC simulation. Further details of this calibration, including evaluation of the associated systematic uncertainties, are in Ref. [51]. No \({\text {pile-up}}\) corrections are applied to the trimmed large-\(R\) jets since this is found to be unnecessary after applying the trimming procedure.

4.3.3 Monte Carlo-based jet calibration

After the origin and \({\text {pile-up}}\) corrections have been employed, a baseline jet energy scale calibration is applied to correct the reconstructed jet energy to the truth-particle jet energy. This calibration is derived in MC-simulated dijet samples following the same procedure used in previous ATLAS jet calibrations [5, 6]. Reconstructed and truth-particle jets are matched and required to fulfil the isolation criteria as described in Sect. 4.2. The jets are then subdivided into narrow bins of \(\eta _{\text {det}}\) of the reconstructed jet and energy of the truth-particle jet \(E_{\text {truth}}\), and \({\mathcal {R}}_E\) is determined for each such bin from the mean of a Gaussian fit (Sect. 4.2). The average reconstructed jet energy \(\langle E_{\text {reco}}\rangle \) (after \({\text {pile-up}}\) correction) is also recorded for each such bin. A calibration function \(c_{\text {JES,1}}(E_{\text {reco}})=1/{\mathcal {R}}_1(E_{\text {reco}})\) is determined for each \(\eta _{\text {det}}\) bin by fitting a smooth function \({\mathcal {R}}_1(E_{\text {reco}})\) to a graph of \({\mathcal {R}}_E\) versus \(\langle E_{\text {reco}}\rangle \) measurements for all \(E_{\text {truth}}\) bins within the given \(\eta _{\text {det}}\) bin. After applying this correction (\(E_{\text {reco}} \mapsto c_{\text {JES,1}}\,E_{\text {reco}}\)) and repeating the derivation of the calibration factor, the jet response does not close perfectly. The derived calibration factor from the second iteration \(c_{\text {JES,2}}\) is close to but not equal to unity. The calibration improves after applying three such iterative residual corrections \(c_{{\text {JES,}}i}\) (\(i\in \{2,3,4\}\)) such that the final correction factor \(c_{\text {JES}} = \prod _{i=1}^4 c_{{\text {JES}},i}\) achieves a jet response close to unity for each \((E_{\text {truth}},\eta _{\text {det}}{})\) bin.

Fig. 5
figure 5

Jet energy, \(p_{{\text {T}}}\) , and mass response after the MC-based jet calibration has been applied for \(R=0.4\) and \(R=1.0\) anti-\(k_{t}\) jets reconstructed from LCW calibrated topo-clusters

For the large-\(R\) jets (trimmed anti-\(k_{t}\)  \(R=1.0\)), a subsequent jet mass calibration is also applied, derived analogously to the energy calibration. Figure 4 shows the energy and jet mass responses for jets with \(R=0.4\) and \(R=1.0\). Jets reconstructed from LCW-calibrated topo-clusters have a response closer to unity than jets built from EM-scale topo-clusters. Figure 5 shows the jet E, \(p_{{\text {T}}}\) , and m response plots after the application of the MC-based jet calibration. Good closure is demonstrated across the pseudorapidity range, but there is some small non-closure for low-\(p_{{\text {T}}}\) jets primarily due to imperfect fits arising from the non-Gaussian energy response and threshold effects.

A small, additive correction \(\Delta \eta \) is also applied to the jet pseudorapidity to account for biased reconstruction close to regions where the detector technology changes (e.g. the barrel–endcap transition region). The magnitude of this correction is very similar to that of the previous calibrations (Figure 11 of Ref. [5]) and can reach values as large as 0.05 near the edge of the forward calorimeters around \(|\eta |=3\), but is typically much smaller in the well-instrumented detector regions.

4.4 Definition of the calibrated jet four momentum

For small-\(R\) jets, i.e. jets built with a radius parameter of \(R=0.4\) or \(R=0.6\), the fully calibrated jet four-momentum is specified by

$$\begin{aligned} \left( E,\eta ,\phi ,m\right) = \left( \,c_{\text {calib}} \, E^{\text {orig}},\,\eta ^{\text {orig}} \,+\, \Delta \eta ,\,\phi ^{\text {orig}} ,\, c_{\text {calib}} \, m^{\text {orig}}\right) , \nonumber \\ \end{aligned}$$
(3)

where the quantities denoted “orig” are the jet four-vector after the origin correction discussed in Sect. 4.3.1, \(\Delta \eta \) is the MC-based pseudorapidity calibration reported in Sect. 4.3.3, and \(c_{\text {calib}}\) is a four-momentum scale factor that combines the other calibration steps:

$$\begin{aligned} c_{\text {calib}} = {\left\{ \begin{array}{ll} c_{\text {PU}} \cdot c_{\text {JES}} \cdot c_{\text {GS}} \cdot c_\eta \cdot c_{\text {abs}} &{} {\text {for data}} \\ c_{\text {PU}} \cdot c_{\text {JES}} \cdot c_{\text {GS}} &{} {\text {for MC simulation}}. \end{array}\right. } \end{aligned}$$
(4)

Here, the \({\text {pile-up}}\) correction factor is defined as

$$\begin{aligned} c_{\text {PU}} = \frac{p_{{\text {T}}} {} - \rho \ A - \alpha (N_{\text {PV}}-1) - \beta \mu }{p_{{\text {T}}}} \end{aligned}$$

in accordance with Eq. (2) (\(p_{{\text {T}}} \mapsto c_{\text {PU}}\,p_{{\text {T}}} \)), \(c_{\text {JES}}\) is derived as explained in Sect. 4.3.3, \(c_{\text {GS}}\) is the global sequential calibration that is discussed in Sect. 5, and the pseudorapidity intercalibration \(c_\eta \) and the absolute in situ calibration \(c_{\text {abs}}\) are detailed in Sects. 69. As given in Eq. (4), the MC-derived calibrations \(c_{\text {JES}}\) and \(c_{\text {GSC}}\) correct simulated jets to the truth-particle jet scale, but jets in data need the in situ corrections \(c_\eta \) and \(c_{\text {abs}}\) to reach this scale. JES systematic uncertainties are evaluated for the in situ terms.

The calibration procedure is slightly different for the large-\(R\) jets used in this paper (Sect. 4.1). These jets do not receive any origin correction or global sequential calibration as the precision needs of the overall scale are not the same as for \(R=0.4\) and \(R=0.6\) jets. Further, no \({\text {pile-up}}\) correction is applied since the trimming algorithm detailed in Sect. 4.1 mitigates the \({\text {pile-up}}\) dependence. However, large-\(R\) jets do receive a MC-derived jet mass calibration \(c_{\text {mass}}\). The calibrated large-\(R\) jet four-momentum is given by

$$\begin{aligned} \left( E,\eta ,\phi ,m\right) = \left( \,c_{\text {JES}} \, E^{\text {const}},\,\eta ^{\text {const}} \,+\, \Delta \eta ,\,\phi ^{\text {const}} ,\, c_{\text {mass}} \, m^{\text {const}}\right) . \nonumber \\ \end{aligned}$$
(5)

By expressing the jet transverse momentum in terms of energy, mass, and pseudorapidity, it can be seen that all calibration terms of Eqs. (3) and (5) affect \(p_{{\text {T}}}\) , for example

$$\begin{aligned} p_{{\text {T}}} = \frac{E \ominus m}{\cosh {\eta }} = \frac{c_{\text {JES}} \, E^{\text {const}} \ominus c_{\text {mass}} \, m^{\text {const}} }{\cosh \left( \eta ^{\text {const}} \,+\, \Delta \eta \right) }, \end{aligned}$$

where the symbol \(\ominus \) denotes subtraction in quadrature, i.e. \(a \ominus b \equiv \sqrt{a^2-b^2}\).

5 Global sequential calibration

The global sequential (GS) calibration scheme exploits the topology of the energy deposits in the calorimeter as well as tracking information to characterize fluctuations in the jet particle content of the hadronic shower development. Correcting for such fluctuations can improve the jet energy resolution and reduce response dependence on the so-called “jet flavour”, meaning dependence on the underlying physics process in which the jet was produced. Jets produced in dijet events tend to have more constituent particles, a wider transverse profile and a lower calorimeter energy response than jets with the same \(p_{{\text {T}}}\) and \(\eta \) produced in the decay of a W boson or in association with a photon (\(\gamma \) + jet) or Z boson (Z + jet). This can be attributed to differences in fragmentation between “quark-initiated” and “gluon-initiated” jets. The GS calibration also exploits information related to the activity in the muon chamber behind uncontained calorimeter jets, for which the reconstructed energy tends to be smaller with a degraded resolution. The calibration is applied in sequential steps, each designed to flatten the jet energy response as a function of a jet property without changing the mean jet energy.

5.1 Description of the method

Any variable x that carries information about the jet response can be used for the GS calibration. A multiplicative correction to the jet energy measurement is derived by inverting the jet response as a function of this variable: \(c(x) = k/R(x)\), where the constant k is chosen to ensure that the average energy is not affected by the calibration, and the average jet response R(x) is determined using MC simulation as described in Sect. 4.2. After a successful application, the jet response should no longer depend on x. As a result, the spread of reconstructed jet energy is reduced, thus improving the resolution.

Each correction is performed separately in bins of \(\eta _{\text {det}}\), in order to account for changes in the jet \(p_{{\text {T}}}\) response in different detector regions and technologies. The corrections are further parameterized as a function of \(p_{{\text {T}}}\) and jet property x: \(c({p_{{\text {T}}}},x)\), except for the correction for uncontained calorimeter jets, which is constructed as a function of jet energy E and the logarithm of the number of muon segments reconstructed in the muon chambers behind the jet: \(c(E,\log N_{\mathrm {segments}})\). The uncontained calorimeter jet correction is constructed using the jet E rather than the \(p_{{\text {T}}}\) to better represent the probability of a jet penetrating the full depth of the calorimeter, which depends on \(\log E\). The two-dimensional calibration function is constructed using a two-dimensional Gaussian kernel [6] for which the kernel-width parameters are chosen to capture the shape of the response across \(\eta _{\text {det}}\) and \(p_{{\text {T}}}\), and at the same time provide stability against statistical fluctuations.

Several variables can be used sequentially to achieve the optimal resolution. The jet \(p_{{\text {T}}}\) after N GS calibration steps is given by the initial jet \(p_{{\text {T}}}\) multiplied by the product of the N corrections:

$$\begin{aligned}&p_{\text {T}}^{\text {GS}} = p_{{\text {T}},0}\,c_{\mathrm {GS}} = p_{{\text {T}},0}\,\prod \nolimits _{j=1}^{N}c_j(\,p_{{\mathrm {T}},j-1},\,x_j\,), \nonumber \\&p_{{\text {T}},i} = p_{{\text {T}},i-1} \,c_i(p_{{\text {T}},i-1},x_i), \end{aligned}$$
(6)

where \(p_{{\text {T}},0}\) is the jet \(p_{{\text {T}}} \) prior to the GS calibration. Hence, when deriving correction j, one needs to start by calibrating the jets with the previous \(j-1\) correction factors. This method assumes there is little to gain from non-linear correlations of the variables used and this has been demonstrated in simulation.

5.2 Jet observables sensitive to the jet calorimeter response

The GS calibration relies on five jet properties that were identified empirically to have a significant effect on the jet energy response. This empirical study was conducted primarily using EM jets, while a reduced scan was performed for LCW jets given that they already exploit some of the following variables as part of the LCW procedure. Two of the variables characterize the longitudinal shower structure of a jet, namely the fractions of energy deposited in the third electromagnetic calorimeter layer, \(f_{{\text {LAr}}3}\)Footnote 2, and in the first hadronic Tile calorimeter layer, \(f_{{\text {Tile}}0}\). These fractions are defined according to

$$\begin{aligned} f_{{\text {LAr}}3}= E^{{\text {LAr}}3}_{\mathrm {EM}} {\big /} E^{\mathrm {jet}}_{\mathrm {EM}},&{\mathrm {and}}~ f_{{\text {Tile}}0}= E^{{\text {Tile}}0}_{\mathrm {EM}} {\big /} E^{\mathrm {jet}}_{\mathrm {EM}}, \end{aligned}$$
(7)

where the subscript EM refers to the electromagnetic scale. The next two of the five jet properties rely on reconstructed tracks from the selected primary vertex that are matched to the calorimeter jets using ghost association (Sect. 4.2). The tracks are required to fulfil quality criteria relating to their impact parameter and the number of hits in the different inner-detector layers, and to have \(p_{{\text {T}}} >1~{\text {Ge}} {\text {V}}\) and \(|\eta |<2.5\). The track-based observables are the number of tracks associated with a given jet \(n_{\mathrm {trk}}\), and the jet width \({\mathcal {W}}_{\mathrm {trk}}\) defined as

$$\begin{aligned} {\mathcal {W}}_{\mathrm {trk}}= \sum _{i=1}^{N_{\text {trk}}}p_{{\text {T}},i}\,\Delta R(i,{\text {jet}}) {\big /} \sum _{i=1}^{N_{\text {trk}}}p_{{\text {T}},i}, \end{aligned}$$
(8)

where \(N_{\text {trk}}\) are the number of tracks associated with the jet, \(p_{{\text {T}},i}\) is the \(p_{{\text {T}}}\) of the \(i{\text {th}}\) track, and \(\Delta R(i,{\text {jet}})\) is the \(\Delta R\) distance in \((\eta ,\phi )\)-space between the \(i{\text {th}}\) track and the calorimeter jet axis. The jet width \({\mathcal {W}}_{\mathrm {trk}}\) quantifies the transverse structure of the jet, which is sensitive to the “jet flavour”. The final variable used in the GS calibration is \(N_{\mathrm {segments}}\), the number of muon segments behind the jet, which quantifies the activity in the muon chambers. Muon segments are partial tracks constructed from hits in the muon spectrometer chambers [55], and are matched to the jet of interest in two stages. Based on jets built using anti-\(k_{t}\) with \(R=0.6\), \(N_{\mathrm {segments}}\) is defined by the number of matching muon segments within a cone of size \(\Delta R=0.4\) around the jet axis. For anti-\(k_{t}\)  \(R=0.4\) jets, the closest \(R=0.6\) jet is found (fulfilling \(\Delta R<0.3\)), and \(N_{\mathrm {segments}}\) is assigned to the \(R=0.4\) jets according to the corresponding value for the \(R=0.6\) jet.

Fig. 6
figure 6

Normalized distributions of \(f_{{\text {Tile}}0}\), \(f_{{\text {LAr}}3}\), \(n_{\mathrm {trk}}\), and \({\mathcal {W}}_{\mathrm {trk}}{}\) for jets \(|\eta _{\text {det}}|<0.6\) in dijet events with \({80}\,{{\text {Ge}} {\text {V}}}<p_{{\text {T}}} ^{\text {avg}}< {110}\,{{\text {Ge}} {\text {V}}}\) in data (filled circles) and Pythia8 MC simulation with both full (empty circles) and fast (empty squares) detector simulation. All jets are reconstructed with anti-\(k_{t}\) \(R=0.4\) and calibrated with the EM + JES scheme. The quantity \(p_{{\text {T}}} ^{\text {avg}}\) is the average \(p_{{\text {T}}}\) of the leading two jets in an event, and hence represent the \(p_{{\text {T}}}\) scale of the jets being probed. \(N_{\mathrm {segments}}\) is not shown since the vast majority of jets in this \(p_{{\text {T}}}\) range have \(N_{\mathrm {segments}}= 0\)

Figures 6 and 7 show distributions comparing data with MC simulations for \(f_{{\text {Tile}}0}\), \(f_{{\text {LAr}}3}\), \(n_{\mathrm {trk}}\), \({\mathcal {W}}_{\mathrm {trk}}\) and \(N_{\mathrm {segments}}\) for jets with \(|\eta _{\text {det}}|<0.6\) produced in dijet events selected as described in Sect. 6.3. Predictions are provided using the default Pythia8 sample with full detector simulation from which the GS calibration is derived, and also using the AFII fast simulation, which is often used in physics analyses (Sect. 3). For the AFII detector simulation, there is no complete implementation of the muon segments produced behind high-energy uncontained jets. Therefore, this correction is not applied to AFII samples, and no AFII prediction is provided in Fig. 7e. It can be seen that the simulation predicts the general shapes of the data, although there are visible differences. Similar results are found in the other \(\eta _{\text {det}}\) regions. Disagreements in the distributions of the jet properties have little impact on the GS calibration performance as long as the response dependence R(x) of the jet properties x is well described by the simulation (Sect. 5.6).

Fig. 7
figure 7

Normalized distributions of \(f_{{\text {Tile}}0}\), \(f_{{\text {LAr}}3}\), \(n_{\mathrm {trk}}\), \({\mathcal {W}}_{\mathrm {trk}}{}\) and \(N_{\mathrm {segments}}\) for jets \(|\eta _{\text {det}}|<0.6\) in dijet events with \({600}\,{{\text {Ge}} {\text {V}}}<p_{{\text {T}}} ^{\text {avg}}< {800}\,{{\text {Ge}} {\text {V}}}\) in the data (filled circles) and Pythia8 MC simulation with both full (empty circles) and fast (empty squares) simulation. All jets are reconstructed with anti-\(k_{t}\) \(R=0.4\) and calibrated with the EM + JES scheme. The quantity \(p_{{\text {T}}} ^{\text {avg}}\) is the average \(p_{{\text {T}}}\) of the leading two jets in an event, and hence represent the \(p_{{\text {T}}}\) scale of the jets being probed

5.3 Derivation of the global sequential jet calibration

The jet observables used for the GS calibration and their order of application are summarized in Table 2. The first four corrections are determined separately in \(\eta _{\text {det}}\)-bins of width 0.1 and are parameterized down to \(p_{{\text {T}}} {}=15~{\text {Ge}} {\text {V}}\). For jets at the LCW + JES scale, only the tracking and uncontained calorimeter jets corrections are applied since the LCW calibration already takes into account shower shape information. No further improvement in resolution is thus achieved through the use of \(f_{{\text {Tile}}0}\) and \(f_{{\text {LAr}}3}\) for LCW jets.

The calorimeter response for EM + JES calibrated anti-\(k_{t}\) \(R=0.4\) jets with \(p_{{\text {T}}} ^{{\mathrm {truth}}}\) in three representative intervals is presented as a function of the different jet property variables used by the GS calibration in Fig. 8. For all properties, a strong dependence of the response as a function of the property is observed. The \(n_{\mathrm {trk}}\) and \({\mathcal {W}}_{\mathrm {trk}}\) show a stronger \(p_{{\text {T}}} \) dependence than the other properties and this is extensible for other \(p_{{\text {T}}} \) and \(\eta _{\text {det}}\) bins and jet collections. The corresponding distributions after the GS calibration are shown in Fig. 9. The jet response dependence on the jet properties is removed to within 2% after applying the GS calibration for all observables. Deviations from unity are expected since the correlations between the variables are not accounted for in the GS calibration procedure.

5.4 Jet transverse momentum resolution improvement in simulation

Figure 10 shows the jet transverse momentum resolution as a function of \(p_{{\text {T}}} ^{\mathrm {truth}}\) in simulated Pythia8 dijet events. While the response remains unchanged, the jet resolution improves as more corrections are added. The relative improvementFootnote 3 for EM + JES calibrated anti-\(k_{t}\)  \(R=0.4\) jets with central rapidity is found to be 10% at \(p_{{\text {T}}} =30~{\text {Ge}} {\text {V}}\), rising to 40% at \(400~{\text {Ge}} {\text {V}}{}\). This is equivalent to removing an absolute uncorrelated resolution source \(\Delta \sigma \) of 10% or 5%, respectively, as can be seen in the lower part of Fig. 10a. The quantity \(\Delta \sigma \) is calculated by subtracting in quadrature the relative jet \(p_{{\text {T}}}\) resolution:

$$\begin{aligned} \Delta \sigma = {\left\{ \begin{array}{ll} -\left( \left( \sigma _{p_{{\text {T}}}}/p_{{\text {T}}} \right) _{\text {no-GS}} \ominus \left( \sigma _{p_{{\text {T}}}}/p_{{\text {T}}} \right) _{\text {GS}}\right) &{} {\text {if}}~ \left( \sigma _{p_{{\text {T}}}}/p_{{\text {T}}} \right) _{\text {no-GS}} > \left( \sigma _{p_{{\text {T}}}}/p_{{\text {T}}} \right) _{\text {GS}} \\ \,+\,\left( \left( \sigma _{p_{{\text {T}}}}/p_{{\text {T}}} \right) _{\text {GS}} \ominus \left( \sigma _{p_{{\text {T}}}}/p_{{\text {T}}} \right) _{\text {no-GS}}\right) &{} {\text {otherwise.}} \end{array}\right. } \end{aligned}$$
(9)
Table 2 Sequence of GS corrections used to improve the jet performance in each \(\eta _{\text {det}}\) region. For jets at the LCW + JES scale, only the tracking and uncontained calorimeter jet corrections are applied

The improvement observed for jets initially calibrated with the LCW + JES scheme is found to be smaller, which is expected as only tracking and non-contained jet corrections are applied to these jets. For both EM + JES and LCW + JES calibrated jets, improvements to the JER is observed across the full \(p_{{\text {T}}}\) range probed (\({25}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} <{1200}\,{{\text {Ge}} {\text {V}}}\)). The fact that JER reduction is observed at high jet \(p_{{\text {T}}}\) means that also the constant term of the calorimeter resolution (Eq. (24)) is reduced by the GS calibration. This improvement can be explained by considering the jet resolution distributions for different values of the jet properties. As is evident in Fig. 8, the mean of these distributions have a strong dependence on the jet property, while the width of the distributions (JER) are not expected to have any such dependence at high jet \(p_{{\text {T}}}\) . The GS calibration can hence be seen as aligning several similarly shaped response distributions, which each have a biased mean, towards the desired truth-particle jet scale.

The conclusions from this section can generally be extended to the whole \(\eta _{\text {det}}\) range, although close to the calorimeter transition regions where the detector instrumentation is reduced (Fig. 4), the track-based observables introduce an even stronger improvement. The enhancement in JER due to the GS calibration is found to be similar for different MC generators.

Only a small improvement is observed after applying the last GS correction for uncontained calorimeter jets in the inclusive jet sample since only a small fraction of energetic jets are uncontained. Figure 11 presents a measure of the improvement in jet energy resolution from applying the fifth GS correction both to inclusive jets and to jets with at least 20 associated muon segments, which are less likely to be fully contained in the calorimeters. The resolution metric is the standard deviation (RMS) of the jet response distribution divided by the arithmetic mean. This quantity is used instead of the normal resolution definition (from the \(\sigma \) of a Gaussian fit as described in Sect. 4.2) since it gives information about the reduction in the low response tail. While the improvement observed is small for an inclusive jet sample, the impact is significant for uncontained jets. A relative resolution improvement of 10% is seen for jets with \(p_{{\text {T}}} {}\approx 100~{\text {Ge}} {\text {V}}\), while the improvement is 20% for jets with \(p_{{\text {T}}} {}\approx 1~{\text {Te}} {\text {V}}\). This corresponds to removing an absolute resolution source of 8% or 4%, respectively.

Fig. 8
figure 8

Jet \(p_{{\text {T}}}\) response as a function of \(f_{{\text {Tile}}0}\), \(f_{{\text {LAr}}3}\), \(n_{\mathrm {trk}}\), \({\mathcal {W}}_{\mathrm {trk}}{}\) and \(N_{\mathrm {segments}}\) for jets with \(|\eta _{\text {det}}|<0.3\) (\(|\eta _{\text {det}}|<1.3\) for \(N_{\mathrm {segments}}\)) in different \(p_{{\text {T}}} ^{{\mathrm {truth}}}\) ranges. All jets are reconstructed with anti-\(k_{t}\) \(R=0.4\) and calibrated with the EM + JES scheme without global sequential corrections. The horizontal line associated with each data point indicates the bin range, and the position of the marker corresponds the centroid within this bin. The underlying distributions of the jet properties for each \(p_{{\text {T}}} ^{{\mathrm {truth}}}\) bin normalized to the same area are also shown as histograms at the bottom of the plots

Fig. 9
figure 9

Jet \(p_{{\text {T}}}\) response as a function of \(f_{{\text {Tile}}0}\), \(f_{{\text {LAr}}3}\), \(n_{\mathrm {trk}}\), \({\mathcal {W}}_{\mathrm {trk}}{}\) and \(N_{\mathrm {segments}}\) for jets with \(|\eta _{\text {det}}|<0.3\) (\(|\eta _{\text {det}}|<1.3\) for \(N_{\mathrm {segments}}\)) in different \(p_{{\text {T}}} ^{{\mathrm {truth}}}\) ranges. All jets are reconstructed with anti-\(k_{t}\) \(R=0.4\) and calibrated with the EM + JES scheme including global sequential corrections. The horizontal line associated with each data point indicates the bin range, and the position of the marker corresponds the centroid within this bin. The underlying distributions of the jet properties for each \(p_{{\text {T}}} ^{{\mathrm {truth}}}\) bin normalized to the same area are also shown as histograms at the bottom of the plots

Fig. 10
figure 10

Jet \(p_{{\text {T}}}\) resolution as a function of \(p_{{\text {T}}} ^{\mathrm {truth}}\) in the nominal Pythia8 MC sample for jets with a \(|\eta _{\text {det}}|<0.3\) and b \(2.8<|\eta _{\text {det}}|<3.2\). The jets are reconstructed with anti-\(k_{t}\) \(R=0.4\). Curves are shown after the EM + JES calibration without global sequential corrections (black circles), with calorimeter-based global sequential corrections only (red squares), with calorimeter- and track-based corrections only (green upward triangles) and including all the global sequential corrections (blue downward triangles). The lower panels show the improvement relative to the EM + JES scale without global sequential corrections obtained using subtraction in quadrature (Eq. (9))

Fig. 11
figure 11

Standard deviation over arithmetic mean of the jet energy response as a function of \(E^{\text {truth}}\) for \(|\eta _{\text {det}}|<1.3\) before (filled circles) and after (empty circles) the fifth global sequential correction for a all jets and b calorimeter jets with \(N_{\mathrm {segments}}>20\) in the nominal Pythia8 dijet MC sample. All jets are reconstructed with anti-\(k_{t}\) \(R=0.4\) and initially calibrated at the EM + JES scale. The requirement \(N_{\mathrm {segments}}>20\) selects a large fraction of “uncontained” jets, i.e. jets for which some of the particles produced in the hadronic shower travel into the muon spectrometers behind the calorimeters. The bottom panels show the improvement introduced by the corrections quantified using subtraction in quadrature (Eq. (9))

5.5 Flavour dependence of the jet response in simulation

The internal structure of a jet, and thereby also its calorimeter response, depends on how the jet was produced. Jets produced in dijet events are expected to originate from gluons more often than jets with the same \(p_{{\text {T}}}\) and \(\eta \) produced in the decay of a W boson or in association with a photon or Z boson. The hadrons of a quark-initiated jet will tend to be of higher energy and hence penetrate further into the calorimeter, while the less energetic hadrons in a gluon-initiated jet will bend more in the magnetic field in the inner detector. It is desirable that such flavour dependence of the calibrated jet should be as small as possible to mitigate sample-specific systematic biases in the jet energy scale (Sect. 9.2.3 for discussion of the associated uncertainty).

Fig. 12
figure 12

The \(p_{{\text {T}}}\) response for anti-\(k_{t}\)\(R=0.4\) jets as a function of \(p_{{\text {T}}} ^{{\mathrm {truth}}}\) for light quark (LQ) jets (top), gluon jets (middle) and b-jets (bottom) with \(|\eta _{\text {det}}{}|<0.3\) (left) and \(2.1<|\eta _{\text {det}}{}|<2.4\) (right) regions in the Pythia8 MC sample. The \(p_{{\text {T}}}\) response after the EM + JES calibration without GS corrections (circles), with calorimeter-based GS corrections only (squares) and including all the GS corrections (triangles) are shown. The lower box of each plot shows the impact on the jet response, subtracting the response before the GS corrections (R) from the response after applying the GS corrections (\(R'\))

The flavour dependence of the response is studied in simulated dijet events by assigning a flavour label to each calorimeter jet using an angular matching to the particles in the MC event record. If the jet matches a b- or a c-hadron, it is labelled a \(b{\text {-jet}}\) or c jet, respectively. If it matches both a b- and a c-hadron, it is labelled a \(b{\text {-jet}}\) . If it does not match any such heavy hadron, the jet is labelled “light quark” (LQ) or gluon initiated, based on the type of the highest-energy matching parton. The matching criterion used is \(\Delta R < R\), where R is the radius parameter of the jet algorithm (0.4 or 0.6). The \(p_{{\text {T}}}\) responses before and after GS calibration for jets in different flavour categories are presented in Fig. 12. For each flavour category, results are shown for two representative pseudorapidity regions. The response for LQ jets is larger than unity since the MC-derived baseline calibration (Sect. 4.3) is derived in dijet events that contain a large fraction of gluon jets. For gluon-initiated jets the response is lower than that of LQ jets, as expected, and \(b{\text {-jets}}\) have a \(p_{{\text {T}}}\) response between that of LQ and gluon jets. In all cases, the GS calibration brings the response closer to unity and hence reduces the flavour dependence, which is important as analyses do not know the flavour of each jet. The change in \(p_{{\text {T}}}\) response introduced by the GS calibration for jets with \(p_{{\text {T}}} {}=80~{\text {Ge}} {\text {V}}\) with \(|\eta |<0.3\) is \(-4\%\), \(\,+\,1\%\) and \(-2\%\) for LQ jets, gluon jets and b-jets, respectively.

5.6 In situ validation of the global sequential calibration

The GS correction is validated in situ with dijet events using the tag-and-probe technique, using the event selection described in Sect. 6, with only one modification: both jets are required to be in the same \(|\eta _{\text {det}}{}|\) region to avoid biases from any missing \(\eta \)-dependent calibration factors. The jet whose response dependence is studied is referred to as the probe jet, while the other is referred to as the reference jet. The choice of reference jet and probe jet is arbitrary when studying the response dependence on the jet properties, and the events are always used twice, alternating the roles of reference and probe. The response for the probe jet is measured through the dijet \(p_{{\text {T}}} \) asymmetry variable \(\mathcal {A}\) (Eq. (10) and Sect. 6.1) in bins of the average \(p_{{\text {T}}}\) of the probe and the reference jet \(p_{{\text {T}}} ^{\text {avg}}\), and is studied as a function of the jet property of the probe jet.

Fig. 13
figure 13

Dijet in situ validation of jet response as a function of \(f_{{\text {Tile}}0}\), \(f_{{\text {LAr}}3}\) and \(n_{\mathrm {trk}}\) for jets with \({80}\,{{\text {Ge}} {\text {V}}}<p_{{\text {T}}} ^{\text {avg}}<{110}\,{{\text {Ge}} {\text {V}}}\) and \(|\eta _{\text {det}}{}|<0.6\) (top) and for jets with \({600}\,{{\text {Ge}} {\text {V}}}<p_{{\text {T}}} ^{\text {avg}}<{800}\,{{\text {Ge}} {\text {V}}}\) and \(|\eta _{\text {det}}{}|<0.6\) (middle) and the same quantity as a function of \({\mathcal {W}}_{\mathrm {trk}}\) and \(N_{\mathrm {segments}}\) (bottom). Each set of measurements are shown for data (filled circles) and for Pythia8 MC simulation with both full (empty circles) and fast (empty squares) detector modelling. All jets are reconstructed with anti-\(k_{t}\) \(R=0.4\) and calibrated with the EM + JES scheme without any global sequential corrections

Results for all variables used in the GS calibration are shown in Fig. 13 for jets with \(|\eta _{\text {det}}|<0.6\) in two representative \(p_{{\text {T}}} \) ranges. No GS calibration is applied to either the probe or the reference jet. It can be seen that the reference Pythia8 dijet MC sample agrees with the data within 1% (4%) for \({600}\,{{\text {Ge}} {\text {V}}}<p_{{\text {T}}} ^{\text {avg}}<{800}\,{{\text {Ge}} {\text {V}}}\) (\({80}\,{{\text {Ge}} {\text {V}}}<p_{{\text {T}}} ^{\text {avg}}<{110}\,{{\text {Ge}} {\text {V}}}\)) for the calorimeter-based variables, and slightly better for the track-based observables. A similar level of agreement is seen in other jet \(p_{{\text {T}}}\) and \(\eta _{\text {det}}\) ranges. These differences impact the average jet \(p_{{\text {T}}}\) weighted by the fraction of jets with corresponding values of the GS property in question; given that these differences occur in the tails of the distributions, the impact on the average jet \(p_{{\text {T}}}\) is thus minimal. Results using MC samples produced with the AFII fast detector simulation are also shown and demonstrate similar agreement with data, although these samples have larger statistical uncertainties. The relative data–MC agreement stays the same after the GS calibration is applied for both full and fast detector simulation.

Fig. 14
figure 14

Jet \(p_{{\text {T}}}\) resolution (top) and difference in jet response between gluon and light quark (LQ) initiated jets (bottom) as a function of \(p_{{\text {T}}} ^{\mathrm {truth}}\) for two representative \(|\eta _{\text {det}}{}|\) regions. Results are shown both before (closed markers) and after (open markers) the global sequential corrections is applied, and separately for jets in the Pythia8 (circles), Sherpa (squares), and Herwig++ (triangles) dijet MC samples. All jets are reconstructed with anti-\(k_{t}\) \(R=0.4\). Jets are labelled LQ- or gluon-initiated, based on the highest-energy parton in the MC event record which fulfils an angular matching to the jet as further detailed in Sect. 5.5

5.7 Comparison of jet resolution and flavour dependence between different event generators

Figure 14 presents comparisons of \(p_{{\text {T}}}\) resolution and response dependence on jet flavour between three MC event generators, namely Pythia8, Herwig++, and Sherpa, each with a different implementation of parton showering, multiple parton–parton interactions and hadronization (Sect. 3). These quantities are shown as a function of jet \(p_{{\text {T}}}\) both with and without GS calibration applied in two representative \(\eta _{\text {det}}\) regions. Pythia8 tends to predict a slightly worse jet \(p_{{\text {T}}}\) resolution for jets with \(p_{{\text {T}}} <50~{\text {Ge}} {\text {V}}\) compared with the jet resolution in Herwig++ and Sherpa, but the improvement introduced by the GS correction is compatible between the different generators. The reduction of jet flavour dependence is studied by taking the difference between the jet responses for LQ and gluon jets, determined as discussed in Sect. 5.5 and as used for light-quark vs gluon discrimination [56]. The overall flavour dependence of the jet response is found to be smaller for Herwig than for Pythia8 and Sherpa, and in general, the LQ jet response is quite similar between the generators, while the response for gluon jets varies more. For jets with \(p_{{\text {T}}} >40~{\text {Ge}} {\text {V}}\), the response difference between LQ and gluon jets is reduced by at least a factor of two after applying the GS correction.

6 Intercalibration and resolution measurement using dijet events

Following the determination and application of MC-based jet calibration factors, it is important to measure the jet response and resolution in situ, quantify the level of agreement between data and simulation, and correct for any discrepancy. The first step is to investigate the jet response dependence across the detector in terms of pseudorapidity. All results presented in this section are obtained with jets calibrated with the calibration chain up to, and including, the GS calibration (Sect. 4.4).

6.1 Techniques to determine the jet calibration and resolution using dijet asymmetry

The jet energy resolution (JER) and the relative response of the calorimeter as a function of pseudorapidity are determined using events with dijet topologies [6, 7]. The \(p_{{\text {T}}}\) balance is quantified by the dijet asymmetry

$$\begin{aligned} \mathcal {A}= \frac{p_{{\text {T}}} ^{{\mathrm {probe}}}- p_{{\text {T}}} ^{{\mathrm {ref}}}}{p_{{\text {T}}} ^{\text {avg}}}, \end{aligned}$$
(10)

where \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) is the transverse momentum of a jet in a well-calibrated reference region, \(p_{{\text {T}}} ^{{\mathrm {probe}}}\) is the transverse momentum of the jet in the calorimeter region under investigation, and \(p_{{\text {T}}} ^{\text {avg}}= (p_{{\text {T}}} ^{{\mathrm {probe}}}\,+\, p_{{\text {T}}} ^{{\mathrm {ref}}})/2\). The average calorimeter response relative to the reference region, 1/c, is then defined as

$$\begin{aligned} \frac{1}{c} \equiv \frac{2\,+\,\left\langle \mathcal {A}\right\rangle }{2-\left\langle \mathcal {A}\right\rangle } \approx \frac{\left\langle p_{{\text {T}}} ^{{\mathrm {probe}}}\right\rangle }{\left\langle p_{{\text {T}}} ^{{\mathrm {ref}}}\right\rangle }, \end{aligned}$$
(11)

where \(\left\langle \mathcal {A}\right\rangle \) is the mean of the asymmetry distribution in a given bin of \(p_{{\text {T}}} ^{\text {avg}}\) and \(\eta _{\text {det}}\), and the last equality of Eq. (11) can be obtained by inserting the expectation value of a first-order Taylor expansion of Eq. (10), giving \(\left\langle \mathcal {A}\right\rangle = \left( \langle p_{{\text {T}}} ^{{\mathrm {probe}}}\rangle -\langle p_{{\text {T}}} ^{{\mathrm {ref}}}\rangle \right) /\langle p_{{\text {T}}} ^{\text {avg}}\rangle \).

Two versions of the analysis are performed. In the central reference method, the calorimeter response is measured as a function of \(p_{{\text {T}}} ^{\text {avg}}\) and \(\eta _{\text {det}}\) relative to the region defined by \(|\eta _{\text {det}}|<0.8\). Jets in this region are precisely calibrated using \(Z \) + jet, \(\gamma \) + jet and multijet data (Sects. 7 and 8). In the matrix method, multiple \(\eta _{\text {det}}\) regions are chosen and the calorimeter response in a given region is measured relative to all other regions. For a given \(p_{{\text {T}}} ^{\text {avg}}\) bin, \(\left\langle \mathcal {A}\right\rangle \) is determined for each of a large number of combinations of \(\eta _{\text {det}}\) regions of the two jets involved. The calorimeter response relative to the central region is then obtained by solving a set of linear equations based on this matrix of dijet asymmetries [6]. A constraint is applied that sets the average response for jets with \(|\eta _{\text {det}}|<0.8\) to unity. The advantage of the matrix method is that a much larger fraction of events can be used, since events with both jets outside \(|\eta _{\text {det}}|<0.8\) are considered, thus reducing the statistical uncertainty of the final result. Statistical uncertainties in the matrix method result are estimated using pseudo-experiments. Each pseudo-experiment generates a new matrix of dijet asymmetries by sampling the average asymmetry \(\left\langle \mathcal {A}\right\rangle \) of each bin (matrix element) according to their statistical uncertainty. The intercalibration factors are then derived for each pseudo-experiment, and the statistical uncertainty of the calibration is obtained from the spread. In this paper, the main results are obtained using the matrix method, and the simpler central reference method is used for validation.

The asymmetry distribution also probes the jet energy resolution. The standard deviation of the asymmetry distribution \(\sigma _{\mathcal {A}}^{\text {probe}}\) in a given \(\left( p_{{\text {T}}} ^{\text {avg}}{},\eta ^{\text {probe}}_{\text {det}}\right) \) bin can be expressed as

$$\begin{aligned} \sigma _{\mathcal {A}}^{\text {probe}}= & {} \frac{\sigma _{p_{{\text {T}}}}^{\text {probe}}\oplus \, \sigma _{p_{{\text {T}}}}^{\text {ref}}}{p_{{\text {T}}} ^{\text {avg}}} = \left\langle \frac{\sigma _{p_{{\text {T}}}}}{p_{{\text {T}}}}\right\rangle _{\text {probe}} \oplus \, \left\langle \frac{\sigma _{p_{{\text {T}}}}}{p_{{\text {T}}}}\right\rangle _{\text {ref}} \nonumber \\= & {} \left\langle \frac{\sigma _{E}}{E}\right\rangle _{\text {probe}} \oplus \, \left\langle \frac{\sigma _{E}}{E}\right\rangle _{\text {ref}}, \end{aligned}$$
(12)

where \(\sigma _{p_{{\text {T}}}}^{\text {probe}}\) and \(\sigma _{p_{{\text {T}}}}^{\text {ref}}\) are the standard deviations of \(p_{{\text {T}}} ^{{\mathrm {probe}}}\) and \(p_{{\text {T}}} ^{{\mathrm {ref}}}\), respectively. The first two equalities of Eq. (12) follow from error propagation of Eq. (10) and from the fact that after calibration \(\langle p_{{\text {T}}} ^{{\mathrm {probe}}}{}\rangle = \langle p_{{\text {T}}} ^{{\mathrm {ref}}}{}\rangle = \langle p_{{\text {T}}} ^{\text {avg}}{} \rangle \) in a given \(p_{{\text {T}}} ^{\text {avg}}\) bin. The energy and \(p_{{\text {T}}}\) resolutions are approximately the same since contributions of the jet angular resolution are negligible (Fig. 2). The standard deviation of the asymmetry distribution \(\sigma _{\mathcal {A}}\) is obtained from a Gaussian fit to the core of the distribution.

The standard deviation of the probe jet \(p_{{\text {T}}}\) is derived from Eq. (13) as

$$\begin{aligned} \left\langle \frac{\sigma _{p_{{\text {T}}}}}{p_{{\text {T}}}}\right\rangle _{\mathrm {probe}} = \sigma _{\mathcal {A}}^{\text {probe}} \,\ominus \, \left\langle \frac{\sigma _{p_{{\text {T}}}}}{p_{{\text {T}}}}\right\rangle _{\mathrm {ref}}, \end{aligned}$$
(13)

where the latter term is derived from events where both jets fall in the central reference region (\(|\eta _{\text {det}}{}|<0.8\)). In this case, the reference region is being probed, and the first and last terms in Eq. (12) are hence equal, which gives \(\left\langle \sigma _{p_{{\text {T}}}}/p_{{\text {T}}} \right\rangle _{\mathrm {ref}} = \sigma _{\mathcal {A}}{\big /} \sqrt{2}\). When calculating the asymmetry, the jets are fully calibrated including all data-driven correction factors.

The \(p_{{\text {T}}}\) balance strictly holds only for \(2\rightarrow 2\) partonic events. In reality, the \(p_{{\text {T}}}\) balance between two jets is affected on an event-by-event basis by additional quark/gluon radiation outside of the jets, as well as hadronization and MPI effects that cause particle losses and additions to the jets, respectively. To account for the impact of such effects, the dijet asymmetry standard deviation \(\sigma _{\mathcal {A}}\) is measured separately for reconstructed and truth-particle jets, and the standard deviation due to detector smearing \(\sigma _{\mathcal {A}}^{\text {det}}\) is obtained by subtracting the truth-particle quantity from the observed quantity in quadrature:

$$\begin{aligned} \sigma _{\mathcal {A}}^{\text {det}} = \sigma _{\mathcal {A}}^{\text {reco}} \ominus \sigma _{\mathcal {A}}^{\text {truth}}. \end{aligned}$$
(14)

This final jet energy resolution measurement \(\left\langle \sigma _{p_{{\text {T}}}}/p_{{\text {T}}} \right\rangle \) is calculated according to Eq. (13) after first correcting the asymmetry width \(\sigma _{\mathcal {A}}\) according to Eq. (14).

6.2 Determining the jet resolution using the dijet bisector method

The bisector method attempts to separate the desired part of the dijet \(p_{{\text {T}}}\) imbalance that is due to fluctuations in the jet calorimeter response from contributions from other effects such as soft parton radiation and the underlying event. In the same way as for the central reference method, selection criteria are applied to select events with dijet topology, and at least one of the two jets is required to have \(|\eta _{\text {det}}{}|<0.8\). This jet is referred to as the “reference jet”, while the other jet is labelled “probe jet”. If both jets fulfil \(|\eta _{\text {det}}{}|<0.8\), the labels are assigned randomly. The \(p_{{\text {T}}}\) (imbalance) of the dijet system in the transverse plane is defined as the vectorial sum of the \(p_{{\text {T}}}\) vectors of the leading two jets: \({\mathbf {p}}_{\text {T}}^{\,jj} = {\mathbf {p}}^{\,\text {probe}}_{\text {T}} \,+\, {\mathbf {p}}^{\,\text {ref}}_{\text {T}}\). This vector is projected onto a Cartesian coordinate system in the transverse plane \((\psi ,\upsilon )\), where the \(\upsilon \)-axis is defined to be along the direction that bisects the angle \(\Delta \phi _{12}\) between the two jets, and the \(\psi \)-axis is defined to have a direction that minimizes the angle to the probe jet as illustrated in Fig. 15. Both effects from the detector (response and resolution) and from physics (e.g. radiation) are present in the \(\psi \) component of the \(p_{{\text {T}}}\) balance that is oriented “towards” the probe jet axis, whereas detector effects should be significantly smaller than physics effect in the \(\upsilon \) component, oriented “away” from both the probe and the reference jet. As a result [7], the jet energy resolution for events in a given \(p_{{\text {T}}} ^{\text {avg}}\) bin where both jets are in the reference region (\(|\eta _{\text {det}}^{\text {probe}}|<0.8\)) is given by

$$\begin{aligned} \left\langle \frac{\sigma _{p_{{\text {T}}}}}{p_{{\text {T}}}}\right\rangle _{\text {ref}} \, = \, \frac{\sigma _\psi \,\ominus \, \sigma _\upsilon }{p_{{\text {T}}} ^{\text {avg}}\sqrt{2 \,\langle \left| \cos \Delta \phi _{12}\right| \rangle }}, \end{aligned}$$
(15)

and for events where the probe jet is outside the reference region it is given by

$$\begin{aligned} \left\langle \frac{\sigma _{p_{{\text {T}}}}}{p_{{\text {T}}}}\right\rangle _{\text {probe}} \, = \, \frac{\sigma _\psi \,\ominus \, \sigma _\upsilon }{p_{{\text {T}}} ^{\text {avg}}\sqrt{\langle \left| \cos \Delta \phi _{12}\right| \rangle }} \ominus \left\langle \frac{\sigma _{p_{{\text {T}}}}}{p_{{\text {T}}}}\right\rangle _{\text {ref}}. \end{aligned}$$
(16)

The standard deviations \(\sigma _\upsilon \) and \(\sigma _\psi \) are evaluated as the width parameters of Gaussian fits to the \(p_{{\text {T}}\upsilon }\) and \(p_{{\text {T}}\psi }\) distributions, respectively.

Although the bisector observables in Eqs. (15) and (16) have less dependence on soft quark or gluon emission than the asymmetry-based jet resolution measurement of Eq. (12), the approach relies on the assumption that the physics effects are the same in the \(\psi \) and \(\upsilon \) components. Corrections to the measured \(\sigma _\psi \) and \(\sigma _\upsilon \) are made by subtracting the corresponding quantities derived using truth-particle jets in quadrature from the measured quantity, analogously to what is done for the central reference method (Eq. (14)).

Fig. 15
figure 15

Illustration of observables used in the dijet bisector technique. The \((\psi ,\upsilon )\)-coordinate system is defined such that the \(\upsilon \)-axis bisects the azimuthal angle \(\Delta \phi _{12}\) between the leading two jets while the \(\psi \)-axis minimizes the angle to the probe jet. The vectorial sum of the transverse momenta of the probe and the reference jets define the dijet transverse momentum \({\mathbf {p}}_{\text {T}}^{\,jj}\). Its components along the \(\psi \)- and \(\upsilon \)-axes (\(p_{{\text {T}}\psi }\) and \(p_{{\text {T}}\upsilon }\)) are used to extract a measurement of the jet energy resolution

6.3 Dijet selection

Dijet events are selected using a combination of central (\(|\eta _{\text {det}}|<3.2\)) and forward (\(|\eta _{\text {det}}|>3.2\)) jet triggers. For this selection, the trigger efficiency for each region of \(p_{{\text {T}}} ^{\text {avg}}{}\) is greater than \(99\%\) and approximately independent of the pseudorapidity of the probe jet. The jet triggers used have different prescales, downscaling factors used to meet bandwidth constraints on the recording of data. Larger prescales are used for data recorded when the instantaneous luminosity is high or for triggers that require lower jet \(p_{{\text {T}}}\) . Due to the different prescales for the central and forward jet triggers, the data collected by the different triggers correspond to different integrated luminosities. Each data event is assigned a trigger based on the \(p_{{\text {T}}} ^{\text {avg}}\) and \(\eta _{\text {det}}\) of the more forward jet. The data is hence split into different categories, and each is weighted according to the integrated luminosity of the dedicated trigger used following the “exclusion method” [57]. Events are selected in which there are at least two jets with \(p_{{\text {T}}} >25~{\text {Ge}} {\text {V}}\) and \(|\eta _{\text {det}}|<4.5\). To select events with a dijet topology, the azimuthal angle between the two leading jets (i.e., the reference and probe jets) is required to be \(\Delta \phi _{12} > 2.5\) and events are rejected if they contain a third jet with \(p_{\text {T}}^{\text {j3}} > 0.4\,p_{{\text {T}}} ^{\text {avg}}\). The jets are also required to fulfil the preselection described in Sect. 4.1.

6.4 Method for evaluating in situ systematic uncertainties

The in situ techniques rely on assumptions that are only approximately fulfilled, and simulation is used to account for these approximations. For example, the momentum balance between the jet and the reference object is altered to varying degrees by the presence of additional radiation. The impact from such radiation is reduced by event topology selection criteria. Since the choice of the exact threshold values is arbitrary, systematic uncertainties are evaluated by rederiving the final result, which is a data-to-MC ratio, after varying these selection criteria. Other systematic uncertainties are evaluated by altering choices used by the method, such as a parameter used in a fit or changing the MC generator. In the case of the \(\gamma \) + jet, \(Z \) + jet, and multijet techniques (Sects. 7 and 8), uncertainties are also established by adjusting the kinematic properties (energy, \(p_{{\text {T}}}\) , etc.) of the reference object according to their associated uncertainties. These variations test the ability of the MC simulation to model the physics effects since they either reduce or amplify their importance.

Many potential effects are considered as systematic uncertainty sources. As explained above, each of these is evaluated by introducing a variation to the analysis. However, due to limited statistics in both the data and MC samples, these variations have an associated statistical uncertainty (i.e. an “uncertainty on the uncertainty”). For example, an introduced variation that has no impact on the measured calibration factor (or resolution) still produces changes consistent with statistical fluctuations. Thus, it is important to only include statistically significant variations as systematic uncertainties. This is achieved with a two-step procedure outlined below.

In the first step, the statistical uncertainty of the systematic variations is evaluated in each \(p_{{\text {T}}}\) bin using pseudo-experiments, following the “bootstrapping” method [58]. Each such pseudo-experiment is constructed by altering the data and MC samples. Each event is counted n times, where n is sampled from a Poisson distribution with a mean of unity. For each pseudo-experiment i, both the nominal \(c_{{\text {nom}},i}\) and varied \(c_{{\text {var}},i}\) results are extracted, and the uncertainty is evaluated as the difference between these results \(\Delta c_{{\text {var}},i} = c_{{\text {var}},i}-c_{{\text {nom}},i}\). If the variation is a change in the selection criteria or a change of the calibration or resolution smearing of any of the objects, the random fluctuations of the events that stay in the same bin are the same between the nominal and varied samples, while the events that migrate between bins will have independent fluctuations. The statistical uncertainty of the systematic uncertainty amplitude is evaluated as the standard deviation of the systematic uncertainty (difference between varied and nominal result) of the pseudo-experiments.

In a second step, adjacent \(p_{{\text {T}}}\) bins might iteratively be combined until the observed variation is statistically significant. If the variation already is significant with the original binning, it is recorded as a systematic uncertainty. Otherwise, neighbouring bins are merged, which results in improved statistical precision. After each bin-merging, it is checked if the systematic variation is significant, and if so, it is recorded as a systematic uncertainty. If after all bins are merged, the variation is still not significant, the systematic effect is considered consistent with zero and is discarded.

For some systematic variations, there are physics reasons for the response to depend on \(p_{{\text {T}}}\) , such as the out-of-cone effects being relatively larger at low \(p_{{\text {T}}}\) . In such cases, the bin merging step is not performed for the nominal uncertainty evaluation, but it is considered within alternative uncertainty scenarios (Sect. 9.4).

The use of the pseudo-experiments and the bin merging procedure strongly reduces the effect of statistical fluctuations when evaluating systematic uncertainties. This procedure is used for all the in situ methods discussed in this paper.

6.5 Relative jet energy scale calibration using dijet events

The following sections detail the determination of the intercalibration aimed at achieving a uniform scale for jets as a function of pseudorapidity.

6.5.1 Comparison of matrix and central reference methods

Figure 16 compares the relative jet response calculated using the matrix method with that obtained from the central reference method. The relative response obtained from the matrix method differs slightly from that from the central reference method, most notably in the forward regions where the difference is up to 4%. This is not surprising since the matrix method uses a significantly larger pool of events that have different kinematics (smaller rapidity separation) than the ones used by the central reference method. The same shift appears in both data and MC simulation, resulting in consistent data-to-MC ratios between the two methods. For \(25~{\text {Ge}} {\text {V}}~ \le p_{{\text {T}}} ^{\text {avg}}{}<40~{\text {Ge}} {\text {V}}\) the statistical precision of the matrix method generally exhibits a \(40\%\) improvement compared with the precision of the central reference method. The level of improvement decreases with increasing \(p_{{\text {T}}} ^{\text {avg}}\) and is typically less than \(10\%\) for \(p_{{\text {T}}} ^{\text {avg}}{}>400~{\text {Ge}} {\text {V}}\). Since the final \(\eta \) intercalibration is derived using data-to-MC ratios that are found to be consistent between the methods and the matrix method gives significantly smaller uncertainties, the matrix method is chosen, and hereafter all \(\eta \) intercalibration results presented are derived using this method.

Fig. 16
figure 16

Relative jet response measured using the matrix and central reference methods for anti-\(k_{t}\) jets with \(R=0.4\) calibrated with the EM + JES scheme as a function of the probe jet pseudorapidity. Results are presented for \(65{\text {Ge}} {\text {V}}\le p_{{\text {T}}} ^{\text {avg}}{}<85\,{\text {Ge}} {\text {V}}\) and 270\({\text {Ge}} {\text {V}}\) \( \le p_{{\text {T}}} ^{\text {avg}}{}<330{\text {Ge}} {\text {V}}\) for data (circles) and Sherpa (triangles) using the central reference method (empty symbols) and the matrix method (filled symbols). Only statistical uncertainties are shown. The dashed lines in the lower panels indicate \(1\pm 0.02\) and \(1\pm 0.05\)

6.5.2 Comparison of data with simulation

Figure 17 shows the relative response as a function of \(\eta _{\text {det}}\) for data and the MC simulations for four \(p_{{\text {T}}} ^{\text {avg}}\) regions. Figure 18 shows the relative response as a function of \(p_{{\text {T}}} ^{\text {avg}}\) for two representative \(\eta _{\text {det}}\) bins, namely \(-1.5\le \eta _{\text {det}}<-1.2\) and \(2.1\le \eta _{\text {det}}<2.4\). The general features of the response in data are reproduced reasonably well by the Sherpa and Powheg + Pythia8 predictions. Furthermore, the theoretical predictions are in good agreement with each other, with a much smaller spread than that observed in the previous studies using Pythia8 and Herwig++ [6], because the new theoretical predictions are accurate to leading order in perturbative QCD for variables sensitive to the third jet’s activity, such as the dijet balance, whereas the Pythia8 and Herwig++ predictions rely on the leading-logarithm accuracy of the parton shower algorithms.

Fig. 17
figure 17

Relative jet response, 1/c, as a function of the jet pseudorapidity for anti-\(k_{t}\) jets with \(R=0.4\) calibrated with the EM + JES scheme, for data (black circles), Sherpa (blue triangles) and Powheg + Pythia8 (red squares). Results are shown separately for \(25~{\text {Ge}} {\text {V}}~<p_{{\text {T}}} ^{\text {avg}}{}<40~{\text {Ge}} {\text {V}}\), \(85~{\text {Ge}} {\text {V}}~<p_{{\text {T}}} ^{\text {avg}}{}<115~{\text {Ge}} {\text {V}}\), \(220~{\text {Ge}} {\text {V}}~<p_{{\text {T}}} ^{\text {avg}}{}<270~{\text {Ge}} {\text {V}}\) and \(760~{\text {Ge}} {\text {V}}~<p_{{\text {T}}} ^{\text {avg}}{}<1200~{\text {Ge}} {\text {V}}\) with associated statistical uncertainties. The lower part of each figure shows the ratio of relative response in MC simulation to that in data, while the thick line indicates the resulting residual correction. The dashed part of this line represents the extrapolation of the ratio into regions which are either statistically limited or probe \(|\eta _{\text {det}}|>2.7\). These measurements are performed using the matrix method. The dashed lines in the lower panels indicate \(1\pm 0.02\) and \(1\pm 0.05\)

Fig. 18
figure 18

Relative jet response, 1/c, as a function of the jet \(p_{{\text {T}}}\) for anti-\(k_{t}\) jets with \(R=0.4\) calibrated with the EM + JES scheme, separately for \(-1.5\le \eta _{\text {det}}<-1.2\) and \(2.1\le \eta _{\text {det}}<2.4\), for data (black circles), Sherpa (blue triangles) and Powheg + Pythia8 (red squares). The lower parts of the figures show the ratios of MC simulation to data relative response, while the thick line indicates the resulting residual correction. The dashed part of this line represents the extrapolation of the ratio into regions which are statistically limited. The dashed lines in the lower panels indicate \(1\pm 0.02\) and \(1\pm 0.05\)

6.5.3 Derivation of residual jet energy scale correction

The residual calibration factor \(c_\eta \) is derived from the ratio of data and Sherpa \(\eta \)-intercalibration factors, i.e. \(c_{\eta ,i} = c^{\text {data}}_i/c^{\textsc {Sherpa}}_i\). The calibration factors from many bins of \(p_{{\text {T}}} ^{\text {avg}}\) and \(\eta _{\text {det}}\) are combined into a smooth function using a two-dimensional Gaussian kernel [6]. The kernel-width parameters of this function are chosen to capture the shape of the MC-to-data ratio across \(p_{{\text {T}}}\) and \(\eta _{\text {det}}\), and at the same time provide stability against statistical fluctuations. The resulting residual correction \(c_\eta \) is shown as a black line in the lower panels of Figs. 17 and 18. In these panels, it can also be seen that the calibration function is fixed for \(\eta _{\text {det}}\) and \(p_{{\text {T}}}\) regions that extend beyond the data measurements. The same freezing of the calibration is also done for \(|\eta _{\text {det}}|>2.7\) since the generator dependence becomes larger in this region. Measurements in these forward regions are not used to derive the intercalibration but are used when assessing the uncertainty.

6.5.4 Systematic uncertainties

All intercalibration systematic uncertainties are derived as a function of \(p_{{\text {T}}}\) and \(|\eta _{\text {det}}|\) with no uncertainty assigned in the reference region (\(|\eta _{\text {det}}|<0.8\)). No statistically significant difference is observed for positive and negative \(\eta _{\text {det}}\) for any of the uncertainties, justifying the parameterization versus \(|\eta _{\text {det}}|\), which increases the statistical power in the uncertainty evaluation.

The difference between Sherpa and Powheg + Pythia8 is used to assess the physics modelling uncertainty. Both of these generators are accurate to leading order in QCD for variables sensitive to the modelling of the third jet (such as the dijet balance). Since there is no a priori reason to trust one generator over the other, the difference between the two predictions is used to estimate the modelling uncertainty. For \(0.8\le |\eta _{\text {det}}|<2.7\), where data are corrected to the Sherpa predictions, the full difference between Powheg + Pythia8 and Sherpa is taken as the uncertainty.Footnote 4 For \(|\eta _{\text {det}}|\ge 2.7\), where the calibration is frozen, the uncertainty is taken as the maximum difference between the extrapolated calibration and the prediction from either Powheg + Pythia8 or Sherpa. The use of these event generators results in a substantial improvement in the agreement between the theoretical predictions, thus reducing the modelling-based uncertainty by a factor of approximately two relative to the previous result [6]. Despite the improvement, this modelling uncertainty remains the largest systematic uncertainty in the measurement.

The physics modelling uncertainty in the relative response is cross-checked using truth-particle jets by varying the Powheg + Pythia8 predictions. The QCD renormalization and factorization scales in the Powheg Box are each varied by factors of 0.5 and 2.0, which has a significantly smaller impact on the relative response than the difference between the Powheg + Pythia8 and Sherpa predictions. A comparison of the relative response between the Powheg + Herwig sample and the Powheg + Pythia8 sample is also performed and is similar to the truth-particle jet relative response between Powheg + Pythia8 and Sherpa. The assigned uncertainty from the difference between Sherpa and Powheg + Pythia8 is a good reflection of the underlying physics modelling uncertainty.

The event topology selection requires \(\Delta \phi _{12}>2.5\) and \(p_{\text {T}}^{\text {j3}} < 0.4\,p_{{\text {T}}} ^{\text {avg}}\). To assess the influence of these selection criteria on the MC modelling of the \(p_{{\text {T}}}\) balance, the residual calibration is rederived after shifting the \(\Delta \phi _{12}\) selection by \(\pm 0.3\) radians and the radiation criteria based on the fractional \(p_{{\text {T}}}\) of a potential third jet by \(\pm 0.1\). The maximum difference between the rederived calibration after the up and down shifts to the nominal is taken as uncertainty. To assess the impact of \({\text {pile-up}}\) , the calibration is rederived in subsets split into high and low \(\mu \) (\(\mu < 14 \) and \(\mu \ge 17 \)), and high and low \(N_{\text {PV}}\) subsets (\(N_{\text {PV}}<9\) and \( N_{\text {PV}} \ge 11\)). The uncertainty due to \({\text {pile-up}}\) effects is taken to be the maximum fractional difference between the varied and nominal calibrations. Similarly, an uncertainty due to the JVF requirement is derived by redoing the calibration after tightening and loosening the JVF criteria following the procedure defined in Ref. [51]. These variations account for the extent to which JVF is mis-modelled for jets originating from the primary interaction vertex. An uncertainty due to imperfect modelling of the jet energy resolution is also assigned by smearing the jet four-momenta in MC simulation using Gaussian random sampling with a standard deviation calculated from the JER data-to-MC difference. The difference between the calibrations obtained with nominal and smeared simulation is taken as the uncertainty due to JER effects.

The total systematic uncertainty is the sum in quadrature of the various components mentioned. Figure 19 presents a summary of the uncertainties as a function of \(|\eta _{\text {det}}|\) for two representative values of jet transverse momentum, namely \(p_{{\text {T}}} = 35~{\text {Ge}} {\text {V}}\) and \(p_{{\text {T}}} = 300~{\text {Ge}} {\text {V}}\). The uncertainties have a strong pseudorapidity dependence, increasing with \(\eta _{\text {det}}\), and have a weaker \(p_{{\text {T}}}\) -dependence, decreasing with increasing jet \(p_{{\text {T}}}\) .

Fig. 19
figure 19

Summary of uncertainties in the intercalibration as a function of the jet \(\eta _{\text {det}}\) for anti-\(k_{t}\) jets with \(R=0.4\) calibrated with the EM + JES scheme, separately for \(p_{{\text {T}}} {} = 35~{\text {Ge}} {\text {V}}\) (left) and \(p_{{\text {T}}} = 300~{\text {Ge}} {\text {V}}\) (right). The individual components are added in quadrature to obtain the total uncertainty. The MC modelling uncertainty is the dominant component for jets with \(|\eta _{\text {det}}{}|>1.5\)

6.6 Jet energy resolution determination using dijet events

Figure 20 shows the measured relative jet energy resolution as a function of \(p_{{\text {T}}} ^{\text {avg}}\) for EM + JES calibrated jets in different \(\eta _{\text {det}}\) regions of the calorimeter. The results are presented for both the dijet balance and bisector methods, and there is good agreement between the methods for all values of \(p_{{\text {T}}} ^{\text {avg}}\) and \(|\eta _{\text {det}}|\). The jet energy resolution in simulated events, determined as described in Sect. 4.2, is also shown as a dotted line and is in agreement with the measured JER in data.

6.6.1 Systematic uncertainties

The JER is determined in data by subtracting the truth-particle jet asymmetry from the measured asymmetry as discussed in Sect. 6.1. The truth-particle jet asymmetry is defined as the weighted average of the truth-particle jet asymmetries obtained for each of the Sherpa, Powheg + Pythia8, Pythia8, and Herwig++ event samples. The uncertainty in this weighted average is taken to be the RMS deviation of the truth-particle jet asymmetries obtained from the four event generators. This source of uncertainty is typically 0.02 at low \(p_{{\text {T}}} ^{\text {avg}}\) for both methods, falling to less than 0.01 at the highest \(p_{{\text {T}}} ^{\text {avg}}\).

Non-closure is defined as the difference between the jet resolution measured by the in situ method and the truth-particle jet resolution obtained by matching truth-particle and calorimeter jets (Sect. 4.2). This is treated as a systematic uncertainty in the method. The weighted average of the truth-particle jet asymmetries predicted by Sherpa, Powheg + Pythia8, Pythia8, and Herwig++ is subtracted in quadrature from the weighted average of the asymmetries evaluated for reconstructed calorimeter jets. The non-closure is typically about 10–15% for the bisector method, but it is larger for the dijet balance method, reaching \(25\%\) in some regions.

Finally, there are a number of systematic uncertainties that arise from experimental sources. The uncertainty in the JES calibration is investigated by shifting the energy of the jets by the \(\pm 1\sigma \) uncertainty, with a typical effect between 5% and 10% at low \(p_{{\text {T}}} ^{\text {avg}}\). The uncertainty due to the choice of JVF selection has a less than \(2\%\) effect for both methods. The uncertainty due to the criterion on the azimuthal angle between the jets is investigated by changing the requirement by \(\pm 0.3\), with a negligible effect at high \(p_{{\text {T}}} ^{\text {avg}}\) for both methods, a small (\(<4\%\)) effect on the dijet balance results at low \(p_{{\text {T}}} ^{\text {avg}}\) and a larger effect (5–15%) on the bisector results at low \(p_{{\text {T}}} ^{\text {avg}}\). The impact of the veto on the third jet is investigated by changing the selection criteria by \(\pm 4~{\text {Ge}} {\text {V}}\), and is found to have a 10–15% effect at low \(p_{{\text {T}}} ^{\text {avg}}\) for both methods, falling to a few percent at higher \(p_{{\text {T}}} ^{\text {avg}}\) values.

The total systematic uncertainty, which is taken as the sum in quadrature of all sources discussed above, is shown as a dashed band around the points in Fig. 20.

Fig. 20
figure 20

Relative jet energy resolution obtained for EM + JES calibrated jets using the bisector (filled circles) and dijet balance (filled squares) methods, respectively. The MC simulated resolution derived from matching truth-particle jets with calorimeter jets is presented by the open triangles connected by dashed lines. The error bars reflect the statistical uncertainty while the hashed band indicates the total systematic uncertainty. Results are shown as a function of the jet \(p_{{\text {T}}}\) in four regions of detector pseudorapidity: \(|\eta _{\text {det}}|<0.8\), \(0.8 \le |\eta _{\text {det}}|<1.2\), \(1.2 \le |\eta _{\text {det}}|< 2.1\), and \(2.1 \le |\eta _{\text {det}}|<2.8\). The lower panels show the data-to-MC ratio, and the thin dashed lines indicate \(1\pm 0.2\) and \(1\pm 0.4\)

7 Calibration and resolution measurement using \(\gamma \) + jet and \(Z \)  +  jet events

This section describes the determination of the final jet calibration that corrects the absolute energy scale of the jets to achieve a data-to-MC agreement within the associated uncertainties. The jet calibration is based on measurements conducted by in situ techniques that exploit the transverse momentum balance between a well-calibrated object and the hadronic recoil (jet). The well-calibrated object is either a photon or a \(Z\)  boson that decays leptonically, either \(Z \) \(\rightarrow \) \(ee\) or \(Z \) \(\rightarrow \) \(\mu \mu \). Three separate datasets are used: (\(Z \) \(\rightarrow \) \(ee\)) + jet, (\(Z \) \(\rightarrow \) \(\mu \mu \)) + jet and \(\gamma \) + jet, and two different in situ techniques are used for each dataset, namely the direct balance technique (DB) and the missing projection fraction method (MPF). The three independent datasets and the two analysis methods provide six separate measurements of the jet calorimeter response that can be cross-checked with each other, allowing detailed studies of systematic uncertainties. For each dataset, the method that gives the smallest overall uncertainty is chosen and is used as input to the final combination of the absolute jet calibration (Sect. 9).

Due to the steeply falling \(Z\)  boson \(p_{{\text {T}}}\) spectrum, the \(Z \) + jet data provide sufficient statistics to calibrate jets at lower \(p_{{\text {T}}}\) and are used in the range \({17}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} <{250}\,{{\text {Ge}} {\text {V}}}\). The \(Z\)  boson four-momentum is reconstructed by four-vector addition of its decay products (leptons). The \(\gamma \) + jet process has a higher cross section and covers the jet \(p_{{\text {T}}}\) range \({25}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} <{800}\,{{\text {Ge}} {\text {V}}}\). However, at low \(p_{{\text {T}}}\) the photon sample has a large contamination from events that do not contain any true prompt photon and hence a sizeable systematic uncertainty. As discussed in Sect. 9.1, a combination of both the \(Z \) + jet and \(\gamma \) + jet channels covers the full momentum range \({17}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} <{800}\,{{\text {Ge}} {\text {V}}}\).

7.1 The direct balance and missing projection fraction methods

Both the DB and MPF methods exploit the momentum balance in events with \(\gamma \) + jet or \(Z \) + jet topology to study the jet calorimeter response. Both methods benefit from accurate knowledge of the energy scale and resolution of the boson (i.e. the photon or the dilepton system). The calibration of electrons and photons is accurately known through measurements using \(Z \rightarrow ee\) data and other final states [44], while the muon reconstruction is determined to high precision through studies of \(J/\Psi \rightarrow \mu \mu \), \(Z \rightarrow \mu \mu \), and \(\Upsilon \rightarrow \mu \mu \) [59].

The DB response \(R_{{\text {DB}}}\) is

$$\begin{aligned} R_{{\text {DB}}}= \left<p_{{\text {T}}} ^{\text {j1}}\,/\,p_{{\text {T}}} ^{{\mathrm {ref}}}\right> ,&\text{ where } p_{{\text {T}}} ^{{\mathrm {ref}}}= p_{{\text {T}}} ^{Z/\gamma }\,\left| \cos {\Delta \phi }\right| , \end{aligned}$$
(17)

where \(p_{{\text {T}}} ^{\text {j1}}\) is the \(p_{{\text {T}}}\) of the leading jet being probed and \(\Delta \phi \) is the azimuthal angle between this jet and the boson (\(Z\) or \(\gamma \)). If the jet includes all the particles that recoil against the \(Z\)  boson or \(\gamma \) and all particles are perfectly measured, then \(p_{{\text {T}}} ^{\text {j1}}\,/\,p_{{\text {T}}} ^{{\mathrm {ref}}}=1\) and \(\cos {\Delta \phi }=\cos {\pi }=-1\). In reality, there is always additional QCD radiation not captured by the jet, which skews the balance. This radiation, referred to as out-of-cone radiation (OOC), tends to be in the same hemisphere as the jet and hence biases the DB to values below unity. The reference transverse momentum \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) used in the denominator of \(R_{{\text {DB}}}\) is the boson momentum projected onto the jet axis in the transverse plane in order to attempt to at least partially reduce OOC effects. The DB is also affected by uncertainties in the reconstructed photon, electron, or muon momenta, as well as contributions from \({\text {pile-up}}\) and multiple parton–parton interactions (the underlying event).

The MPF method [5, 60] is an alternative to the DB technique. Rather than balancing the jet object itself against the well-measured boson, the whole hadronic recoil is used. The MPF measures the response for the full hadronic recoil, which is significantly less sensitive to OOC radiation and effects due to \({\text {pile-up}}\) and the underlying event. The logic of the MPF method is detailed below for \(\gamma \) + jet events. The case of \(Z \) + jet is the same with the \(Z\)  boson replacing the photon.

From conservation of transverse momentum, the \(p_{{\text {T}}}\) vector of the system of all hadrons produced in a \(\gamma \) + jet event, \({\mathbf {p}}_{\text {T}}^{\,{\mathrm {recoil}}}{}\), will perfectly balance the photon \({\mathbf {p}}_{\text {T}}^{\,{\mathrm {\gamma }}}{}\) at the truth-particle level. In a perfect \(2\rightarrow 2\) process, \({\mathbf {p}}_{\text {T}}^{\,{\mathrm {recoil}}}\) would be equal to the \({\mathbf {p}}_{\mathrm {T}}\) of the parton, which in turn is that of the jet. At reconstruction level, the \(p_{{\text {T}}}\) of the photon (or \(Z\) boson) is well calibrated and hence accurately reconstructed, while the hadronic response is low prior to calibration, primarily due to the non-compensating nature of the ATLAS calorimeters.Footnote 5. There is hence a momentum imbalance, which defines the missing transverse momentum \({{\mathbf {E}}}^{\,\mathrm {miss}}_{\mathrm {T}}\):

$$\begin{aligned} {\mathbf {p}}^{\,\gamma }_{\text {T,truth}} \,+\, {\mathbf {p}}^{\,\text {recoil}}_{\text {T,truth}} = {\mathbf {0}}&({\text {truth-particle level}}) \nonumber \\ {\mathbf {p}}_{\text {T}}^{\,{\mathrm {\gamma }}}\,+\, {\mathbf {p}}_{\text {T}}^{\,{\mathrm {recoil}}}\,+\, {{\mathbf {E}}}^{\,\mathrm {miss}}_{\mathrm {T}}= {\mathbf {0}}&({\text {detector level}}). \end{aligned}$$
(18)

Projecting the vector quantities of Eq. (18) onto the direction of the photon \({\hat{n}}_{\gamma }\) and dividing the result by \(p_{\text {T}}^{\gamma }\) gives the MPF observable \(r_{\text {MPF}}\), whose mean is the MPF response \(R_{\text {MPF}}\), where

$$\begin{aligned}&r_{\text {MPF}} = -\frac{{\hat{n}}_\gamma \cdot {\mathbf {p}}_{\text {T}}^{\,\text {recoil}}}{p_{\text {T}}^\gamma } = 1 \,+\, \frac{{\hat{n}}_{\gamma } \cdot {{\mathbf {E}}}^{\,\mathrm {miss}}_{\mathrm {T}}}{p_{{\mathrm {T}}}^{{\mathrm {\gamma }}}} \qquad {\text { and }} \nonumber \\&R_{\text {MPF}} = \left\langle 1 \,+\, \frac{{\hat{n}}_{\gamma } \cdot {{\mathbf {E}}}^{\,\mathrm {miss}}_{\mathrm {T}}}{p_{{\mathrm {T}}}^{{\mathrm {\gamma }}}}\right\rangle . \end{aligned}$$
(19)

The \(E_{{\text {T}}}^{{\text {miss}}}\) definition used in Eqs. (18) and (19) is based on the calibrated momentum of the photon (or dilepton system for \(Z \) + jet data) using topo-clusters at the constituent scale, either at the EM-scale when studying the EM + JES calibration or at the LCW-scale for the LCW + JES calibration. Details of the ATLAS \(E_{{\text {T}}}^{{\text {miss}}}\) reconstruction are in Ref. [61].

The MPF response \(R_{\text {MPF}}\) provides a measure of the \(p_{{\text {T}}}\) response of the calorimeter to the hadronic recoil for a given \(p_{\text {T}}^\gamma \). A feature of this method is that it is almost independent of the jet algorithm as the jet definition enters only in the event selection criteria applied (Sect. 7.2). Except for two relatively small corrections known as the topology and showering corrections (Sect. 7.3.4.2), the \(R_{\text {MPF}}\) determined in \(\gamma \) + jet or \(Z \) + jet events can be used as an estimator of the calorimeter jet response at \({\text {pile-up}}\) -subtracted scale (Sect. 4.3.2). This is because \({\text {pile-up}}\) is independent of both the hard interaction and the azimuth \(\phi \), and so its contribution to \({\hat{n}}_\gamma \cdot E_{{\text {T}}}^{{\text {miss}}} {}\) will be zero on average, meaning that \(R_{\text {MPF}}\) already effectively subtracts the \({\text {pile-up}}\) as is done for jets using Eq. (2). Since \(R_{\text {MPF}}\) is an approximation of the \({\text {pile-up}}\) -subtracted jet response, it can be compared with the corresponding quantity of the MC-derived calibration in Fig. 4 that defines \(1/c_{\text {JES}}\).

The \(R_{\text {DB}}\) and \(R_{\text {MPF}}\) parameters are determined in bins of \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) (Eq. (17)) from the mean parameter extracted from fits to the balance distributions (\(p_{{\text {T}}} ^{\text {j1}}/p_{{\text {T}}} ^{{\mathrm {ref}}}\) and \(r_{\text {MPF}}\)) using a Modified Poisson distribution, which was also used in the previous ATLAS jet calibration [6]. This distribution starts from a standard Poisson distribution \(f_{\text {P}}(n;\nu )\) and is extended to non-integer values using a Gamma function \(\Gamma (n\,+\,1)\), followed by the introduction of a new parameter s used to redefine the argument using \(x=s^2\,n\) and defining \(\mu \equiv E[x] = s^2\,\nu \), giving

$$\begin{aligned} f_{\text {MP}}(x;\mu ,s) = \frac{(\mu /s^2)^{x/s^2}}{\Gamma (x/s^2\,+\,1)} \frac{{\text {e}}^{-\mu /s^2}}{s^2}. \end{aligned}$$
(20)

This distribution has the same shape as a “smoothed” Poisson distribution with \(\nu = \mu /s^2\) and has mean \(\mu \) and standard deviation \(\sqrt{\mu }\,s\). For larger values of \(\mu /s^2\) (\(\gtrapprox 15\)), it is very similar to a Gaussian distribution, while for lower values (\(\mu /s^2\lessapprox 5\)) the longer upper tail of a Poisson distribution is prominent. The Modified Poisson function better describes the balance distributions and is motivated by the Poisson nature of sampling calorimeters.

The MPF and DB methods probe the calorimeter response to jets in a different way and are sensitive to different systematic effects. They therefore provide complementary measurements of the jet-energy scale. The explicit use of jets in the measurement of the jet response from DB makes this technique dependent on the jet reconstruction algorithm while the MPF technique is mostly independent of the jet algorithm, as explained above. Thus, in the following, when presenting MPF results, no jet algorithm is explicitly mentioned.

7.2 Event and object selection

This section outlines the event selection used for the DB and MPF analyses separately for the \(\gamma \) + jet and \(Z \) + jet datasets. The two methods have similar selections, but the restriction on the subleading jet \(p_{{\text {T}}}\) (Sect. 7.2.3) is less stringent for the MPF method because it is less sensitive to QCD radiation.

7.2.1 Photon selection

The \(\gamma \) + jet data was collected using six different single-photon triggers, each with a different associated photon \(p_{{\text {T}}}\) threshold. The five lower-threshold triggers were prescaled, while only the highest-threshold trigger was not prescaled. A given \(\gamma \) + jet event was assigned to one of these triggers, based on the \(p_{{\text {T}}}\) of the leading photon reconstructed by the algorithm used in the high-level trigger. This mapping was created such that the trigger efficiency for each \(p_{{\text {T}}}\) range was at least 99%. The lowest-threshold trigger data has the largest associated prescale factor and is used for photons between 20 and \(40~{\text {Ge}} {\text {V}}\), while the highest-threshold trigger, which was not prescaled, is used for \(p_{{\text {T}}} >120~{\text {Ge}} {\text {V}}\).

Reconstructed photons are required to satisfy strict identification criteria ensuring that the pattern of energy deposition in the calorimeter is consistent with that expected for a photon [62]. Photons are calibrated following the procedure in Ref. [44] and are required to have \(p_{{\text {T}}} > 25~{\text {Ge}} {\text {V}}\) and \(|\eta |<1.37\) such that they are fully contained within the electromagnetic barrel calorimeter. In order to suppress backgrounds from calorimeter signatures of hadrons misidentified as photons, an isolation criterion is further applied to all photons. The isolation transverse energy of a photon \(E_{\text {T}}^{\text {iso}}\) is calculated from calorimeter energy deposits in a cone of size \(\Delta R=0.4\) around the photon, excluding the photon itself and the expected contribution from \({\text {pile-up}}\) [62]. Photons are initially required to fulfil \(E_{\text {T}}^{\text {iso}}<3.0~{\text {Ge}} {\text {V}}\). However, for events where the leading photon has \(p_{{\text {T}}}\) below \(85~{\text {Ge}} {\text {V}}\), the contamination from misidentified photons is still large. Hence, more stringent criteria are applied as follows: \(E_{\text {T}}^{\text {iso}}<0.5~{\text {Ge}} {\text {V}}\) if \(p_{{\text {T}}} ^\gamma <45~{\text {Ge}} {\text {V}}\), or else \(E_{\text {T}}^{\text {iso}}<1.0~{\text {Ge}} {\text {V}}\) if \(p_{{\text {T}}} ^\gamma <65~{\text {Ge}} {\text {V}}\), and otherwise \(E_{\text {T}}^{\text {iso}}<2.0~{\text {Ge}} {\text {V}}\) if \(p_{{\text {T}}} ^\gamma <85~{\text {Ge}} {\text {V}}\). Each event is required to have at least one photon fulfilling these criteria. In the very rare case of two such photons, the leading one is used.

7.2.2 \(Z\)  boson selection

A typical \(Z\)  boson selection is applied, starting by requiring dilepton triggers that were not prescaled during the data-taking period. For the \(Z \) \(\rightarrow \) \(ee\) channel, the dielectron triggers requires two “loose” electrons, defined in Ref. [63], with \(E_{{\text {T}}} > 12~{\text {Ge}} {\text {V}}\) and \(|\eta |<2.5\). For the \(Z \) \(\rightarrow \) \(\mu \mu \) channel, the trigger requires one “tight” and one “loose” muon, defined in Ref. [64], with \(p_{{\text {T}}} >18~{\text {Ge}} {\text {V}}\) and \(p_{{\text {T}}} >8~{\text {Ge}} {\text {V}}\), respectively. The reconstructed electrons are required to fulfil “medium” quality requirements [65] and are calibrated as detailed in Ref. [44]. Electrons are required to have \(p_{{\text {T}}} {}>20~{\text {Ge}} {\text {V}}\) and \(|\eta |<2.37\), excluding the barrel–endcap transition region \(1.37<|\eta |<1.52\). Muons are reconstructed through the combination of trajectories and energy-loss information in several detector systems [59] and are required to have \(|\eta |<2.5\) and \(p_{{\text {T}}} >20~{\text {Ge}} {\text {V}}\). Each event is required to have exactly two reconstructed electrons or two muons with opposite charge. The invariant mass of the dilepton system \(m_{\ell \ell }\) must then fulfil \(80~{\text {Ge}} {\text {V}}<m_{\ell \ell }<116~{\text {Ge}} {\text {V}}\), ensuring high \(Z\rightarrow \ell \ell \) purity.

7.2.3 Jet and boson + jet topology selection

Jets are reconstructed and a preselection is applied, including standard \({\mathrm {JVF}}\) requirements, as described in Sect. 4.1. They are calibrated with all steps prior to the absolute in situ correction (Sect. 4.3). To avoid double counting of energy depositions, jets are required to be \(\Delta R>0.35\) from a photon for the \(\gamma \) + jet analysis or from any of the leptons in the \(Z \) + jet analysis for jets reconstructed with \(R=0.4\). The corresponding criterion is \(\Delta R>0.5\) for \(R=0.6\) jets.

Fig. 21
figure 21

Distributions of \(p_{{\text {T}}} ^{\text {j1}}/p_{{\text {T}}} ^{{\mathrm {ref}}}\) for a \({17}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{20}\,{{\text {Ge}} {\text {V}}}\), b \({60}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{80}\,{{\text {Ge}} {\text {V}}}\), c \({160}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{210}\,{{\text {Ge}} {\text {V}}}\), and d \({500}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{600}\,{{\text {Ge}} {\text {V}}}\) using anti-\(k_{t}\) jets with \(R=0.4\) calibrated with the EM + JES scheme in data from the a, b \(Z \) + jet and c, d \(\gamma \) + jet analyses. The dashed lines in a, b show the fitted Modified Poisson distributions of Eq. (20), from which the means are taken as the DB response measurements \(R_{{\text {DB}}}\). The solid lines indicate the fitting ranges. The lack of data at low \(p_{{\text {T}}} ^{\text {j1}}/p_{{\text {T}}} ^{{\mathrm {ref}}}\) visible in a is due to the \(p_{{\text {T}}} ^{\text {j1}}> 10~\hbox {GeV}\) criterion. The markers are the data counts with error bars corresponding to the statistical uncertainties

The leading jet is required to have \(|\eta _{\text {det}}|<0.8\) and \(p_{{\text {T}}}\) greater than \(10~{\text {Ge}} {\text {V}}\) for the \(Z \) + jet analysis and \(12~{\text {Ge}} {\text {V}}\) for the \(\gamma \) + jet analysis. To enforce a “boson + jet” topology and hence suppress additional QCD radiation, criteria are imposed on the azimuthal angle \(\Delta \phi (Z/\gamma ,{\text {j1}})\) between the \(Z\)  boson or photon and the leading jet and on the subleading jet transverse momentum \(p_{\text {T}}^{\text {j2}}\), if such a jet is present. The DB analysis requires \(\Delta \phi (Z/\gamma ,{\text {j1}})>2.8\) and \(p_{{\text {T}}} ^{\text {j2}}<{\text {max}}(8~{\text {Ge}} {\text {V}},0.1\,p_{{\text {T}}} ^{{\mathrm {ref}}})\) while the MPF analysis uses the criteria \(\Delta \phi (Z/\gamma ,{\text {j1}})>2.9\) and \(p_{{\text {T}}} ^{\text {j2}}<{\text {max}}(8~{\text {Ge}} {\text {V}},0.3\,p_{{\text {T}}} ^{{\mathrm {ref}}})\). The subleading jet \(p_{{\text {T}}}\) is always defined using the jet collection reconstructed using \(R=0.4\), even when studying jets built using \(R=0.6\) or \(R=1.0\). For jets built using \(R=1.0\), \(p_{{\text {T}}} ^{\text {j2}}\) is defined as the \(p_{{\text {T}}}\) of the leading \(R=0.4\) jet that fulfils \(\Delta R({\text {j1}},{\text {j2}})>0.8\), where “j1” refers to the leading \(R=1.0\) jet, i.e. the jet that is being probed. This ensures that the “j2” jet will have a significant proportion of its energy depositions outside of the large-\(R\) jet.

7.3 Jet response measurements using \(Z \) + jet and \(\gamma \) + jet data

Measurements of \(R_{{\text {DB}}}\) and \(R_{\text {MPF}}\) using both of the individual \(Z \rightarrow \ell \ell \) datasets and the \(\gamma \) + jet dataset are discussed below. The subsequent combination of the \(Z \rightarrow \ell \ell \) and \(\gamma \) + jet results into the final in situ calibration is detailed in Sect. 9.1.

The \(Z \) \(\rightarrow \) \(ee\) and \(Z \) \(\rightarrow \) \(\mu \mu \) analyses probe jets over the same kinematic space and use exactly the same \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) binning. Within each bin, the balance distributions \(p_{{\text {T}}} ^{\text {j1}}/p_{{\text {T}}} ^{{\mathrm {ref}}}\) and \(r_{\text {MPF}}\) agree between the channels for both the cores of the distributions (including their means) and their tails (including their standard deviations). The two datasets are combined into a \(Z \rightarrow \ell \ell \) channel, which increases the statistical power of the measurement. This combination is done consistently for data and MC simulation, and also for systematic variations.

7.3.1 Direct balance results

Figure 21 presents four DB \(p_{{\text {T}}} ^{\text {j1}}/p_{{\text {T}}} ^{{\mathrm {ref}}}\) distributions in representative \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) bins from the \(Z \) + jet and \(\gamma \) + jet analyses using anti-\(k_{t}\) jets with \(R=0.4\) calibrated with the EM + JES scheme. Good fit quality using the Modified Poisson parameterization of Eq. (20) is observed. This is true for the other \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) bins considered, both for data and for all MC samples considered.

The value of \(R_{{\text {DB}}}\) is extracted for each \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) bin for both data and simulation. Figure 22 shows the measurements of \(R_{{\text {DB}}}\) in data compared with predictions from the different MC generators as a function of \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) for anti-\(k_{t}\)\(R=0.4\) jets calibrated with the EM + JES scheme. The different MC generators agree with data within 1% for \(p_{{\text {T}}} >40~{\text {Ge}} {\text {V}}\) with slightly worse agreement at lower \(p_{{\text {T}}}\) . The worst data-to-MC agreement is for EM + JES calibrated \(R=0.4\) jets in the \({17}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{20}\,{{\text {Ge}} {\text {V}}}\) bin (Fig. 22a), for which the Powheg MC sample predicts \(\sim \)5% higher \(R_{{\text {DB}}}\) than what is observed in data. For LCW + JES calibrated \(R=0.4\) jets and \(R=0.6\) jets using both calibration schemes, the data-to-MC agreement is within 3% across the full \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) range probed.

Fig. 22
figure 22

\(R_{{\text {DB}}}\) for anti-\(k_{t}\) jets with \(R=0.4\) and calibrated with the EM + JES scheme from the a \(Z \) + jet and b \(\gamma \) + jet analyses in the data and for two MC simulations. Only statistical uncertainties are shown

For the \(\gamma \) + jet analysis, the measured responses agree within 1% with the MC predictions for \(p_{{\text {T}}} ^{{\mathrm {ref}}}{}<100~{\text {Ge}} {\text {V}}\) for both \(R=0.4\) and \(R=0.6\) jets using both calibration schemes. For jets with \(p_{{\text {T}}} ^{{\mathrm {ref}}}>100~{\text {Ge}} {\text {V}}\), the MC simulation systematically tends to overestimate the measured response by approximately \(1\%\).

7.3.2 Validation of intercalibration of forward jets using \(Z \) + jet data

The derived intercalibration in Sect. 6 corrects jets with forward rapidities \(|\eta _{\text {det}}|>0.8\) by about \(1{-}3\%\) (Fig. 17). The total uncertainty in this calibration is presented in Fig. 19 and is typically below \(1\%\) for jets with \(|\eta _{\text {det}}|<3\), increasing to about \(3.5\%\) for low-\(p_{{\text {T}}}\) jets with \(|\eta _{\text {det}}|>4\).

To validate this calibration, the DB analysis is repeated for the jets with \({0.8}\le |\eta _{\text {det}}{}|<{4.5}\) using \(Z \) + jet events. As in the standard analysis (Sect. 7.2.3), the intercalibration is applied to the jets, and hence the data-to-MC ratio of \(R_{{\text {DB}}}\) is expected to be uniform versus \(\eta _{\text {det}}\) within the uncertainty assigned to the intercalibration. Results of this analysis for EM + JES calibrated anti-\(k_{t}\)\(R=0.4\) jets are presented in Fig. 23. The prediction of both MC generators agree with the data within the assigned uncertainties for jets with \({0.8}\le |\eta _{\text {det}}{}|<{2.8}\). For the region \(|\eta _{\text {det}}|>2.8\), differences can be up to 7% as shown in Fig. 23; however, the statistical uncertainties of the \(Z \rightarrow \ell \ell \) measurements are of similar magnitude. Hence, the results validate the derived dijet intercalibration.

Fig. 23
figure 23

\(R_{{\text {DB}}}\) as a function of \(\eta _{\text {det}}\) for anti-\(k_{t}\) jets with \(R=0.4\) calibrated with the EM + JES scheme in data (black), as well as in Sherpa (red) and Powheg + Pythia8 (blue) simulation, for a \({45}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{65}\,{{\text {Ge}} {\text {V}}}\) and b \({110}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{160}\,{{\text {Ge}} {\text {V}}}\). Only statistical uncertainties are shown

7.3.3 MPF results

Figure 24 presents \(R_{\text {MPF}}\) calculated at both the EM and LCW scales as a function of \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) extracted using both the \(Z \) + jet and \(\gamma \) + jet events. As mentioned in Sect. 7.1, \(R_{\text {MPF}}\) is a measurement of the hadronic response of the calorimeter and does not include the MC-derived calibration \(c_{\text {JES}}\) nor the GS calibration \(c_{\text {GS}}\) (see Sects. 7.3.4.2 and 7.3.4.6 for further discussion on this). The “upturn” of \(R_{\text {MPF}}\) at low values of \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) visible in Fig. 24 is an expected artefact of the jet reconstruction threshold. Since a jet is required to be present in the event (Sect. 7.2.3), when this jet’s \(p_{{\text {T}}}\) fluctuates low the event might fail the selection. For such an event, \(r_{\text {MPF}}\) will also tend to be low. And similarly, events with jets that fluctuate high in \(p_{{\text {T}}}\) will have high \(r_{\text {MPF}}\) and will pass the selection.

For the \(Z \) + jet analysis, the \(R_{\text {MPF}}\) measured in data is systematically about 1% below the prediction of Powheg + Pythia8, considered the reference MC sample. For the \(\gamma \) + jet analysis, the predictions of \(R_{\text {MPF}}\) from both MC simulations agree across the full \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) range within the statistical precision. Both simulations systematically overestimate the \(R_{\text {MPF}}\) value measured in data by about 1% for \(p_{{\text {T}}} ^{{\mathrm {ref}}}{}>85~{\text {Ge}} {\text {V}}\) at the EM scale and by about 1% for \(p_{{\text {T}}} ^{{\mathrm {ref}}}>50~{\text {Ge}} {\text {V}}\) at the LCW scale.

Fig. 24
figure 24

The MPF response \(R_{\text {MPF}}\) in a, b \(Z \) + jet and c, d \(\gamma \) + jet events for jets calibrated a, c at the EM scale and b, d using the LCW scheme, for both data and MC simulation, as a function of \(p_{{\text {T}}} ^{{\mathrm {ref}}}\). Only statistical uncertainties are shown

7.3.4 Systematic uncertainties

The final JES calibration that is described in Sect. 9 is based on the data-to-MC ratio of the response measurements. As explained in detail in Sect. 6.4, systematic uncertainties in this quantity are evaluated by introducing variations to the analysis. The following seven sections present the evaluation of in total 17 uncertainty sources that affect the data-to-MC ratio of \(R_{{\text {DB}}}\) and \(R_{\text {MPF}}\). These uncertainties assess various effects that can affect the response measurement, such as impact of additional QCD radiation, choice of MC generator, effects from out-of-cone radiation and \({\text {pile-up}}\) , and the precision of the \(p_{{\text {T}}}\) scale of the reference objects (photons, electrons, or muons).

7.3.4.1 Suppression of additional radiation

As explained in Sect. 7.2.3, a “boson + jet” topology is selected by imposing constraints on the azimuthal angle \(\Delta \phi \) between the boson and the jet and by restricting the \(p_{{\text {T}}}\) of any subleading jet. These criteria reduce the impact from additional QCD radiation on the momentum balance between the jet and the boson. Systematic uncertainties from two sources are evaluated, one through varying the \(\Delta \phi \) requirement and one through variations of the \(p_{{\text {T}}} ^{\text {j2}}\) selection. Constructing a single uncertainty component from variation in simulations of the two criteria is also considered (as was done previously [5]); however, the two-component approach is sufficient.

The \(\Delta \phi \) selection is varied by \(\pm 0.1\) around the nominal values of 2.9 for MPF and 2.8 for DB (Sect. 7.2.3). The \(p_{{\text {T}}} ^{\text {j2}}\) requirement for the DB analysis is similarly tightened to \({\text {max}}(7~{\text {Ge}} {\text {V}},0.05\,p_{{\text {T}}} ^{{\mathrm {ref}}})\) and loosened to \({\text {max}}(9~{\text {Ge}} {\text {V}},0.15\,p_{{\text {T}}} ^{{\mathrm {ref}}})\). The MPF selection is varied by similar amounts around the nominal selection. The resulting uncertainty from the \(\Delta \phi \) variation is generally negligible. The uncertainty due to the \(p_{{\text {T}}} ^{\text {j2}}\) requirement is 0.4% or smaller for \(p_{{\text {T}}} ^{{\mathrm {ref}}}{} < 50~{\text {Ge}} {\text {V}}\) and is negligible above this value.

7.3.4.2 Systematic uncertainties due to out-of-cone radiation

For the DB method, the \(p_{{\text {T}}}\) of a jet, even if perfectly calibrated, will always tend to be smaller than that of the photon or \(Z\)  boson due to the out-of-cone radiation (Fig. 21). The impact of the out-of-cone radiation on \(R_{{\text {DB}}}\) is studied in both data and simulation by measuring the average \(p_{{\text {T}}}\) density of tracks as a function of the angular distance (\(\Delta R\)) between the track direction and the leading jet axis. Based on this \(p_{{\text {T}}}\) profile, the fraction of the radiation outside the jet cone is estimated (see Section 9.4 of Ref. [6] for details), and an out-of-cone systematic uncertainty is evaluated on the basis of the simulation’s ability to model the measured value of this quantity. The resulting uncertainty is as large as 2% at \(p_{{\text {T}}} ^{{\mathrm {ref}}}=40~{\text {Ge}} {\text {V}}\) and is smaller at higher \(p_{{\text {T}}} ^{{\mathrm {ref}}}\).

In principle, the MPF technique does not depend on the OOC correction because the calorimeter response is integrated over the whole detector. However, two effects related to the OOC contribution must be considered. The “showering correction” quantifies the migration of energy across the jet boundary of the calorimeter jet relative to the truth-particle jet and is difficult to measure with data. This effect is included in the DB analysis by design since it is based on reconstructed jets but is not included for the MPF method since it measures the entire hadronic recoil. In addition, the hadronic response in the periphery of the jet is different than in the core because of the different energy densities and particle compositions. This “topology correction” is also difficult to extract from data but is expected to be small since the average \(p_{{\text {T}}}\) density around the jet axis decreases fairly rapidly, and only a small fraction of the \(p_{{\text {T}}}\) is outside the jet radius. MC studies have shown that the uncertainty in each of these corrections is significantly smaller than the DB OOC uncertainty. As a conservative approach, the OOC uncertainty measured in data for the DB case is used to estimate the contributions from showering and jet topology to the uncertainty in the JES determined using the MPF technique. The use of this larger uncertainty does not significantly affect the total systematic uncertainty in the JES from this analysis over most of the \(p_{{\text {T}}}\) range.

7.3.4.3 Impact of the Monte Carlo generator

For each final state, predictions of the response observables (\(R_{{\text {DB}}}\) and \(R_{\text {MPF}}\)) are produced with two different MC generators: Powheg and Sherpa for \(Z \) + jet and Pythia8 and Herwig++ for \(\gamma \) + jet. As detailed in Sect. 3, these generators use different modelling of the parton shower, jet fragmentation, and multiple parton interactions. The difference in the data-to-MC ratio of the response between the generators is taken as a “generator” systematic uncertainty source. This is a reasonable estimate of the dependence of the pp collision modelling on \(R_{{\text {DB}}}\) and \(R_{\text {MPF}}\), but a possible compensation by competing modelling effects cannot be excluded. This generator modelling constitutes the largest systematic uncertainty source for \(Z \) + jet for \(p_{{\text {T}}} ^{{\mathrm {ref}}}{} \lesssim 50~{\text {Ge}} {\text {V}}\), where it can be as large as 2.5%.

7.3.4.4 Uncertainties associated with the reference objects

The definitions of \(R_{{\text {DB}}}\) and \(R_{\text {MPF}}\) both have the \(p_{{\text {T}}}\) of the reference object in the denominator and are hence sensitive to knowledge of its energy scale and resolution. For the \(Z \) + jet analyses, uncertainties in \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) arise from the precision of the electron energy scale (EES) and energy resolution (EER) and from the muon momentum scale (MMS) and resolution (MMR), while for \(\gamma \) + jet, uncertainties arise from the precision of the photon energy scale (PES) and energy resolution (PER).

The EES is measured in data [44] and has three uncertainty components: MC modelling of the \(Z\rightarrow ee\) decay; the material description in simulation; and the response of the calorimeter’s presampler. These are treated statistically independent of each other. The EER uncertainty is parameterized by a single component. The MMS and MMR are determined in data [59] and have one and two associated uncertainty components, respectively. Finally, the PES and PER are evaluated using extrapolation of EES and EER, and are hence affected by the same uncertainty sources. Hence, they have the same four uncertainty sources, but these affect photons and electrons quite differently.

Each of the 11 uncertainty sources are propagated to the simulated samples by adjusting the four-momenta of the reconstructed electron, muon, or photon. The uncertainties in \(R_{{\text {DB}}}\) and \(R_{\text {MPF}}{}\) are then evaluated following the procedure described in Sect. 6.4. For all objects, the resolution uncertainties are found to be negligible (0.1% or less). For \(\gamma \) + jet, the PES uncertainties are reasonably independent of \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) and their sum in quadrature amounts to 0.5–0.6%. The magnitudes of the EES and MMS uncertainties are less than 0.3%.

7.3.4.5 Impact of additional \({\text {pile-up}}\)interactions

Jets produced in additional \({\text {pile-up}}\) interactions are present in both data and simulation and might impact the response measurements. To study this effect, the \({\mathrm {JVF}}\) criterion is varied around its nominal value of 0.25 (Sect. 4.1). The \({\mathrm {JVF}}\) criteria used for this variation are based on studies presented in Ref. [51] and are 0.24 and 0.27 for EM + JES calibrated jets and are 0.21 and 0.28 for jets calibrated using LCW + JES.

Studies of the dependence of \(R_{{\text {DB}}}\) and \(R_{\text {MPF}}\) on the number of primary vertices \(N_{{\text {PV}}}\) in the event and on the average number of interactions per beam bunch crossing \(\mu \) were also performed. Figure 25 presents results from these studies for the MPF method for a representative selection of \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) bins. The data-to-MC ratio of \(R_{\text {MPF}}\) is found to be independent of both \(N_{{\text {PV}}}\) and \(\mu \) for all \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) bins. The same conclusion is reached for the DB analysis. Hence, only one \({\text {pile-up}}\) uncertainty component is assigned, due to the \({\text {pile-up}}\) mitigation using the \({\mathrm {JVF}}\) criterion.

Fig. 25
figure 25

MPF response \(R_{\text {MPF}}\) in \(\gamma \) + jet events for jets calibrated at the EM scale for both data (black filled circles) and MC simulation (red empty circles), as a function of a, b \(N_{{\text {PV}}}\) and c, d \(\mu \) for a reference \(p_{{\text {T}}}\) range of a, c \({45}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{65}\,{{\text {Ge}} {\text {V}}}\) and b, d \({110}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{200}\,{{\text {Ge}} {\text {V}}}\). Only statistical uncertainties are shown

7.3.4.6 Impact of lack of GS correction for the MPF method

The GS correction (Sect. 5) is based on the properties of jets. Since the MPF does not use jets directly, applying the GS correction in the standard way will have no impact on \(R_{\text {MPF}}\). Instead, the GS correction factor \(c_{\text {GSC}}\) is extracted from the leading jet in each event and is used to adjust \(R_{\text {MPF}}\). Two methods were tested: simply scaling \(R_{\text {MPF}}\) with \(c_{\text {GSC}}\) and recalculating \(R_{\text {MPF}}\) after adjusting \({\mathbf {E}}_{{\mathrm {T}}}^{{\mathrm {miss}}}\) by adding the change of the jet momentum vector due to the GS correction \((c_{\text {GSC}}-1){\mathbf {p}}_{\text {T}}^{\,j1}\). Both methods result in a negligible change to the data-to-MC ratio of \(R_{\text {MPF}}\), and no uncertainty is assigned for this effect.

7.3.4.7 Impact of background in the \(\gamma \) + jet sample

The \(\gamma \) + jet dataset suffers from non-negligible contamination from dijet events where one of the jets is misreconstructed as a photon. The purity, i.e. the fraction of actual \(\gamma \) + jet events, after the nominal selection is estimated using a “sideband” technique based on the event yields in different control regions defined by alternative photon isolation and identification criteria. This technique is described in detail in Refs. [5, 62]. The purity increases with \(p_{{\mathrm {T}}}^{{\mathrm {\gamma }}}\), being about \(80\%\) at \(p_{{\mathrm {T}}}^{{\mathrm {\gamma }}}= 85~{\text {Ge}} {\text {V}}\) and rising above \(90\%\) for \(p_{{\mathrm {T}}}^{{\mathrm {\gamma }}}\gtrsim 200~{\text {Ge}} {\text {V}}\). The misreconstructed events tend to have higher \(p_{{\text {T}}} ^{\text {j1}}/p_{{\text {T}}} ^{{\mathrm {ref}}}\) and \(r_{\text {MPF}}\). The difference in DB and MPF response between true \(\gamma \) + jet events and misreconstructed events is estimated by varying the photon identification criteria. The \(\gamma \) + jet MC samples used have perfect purity by definition. The uncertainty due to the contamination from dijet events in the \(\gamma \) + jet analyses was estimated by multiplying the fraction of misreconstructed events by the relative difference in response between \(\gamma \) + jet and misreconstructed events. The resulting uncertainty decreases with \(p_{{\text {T}}} ^{{\mathrm {ref}}}\). For the DB analysis, it is \(\sim \)3.5% at \(p_{{\text {T}}} ^{{\mathrm {ref}}}{}=35~{\text {Ge}} {\text {V}}\), decreasing to \(~1\%\) at \(p_{{\text {T}}} ^{{\mathrm {ref}}}{}=100~{\text {Ge}} {\text {V}}\) and to \(<0.4\%\) for \(p_{{\text {T}}} ^{{\mathrm {ref}}}>250~{\text {Ge}} {\text {V}}\). For MPF, this uncertainty is smaller by approximately a factor of 2. This reduction is due to the definition of the observable, where \(R_{\text {MPF}}\) is inherently more stable against stochastically-oriented effects (pileup, fake photons, etc) compared to \(R_{{\text {DB}}}\) due to such contributions cancelling in \(R_{\text {MPF}}\) when averaged over many events.

7.3.4.8 Summary of the systematic uncertainties

Figure 26 presents the JES uncertainties from various sources, evaluated for both the DB and MPF methods with anti-\(k_{t}\)\(R=0.4\) jets calibrated with the EM + JES using the \(Z \) + jet dataset. The total uncertainty is obtained by addition in quadrature of the uncertainties from different sources. Overall, the DB and MPF methods achieve similar levels of precision. The MC generator uncertainties dominate for \(p_{{\text {T}}} ^{{\mathrm {ref}}}{} \lesssim 50~{\text {Ge}} {\text {V}}\) and the out-of-cone uncertainty is also significant for \(p_{{\text {T}}} ^{{\mathrm {ref}}}{} \lesssim 80~{\text {Ge}} {\text {V}}\). The statistical uncertainty is the major uncertainty for the \(Z \) + jet analyses at \(p_{{\text {T}}} ^{{\mathrm {ref}}}{} > 200~{\text {Ge}} {\text {V}}\).

Fig. 26
figure 26

Summary of the JES statistical and systematic uncertainties evaluated for the \(Z \) + jet a DB and b MPF analyses for anti-\(k_{t}\) jets with \(R=0.4\) and calibrated with the EM + JES scheme. The total uncertainty, shown as a shaded region, is obtained from the addition in quadrature of all uncertainty sources. EES/EER denotes the electron energy scale/resolution, while MMS/MMR denotes the muon momentum scale/resolution

Figure 27 shows the uncertainties for the corresponding \(\gamma \) + jet analyses. Here, the photon purity systematic uncertainty is the dominant uncertainty for the DB method for \(p_{{\text {T}}} ^{{\mathrm {ref}}}< 100~{\text {Ge}} {\text {V}}\), while it is significantly smaller and subdominant for MPF. The other systematic uncertainties are of similar magnitude for the two methods. For the range \({100}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{400}\,{{\text {Ge}} {\text {V}}}\), the photon energy scale contributes the dominant uncertainty.

Fig. 27
figure 27

Summary of the JES statistical and systematic uncertainties evaluated for the \(\gamma \) + jet a DB and b MPF analyses for anti-\(k_{t}\) jets with \(R=0.4\) and calibrated with the  EM + JES scheme. The total uncertainty, shown as a shaded region, is obtained from the addition in quadrature of all uncertainty sources. PES denotes the photon energy scale, while PER denotes the photon energy resolution

7.4 Calibration of large-\(R\) jets

For analyses based on pre-2012 data, the JES uncertainty of large-\(R\) jets has been evaluated in situ using track jets (Sect. 4.1) [45]. This method, discussed further in Sect. 9.5, is limited to 2–7% precision due to tracking uncertainties and the uncertainty in the charged-particle component of the jet. Furthermore, this method is restricted to the central calorimeter region \(|\eta _{\text {det}}{}|<{1.2}\), since at more forward \(\eta _{\text {det}}\), the large-\(R\) jet will not be fully contained in the acceptance of tracking detectors. This section presents an improved large-\(R\) jet JES uncertainty evaluation using \(\gamma \) + jet events.

7.4.1 \(R_{{\text {DB}}}\) measurements using \(\gamma \) + large-\(R\) jet events

The DB analysis is performed for large-\(R\) jets using the same approach as for small-\(R\) jets. The binning in \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) and \(\eta _{\text {det}}\) is different, chosen to account for the available data statistics, and \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) is defined simply as \(p_{{\text {T}}} ^{{\mathrm {ref}}}\equiv p_{{\mathrm {T}}}^{{\mathrm {\gamma }}}\) instead of projecting onto the jet axis (Eq. (17)). Examples of \(p_{{\text {T}}} ^{\text {j1}}/p_{{\text {T}}} ^{{\mathrm {ref}}}\) distributions fitted with the Modified Poisson function are shown in Fig. 28.

Fig. 28
figure 28

\(p_{{\text {T}}} ^{\text {j1}}/p_{{\text {T}}} ^{{\mathrm {ref}}}\) distributions for events with a \({85}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{110}\,{{\text {Ge}} {\text {V}}}\) and b \({260}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} ^{{\mathrm {ref}}}<{310}\,{{\text {Ge}} {\text {V}}}\) for trimmed anti-\(k_{t}\) jets with \(R=1.0\) calibrated with the LCW + JES scheme in data. The lines present fits to the data of a Modified Poisson function (Eq. (20)). The markers show data with error bars corresponding to the statistical uncertainties

Figure 29 presents \(R_{{\text {DB}}}\) as a function of \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) for large-\(R\) jets in two \(\eta _{\text {det}}\) ranges, both for data and MC simulations. The response in the central calorimeter region, \(|\eta _{\text {det}}{}|<0.8\), is modelled within 1% by the simulation, with simulations tending to overestimate the response by \(\sim \)0.5%. For large-\(R\) jets with \(0.8<|\eta _{\text {det}}{}|<1.2\), this deviation grows to \(\sim \)2%. Rather than using this deviation as a calibration to correct for the difference between data and MC simulation, this difference is taken as an additional uncertainty. As detailed in Sect. 4.4, large-\(R\) jets do not receive any intercalibration \(c_\eta \).

Fig. 29
figure 29

\(R_{{\text {DB}}}\) for trimmed anti-\(k_{t}\) jets with \(R=1.0\) calibrated with the LCW + JES scheme for a \(|\eta _{\text {det}}{}|<{0.8}\) and b \({0.8}\le |\eta _{\text {det}}{}|<{1.2}\) for both data (filled circles) and MC simulation (empty circles), as a function of the \(p_{{\text {T}}} ^{{\mathrm {ref}}}\). Only statistical uncertainties are shown

7.4.2 Systematic uncertainties

Most of the systematic uncertainties are evaluated in the same way as for small-\(R\) jets as detailed in Sect. 7.3.4. Additional uncertainties specific to large-\(R\) jets and changes to the evaluation of some of the uncertainty sources are detailed below.

  • Rather than using the difference of the data-to-MC ratio of \(R_{{\text {DB}}}\) from unity as a calibration, it is instead taken as an uncertainty. This allows a straightforward combination with the procedure used to derive uncertainties outside of the kinematic range for which the \(\gamma \) + large-\(R\) jet \(R_{{\text {DB}}}\) can be derived. This is a significant uncertainty source, especially for jets with \({0.8}\le |\eta _{\text {det}}{}|<{1.2}\).

  • The OOC uncertainty is evaluated only for large-\(R\) jets with \(|\eta _{\text {det}}{}|<{0.8}\), since for other \(\eta _{\text {det}}\) bins the large-\(R\) jets are not always fully contained within the tracking acceptance. The uncertainty derived in this central \(\eta _{\text {det}}\) range is also applied to the more forward \(|\eta _{\text {det}}{}|\) bins. Due to the large radius (\(R=1.0\)), out-of-cone effects are very small, and the uncertainty is negligible for \(p_{{\text {T}}} {}>100~{\text {Ge}} {\text {V}}\).

  • As mentioned in Sect. 7.2.3, the subleading jet (labelled “j2”) that is used to suppress additional QCD radiation is required to have an angular separation of \(\Delta R({\text {j1}},{\text {j2}})>0.8\) from the large-\(R\) jet (“j1”). Since the leading jet has \(R=1.0\) while the subleading jet has \(R=0.4\), this means that there is a significant overlap in terms of solid angle, but since the \(p_{{\text {T}}}\) profile of jets tend to be narrow (see the \({\mathcal {W}}_{\mathrm {trk}}\) distribution of Figs. 6 and 7), the amount of energy sharing is still expected to be small. The assigned uncertainty component is evaluated by changing the \(\Delta R\) requirement from 0.8 to 1.4 to ensure that there is strictly no overlap between the two jets.

  • Since the small-\(R\) jets are reconstructed independently of the large-\(R\) jets using the same topo-clusters as input, a large-\(R\) jet will sometimes contain two small-\(R\) jets close to the large-\(R\) jet axis. It is possible that events with such topologies have additional uncertainties due to the QCD modelling. To assess this effect, an alternative subleading jet selection was applied in which “j2” is defined simply as the subleading \(R=0.4\) jet, without any restriction based on the angle to the large-\(R\) jet. This means that “j2” will sometimes be within the large-\(R\) jet and sometimes not (the leading \(R=0.4\) jet tends to be aligned very close to the large-\(R\) jet axis). With this definition, the event selection \(p_{{\text {T}}} ^{\text {j2}}{}<0.1\,p_{{\text {T}}} ^{{\mathrm {ref}}}\) was applied in place of the standard \(p_{{\text {T}}} ^{\text {j2}}\) selection, and an uncertainty component was derived from the impact of this variation.

  • An additional dependence of the jet response for large-\(R\) jets on the ratio of the jet mass to the \(p_{{\text {T}}}\) , \(m/p_{{\text {T}}} \), is observed, particularly for large \(|\eta _{\text {det}}|\). A systematic uncertainty is assigned to account for this dependence, derived as a triple ratio. The data/MC ratios of the \(R_{{\text {DB}}}\) ratios are evaluated in the two \(m/p_{{\text {T}}} {}\) ranges shown in Fig. 30, corresponding to \(m/p_{{\text {T}}} {}<0.15\) and \(m/p_{{\text {T}}} {}>0.15\). The systematic uncertainty is given by the ratio of the double ratios obtained for the two \(m/p_{{\text {T}}} {}\) ranges.

Fig. 30
figure 30

\(R_{{\text {DB}}}\) as a function of \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) measured for trimmed anti-\(k_{t}\) jets with \(R=1.0\) in \(\gamma \) + jet events shown separately for jets with a \(|\eta _{\text {det}}{}|<{0.8}\) and b \({0.8}\le |\eta _{\text {det}}{}|<{1.2}\). Separate results are shown for jets with \(m/p_{{\text {T}}} <0.15\) and \(m/p_{{\text {T}}} >0.15\), displayed with circles and triangles, respectively. Measurements in data are shown as filled markers and MC predictions as open makers. Statistical uncertainties are shown for each point. The lower parts of the figures show the systematic uncertainty evaluated as the data-to-MC ratio of the ratios of \(R_{{\text {DB}}}\) extracted in the two \(m/p_{{\text {T}}} \) ranges

7.4.3 \({\text {Pile-up}}\) uncertainty for large-\(R\) jets

As discussed in Sect. 4.1, large-\(R\) jets do not receive any \({\text {pile-up}}\) correction. Due to the trimming algorithm applied, large-\(R\) jets are significantly less sensitive to \({\text {pile-up}}\) than standard small-\(R\) jets. To study the impact of \({\text {pile-up}}\) on the large-\(R\) jet \(p_{{\text {T}}}\) , it is measured as a function of \(N_{{\text {PV}}}\) and \(\mu \) in bins of \(p_{{\text {T}}}\) of track jets that are matched to the large-\(R\) jets being probed. Track jets are resilient to \({\text {pile-up}}\) since they are built from inner-detector tracks that are matched only to the primary vertex, and do not contain contributions (tracks) from \({\text {pile-up}}\) vertices (in most cases). The track jets are reconstructed using the same algorithm as the calorimeter large-\(R\) jets (trimmed anti-\(k_{t}\)\(R=1.0\)).

Fig. 31
figure 31

Large-\(R\) jet \(p_{{\text {T}}}\) as a function of a \(N_{{\text {PV}}}\) in \(\gamma \) + jet events with \(20<\mu <22\), and as a function of b \(\mu \) in events with \(10<N_{{\text {PV}}}<12\). The \(p^{{\mathrm {trackjet}}}_{\mathrm {T}}\) is required to be within \(55< p^{{\mathrm {trackjet}}}_{\mathrm {T}}<65~{\text {Ge}} {\text {V}}\). The lines are linear fits. The markers show data with error bars corresponding to the statistical uncertainties. The lowest point shown in b corresponds to special data-taking conditions, and is thus not expected to match the other points on the plot

Within a given track jet \(p_{{\text {T}}}\) bin, the large-\(R\) jets are expected to have a similar truth-particle jet \(p_{{\text {T}}}\). The reconstructed \(p_{{\text {T}}}\) is studied as a function of \(N_{{\text {PV}}}\) and \(\mu \). The results for a representative track jet \(p_{{\text {T}}}\) bin is presented in Fig. 31. As expected (Eq. (2) and Ref. [6]) there is a linear dependence of the jet \(p_{{\text {T}}}\) on both \(N_{{\text {PV}}}\) and \(\mu \). For each track jet \(p_{{\text {T}}}\) bin, the “gradients” \(\partial p_{{\text {T}}}/\partial N_{{\text {PV}}}\) and \(\partial p_{{\text {T}}}/\partial \mu \) are extracted from the slopes of a linear fits of \(p_{{\text {T}}}\) vs \(N_{{\text {PV}}}\) and \(p_{{\text {T}}}\) vs \(\mu \) (Fig. 31). Figure 32 shows these gradients as a function of the average \(p_{{\text {T}}}\) of the large-\(R\) jets. Both of these graphs are well described with a function of the form \(a\,+\,b\,\log {(p_{{\text {T}}}/p_{\text {T,0}})}\), where the parameters a and b are extracted from a fit and \(p_{\text {T,0}}\) is a constant chosen to be \(50~{\text {Ge}} {\text {V}}\).

Fig. 32
figure 32

The gradients a \(\partial p_{{\text {T}}}/\partial N_{{\text {PV}}}\) and b \(\partial p_{{\text {T}}}/\partial \mu \) extracted in \(\gamma \) + jet data (filled circles) and the difference of the gradients measured in data and MC simulations (empty circles) as functions of the large-\(R\) calorimeter jet \(p_{{\text {T}}}\) . The lines correspond to fits of a constant to the uncertainty (red dashed line) and a function of the form \(a\,+\,b\,\log (p_{{\text {T}}}/p_{\text {T,0}})\) with \(p_{\text {T,0}}=50~{\text {Ge}} {\text {V}}\) for the gradients vs \(p_{{\text {T}}}\)  (black solid line). The error bars shown on the markers reflect the statistical uncertainty

Based on the \(p_{{\text {T}}}\) parameterization of the gradients from the fits to data described in the previous paragraph (Fig. 32), two uncertainty components are derived that have the following impact on the jet \(p_{{\text {T}}}\)

$$\begin{aligned} \Delta _{N_{{\text {PV}}}}= & {} ( \partial p_{{\text {T}}}/\partial N_{{\text {PV}}})\left( N_{{\text {PV}}}- N_{{\text {PV}}}^{\text {ref}}\right) \quad {\text { and }} \nonumber \\ \Delta _{\mu }= & {} ( \partial p_{{\text {T}}}/\partial \mu )(\mu - \mu ^{\text {ref}}), \end{aligned}$$
(21)

where \(\partial p_{{\text {T}}}/\partial N_{{\text {PV}}}\) and \(\partial p_{{\text {T}}}/\partial \mu \) are the gradients parameterized as a function of large-\(R\) jet \(p_{{\text {T}}}\) according to the fitted functions, \(\mu \) is the average number of interaction per bunch crossing, and \(N_{{\text {PV}}}\) is the number of primary vertices of the event, and \({\mu }^{\text {ref}}=20.7\) and \(N_{{\text {PV}}}^{\mathrm {ref}}=11.8\) are the average values for the full 2012 \(\gamma \) + jet dataset. As can be seen from Eq. (21), the impact on the jet \(p_{{\text {T}}}\) from the two uncertainty components can change sign depending on the amount of \({\text {pile-up}}\) , and become zero for jets produced in events with \({\text {pile-up}}\) conditions matching the 2012 average values. The resulting fractional \(p_{{\text {T}}}\) uncertainties are presented for two values of large-\(R\) jet \(p_{{\text {T}}}\) in Fig. 33.

Fig. 33
figure 33

Systematic uncertainties due to the \(N_{{\text {PV}}}\) and \(\mu \) dependence of the large-\(R\) jet \(p_{{\text {T}}}\) presented in percentage as functions of \(\mu \) (dashed lines) and \(N_{{\text {PV}}}\) (solid lines) for large-\(R\) jets with a \(p_{{\text {T}}} = 200~{\text {Ge}} {\text {V}}\) and b \(p_{{\text {T}}} {}=300~{\text {Ge}} {\text {V}}\). Horizontal lines are added at 0% and ±2% to guide the eye

Fig. 34
figure 34

Statistical and systematic uncertainties in the data-to-MC ratio of \(R_{{\text {DB}}}\) for trimmed \(R=1.0\) anti-\(k_{t}\) jets calibrated with the LCW + JES scheme with a, c \(|\eta _{\text {det}}{}|<{0.8}\) and b, d \({0.8}\le |\eta _{\text {det}}{}|<{1.2}\). The uncertainties are evaluated in bins of \(p_{{\text {T}}} ^{{\mathrm {ref}}}\), converted to jet \(p_{{\text {T}}}\) , and translated into a function using a Gaussian kernel smoothing. a, b highlight the systematic uncertainty components derived analogously (and hence due to the same effects) to those for small-\(R\) jets (Fig. 27). In c and d, these uncertainties are added in quadrature, and the uncertainties specific to large-\(R\) jets are instead displayed separately. In all figures, the difference of the data-to-MC \(R_{{\text {DB}}}\) from unity is taken as an additional uncertainty rather than being applied as a calibration; this uncertainty is shown as a solid line

7.4.3.1 Summary of systematic uncertainties

Figure 34 presents a summary of the statistical and systematic uncertainties in the large-\(R\) jet \(p_{{\text {T}}}\) from the DB analysis, including a detailed breakdown of the uncertainty components that are in common with the small-\(R\) jet \(\gamma \) + jet measurements presented in Sect. 7.3.4, while the additional uncertainty sources specific to large-\(R\) jets are presented in Sect. 7.4.2. The total uncertainty for \(|\eta |<0.8\) is found to be \(\sim \)1% above \(150~{\text {Ge}} {\text {V}}{}\), rising to \(\sim \)2% at \(1~{\text {Te}} {\text {V}}{}\). At larger \(|\eta |\), the uncertainty increases to \(\sim \)2% at low \(p_{{\text {T}}} ^{{\mathrm {ref}}}\), rising to \(\sim \)3% at \(1~{\text {Te}} {\text {V}}\). The uncertainties are dominated by the photon energy scale uncertainty, the uncertainty coming from the large-\(R\) jet response dependence on the ratio of \(m/p_{{\text {T}}} \), and the difference of the data-to-MC \(R_{{\text {DB}}}\) from unity. The generator systematic uncertainty becomes dominant for \(|\eta |>1.2\).

7.5 Measurement of the jet energy resolution using the DB method

The width of the DB distribution in a given \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) bin is used to probe the JER. The detector resolution of the reference object is negligible compared with that of the jet, so the method to measure the JER using \(Z \) + jet and \(\gamma \) + jet events is significantly simpler than that for dijets described in Sect. 6. The event selection and binning is the same as for the \(R_{{\text {DB}}}\) measurements, but instead of determining the mean \(R_{{\text {DB}}}\) of the \(p_{{\text {T}}} ^{\text {j1}}/p_{{\text {T}}} ^{{\mathrm {ref}}}\) distribution within each \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) bin, the width \(\sigma _{\text {DB}}^{\text {reco}}\) is extracted as the standard deviation of the same Modified Poisson fit. The relative JER \(\sigma _E/E\) is then estimated using

$$\begin{aligned} \frac{\sigma _E}{E} = \frac{\sigma _{p_{{\text {T}}}}}{p_{{\text {T}}}} = \sigma _{\text {DB}}^{\text {reco}} \ominus \sigma _{\text {DB}}^{\text {truth}}, \end{aligned}$$
(22)

where the first equality holds to a good approximation since the contribution from the angular resolution is negligible, and the second relation follows from the same reasoning as for Eq. (14) (Sect. 6.1). The parameter \(\sigma _{\text {DB}}^{\text {truth}}\) is obtained using a fit to the \(p_{{\text {T}}} ^{{\mathrm {truth}}}/p_{{\text {T}}} ^{{\mathrm {ref}}}\) distribution extracted using MC simulation with same selection (applied to reconstructed jets) as for the DB measurement. For each simulated event, \(p_{{\text {T}}} ^{{\mathrm {truth}}}\) is defined from the truth-particle jet that is ghost-matched (Sect. 4.2) to the leading reconstructed jet. The simulated JER is also extracted from the MC samples with fits to \(p_{{\text {T}}} ^{\text {reco}}/p_{{\text {T}}} ^{{\mathrm {truth}}}\) (Sect. 4.2).

Fig. 35
figure 35

Comparison of the jet energy resolution determined in situ (triangles) with the MC jet energy resolution (circles) measured for anti-\(k_{t}\) EM + JES jets with \(R=0.4\), in a \(Z \) + jet and b \(\gamma \) + jet events. The bottom frame shows the ratio of the two

Figure 35 presents a MC-based comparison between the relative JER obtained using the in situ technique applied to the simulated events (Eq. (22)) and the relative JER extracted from truth-particle jet matching. Over most of the \(p_{{\text {T}}}\) range probed, the in situ extracted JER agrees with the expectation from simulation within 10%, but for the first and last bins probed, the agreement is worse (20–40%). The difference between the measured and expected JER is taken as a “non-closure” systematic uncertainty.

The truth-particle jet DB width \(\sigma _{\text {DB}}^{\text {truth}}\), used in the JER measurement (Eq. (22)) depends on details of the physics model implemented in the MC generator. A systematic “generator” uncertainty is evaluated to assess this dependence through taking the difference between the extracted JER using different MC generators. Other systematic uncertainty sources considered for the JER measurement are the same as those considered for the JES measurements discussed in Sect. 7.3.4 and are derived for the JER analogously. Figure 36 presents the resulting JER uncertainties for anti-\(k_{t}\)\(R=0.4\) jets and also the relative data-to-MC difference of the JER measurement. Results are reported as a function of \(p_{{\text {T}}} ^{{\mathrm {ref}}}\), separately for the measurements performed using the \(Z \) + jet and \(\gamma \) + jet datasets. The total, relative JER uncertainty evaluated for the \(Z \) + jet JER measurements varies between 20 and 40%, depending on the algorithm and \(p_{{\text {T}}} ^{{\mathrm {ref}}}\) values. Dominant sources of uncertainties include the choice of MC generator for the modelling of \(\sigma _{\text {DB}}^{\text {truth}}\), the non-closure, and limited statistics. The JER uncertainty of the \(\gamma \) + jet measurement is slightly smaller than that from \(Z \) + jet events, varying between 10 and 30%. The dominant sources of systematic uncertainties are the choice of MC generator and the suppression of additional radiation. For \(p_{{\text {T}}} ^{{\mathrm {ref}}}{}<50~{\text {Ge}} {\text {V}}\), the data-to-MC differences can be as large as \(\sim \)40%.

Fig. 36
figure 36

Summary of systematic uncertainties of the JER measured using the DB method in a \(Z \) + jet and b \(\gamma \) + jet events for anti-\(k_{t}\) jets with \(R=0.4\) calibrated with the EM + JES scheme. The difference between the JER measured in data and MC simulation is presented as an uncertainty (“data/MC difference”). The total uncertainty, shown as a shaded area, is obtained by addition in quadrature of all uncertainty components (including the data-to-MC difference). EES/EER denotes the electron energy scale/resolution, MMS/MMR denotes the muon momentum scale/resolution, and PES denotes the photon energy scale

8 High-\(p_{{\text {T}}}\) -jet calibration using multijet balance

The \(Z \) + jet and \(\gamma \) + jet analyses described in the previous section probe the jet calibration in the range \({17}\,{{\text {Ge}} {\text {V}}} \le p_{{\text {T}}} <{800}\,{{\text {Ge}} {\text {V}}}\). For jets with \(p_{{\text {T}}}\) above \(800~{\text {Ge}} {\text {V}}\), there are an insufficient number of \(Z \) + jet and \(\gamma \) + jet events, and the multijet balance (MJB) technique is used instead. This method exploits the \(p_{{\text {T}}}\) balance of events in which the leading (highest-\(p_{{\text {T}}}\) ) jet is produced back-to-back with a recoil system composed of multiple lower-\(p_{{\text {T}}}\) jets. The jets in the recoil system are fully calibrated including the in situ corrections described in the previous sections, while the leading jet, which is being probed, is calibrated with all corrections except the absolute in situ correction (Sect. 7).

The multijet balance observable \(R_{{\text {MJB}}}\) is defined as:

$$\begin{aligned} R_{{\text {MJB}}} = \left\langle \frac{p_{{\text {T}}} ^{\text {j1}}}{p_{{\text {T}}} ^{\mathrm {recoil}}}\right\rangle , \end{aligned}$$

where \(p_{{\text {T}}} ^{\text {j1}}\) is the \(p_{{\text {T}}}\) of the leading jet and \(p_{{\text {T}}} ^{\mathrm {recoil}}\) is that from the vectorial sum of the subleading jet four-momenta. The parameter \(R_{{\text {MJB}}}\) is measured in both data and MC simulations in bins of \(p_{{\text {T}}} ^{\mathrm {recoil}}\). The multijet balance observable \(R_{{\text {MJB}}}\) is not an unbiased estimator of the leading jet response. It has a value below unity even at particle level due to the effects of soft quark/gluon emission outside of the jets. The largest deviation is at low \(p_{{\text {T}}}\) , with data and MC simulations exhibiting similar dependence. This underlying bias is reduced in the double ratio \(R_{{\text {MJB}}}^{\text {data}} / R_{{\text {MJB}}}^{\text {MC}}\), allowing the response of high-\(p_{{\text {T}}}\) jets to be estimated. Mis-modelling in the simulation is evaluated as a systematic uncertainty of the double ratio.

As mentioned above, the jets used in the construction of \(p_{{\text {T}}} ^{\mathrm {recoil}}\) are fully calibrated, including all in situ calibrations. However, the in situ corrections from the \(Z/\gamma \)  +  jet analyses are only available for \(p_{{\text {T}}} <800~{\text {Ge}} {\text {V}}\) (Sect. 7). An iterative procedure is used to calibrate all jets that are used in the calculation of \(p_{{\text {T}}} ^{\mathrm {recoil}}\). For the first iteration of the MJB, an upper limit is imposed on the \(p_{{\text {T}}}\) of the recoil jets such that the second highest-\(p_{{\text {T}}}\) jet in the event has a \(p_{{\text {T}}} <800~{\text {Ge}} {\text {V}}\). This initial selection allows corrections to \(R_{\text {MJB}}\) to be derived, but limits the overall statistical accuracy of the measurements at high \(p_{{\text {T}}}\) . To improve the statistical accuracy, \(R_{\text {MJB}}\) is recalculated after the application of the correction factors from the first iteration to jets in the recoil system with \(p_{{\text {T}}} {}>800~{\text {Ge}} {\text {V}}\).

8.1 Event selection

Multijet events were obtained using single-jet triggers that are fully efficient for a given bin of \(p_{{\text {T}}} ^{\mathrm {recoil}}\). The triggers used for \(300~{\text {Ge}} {\text {V}}~<p_{{\text {T}}} ^{\mathrm {recoil}}<600~{\text {Ge}} {\text {V}}\) were prescaled, whereas a non-prescaled jet trigger was used for \(p_{{\text {T}}} ^{\mathrm {recoil}}~> 600~{\text {Ge}} {\text {V}}\). Events are required to contain at least three jets with \(p_{{\text {T}}} > 25~{\text {Ge}} {\text {V}}\). The leading jet is required to have \(|\eta _{\text {det}}|<1.2\), and the subleading jets that constitute the recoil system are required to have \(|\eta _{\text {det}}|<2.8\). To select non-dijet events, the leading jet in the recoil system \(p_{{\text {T}}} ^{\text {j2}}\) is required to have less than \(80\%\) of the total \(p_{{\text {T}}}\) of the recoil system (\(p_{{\text {T}}} ^{\text {j2}}/p_{{\text {T}}} ^{\mathrm {recoil}}< 0.8\)). Furthermore, the angle \(\alpha \) in the azimuthal plane between the leading jet three-momentum and the vector defining the recoil system is required to satisfy \( |\alpha - \pi | < 0.3\) radians, and the angle \(\beta \) in the azimuthal plane between the leading jet and the nearest jet from the recoil system is required to be greater than 1 radian.

8.2 Results

Figure 37 shows \(R_{\text {MJB}}\) for data and MC simulation using the EM + JES calibration scheme. The MJB method provides inputs for the in situ jet calibration in the \(p_{{\text {T}}}\) range between \({300}\,{{\text {Ge}} {\text {V}}}\) and \({1900}\,{{\text {Ge}} {\text {V}}}\). The data and MC simulation agree to within \(1\%\) across the \(p_{{\text {T}}}\) range probed, a feature that is reproduced by the \(Z/\gamma \)  +  jet analyses (Sect. 7).

Fig. 37
figure 37

a Multijet balance \(R_{{\text {MJB}}}\) in data (circles) and MC simulation (triangles) for anti-\(k_{t}\)  \(R=0.4\) jets calibrated with the EM + JES scheme. The bottom frame compares \(R_{\text {MJB}}^{\text {MC}}/R_{\text {MJB}}^{\text {data}}\) (triangles) with the corresponding \(\gamma \)/Z + jet results (magenta solid line). b The impact of in situ, event selection (topology), physics modelling, and jet flavour systematic uncertainties on \(R_{\text {MJB}}\). The error bars on the \(R_{\text {MJB}}\) measurements only show statistical uncertainties

8.3 Systematic uncertainties

Since the jets entering \(R_{\text {MJB}}\) have been calibrated using the other in situ approaches, the uncertainty in the energy scale of the jets in the recoil system is defined by the systematic and statistical uncertainties of each in situ procedure. To propagate the uncertainty to \(R_{\text {MJB}}\), all input components are individually varied by \(\pm 1\sigma \) and the full iterative analysis procedure is repeated for each such variation. Changes in \(R_{\text {MJB}}\) due to the statistical uncertainties of the \(\gamma \) + jet and Z + jet calibrations are typically much smaller than 1%.

Also, the event selection criteria and the modelling in the event generators affect the \(p_{{\text {T}}}\)  balance \(R_{\text {MJB}}\). The impact of the event selection is investigated by shifting each selection criterion up and down by a specified amount and observing the change in \(R_{\text {MJB}}\). The \(p_{{\text {T}}}\) threshold for jets is shifted by \(\pm 5~{\text {Ge}} {\text {V}}\), the requirement on the ratio \(p_{{\text {T}}} ^{\text {j2}}/p_{{\text {T}}} ^{\mathrm {recoil}}\) is shifted by \(\pm 0.1\), the angle \(\alpha \) by \(\pm 0.1\), and the angle \(\beta \) by \(\pm 0.5\). The uncertainty due to MC modelling of multijet events is estimated from the symmeterized envelope of MJB corrections obtained by comparing the nominal results obtained from Sherpa with those obtained from Powheg + Pythia8, Pythia8, and Herwig++.

The unknown flavour of each jet is also a source of systematic uncertainty. The uncertainty in \(R_{\text {MJB}}\) due to the jet flavour response is evaluated using a correlated propagation of the jet flavour response uncertainties, i.e. all jets in the recoil system are shifted simultaneously. The jet flavour composition uncertainty is propagated to \(R_{\text {MJB}}\) for the first, second, and third recoil jets independently, with the final composition uncertainty obtained from the quadrature sum of the three variations. The total uncertainty due to the unknown parton flavour is taken as the sum in quadrature of the flavour response and composition uncertainties.

Examples of the impact of systematic uncertainties are shown in Fig. 37b for anti-\(k_{t}\)  \(R=0.4\) jets using the EM + JES calibration scheme. The uncertainties are grouped together into in situ, event topology, physics modelling, and jet flavour categories. Uncertainties for anti-\(k_{t}\)  \(R=0.6\) jets or the LCW + JES scheme are comparable.

The uncertainty accounting for the difference of the jet energy resolution between data and simulation was not propagated to the recoil system of the multi-jet balance as the scale was derived before the resolution. However, the impact of this effect on the multi-jet balance was checked after the resolution was derived, and was found to introduce per-mille level differences on the extraction of the scale. This effect is therefore negligible compared to the existing uncertainties on the multi-jet balance shown in Fig. 37b.

9 Final jet energy calibration and its uncertainty

As detailed in Sects. 7 and 8, response observables that are directly proportional to the JES are constructed using in situ techniques by exploiting the transverse momentum balance in \(\gamma \) + jet, Z + jet, and multijet events. These response observables are determined in both data and MC simulations. The final residual jet calibration \(c_{\text {abs}}\), which accounts for effects not captured by the MC calibration, is defined through the ratio of the responses measured in data and MC simulation by

$$\begin{aligned} \frac{1}{c_{\text {abs}}} = \frac{R_{\text {data}}}{R_{\text {MC}}}. \end{aligned}$$
(23)

As explained in Sect. 4.3, the absolute in situ correction \(c_{\text {abs}}\) is applied last in the calibration chain following the origin, \({\text {pile-up}}\) , MC-based, and dijet in situ calibrations. Just as for the dijet intercalibration (Sect. 6), the absolute correction is applied only to data to remove any residual differences in the jet response following the MC calibration. The dijet \(\eta \) intercalibration is referred to as a relative in situ calibration, as it quantifies the balance between a pair of jets in different detector regions without evaluating the absolute scale of either jet. The absolute calibration is done for the \(Z \) + jet, \(\gamma \) + jet, and MJB techniques, which all balance the probe jet against a well-known reference quantity, thus providing a measure of the absolute scale of the jet and are known as absolute in situ calibrations.

Figure 38 summarizes the results of the Z + jet, \(\gamma \) + jet, and multijet balance analyses, showing the ratio of jet response in data to jet response in MC simulations. In the \(p_{{\text {T}}} \) range \(20{-}2000~{\text {Ge}} {\text {V}}\), the response agrees between MC simulations and data at the 1% level. The deviation of the response from unity defines the absolute in situ calibration which is applied to jets in data. There is good agreement and little tension between the three different in situ methods in the regions of phase space where they overlap.

Fig. 38
figure 38

Ratio of response measured in data to response measured in MC samples for Z + jet (empty squares), \(\gamma \) + jet (filled triangles) and multijet balance (empty triangles) in situ analyses. The method giving the smallest overall uncertainty is used, corresponding to the DB approach for Z + jet and the MPF approach for \(\gamma \) + jet. Each measurement has two error bars: the smaller interval corresponds to the statistical uncertainty, while the outer error interval corresponds to the total uncertainty. Also shown is the combined correction (line) with its associated total uncertainty (wider band) and statistical uncertainty (narrower band) as discussed in Sect. 9.1

9.1 Combination of absolute in situ measurements

The separate measurements from Z + jet, \(\gamma \) + jet, and multijet balance are combined using the procedure outlined in Ref. [6]. For the Z + jet and \(\gamma \) + jet measurements, the method giving the smallest overall uncertainty is used, corresponding to the DB approach for Z + jet and the MPF approach for \(\gamma \) + jet (see Sect. 7 for details on the methods). The choice of DB for Z + jet is a compromise between the precision of the \(R=0.4\) and \(R=0.6\) jet calibrations in the low \(p_{{\text {T}}}\) regime where the Z + jet final state is most relevant: it was found that the DB and MPF techniques give similar uncertainties for \(R=0.4\) jets, while the DB technique provides improved precision for \(R=0.6\) jets. In contrast, the MPF technique is used for \(\gamma \) + jet events as it is found to generally provide better uncertainties across its kinematic range of relevance. This combination uses the compatibility of the three in situ measurements and their associated systematic uncertainties to produce a combined measurement of the response ratio with associated uncertainties.

Table 3 presents the 26 systematic uncertainty sources that affect \(c_{\text {abs}}\). These are evaluated as detailed in Sects. 7.3.4 and 8.3. The electron and photon energy scale uncertainties are each split into four sources that are fully correlated. These are treated as four \(e/\gamma \) energy scale sources, yielding a list of 22 systematic uncertainty components. Each source is further classified into one of the following four categories:

  • detector description (det.),

  • physics modelling (model),

  • statistics and method (stat./meth.), and

  • mixed detector and modelling (mixed).

Table 3 Summary of the uncertainty components propagated through to the combination of absolute in situ jet energy scale measurements from \(Z \) + jet, \(\gamma \) + jet, and multijet balance studies. These are discussed in more detail in Sects. 6 and 7

The combination is carried out using the absolute in situ measurements (Eq. (23)) in bins of \(p_{{\text {T}}} ^{\text {ref}}\) and evaluated at \(\langle p_{{\text {T}}} ^{\text {ref}}\rangle \). The data-to-MC response ratio is defined in fine \(p^{\text {ref}}_{\text {T}}\) bins for each method using interpolating second-order polynomial splines. The combination is then carried out using a weighted average of the absolute in situ measurements based on a \(\chi ^2\)-minimization. This local \(\chi ^2\) is used to define the level of agreement between measurements.

Each uncertainty source in the combination is treated as being fully correlated across \(p_{{\text {T}}} \) and \(\eta \) and independent of one another. All the uncertainty components are propagated to the combined results using pseudo-experiments [5]. To determine the correlations between different phase-space regions, it is necessary to understand the contribution of each uncertainty component to the final uncertainty. Therefore, each individual source is propagated separately to the combined result by coherently shifting all the correction factors by one standard deviation. Comparison of this shifted combination result with the nominal result provides an estimate of the propagated systematic uncertainty.

One exception is the jet flavour uncertainty of the recoil in the multijet balance method (Sect. 8). It is correlated in a non-trivial way with the additional uncertainties due to flavour composition and response considered in analyses. Including this uncertainty does not change the overall absolute in situ uncertainty by a significant amount after combination with the other in situ methods, so it is dropped.

To take tensions between measurements into account, each uncertainty source is increased by rescaling it by \(\sqrt{\chi ^2/n_{\text {dof}}}\) if \(\chi ^2/n_{\text {dof}}\) is larger than unity [66], where \(n_{\text {dof}}\) is the number of degrees of freedom. The number of degrees of freedom varies with \(p_{{\text {T}}}\) and corresponds to the number of in situ methods \(n_{\text {in-situ}}\) that contribute to the combination minus one, i.e. \(n_{\text {dof}} = n_{\text {in-situ}}-1\). The local \(\sqrt{\chi ^2/n_{\text {dof}}}\) of the final combination (Fig. 39) for both jet collections is below unity for most of the \(p_{{\text {T}}} \) range and barely exceeds 2 anywhere. The combined in situ factor is the final calibration factor to be applied to data after reducing statistical fluctuations using a sliding Gaussian kernel.

Fig. 39
figure 39

\(\chi ^2/n_{\text {dof}}\) for the combination of absolute in situ measurements illustrating the compatibility of the included in situ calibrations as a function of jet \(p_{{\text {T}}}\) . At any given point, there are at most two in situ results being combined: \(Z \) + jet and \(\gamma \) + jet at low \(p_{{\text {T}}}\) , or \(\gamma \) + jet and MJB at high \(p_{{\text {T}}}\) , which means that the number of degrees of freedom \(n_{\text {dof}}\) is equal to one. For a small \(p_{{\text {T}}}\) range near \(300~{\text {Ge}} {\text {V}}\), only one measurement (\(\gamma \) + jet) contributes, and there is a gap (\(n_{\text {dof}}=0\)). The points where the curve touches zero correspond to where the two in situ calibrations cross

Figure 40 shows the uncertainty sources for the three absolute in situ analyses used in the combination as a function of \(p_{{\text {T}}}\) . In the combination, the \(Z \) + jet measurement is most important at low \(p_{{\text {T}}}\) , the \(\gamma \) + jet measurement at medium \(p_{{\text {T}}}\) , and the multijet balance at high \(p_{{\text {T}}}\) .

The combined jet response, shown as a line in Fig. 38, is observed to have a general offset of 0.5% between data and MC simulation (with data below the MC prediction). The total uncertainty from the combination of absolute in situ techniques is shown as the wider band around the measured response and is about 3.5% (2.5%) for jets with \(p_{{\text {T}}} {} \approx 25~{\text {Ge}} {\text {V}}\) for EM (LCW) jets and decreases to about 1% (1%) for \(p_{{\text {T}}}\) above \(200~{\text {Ge}} {\text {V}}\).

As mentioned a spline-based combination procedure, with a local averaging within fine \(p_{{\text {T}}}\) intervals followed by a global smoothing, is used for the in-situ JES combination. This method avoids assumptions on the jet energy response dependence that are implicitly present in procedures based on global fits using a functional form, which can further reduce the uncertainties (see e.g. Ref. [67]).

Fig. 40
figure 40

Individual uncertainty sources used in the combination for the three absolute in situ calibration methods. The systematic uncertainties displayed correspond to those in Table 3

9.2 Jet energy scale uncertainties

In addition to the uncertainties coming from the combination of in situ methods detailed above, there are several other uncertainties that account for other potential systematic effects or expand the kinematic reach. These additional uncertainties are described below, and summarized in Sect. 9.2.4

9.2.1 Single-hadron response

The jet energy response measured by the in situ methods can also be compared with results from a method where the jet energy scale is estimated from the calorimeter response to single hadrons measured in test beam studies. This provides a cross-check of the direct balance in situ methods, albeit with a larger uncertainty, and also allows the extension of the in situ measurements of the jet energy scale to higher energies beyond the reach of balance methods due to limited data. In this “single hadron” method, jets are treated as a superposition of the individual energy deposits of their constituent particles [68]. In some cases, highly energetic jets contain constituents beyond test-beam energies. When this occurs, a constant 10% uncertainty is applied to each of these constituents.

In the previous jet energy scale measurements based on data taken in 2011 [6], the absolute in situ methods and the single-hadron response studies gave consistent results, indicating that MC simulation overestimated the jet response in data by approximately 2%. However, since the in situ methods are more precise (approximately 2% uncertainty compared to 5%) the single-particle response method is only used at high \(p_{{\text {T}}} \,\) (\(>1500~{\text {Ge}} {\text {V}}\)) where the statistical power of in situ methods becomes limited. The single-hadron response measurements from the 2011 data [6] are propagated to high \(p_{{\text {T}}} \,\) jets to provide an uncertainty where it is beyond the reach of the absolute in situ analyses.

9.2.2 \({\text {Pile-up}}\) uncertainties

There are four uncertainties sources associated with the mitigation of the \({\text {pile-up}}\) contributions to the jet momentum (Eq. (2)) that are evaluated by comparing data with simulation using in situ techniques. Two of the uncertainties are in the values of the slope parameters \(\alpha \) and \(\beta \) that determine the dependence on the number of reconstructed \({\text {pile-up}}\) vertices and the average interactions per crossing, respectively. The third uncertainty accounts for jet \(p_{{\text {T}}}\) dependence of the \(\alpha \) and \(\beta \) parameters. These uncertainties are evaluated using momentum balance in \(Z \) + jet events. The fourth uncertainty is associated with a topology dependence of the event \(p_{{\text {T}}}\) -density \(\rho \). It is evaluated as the largest difference in measured average \(p_{{\text {T}}}\) density \(\langle \rho \rangle \) at a given \({\text {pile-up}}\) condition \(\mu \) between dijet, \(\gamma \) + jet, and \(Z \) + jet events. As shown in Eq. (2), this uncertainty is directly proportional to the jet area and is larger by approximately a factor of \(0.6^2/0.4^2=2.25\) for \(R=0.6\) jets compared with \(R=0.4\) jets. For \(R=0.6\) jets, this tends to be the dominant uncertainty component with a typical magnitude of 2% for jets with \(p_{{\text {T}}}\) around \(40~{\text {Ge}} {\text {V}}\). For \(R=0.4\) jets in events with moderate \({\text {pile-up}}\) , the \(N_{\text {PV}}\)-dependent uncertainty component tends to be largest for jets in the central calorimeter region while the \(\mu \) component is largest in the forward calorimeter region (\(|\eta _{\text {det}}|>2.8\)).

9.2.3 Flavour-based uncertainties

The in situ methods used to derive final corrections and uncertainties of the jet energy scale make use of event samples with particular fractions of jets initiated by quarks and gluons. The event samples in physics analyses may have jet flavour compositions which differ from that of the calibration sample.

The response for quark-initiated jets is considerably higher than that for gluon-initiated jets (Sect. 5.5). Therefore, if the flavour composition of final states in a given analysis is unknown, it has an impact on the JES uncertainty. The degree to which the flavour of jets is known in an analysis can be specified in order to evaluate the corresponding uncertainty. Alternatively, analyses can be conservative and use a completely unknown flavour composition.

While the response for light-quark-initiated jets is found to be in good agreement between different generators, shifts are seen in the gluon jet response for different generators due to differences in the jet fragmentation. There is therefore an additional uncertainty for gluon-initiated jets, which is subdominant in the \(Z \) + jet and \(\gamma \) + jet regions used to constrain the uncertainty, as defined by the difference between the gluon jet response in Pythia8 and Herwig++. These differences are typically reduced using LCW topo-clusters as inputs, and this is visible in the central region of the detector when comparing to jets built using EM topo-clusters. However, this is less true in the forward region of the detector where the LCW correction is less robust due to the different properties of the more forward calorimeters.

Further details of this uncertainty are given in Ref. [6], and additional discussion of how the GS correction reduces the jet flavour uncertainties are presented in Sects. 5.5 and 5.7.

9.2.4 Summary of jet energy scale uncertainties

The total jet energy scale uncertainty is compiled from multiple sources:

  • 22 systematic sources from absolute in situ methods,

  • 34 statistical sources from absolute in situ methods,

  • a single-hadron response uncertainty which only affects the highest-\(p_{{\text {T}}}\) jets beyond the reach of in situ techniques (Sect. 9.2.1),

  • two \(\eta \)-intercalibration uncertainties (one systematic, one statistical),

  • four sources from uncertainties associated with the \({\text {pile-up}}\) corrections:

    • \(\mu \)-dependent uncertainty in the \({\text {pile-up}}\) correction,

    • \(N_{{\text {PV}}}\)-dependent uncertainty in the \({\text {pile-up}}\) correction,

    • \(p_{{\text {T}}}\) dependence of \({\text {pile-up}}\) corrections, and

    • \(\rho \) topology dependence,

    as outlined in Ref. [51] (Sect. 9.2.2), and

  • two sources due to jet flavour (Sect. 9.2.3).

The last two terms are assumed to be independent, resulting in a jet energy scale uncertainty defined in terms of 65 components (nuisance parameters). The resulting, total jet energy scale uncertainty is shown as a function of jet \(p_{{\text {T}}}\) in Fig. 41 and versus jet \(\eta \) in Fig. 42.

Fig. 41
figure 41

The total jet energy scale uncertainty as a function of \(p_{{\text {T}}}\) for central jets. Two flavour compositions are shown, one for dijet events, where the quark/gluon composition is taken from MC simulations and an associated uncertainty from generator comparisons, and one for an unknown flavour composition (assuming 50:50 quark:gluon jets with a 100% uncertainty). “Absolute in situ JES” refers to the uncertainty arising from \(Z \) + jet, \(\gamma \) + jet, and multijet measurements, including also the single-hadron response uncertainty at high \(p_{{\text {T}}}\). “Relative in situ JES” refers to the uncertainty arising from the dijet \(\eta \) intercalibration. “Punch-through” refers to the uncertainty in the final (muon-based) stage of the global sequential correction

Fig. 42
figure 42

The total jet energy scale uncertainty as a function of \(|\eta |\) for \(p_{{\text {T}}} = 40~{\text {Ge}} {\text {V}}\) jets. Two flavour compositions are shown, one for dijet events, where the quark/gluon composition is taken from MC simulations and an associated uncertainty from generator comparisons, and one for an unknown flavour composition (assuming 50:50 quark:gluon jets with a 100% uncertainty). “Absolute in situ JES” refers to the uncertainty arising from \(Z \) + jet, \(\gamma \) + jet, and multijet measurements. “Relative in situ JES” refers to the uncertainty arising from the dijet \(\eta \) intercalibration. “Punch-through” refers to the uncertainty in the final (muon-based) stage of the global sequential correction

9.2.5 Uncertainties in fast simulation

All uncertainties discussed in the previous section apply to MC samples produced using either the full or fast simulation. However, a small non-closure of the jet calibration was observed in fast simulation compared with full simulation. To account for this, an additional systematic uncertainty must be included in analyses using fast simulation since relative and absolute in situ methods are not used to validate this simulation. The size of this uncertainty compared with other systematic uncertainties is generally small for \(R=0.4\) jets (Fig. 43). However, as shown in Fig. 44, this uncertainty becomes sizeable for \(R=0.6\) jets.

Fig. 43
figure 43

Total uncertainty in the calibration of anti-\(k_t\), \(R=0.4\) jets in fast simulation as a function of \(p_{{\text {T}}} \) and \(\eta \). “Absolute in situ JES” refers to the uncertainty arising from \(Z \) + jet, \(\gamma \) + jet, and multijet measurements. “Relative in situ JES” refers to the uncertainty arising from the dijet \(\eta \) intercalibration. “MC non-closure, fast simulation” refers to the additional non-closure observed in fast simulation when comparing with full simulation

Fig. 44
figure 44

Total uncertainty in the calibration of anti-\(k_t\), \(R=0.6\) jets in fast simulation as a function of \(p_{{\text {T}}} \) and \(\eta \). The large “MC non-closure” term demonstrates the limitations of using \(R=0.6\) jets in fast simulation. “Absolute in situ JES” refers to the uncertainty arising from \(Z \) + jet, \(\gamma \) + jet, and multijet measurements. “Relative in situ JES” refers to the uncertainty arising from the dijet \(\eta \) intercalibration

9.3 Simplified description of uncertainty correlations

The list of uncertainties described in Sect. 9.2.4 requires an analysis to propagate a total of 65 JES uncertainty terms to correctly account for all correlations in the jet calibration. For many analyses it is preferable to describe such correlations using a reduced set of uncertainty components (nuisance parameters).

As detailed in Ref. [6], the total covariance matrix of the JES correction factors including all the in situ sources can be diagonalized, and then a new set of independent uncertainty sources can be derived from the eigenvectors and eigenvalues. A good approximation of the covariance matrix is then obtained by selecting a subset of the new uncertainty sources (those with the largest eigenvalues) and combining the remaining nuisance parameters into a residual term. Figure 45 demonstrates this procedure, showing the nominal correlation matrix and the difference between this and a similar matrix derived from a reduced set of nuisance parameters. Only uncertainties depending on a single parameter (in this case \(p_{{\text {T}}}\)) are combined in this way and any uncertainties with dependencies on other parameters are left separate. Including such uncertainty components with additional parameter dependencies in the combination would not result in any significant reduction of the correlation information into fewer nuisance parameters, as such components require additional dimensions to represent their correlations.

Two reduction schemes are provided. The first scheme reduces the number of central absolute in situ nuisance parameters, those shown in Fig. 40 and the statistical components of the \(\gamma \,+\,\)jet, \(Z\,+\,\)jet, and multijet balance, from 56 to 6 (“standard”). To preserve some knowledge of the uncertainty source in this procedure, a second scheme is provided where the reduction is done within categories (statistical, detector, modelling, or mixed). This “category based” reduction reduces the number of central absolute in situ parameters from 56 to 15. Retaining the separation of detector, statistical, and modelling components allows the correlation between experiments and different data-taking years to be assessed in combinations of measurements. No reduction is done for the other terms, and in addition to the 6 (15) nuisance parameters, nine additional parameters are required, resulting in 15 (24) parameters. This procedure gives a simpler propagation of the correlations and uncertainties associated with the jet energy scale with very little loss of information about the correlations.

Fig. 45
figure 45

The a, c JES correlation matrix and b, d difference between the full correlation matrix and that derived from a reduced number (6) of absolute in situ uncertainty components for anti-\(k_{t}\)  \(R=0.4\) jets calibrated with the a, b EM + JES and c, d LCW + JES schemes

A method has been developed for evaluating the correlations between the full set of 56 in situ JES uncertainty terms and a reduced set. This is especially useful for evaluating the correlations between the uncertainties obtained for two physics analyses that use different uncertainty configurations (e.g. the full set and a reduced set of JES uncertainty terms). In this method, each JES uncertainty term in the full set is projected, in the space of uncertainties, onto the direction of each uncertainty term in the reduced set. The corresponding projection coefficients allow expression of the uncertainties propagated by one analysis using a given configuration in terms of the components corresponding to another configuration. Therefore, this allows correlations to be assessed between analyses using different uncertainty configurations.

9.4 Alternative uncertainty configurations

Many physics analyses use “profiling” of uncertainties in the statistical analysis, such as the profile log-likelihood method, which improves the precision of the associated physics results. These methods may make significant use of the uncertainty amplitudes and correlation in different kinematic regions, and the exact parameterization of the JES systematic uncertainties might impact the result. Since the correlation between uncertainty sources often is unknown, the nominal uncertainty parameterization discussed in the previous sections corresponds to a “best guess”. Certain analyses could erroneously benefit from somewhat arbitrary choices made during the construction of this uncertainty scheme. To allow analyses to test if their results depend on these choices, two alternative uncertainty parameterizations are provided, one that results in stronger JES uncertainty correlations and one that gives weaker correlations. These are constructed by making alternative assumptions about the correlation between different effects and by employing a different rebinning prescription when propagating absolute in situ derived uncertainties to the combination.

In both the strong and weak correlation scenarios, a change is made in the rebinning procedure described in Sect. 6.4. The condition for stopping the merging of bins is altered such that the stronger (weaker) correlation scenario has more (less) bins merged. The effect of this procedure is particularly noticeable at low \(p_{{\text {T}}} \) and results in a reduction of the absolute in situ uncertainties for the stronger correlation scenario. In addition, both alternatives use a slower turn-on of the interpolation between multijet balance and single-particle uncertainties at \(p_{{\text {T}}} \approx 1.7~{\text {Te}} {\text {V}}\) (Fig. 41).

For the strong correlations alternative, certain uncertainty components that are treated as being uncorrelated with each other in the nominal parametrization are combined into a correlated component. This is only done for components that are suspected to have some correlation. The flavour composition uncertainty is also switched from using Pythia8 to derive the quark/gluon response to using Herwig++ to fully encompass generator dependence.

For the weak correlation alternative, several “2-point” systematic uncertainties are split into two subcomponents [69]. The term 2-point systematic uncertainties refers to uncertainties evaluated by comparison of the nominal result with only one alternative, e.g. a comparison between the predictions from two MC generators. The two constructed uncertainty components are defined such that their sum in quadrature equals the original component, thus the total uncertainty is retained. The split is performed by multiplying the original component by a factor varying linearly from 0 to 1 in either \(|\eta |\) or \(\log p_{{\text {T}}} {}\), forming the first subcomponent, while the second subcomponent is formed as the quadrature complement. Components treated this way in the alternative configurations include the \(\eta \)-intercalibration modelling term and flavour components.

9.5 Large-R jet uncertainties

Uncertainties in the large-R jet calibration are determined using in situ methods with the same principle as for \(R=0.4\) and \(R=0.6\) jets. Jet energy scale uncertainties are derived by combining direct balance measurements (Eq. (17)) performed in \(\gamma \) + jet events and are combined with uncertainties with track jets as reference objects. Uncertainties for the jet mass scale are derived only using track jets as reference objects. The track jet double-ratio method is discussed below along with an additional topological uncertainty similar to the flavour composition uncertainty in small-R jets. The \(\gamma \) + jet studies and uncertainties are discussed in Sect. 7.

9.5.1 Track jet double-ratio method

In the double-ratio method, track jets are used as reference objects since charged-particle tracks are both well measured and independent of the calorimeter and are associated with calorimeter jets using a geometrical matching in the \(\eta \)\(\phi \) plane. This method assumes that energy fluctuations measured using the calorimeter are independent of the charge-to-neutral fraction of the particle-level jet’s constituents. This is only approximately true because the calorimeter response is different for charged and neutral particles. The precision of the method requires that the track jet momentum resolution is much smaller than the calorimeter jet energy resolution, an excellent approximation for calorimeter jet momenta up to several hundred \({\text {Ge}} {\text {V}}\).

This approach was widely used in the measurement of the jet mass and substructure properties of jets in the 2011 data [45]. Performance studies [70] have shown that there is excellent agreement between the measured positions of clusters and tracks in data, indicating no systematic misalignment between the calorimeter and the inner detector. However, the use of track jets as reference objects is limited to a precision in the jet mass scale of around 3–7% in the central detector region due to systematic uncertainties arising from the inner-detector tracking efficiency [71] and confidence in MC modelling of the charged and neutral components of jets. The track jet double ratio is compared for two different MC generators: Pythia8 and Herwig++, and the larger disagreement between data and MC prediction is taken as the uncertainty. Figure 46 shows the jet mass scale uncertainty for anti-\(k_{t}\)  \(R = 1.0\) trimmed jets in different detector regions. The uncertainties are derived in bins of \(p_{{\text {T}}}\) , \(|\eta |\), and \(m/p_{{\text {T}}} \), and two examples are shown.

Fig. 46
figure 46

Jet mass scale (JMS) uncertainties for anti-\(k_{t}\)  \(R = 1.0\) trimmed jets (\(f_{\mathrm {cut}} = 0.05\) and \(R_{\mathrm {sub}} = 0.3\)) in different detector regions for a \(m/p_{\mathrm {T}} = 0.2\) and b \(m/p_{\mathrm {T}} = 0.6\)

9.5.2 Topological uncertainty

Similarly to the jet flavour composition uncertainty for small-R jets, an uncertainty in the jet energy response for different mixtures of quark/gluon jets, boosted top jets, and W jets is derived for large-R jets. Simulated \(t{\bar{t}}\) events are used to account for the different hard substructure and energy distributions within the W or top jets compared with quark/gluons jets which are taken from W + jet samples requiring exactly one lepton. The uncertainties are derived for anti-\(k_{t}\)  \(R = 1.0\) trimmed jets. Figure 47 shows the \(p_{{\text {T}}}\) dependance of the jet response in three \(\eta \) regions for four different kinds of jets: “full top” jets have the three quarks from the top decay contained within \(\Delta R = 0.8\) of the jet axis; “W-only” jets have the quarks from the W decay within \(\Delta R = 0.8\) of the jet axis but any b-quark must have \(\Delta R > 1.2\); “non-top” jets have the top quark separated from the jet axis by \(\Delta R > 2.0\); and, “QCD jets” are jets from a leptonically decaying W + jets sample. The topological uncertainty (Fig. 47) is determined by the envelope of the responses of these different types of jets.

Fig. 47
figure 47

One minus the jet \(p_{{\text {T}}}\) response (\(\langle p_{{\text {T}}} ^{{\mathrm {truth}}}- p_{{\text {T}}} ^{\text {reco}}\rangle /p_{{\text {T}}} ^{{\mathrm {truth}}}= 1-{\mathcal {R}}_{p_{{\text {T}}}}\)) for anti-\(k_{t}\)  \(R = 1.0\) trimmed jets with different flavour composition for a \(0.0< |\eta | < 0.8\), b \(0.8< |\eta | < 1.2\), c \(1.2< |\eta | < 2.0\). The categories in the plot are defined by (1) “\(t{\bar{t}}\) full top” jets (circles) that represent jets for which the three quarks from a hadronic top quark decay are contained within \(\Delta R = 0.8\) of the jet axis; (2) “\(t{\bar{t}}\) W-only” jets (squares), for which the quarks from the W boson decay are within \(\Delta R = 0.8\) of the jet axis while the b-quark fulfils \(\Delta R > 1.2\); (3) “\(t{\bar{t}}\) non-top” jets (lower triangles) that represent jets for which the top quark is \(\Delta R > 2.0\) from the jet; and, (4) “Wjets QCD” jets (upper triangles) representing jets from a leptonically decaying W boson in a W + jets MC sample. These are plotted as a function of reconstructed jet \(p_{{\text {T}}}\)\((p_{{\text {T}}} ^{\text {reco}})\), but due to the large bin size compared with the \(p_{{\text {T}}}\) resolution, the choice of plotting \(p_{{\text {T}}} ^{\text {reco}}\) or \(p_{{\text {T}}} ^{{\mathrm {truth}}}\) is of little significance

9.5.3 Combination

The jet \(p_{{\text {T}}}\) scale uncertainties are available within \(|\eta | < 2.0\) but the available data at high \(p_{{\text {T}}}\) (\(p_{{\text {T}}} > 800~{\text {Ge}} {\text {V}}\)) is limited for the direct \(\gamma \) + jet \(p_{{\text {T}}}\) balance method. By contrast, the uncertainties from the track jet double ratios cover \(p_{{\text {T}}} > 800~{\text {Ge}} {\text {V}}\). To benefit from the drastically reduced \(p_{{\text {T}}}\) scale uncertainties derived with \(\gamma \) + jet events, a linear interpolation is performed around \(p_{{\text {T}}} {} = 800~{\text {Ge}} {\text {V}}\) between the two methods. The uncertainty arising from the topological composition of the jet is added in quadrature to form the total uncertainty. This total uncertainty and its components are shown as a function of \(p_{{\text {T}}}\) in Fig. 48.

Fig. 48
figure 48

Combination of the uncertainties in the jet \(p_{{\text {T}}}\) scale for anti-\(k_{t}\)  \(R = 1.0\) trimmed jets for \(|\eta | = 0\) and two values of \(m/p_{{\text {T}}} \): a \(m/p_{{\text {T}}} = 0.20\) and b \(m/p_{{\text {T}}} = 0.60\)

10 Final jet energy resolution and its uncertainty

The measurement of the jet energy resolution (JER) in data is a multi-step process. As detailed in Sects. 6 and 7, the analyses employed to measure the JER are essentially the same as for the jet calibration, but the observable of interest is not the mean of the response observable but is its width. For the central rapidity region, the JER is measured with good precision using \(\gamma \) + jet and Z + jet events. In the forward pseudorapidity region and for high \(p_{{\text {T}}}\) , dijet events provide the most precise determination of the JER. For very low \(p_{{\text {T}}}\) jets there is a significant contribution to the jet energy resolution from \({\text {pile-up}}\) particles and electronic noise. Using the data taken in 2012, new methods have been developed to measure the \({\text {pile-up}}\) component.

The jet energy resolution is parameterized as a function of three terms [7],

$$\begin{aligned} \frac{\sigma _{p_{\text {T}}}}{p_{{\text {T}}}} = \frac{N}{p_{{\text {T}}}} \oplus \frac{S}{\sqrt{p_{{\text {T}}}}} \oplus C \ , \end{aligned}$$
(24)

where N parameterizes the effect of noise (electronic and \({\text {pile-up}}\) ), S parameterizes the stochastic effect arising from the sampling nature of the calorimeters, and C is a \(p_{{\text {T}}} \)-independent constant term. It is the determination of these terms in data that is the subject of this section.

In Sect. 10.1, the MC simulated jet energy resolution is discussed, followed by the determination of the noise term in data in Sect. 10.2. The combination of the measurements of the noise term and the Z + jet, \(\gamma \) + jet, and dijet measurements, described in Sects. 7 and 6, respectively, is detailed in Sect. 10.3. The uncertainty in the measurement of the jet energy resolution arising from the various in situ methods is propagated through the fit to the \(p_{{\text {T}}} \) dependence of the jet energy resolution.

10.1 JER in simulation

The jet energy resolution is measured in simulated event samples as described in Sect. 4.2, i.e. it is defined as the width parameter \(\sigma \) of a Gaussian fit to the jet energy response distribution restricted to the range \(\pm 1.5\,\sigma \) around the mean. Figure 49 shows the resolution determined using Pythia8 dijet MC samples both with full Geant4 detector simulation and with fast simulation. The two simulations generally agree very well, although there are some discrepancies in the very forward regions. The distribution is shown both with and without the GS correction, which significantly improves the resolution (decreasing the resolution of \(R=0.4\) EM + JES jets from 10% to 7% at 100 \({\text {Ge}} {\text {V}}\)), particularly for jets built from EM-scale clusters. The resolution is shown as functions of \(p_{{\text {T}}} ^{\text {truth}}\) and \(|\eta _{\text {det}}|\). As expected, the resolution improves quickly with increasing \(p_{{\text {T}}} ^{\text {truth}}\). The resolution for a fixed value of \(p_{{\text {T}}}\) gets better towards more forward regions (this is not the case for constant jet energy).

10.2 Determination of the noise term in data

Noise, both from the calorimeter electronics and from \({\text {pile-up}}\) , forms a significant component of the JER at low \(p_{{\text {T}}}\) . The noise term is not evaluated for \(R=1.0\) trimmed jets, as they are only used for \(p_{{\text {T}}} {}>200~{\text {Ge}} {\text {V}}{}\) at which point the noise term is negligible. It is quite challenging to measure the JER at low \(p_{{\text {T}}}\) with in situ techniques (Sect. 10.3) as uncertainties increase at low \(p_{{\text {T}}}\) and the stochastic and noise terms are correlated at intermediate \(p_{{\text {T}}}\) . Two alternative methods have hence been developed to target the noise term. These attempt to extract the noise at the constituent scale (the scale of the input topo-clusters) as explained in Sects. 10.2.1 and 10.2.2. They are translated into the effect on the jet resolution at the calibrated scale in Sect. 10.2.3. Good agreement is found between the methods, and a closure test is performed using MC simulations in Sect. 10.2.4, leading to a final value for the noise term in the jet energy resolution.

The JER noise term receives contributions from the cells inside the topo-clusters created by the actual truth-particle jet as well as from \({\text {pile-up}}\) . The noise term is significantly affected by the topo-cluster formation threshold as jets will contain a varying fraction of particles that have enough energy to form a topo-cluster. The noise term in data without \({\text {pile-up}}\) is denoted \(N^{\mu =0}\). As just mentioned, this term will be affected by a contribution corresponding to the number of constituent particles produced without enough energy to produce topo-clusters or that have been swept out of the cone by the magnetic field, and also by the electronic noise from the cells inside the topo-clusters. \({\text {Pile-up}}\) particles can result in increased noise of topo-clusters seeded by the truth-particle jet particles, and also create new topo-clusters that are included in the jet. The latter effects is assumed to dominate, and its contribution to the JER noise term is denoted \(N^{\mathrm {PU}}\). A third source of noise are topo-clusters created solely from electronic noise in the entire calorimeter. This is assumed to be a negligible effect as the topo-clusters require a calorimeter cell with \(4\,\sigma \) energy over noise, which is also confirmed in data from events without collisions. The following sections present two different measurements of \(N^{\mathrm {PU}}\).

Fig. 49
figure 49

Jet energy resolution measured in dijet MC samples as a function of \(p_{{\text {T}}} ^{\text {truth}}\) for a EM + JES and b LCW + JES jets (filled markers). The resolution in both events simulated with the full Geant4 toolkit (circles) and with fast simulation (squares) are shown. Additionally the improvement from the global sequential correction is shown (empty markers). Figures c and d show the dependence of the resolution on \(|\eta _{\text {det}}|\,\) for low-\(p_{{\text {T}}}\) (20–25 GeV) jets and the level of agreement between full simulation (circles) and fast simulation (squares)

10.2.1 \({\text {Pile-up}}\) noise measured using random cones in zero-bias data

In the random cone method, a cone of given size is formed at a random values of \(\eta \) and \(\phi \) in zero-bias data, and the energies of all clusters (at either EM or LCW scale) that fall within this cone are combined. The data was collected using a zero-bias trigger that records events occurring one LHC revolution after an event is accepted by a L1 electron/photon trigger. The total \(p_{{\text {T}}}\) of a random cone is hence expected to only capture contributions from \({\text {pile-up}}\) interactions. Since jets formed with the anti-\(k_{t}\) algorithm tend to be circular (Fig. 3a), fluctuations of the \(p_{{\text {T}}}\) in a random cone can be considered a measure of the expected \({\text {pile-up}}\) fluctuations that are captured by an anti-\(k_{t}\) jet with a radius parameter equal to the cone size.

The \(\eta \) of the cone is randomly sampled within the range for which the noise is being probed, and the random cone method proceeds by forming a second cone at \(\phi \,+\,\pi \) (“back-to-back” in azimuth to the first cone) but at a new random \(\eta \), also restricted to the \(\eta \) range probed. The effect of the noise in these cones is expected to be the same on averageFootnote 6, and the difference in the random cone \(p_{{\text {T}}} {}\), \(\Delta p_{{\text {T}}} {}\), is plotted. The difference between two cones is used to remove any absolute offset present as the jet calibration would remove any absolute bias affecting the jets. The noise is studied as a function of \(\eta \) by restricting the \(|\eta |\) values that can be chosen for the random cones as previously mentioned. Since the topo-clusters that enter the random cone have no origin correction applied (Sect. 4.3.1), the \(\eta \) of the random cone corresponds to \(\eta _{\text {det}}\) of a jet. An example of the distribution of this noise in data is shown in Fig. 50. Due to the random nature of the \({\text {pile-up}}\) energy deposits with significant energy over noise, the \(\Delta p_{{\text {T}}} {}\) distribution is not expected to be Gaussian. The 68% confidence interval of this distribution is defined as the width. Since \(\Delta p_{{\text {T}}} {}\) gives the fluctuations of two cones, this value is divided by \(\sqrt{2}\,\) to give an estimate of the noise term due to \({\text {pile-up}}\)\(N^{\text {PU}}\) at the constituent scale for a given jet.

The growth of this noise term at the constituent scale as a function of the average number of interactions per bunch crossing is shown in Fig. 51 separately for \(|\eta |<0.8\) and \(3.2<|\eta |<4.5\). From these results, it is clear that the MC simulations overestimate the influence of \({\text {pile-up}}\) events, and this effect is increased in the forward region. Also, the noise term at constituent scale is larger for LCW than EM topo-clusters, because the LCW weighting acts to increase the energy scale of the topo-clusters, which also increases the constituent-level noise term. The EM- and LCW-scale noise terms can only be fairly compared after applying the jet calibration factor, which is done later in Sect. 10.2.3. Figure 52 shows the average \({\text {pile-up}}\) noise fluctuations expected in different jets in 2012 for the different \(|\eta |\,\) regions. The data–MC agreement deteriorates in the more forward regions of the detector. This is likely to arise from poor modelling of the \({\text {pile-up}}\) being exacerbated in this region due to the change in detector granularity and noise thresholds.

To extract the \({\text {pile-up}}\) noise term for average 2012 conditions, the noise term in random cones is extracted from the total 2012 zero-bias dataset. To ensure that the \(\mu \) distribution used in other in situ measurements (dijet, \(Z \) + jet, and \(\gamma \) + jet) is identical to that in the zero-bias dataset, a reweighting is applied dependent on the \(\mu \) distribution. This reweighting has a very small effect as the zero-bias trigger and prescales are designed to produce a dataset which mimics the \(\mu \) distribution of the full dataset used for physics. In addition, to enable a direct comparison between data and MC simulations, the simulated \(\mu \) distribution is reweighted to that of the data.

Fig. 50
figure 50

The balance of random cones of size 0.4 (\(\Delta p_{{\text {T}}} {}=p_{{\text {T}}} ^1-p_{{\text {T}}} ^2\)) in the central region \(|\eta |<0.8\) in 2012 zero-bias data using EM and LCW clusters. The non-Gaussian shape of this distribution is demonstrated by the inclusion of a Gaussian fit (dashed lines) to the data (solid lines)

Fig. 51
figure 51

The magnitude of the expected fluctuations in different jet radii at the constituent scale derived using the random cone procedure as a function of \(\mu \), for data (filled markers) and MC simulations (open markers). The results are shown for \(R=0.4\) and \(R=0.6\) cones and at both the EM and LCW scales probing two calorimeter \(|\eta |\) regions, one central (\(|\eta |<0.8\)) and one forward (\(3.2<|\eta |<3.6\)). A scale factor of 1.09 has been applied to \(\mu \) in the MC simulations to correct for extra activity observed in the minimum-bias tune

Fig. 52
figure 52

The magnitude of the expected fluctuations within different jet radii at the constituent scale as a function of \(|\eta |\) for data (filled markers) and MC simulations (open markers). The results are shown for \(R=0.4\) and \(R=0.6\) cones and at both the EM and LCW scales

10.2.2 \({\text {Pile-up}}\) noise term measurements using the soft jet momentum method

As explained in Sect. 4.3.2, the event \(p_{{\text {T}}}\) -density \(\rho \) is obtained by reconstructing jets using the \(k_t\) algorithm without applying any jet \(p_{{\text {T}}}\) threshold and defining \(\rho \) to be the median of the jet \(p_{{\text {T}}}\) -density \(p_{{\text {T}}}/A\), where A is the area of the jet. Starting from this quantity, the noise term of the JER due to \({\text {pile-up}}\)\(N^{\text {PU}}\) is extracted by defining a new observable \(\sigma _\rho \) that is a measure of the fluctuations in \(p_{{\text {T}}}\) per unit area assuming a stochastic model of noise. Due to using the median (rather than the mean) in its definition, \(\rho \) is to first order insensitive to the hard process. Any type of data can in principle be used for the measurement. The results presented in this section are based on \(Z\rightarrow \mu \mu \) data. The following steps are performed:

  • Jets are reconstructed using the Cambridge–Aachen algorithm [72] with \(R=0.6\) and required to have \(|\eta |<2.1\). No \(p_{{\text {T}}} \) threshold is applied, and the jet \(p_{{\text {T}}}\) extends down to zero.

  • For each jet, the quantity \(r=(p_{{\text {T}}}-\rho \, A)/\sqrt{A}\) is calculated, where A is the jet area defined using the Voronoi procedure [54]. Since no jet \(p_{{\text {T}}}\) threshold is applied, many jets will be built from noise only. The distribution of r is expected to be centred at zero since after subtracting \(\rho \, A\) there should be as many jets above the \(p_{{\text {T}}}\) density as below.

  • The observable \(\sigma _\rho \) is defined event-by-event from the width of the r distribution of all jets in the event. To avoid complications of non-Gaussianity and the hard-scatter event biasing the upper side tail, \(\sigma _\rho \) is defined by half the difference between the 84% and 16% quantile points.

The size of the expected fluctuations at the constituent scale of a given jet is given by \(\sigma _\rho \,\sqrt{A}\). The distributions of \(\sigma _\rho \,\) for EM-scale and LCW-scale clusters in \(Z\rightarrow \mu \mu \) data and Pythia8 samples are shown in Fig. 53. \(Z\rightarrow \mu \mu \) events are used to select an unbiased set of events for data-to-MC comparison, thus avoiding the use of any jet-based trigger which would bias the jet distributions. As in the random cone method (Fig. 51), the \({\text {pile-up}}\) noise is overestimated in the MC simulations. An estimate of the noise term due to \({\text {pile-up}}\) is obtained by scaling the mean value of the \(\sigma _\rho \) distribution by \(\sqrt{\pi R^2}\).

Fig. 53
figure 53

Extracted values of \(\sigma _\rho \) in data (points) and MC simulations (histogram) for a sample of \(Z\rightarrow \mu \mu \,\) events at the EM scale (left) and the LCW scale (right). This observable quantifies the fluctuations of the \(p_{{\text {T}}}\) density \(\rho \), i.e. the \(p_{{\text {T}}}\) per area in \((y,\phi )\)-space

Table 4 Measurements of \(\langle \sigma _\rho \rangle \) and \(\langle \sigma _\rho \rangle \,\sqrt{A}\,\), where \(\langle \sigma _\rho \rangle \) is the mean of the \(\sigma _\rho \) distribution, and the random cone results, both using data and MC simulations. The area is defined by \(A=\pi R^2\), where R is the radius parameter. The \(\sigma _\rho \, \sqrt{A}\) results, which is a noise term measurement from the soft jet momentum method, is extracted using the region \(|\eta |<2.1\) while the noise term measurement using the random cone method is extracted for jet \(|\eta |<0.8\). Statistical uncertainties of both measurements are negligible

10.2.3 Comparison of methods and construction of the noise term

As described in the previous two sections, the random cone and the soft jet momentum methods can both be used to measure the noise term of the jet energy resolution. It is useful to compare their results and to contrast the two methods. As well as using different data samples, these methods make quite different assumptions about the underlying physics:

  • The soft jet momentum method implicitly assumes the \({\text {pile-up}}\) noise is stochastic (such that it grows with \(\sqrt{A}\)).

  • The random cone method measures the noise in several \(\eta \)-bins, while the soft jet momentum method does not consider any \(\eta \)-dependence of the noise within the probed detector region \(|\eta |<2.1\).

  • The symmetry assumption of the two cones back-to-back in azimuth in the zero bias events is not required by the soft jet momentum method.

Further, while the soft jet momentum method gives an estimate of the noise term in each event (as is done for the calculation of \(\rho \)), the random cone method gives the noise term over an event sample. Table 4 compares the measured noise term at the constituent scale using the two methods. The two sets of measurements agree at the level of 20%.

10.2.4 Closure test of the \({\text {pile-up}}\) noise measurement in MC simulation

A closure test is performed on the \({\text {pile-up}}\) noise measurements by comparing the random cone result with the \({\text {pile-up}}\) noise extracted using truth-particle jets in MC simulation. The \({\text {pile-up}}\) noise in MC simulations is extracted by measuring the MC JER (Sect. 4.2) in two Pythia8 dijet samples: one without \({\text {pile-up}}\) and one sample with 2012 \({\text {pile-up}}\) conditions. By subtracting the JER measured in the sample without \({\text {pile-up}}\) from the JER measured in the sample with \({\text {pile-up}}\) , the contribution from the \({\text {pile-up}}\) noise is isolated and can be compared with the measurement of the noise term using the random cone method. However, this comparison cannot be done directly since the random cone measures the noise at constituent scale (EM or LCW), while the JER is measured at the fully calibrated scale (EM + JES or LCW + JES). To account for this mismatch in scale, the random cone measurements are scaled by the average MC calibration factor \(\langle c_{\text {JES}}\rangle \) evaluated for the jets in the kinematic region of interest. The results of these tests are shown in Fig. 54 as a function of \(p_{{\text {T}}}\) for both EM and LCW jets. The relevant comparison is that of the estimated noise term \(N^{\text {PU}}\) and the quadrature difference of the MC JER measurements with and without \({\text {pile-up}}\) . In the central region \(|\eta |<0.8\), good closure is observed, both for EM + JES and LCW + JES. In this region, the calorimeters have high granularity, and as a consequence energy clusters from \({\text {pile-up}}\) and from the hard-scatter signal tend to form separately with little overlap. Slightly larger non-closure is observed towards the more forward regions, which is expected due to the coarser angular granularity and higher noise thresholds, which result in a larger overlap between energy deposits from \({\text {pile-up}}\) and the hard scatter.

The same closure test was performed for the \(N^{\text {PU}}\) measured with the soft jet momentum method, and the difference between the results is taken as a systematic uncertainty due to the arbitrariness of the selection of method. Additionally, the degree of non-closure of the method is taken as a systematic uncertainty.

Fig. 54
figure 54

Comparison between the \({\text {pile-up}}\) noise term \(N^{\text {PU}}\) extracted using the random cone method (upward triangles) with the expectation from MC simulation (downward triangles). Results are shown for jets built from EM (left) and LC (right) topo-clusters, for jets with \(|\eta _{\text {det}}{}|<0.8\) (top) and \(1.2<|\eta _{\text {det}}{}|<2.1\) (bottom). The expected \(N^{\text {PU}}\) is obtained by quadrature subtraction of the JER obtained from MC simulation of events with nominal \({\text {pile-up}}\) (circles) from that of events with no \({\text {pile-up}}\) (squares). Fits performed to the measured and expected \({\text {pile-up}}\) noise data are displayed as dotted curves. Quadrature differences corresponding to points where, due to statistical fluctuations, the resolution is worse in the no \({\text {pile-up}}\) scenario are not displayed

10.2.5 Noise term in the no \({\text {pile-up}}\) scenario

The random cone and soft jet momentum methods provide measurements of the part of the noise term arising from \({\text {pile-up}}\) activity \(N^{{\text {PU}}}\). In the dijet MC sample without \({\text {pile-up}}\) , for which \(\mu = 0\), the noise term does not have any \({\text {pile-up}}\) contribution but does include other effects such as electronic noise on the signal clusters and threshold effects. To get a handle on the additional noise terms not included in the random cone or soft jet methods, the \(\mu =0\) MC simulated resolution is fitted with the standard N, S and C parameterization of Eq. (24) to extract the no \({\text {pile-up}}\) noise term \(N^{\mu =0}\). The result of such fits are presented in Table 5.

Table 5 The noise term \(N^{\mu =0}\) in \({\text {Ge}} {\text {V}}\) extracted in a dijet MC sample without \({\text {pile-up}}\) . The values and uncertainties are extracted from a fit. For data, an additional 20% uncertainty is assigned, based on the 2010 measurements [7]

The total jet energy resolution (Eq. (24)) was measured in 2010 and agreed between data and MC simulations within 10% for jet \(p_{{\text {T}}}\) in the range \(30~{\text {Ge}} {\text {V}}{}<p_{{\text {T}}} <500~{\text {Ge}} {\text {V}}{}\) [7]. For \(p_{{\text {T}}} =30~{\text {Ge}} {\text {V}}\) in the central region, the noise term is responsible for more than half of the total resolution. Given that the dominant resolution source leads to a total resolution modelled to the level of 10%, this implies that the noise term itself agrees between data and MC simulation to the level of 20% in simulated samples without \({\text {pile-up}}\) . This conclusion is also supported by single-particle measurements [68]. This extrapolation includes some additional assumptions in the MC modelling of the detector as several settings changed between 2010 and 2012, most notably the topo-cluster noise thresholds; however, 20% is considered a conservative estimate of the uncertainty in this component.

The total JER noise term N is defined by combining the noise term extracted in the no \({\text {pile-up}}\) sample with that originating from \({\text {pile-up}}\) (measured above) using a sum in quadrature, i.e. \(N = N^{\text {PU}}\oplus N^{\mu =0}\).

Fig. 55
figure 55

The \(\chi ^2\,\) per degree of freedom for the jet energy resolution showing the compatibility of the three in situ measurements of the jet energy resolution (dijet asymmetry, Z + jet balance, and \(\gamma \) + jet balance) for jets calibrated with the a EM + JES and b LCW + JES schemes

Fig. 56
figure 56

The three eigenvectors after the eigenvector reduction of the nuisance parameters in the jet energy resolution measurement for LCW + JES \(R=0.4\) jets for four different \(|\eta |\) bins. These nuisance parameters fully describe the correlations

Fig. 57
figure 57

The jet resolution as a function of \(p_{{\text {T}}}\) for the four different jet collections in the central region. The three in situ inputs to the measurement, namely Z + jet (empty squares), \(\gamma \) + jet (filled triangles), and dijet balance (empty triangles) are shown displaying the compatibility of the measurements. The final fit using the function in Eq. (24) is included with its associated statistical and total uncertainty

Fig. 58
figure 58

The jet resolution as a function of \(p_{{\text {T}}}\) for LCW + JES anti-\(k_t\) \(R=0.4\) jets in three more-forward regions. The dijet in situ inputs (empty triangles) to the measurement are shown. The final fit using the function in Eq. (24) is included with its associated statistical and total uncertainty

10.3 Combined in situ jet energy resolution measurement

The JER measurements based on the bisector method in dijet events reported in Sect. 6.6 and the vector boson plus jet balance reported in Sect. 7.5 are statistically combined using a chi-squared minimization of the function in Eq. (24). In this fit, the noise term is held at the central value found in the previous section, while measurements of the \(S\,\) and \(C\,\) terms are extracted. The uncertainties in each term are evaluated in the same way they were in the JES determination in Sect. 9.1, i.e. by re-evaluating the JER measurement after a 1\(\sigma \) shifts of each individual uncertainty source. The degree of agreement between the three in situ measurements is in Fig. 55, which shows the \(\chi ^2\) per degree of freedom as a function of \(p_{{\text {T}}}\). The low values of the \(\chi ^2\,\) per degree of freedom across the \(p_{{\text {T}}}\) range demonstrates that the in situ methods agree well. As expected, there is a large anti-correlation between the S and C parameters of \(-0.25\) (\(-0.44\)) for EM + JES (LCW + JES) calibrated jets, and the \(\chi ^2\,\) per degree of freedom for the fit to find N, S and C is 8/35 (15/35) when correlations are not considered, and 71/35 (58/35) when correlations are considered. The relatively large size of the \(\chi ^2\) per degree of freedom when correlations are considered indicates a limitation in the fitting function used. It is a possible indication of the need for higher-order terms in the series to better describe the resolution dependence on \(p_{{\text {T}}}\). A similar effect is seen when looking at the fit to these three parameters in MC simulations.

When propagating the uncertainty in the noise term to the fit the resulting changes in the fitted values of N, \(S\,\) and \(C\,\) for anti-\(k_t\) \(R=0.4\,\) EM + JES (LCW + JES) jets are \(\genfrac{}{}{0.0pt}{}{\,+\,0.63}{-0.63}\), \(\genfrac{}{}{0.0pt}{}{-0.038}{\,+\,0.030}\), \(\genfrac{}{}{0.0pt}{}{\,+\,0.001}{-0.001}\) \(\Big (\genfrac{}{}{0.0pt}{}{\,+\,0.74}{-0.74}\), \(\genfrac{}{}{0.0pt}{}{-0.048}{\,+\,0.039}\), \(\genfrac{}{}{0.0pt}{}{\,+\,0.002}{-0.002}\Big )\). Again, correlations between the different components are observed, namely increasing \(N\,\) results in a reduced \(S\,\) and increased C.

To reduce the number of parameters which need to be propagated, the full set of eigenvectors is built according to the total effect on the JER measurement of each uncertainty component (rather than the effect of each component on the N, \(S\,\) and C terms individually). These uncertainty sources can then be reduced in number by using an eigenvector decomposition (diagonalization) as was done for the JES. This allows the full correlations to be retained and propagated to analyses. Figure 56 shows the three eigenvectors after this diagonalization. Combining in quadrature the results from varying \(N\,\) and propagating the in situ uncertainties gives \(N=3.33\pm 0.63~(4.12\pm 0.74)~{\text {Ge}} {\text {V}}\), \(S=0.71\pm 0.07~(0.74\pm 0.10)~\sqrt{{\text {Ge}} {\text {V}}}\), and \(C=0.030\pm 0.003~(0.023\pm 0.003)\) for anti-\(k_t\) \(R=0.4\) EM + JES (LCW + JES) jets. Figure 57 shows the individual measurements of the resolution in the central region, the result of the combination, and the associated uncertainty. The uncertainty in the jet energy resolution for anti-\(k_t\,\) \(R=0.4\,\) jets is less than 0.03 at \(20~{\text {Ge}} {\text {V}}\) and below 0.01 above \(100~{\text {Ge}} {\text {V}}\).

When considering the more forward \(|\eta |\,\) bins, the large statistical uncertainty in \(Z \) + jet and \(\gamma \) + jet events means that only dijet measurements are useful. These are combined with the measured noise term in data in the same way as in the central region. For LCW + JES anti-\(k_t\,\) \(R=0.4\,\) jets all the different regions are shown in Fig. 58 and the extracted N, S, and C parameters for all jet collections are shown in Tables 6 and 7.

Finally, to account for correlations between the measurements at different \(|\eta |\,\) a correlation matrix as a function of \(p_{{\text {T}}} \) and \(|\eta |\,\) is built. The systematic uncertainties of the noise term and dijet balance results are assumed to be correlated between \(|\eta |\,\) regions. The eigenvector reduction is performed, which results in, at most, 12 uncertainty components required to capture all the correlations between the \(p_{{\text {T}}}\) and \(|\eta |\,\) regions covered by the in situ studies.

Table 6 Extracted values of the N, S, and C terms from a combined fit to the jet energy resolution measurements for \(R=0.4\) and \(R=0.6\) jets, both calibrated with the EM + JES scheme. The quoted uncertainties of the N, S, and C terms are highly correlated with each other
Table 7 Extracted values of the N, S, and C terms from a combined fit to the jet energy resolution measurements for \(R=0.4\) and \(R=0.6\) jets, both calibrated with the LCW + JES scheme. The uncertainties shown are highly correlated between the N, S, and C terms

11 Conclusions

This article describes the determination of the jet energy scale (JES) and jet energy resolution (JER) for data recorded by the ATLAS experiment in 2012 at \(\sqrt{s} = 8~{\text {Te}} {\text {V}}\).

The calibration scheme used for anti-\(k_{t}\) jets reconstructed using radius parameter \(R=0.4\) or \(R=0.6\) corrects for \({\text {pile-up}}\) and the location of the primary interaction point before performing a calibration based on MC simulation. These initial steps in the calibration provide stability of the calibration as a function of \({\text {pile-up}}\) and improve the angular resolution of jets. Following the MC-simulation-derived baseline calibration, a global sequential correction is performed. It is also derived from MC simulations using information about how the jet deposits energy in the calorimeter, the tracks associated with the jet, and the activity in the muon chambers behind the jet (particularly important for high-\(p_{{\text {T}}} \) jets). This improves the resolution of jets and reduces the difference in energy scale between quark- and gluon-initiated jets.

Following these MC-based calibration steps, the data taken in 2012 are used to perform a residual calibration that constrains the uncertainties. This is performed for anti-\(k_{t}\) jets with \(R=0.4\) and \(R=0.6\) calibrated with both the EM + JES and LCW + JES schemes. Dijet events are used to calibrate jets in the forward region relative to the central region as a function of jet transverse momentum and pseudorapidity. The uncertainties of this calibration step have been significantly reduced compared with previous results primarily though the use of event generators with improved modelling of multijet production. The total uncertainties are typically below \(1\%\) for central jets, rising to \(3.5\%\) for low-\(p_{{\text {T}}}\) jets at high absolute pseudorapidity. Central jets are calibrated by exploiting the balance between jets recoiling against either a photon or a Z boson. In the pseudorapidity region \({0.8}\le |\eta _{\text {det}}{}|<{2.8}\), the jet energy scale is validated with \(Z \) + jet events using the direct \(p_{{\text {T}}}\) balance technique. The jet energy scale calibration for central jets with high \(p_{{\text {T}}}\) is determined using events in which an isolated high-\(p_{{\text {T}}}\) jet recoils against a system of lower-\(p_{{\text {T}}}\) jets. The final calibration is obtained through a statistical combination of the different measurements. This results in a correction at the level of 0.5% to the JES in data with an associated uncertainty of less than 1% for central anti-\(k_t\) \(R=0.4\) jets with \(100<p_{{\text {T}}} <1500~\hbox {GeV}\). At higher \(p_{{\text {T}}}\), the uncertainty is about 3% as in situ measurements become statistically limited, and instead the calibration relies on single-hadron response studies.

The jet energy scale of trimmed anti-\(k_{t}\) jets with \(R=1.0\) is derived using MC simulation in the same way as for \(R=0.4\) and \(R=0.6\) jets, thus calibrating the jets to the LCW + JES scale. In an additional step, a dedicated calibration of the jet mass for the \(R=1.0\) jets is derived. The MC-derived calibration is tested in situ using the direct balance method with \(\gamma \) + jet events. These studies are used to evaluate uncertainties in the calibration. The total uncertainty for \(|\eta _{\text {det}}{}|<{0.8}\) is found to be around \(3\%\) for jets with low \(p_{{\text {T}}}\) , falling to about \(1\%\) for jets with \(p_{{\text {T}}} \ge 150~{\text {Ge}} {\text {V}}\). At larger \(|\eta _{\text {det}}|\), the uncertainty increases to \(4{-}5\%\) at low jet \(p_{{\text {T}}}\) , decreasing to \(1{-}2\%\) for \(p_{{\text {T}}} > 150~{\text {Ge}} {\text {V}}\).

The JER is measured in 2012 data using several in situ methods. The JER \({\text {pile-up}}\) noise term is determined using novel techniques that exploit the increased level of \({\text {pile-up}}\) interactions in the 2012 data. Three measurements of the JER as a function of jet \(p_{{\text {T}}}\) and \(\eta _{\text {det}}\) are performed using \(\gamma \) + jet, Z + jet and dijet data. A final measurement of the JER is obtained using a statistical combination of these measurements, using a methodology similar to that used for the JES calibration. The different in situ inputs are found to be consistent with each other over the kinematic regions where they overlap. For anti-\(k_{t}\)  \(R=0.4\) jets in the central calorimeter region \(|\eta |<0.8\) calibrated with the EM + JES calibration scheme, the JER resolution parameters are measured to be \(N=3.33\pm 0.63~{\text {Ge}} {\text {V}}\), \(S=0.71\pm 0.07~ \sqrt{{\text {Ge}} {\text {V}}}\), and \(C=0.030\pm 0.003\), which corresponds to a relative JER of \(\sigma _{p_{\text {T}}} / p_{{\text {T}}} = (23 \pm 2)\%\) for \(p_{{\text {T}}} = 20~{\text {Ge}} {\text {V}}\) and \(\sigma _{p_{\text {T}}} / p_{{\text {T}}} = (8.4 \pm 0.6)\%\) for \(p_{{\text {T}}} = 100~{\text {Ge}} {\text {V}}\). The jet energy resolution in data is generally well reproduced by the MC simulation. In certain kinematic regions, the simulated jets have a slightly smaller resolution than jets in data. In physics analyses, the \(p_{{\text {T}}}\) of the simulated jets is corrected by random smearing to match the resolution observed in data. The required amount of smearing is of similar order of magnitude as the jet energy resolution uncertainties.