1 Introduction

The energetic proton–proton (pp) collisions produced by the Large Hadron Collider (LHC) predominantly result in a large number of charged and neutral hadrons which form collimated sprays known as jets. In the ATLAS Experiment [1] the energy of these particles is measured using the calorimeters, and a tracking system measures the momentum of the charged particles. A particle-flow algorithm [2] is used to take advantage of these two systems in the reconstruction of jets. The measurement of hadronic energy deposits is essential for the reconstruction of jets and the jet energy scale calibration relies on the accurate simulation of hadrons interacting with the calorimeter [3]. A powerful method of understanding the calorimeter response to hadrons is to consider the ratio of the energy reconstructed in the calorimeter to the momentum measured in the well-aligned tracking detector [4]. Previously, this was studied using isolated single hadrons from inclusive pp collisions [5, 6]. In this paper, the large dataset accumulated in Run 2 is exploited to select events with isolated charged pions from \(\tau \)-lepton decays so that a much higher energy regime can be probed.

The calorimeter response to electromagnetic particles such as electrons and photons is very well known due to their showers being easier to simulate accurately and precise measurements in situ using \(Z\rightarrow ee\) events. The energy scale of electrons is therefore known with an uncertainty of less than 0.1% for transverse momenta \(25<p_{\text {T}} <70\) \(\text {GeV}\) in the central region [7]. The large variety of physics processes in hadronic interactions and the non-compensating nature of the ATLAS calorimeters make the response to hadrons harder to model in simulation, motivating in situ measurements using highly energetic single pions from \(\tau \)-lepton decays.

In this paper the calorimeter response across a wide range of transverse momenta is investigated. W bosons are produced copiously at the LHC and can decay via \(\tau \)-leptons to single charged pions, providing a source of isolated pions: \(W^\pm \rightarrow \tau ^\pm (\rightarrow \pi ^\pm \nu _\tau )\nu _\tau \). Using the Run 2 dataset recorded in 2015–2018, large samples of pions are selected in order to accurately measure the calorimeter response using the ratio of calorimeter energy to track momentum. The measured response is corrected for the contributions from the multiple pp interactions per bunch crossing and the residual signals remaining in the calorimeter from adjacent bunch crossings, contributions collectively referred to as pile-up. The charged-pion response is measured as a function of track pseudorapidity, \(\eta ^{\text {trk}}\), across the tracker volume and as a function of track transverse momentum, \(p_{\text {T}}^{\text {trk}}\), in the barrel and endcaps. The average particle energy in the highest \(p_{\text {T}}^{\text {trk}}\) bin in the endcaps is 680 (670) \(\text {GeV}\) in data (simulated data).

Previous measurements of the single-hadron response using this ratio were performed using 2010 and 2012 data [5, 6]. These measurements focused on special runs with low instantaneous luminosity and identified isolated tracks from minimum-bias events. As high-momentum isolated tracks are not frequently produced in pure QCD interactions due to the formation of jets which consist of multiple closely spaced particles, they probed the low-momentum phase space with \(p<30\) \(\text {GeV}\). Test-beam measurements using the Super Proton Synchrotron (SPS) probed the response to hadrons with energies, E, up to 350 \(\text {GeV}\) [8,9,10,11,12]. For the combined, electromagnetic and hadronic, barrel calorimeter, the precision of these measurements ranged from 2.8% at \(E=20\) \(\text {GeV}\) to 1.4% at \(E=350\) \(\text {GeV}\) [12]. The uncertainties were dominated by the non-uniformity of the calorimeter, imperfect knowledge of the effect of the material in front of it, and, for positive beams, the fraction of protons. In this paper the response is measured in situ with a pure \(\pi ^\pm \) sample. Additionally, the latest ATLAS detector simulation with updated geometry [13] and with the latest set of hadronic shower simulation models (called the physics list [14]) as used in all recent physics analyses, is used in the comparisons of data with simulation.

Precise knowledge of the calorimeter response at high \(p_{\text {T}}\) is essential for many ATLAS physics analyses. The uncertainty in the energy scale calibration of hadronic jets, the jet energy scale (JES) [3], is one of the primary experimental uncertainties in many searches for new physics and in measurements of Standard Model processes [15,16,17,18]. In situ techniques are used to derive corrections to the transverse momentum of jets to account for differences between data and simulation, as well as uncertainties in these corrections. For the highest-\(p_{\text {T}}\) jets in the \(\text {TeV}\) regime, where in situ methods exploiting \(p_{\text {T}}\) balance cannot be used due to a lack of data because of the low cross-section, the determination of the uncertainties in the momentum relies on measurements of the calorimeter energy scale from single hadrons. These are based on the low-momentum in situ measurements, test-beam results and conservative extrapolations, further motivating this measurement. Also, the calibration of hadronically decaying \(\tau \)-leptons relies on these measurements at high \(p_{\text {T}}\)  [19]. Additionally, measurements of the internal structure of jets rely on knowledge of the calorimeter energy scale [20]. Likewise, when tagging hadronically decaying heavy particles [21] the jet substructure variables that are used cannot be corrected easily for calorimeter scale discrepancies, so they depend on good modelling of the underlying calorimeter energy scale.

The paper is organised as follows. Section 2 introduces the ATLAS detector, and Sect. 3 describes the dataset and samples of simulated events. The selection of events for the measurement and the variable of interest are defined in Sect. 4. The energy response is evaluated in Sect. 5 and uncertainties in the measurements are evaluated in Sect. 6. The width of the distribution of the variable of interest is measured in Sect. 7 and the longitudinal segmentation of the calorimeters is exploited in Sect. 8 to investigate discrepancies between simulation and data. Finally, the conclusions are presented in Sect. 9.

2 ATLAS detector

The ATLAS experiment [1, 22, 23] at the LHC is a multipurpose particle detector with a forward–backward symmetric cylindrical geometry and a near \(4\pi \) coverage in solid angle.Footnote 1 It consists of an inner tracking detector surrounded by a thin superconducting solenoid providing a 2 T axial magnetic field, electromagnetic and hadronic calorimeters, and a muon spectrometer.

The inner tracking detector covers the pseudorapidity range \(|\eta | < 2.5\). Radially outwards, it consists of silicon pixel, silicon microstrip, and transition radiation tracking (TRT) detectors.

Lead/liquid-argon (LAr) sampling calorimeters provide electromagnetic (EM) energy measurements with high granularity. A central barrel covers \(|\eta |<1.475\) and endcaps cover \(1.375<|\eta |<3.2\). A steel/scintillator-tile (Tile) hadron calorimeter covers the central pseudorapidity range with one barrel covering \(|\eta | < 1\) and extended barrels also covering \(0.8<|\eta |<1.7\). These are longitudinally segmented in shower depth into three layers. Hadronic calorimetry in the endcaps (HEC) is provided by a copper/liquid-argon calorimeter which consists of four layers. The forward regions are instrumented with LAr calorimeters for both the EM and hadronic energy measurements up to \(|\eta | = 4.9\). Additionally pre-sampler and crack scintillators recover energy lost in non-instrumented regions.

The muon spectrometer surrounds the calorimeters and is based on three large air-core toroidal superconducting magnets with eight coils each. The field integral of the toroids ranges between 2.0 and 6.0 T m across most of the detector. The muon spectrometer includes a system of precision tracking chambers and fast detectors for triggering.

A two-level trigger system is used to select events [24]. The first-level trigger is implemented in hardware and uses a subset of the detector information to accept events at a rate below 100 kHz. This is followed by a software-based trigger that reduces the accepted event rate to 1 kHz on average, depending on the data-taking conditions.

An extensive software suite [25] is used for real and simulated data reconstruction and analysis, for operation and in the trigger and data acquisition systems of the experiment.

3 Data and samples of simulated events

The pp collision data analysed were recorded with the ATLAS detector from 2015 to 2018 at a centre-of-mass energy, \(\sqrt{s}\), of 13 \(\text {TeV}\). The average number of pp interactions per bunch crossing was 33.7Footnote 2 and it varied from \(\approx 10\) to \(\approx 65.\) Events were selected by missing transverse momentum (\(E_{\text {T}}^{\text {miss}}\)) triggers [27]. After the application of data-quality requirements [28], the data sample corresponds to an integrated luminosity of 139 fb\(^{-1}\) with an uncertainty of 1.7% [26] obtained using the LUCID-2 detector [29] for the primary luminosity measurements.

Samples of simulated events created using Monte Carlo techniques are used to model the Standard Model processes. The main signal processes are \(W(\rightarrow \tau \nu _\tau )+\)jets and top pair production (\(t\bar{t}\)), with smaller signal contributions from single top production and small background contributions from \(W(\rightarrow e\nu _e,\mu \nu _\mu )+\)jets and \(Z(\rightarrow \nu \nu )+\)jets. Other processes are cross-checked with simulated event samples and found to be negligible.

The production of \(W+\)jets and \(Z+\)jets was simulated with the Sherpa 2.2.1 [30, 31] generator using matrix elements (MEs) with next-to-leading-order (NLO) accuracy for up to two jets, and matrix elements with leading-order accuracy for up to four jets, calculated with the Comix [32] and OpenLoops [33, 34] libraries. They were matched with the Sherpa parton shower [35] using the MEPS@NLO prescription [36,37,38,39] with the set of tuned parameters (tune) developed by the Sherpa authors. All polarisation effects in \(\tau \)-lepton production were retained by using a full matrix element calculation, and were propagated to its decay in the HADRONS++ module, which implements intermediate hadronic resonances and spin-correlation effects. The NNPDF3.0nnlo set of parton distribution functions (PDFs) [40] was used and the samples are normalised to a next-to-next-to-leading-order (NNLO) prediction [41].

The production of \(t\bar{t}\) and single-top-quark events was modelled using the Powheg Box v2 [42,43,44,45] generator at NLO with the NNPDF3.0nlo [40] set of PDFs. These events were processed with Pythia 8.230 [46] using the A14 tune [47] and the NNPDF2.3lo set of PDFs [48]. The decays of bottom and charm hadrons were modelled using the EvtGen 1.6.0 [49] program. In \(t\bar{t}\) events the \(h_\mathrm {damp}\) parameterFootnote 3 was set to 1.5 \(m_\mathrm{top}\) [50]. The \(t\bar{t}\) process is normalised to the inclusive cross-section calculation at NNLO in QCD including the resummation of next-to-next-to-leading logarithmic (NNLL) soft-gluon terms from Top++2.0 [51,52,53,54,55,56,57]. The single-top-quark production processes are normalised to the inclusive cross-sections calculated at NLO in QCD with NNLL soft-gluon corrections [58, 59]. For single top quark production in the Wt-channel events, the diagram removal scheme [60] was used to remove overlap with \(t\bar{t}\) production. In all samples the top quark mass is 172.5 \(\text {GeV}\).

To study the generator dependence of the analysis an alternative sample of QCD W+jets events was simulated by MadGraph5_aMC@NLO 2.2.2 [61] using LO-accurate MEs with up to four final-state partons and the NNPDF3.0nlo set of PDFs. The events were interfaced to Pythia 8.186 [62], with the A14 tune using the NNPDF2.3lo PDF set, for the modelling of the parton shower, hadronisation, and underlying event. The overlap between matrix element and parton shower emissions was removed using the CKKW-L merging procedure [63, 64]. The decays of bottom and charm hadrons were performed by EvtGen 1.2.0 [49].

All samples were passed through a detailed simulation of the ATLAS detector [65] based on Geant4  [66]. The modelling of hadron interactions was done by the FTFP_BERT_ATL [14] physics list. In this physics list the Bertini cascade model [67] is used for hadrons with energy below 12 \(\text {GeV}\), and the Fritiof string model [68, 69] for hadrons with energy above 9 \(\text {GeV}\). The probability of using each model changes smoothly across the region of overlap.

Furthermore, simulated inclusive inelastic pp collisions were overlaid to model additional pile-up collisions in the same and neighbouring bunch crossings. These were generated with Pythia 8.210 [46] with the A3 tune [70] and NNPDF2.3lo PDF set.

4 Event selection, observables and response extraction

Events are selected using the properties of reconstructed objects to obtain a high-purity sample of \(W^\pm \rightarrow \tau ^\pm (\rightarrow \pi ^\pm \nu _\tau )\nu _\tau \) decays which are used to measure the calorimeter response. The calorimeter energy associated with the track is summed and a fit to the energy divided by the track momentum is performed to extract the calorimeter response. The reconstructed objects used to form the variable of interest are described as follows:

  • Charged-pion tracks are reconstructed by an iterative track-finding algorithm seeded by measurements in the silicon layers of the inner detector [71, 72]. The precise alignment of the inner detector ensures that these tracks are accurately reconstructed with a residual sagitta bias and momentum scale bias of less than 0.1 \(\text {TeV}\) \({}^{-1}\) and \(0.9\times 10^{-3}\) respectively [4].

  • Calorimeter topoclusters are clusters of connected calorimeter cells throughout both the EM and hadronic calorimeters. They are seeded from cells with reconstructed energy significantly above the noise [73]. Their energy is the sum of the energies of the constituent cells, which are calibrated at the electromagnetic scale. This scale is defined such that the response to electromagnetic showers is correctly calibrated in all calorimeters. Due to the non-compensating nature of the ATLAS calorimeters, the response to hadronic showers is expected to be lower such that they will be under-calibrated. Single particles can form multiple topoclusters [2], so when measuring the response all clusters in both the EM and hadronic calorimeters within a region are summed.

Additionally, various other reconstructed particles and observables are used in the event selection:

  • Electrons are reconstructed from tracks and calorimeter energy deposits. They are identified through a log-likelihood discriminant based on the track properties and the shower shape in the calorimeter. Electrons are identified with the Loose working point and require \(p_{\text {T}} >10\) \(\text {GeV}\) and \(|\eta |<2.47\) [74].

  • Muons are reconstructed using combined fits of inner detector and muon spectrometer tracks and are required to meet the Medium identification criteria and have \(p_{\text {T}} >10\) \(\text {GeV}\) and \(|\eta |<2.7\) [75]. The momentum scale and resolution for muons are precisely calibrated and measured using the Z-boson and \(J/\psi \) resonances [76].

  • Jets are reconstructed for the purpose of building the missing transverse momentum, \(E_{\text {T}}^{\text {miss}}\) . Particle-flow objects [2] are clustered using the anti-\(k_t\) algorithm [77] with radius parameter \(R=0.4\) using the FastJet package [78] and are calibrated to the scale of jets built from the momentum of the stable interacting particles entering the simulation using the same algorithm. During the calibration sequence, corrections are applied to mitigate the effects of pile-up, and jets in data are corrected for residual differences between data and simulation measured in situ using various \(p_{\text {T}}\)-balance techniques [3]. Calibrated jets are required to have \(p_{\text {T}} >20\) \(\text {GeV}\). Jets originating from pile-up are rejected using inner-detector information [79].

  • \(E_{\text {T}}^{\text {miss}}\) is reconstructed from the vector sum of the momenta of the hard objects described above and a soft term formed from tracks matched to the primary vertex (PV)Footnote 4 of interest but not associated with any hard object [81].

The event selection is based on the above reconstructed particles and observables, and is designed to obtain a high-purity sample of \(W^\pm \rightarrow \tau ^\pm (\rightarrow \pi ^\pm \nu _\tau )\nu _\tau \) events with the following properties:

  1. 1.

    \(E_{\text {T}}^{\text {miss}} {}>150\) \(\text {GeV}\) in addition to being selected by the missing transverse momentum trigger.

  2. 2.

    A leading track with \(p_{\text {T}} >10\) \(\text {GeV}\) and \(|\eta |<2.5\) satisfying the following criteria:

    • The sum of the \(p_{\text {T}}\) of other tracks, matched to the PV, within a cone of size \(\Delta R=0.3\) around the track is less than 2 \(\text {GeV}\).

    • The track’s longitudinal impact parameter, measured relative to the PV, is less than 1.5 mm.

    • The track’s transverse impact parameter, measured relative to the beamline, is less than 0.5 mm.

    • The track satisfies the tight criteria for hits in the silicon pixel and microstrip detectors [82].

    • The \(\chi ^2\) per degree of freedom of the track fit is less than 1.5.

    • Several TRT hits are used in the track fit if the track is within \(|\eta |<2.0\).

  3. 3.

    There are no reconstructed electrons or muons within \(\Delta R=0.4\) of the leading track.

  4. 4.

    At least 15% of the total energy of all topoclusters within \(\Delta R=0.15\) of the track is deposited in the hadronic calorimeters.

  5. 5.

    There are less than two muon spectrometer track segments [75] within \(\Delta R=0.15\) of the track.

  6. 6.

    The track does not point at any region of the Tile calorimeter that had defects during that time period.

  7. 7.

    To reduce backgrounds where the reconstructed track is not from \(W^\pm \rightarrow \tau ^\pm (\rightarrow \pi ^\pm \nu _\tau )\nu _\tau \), the transverse massFootnote 5 of the leading track and the \(E_{\text {T}}^{\text {miss}}\) satisfies \(m_\text {T}<100\) \(\text {GeV}\), or \(m_\text {T}<90\) \(\text {GeV}\) if \(p_{\text {T}}^{\text {trk}} <30\) \(\text {GeV}\), or \(m_\text {T}<80\) \(\text {GeV}\) if \(p_{\text {T}}^{\text {trk}} <20\) \(\text {GeV}\).

  8. 8.

    The transverse impact parameter of the track relative to the beamline is greater than: 0.04 mm if \(p_{\text {T}}^{\text {trk}} <16\) \(\text {GeV}\), 0.03 mm if \(16<p_{\text {T}}^{\text {trk}} <20\) \(\text {GeV}\), and 0.02 mm if \(20<p_{\text {T}}^{\text {trk}} <30\) \(\text {GeV}\). No criteria is applied to tracks with \(p_{\text {T}}^{\text {trk}} >30\) \(\text {GeV}\). This exploits the lifetime of the \(\tau \)-lepton to reduce backgrounds where the track is from a QCD process.

The first criterion ensures that the selected events are in the well-understood region of the \(E_{\text {T}}^{\text {miss}}\) trigger efficiency where it is not rapidly changing with respect to \(E_{\text {T}}^{\text {miss}}\) . The tracking criteria select isolated tracks associated with the primary vertex that have a large number of hits to measure the momentum. The TRT hit requirements and the criterion for the \(\chi ^2\) per degree of freedom of the track fit ensure that the track reconstruction uses the large lever arm within the TRT volume and that the track fit is of good quality. The third, fourth and fifth criteria reject tracks that are formed from electrons, muons and converted photons. The final two criteria reject events where the track does not originate from a \(\tau \)-lepton; such backgrounds are only significant at low \(p_{\text {T}}^{\text {trk}}\). They utilise the expected displacement of the track owing to the \(\tau \)-lepton lifetime and the expected upper bound on the transverse mass at the W boson mass.

After this selection the sample has a high purity of \(W^\pm \rightarrow \tau ^\pm (\rightarrow \pi ^\pm +\nu _\tau )\nu _\tau \) events with the most significant background being \(W^\pm \rightarrow \tau ^\pm (\rightarrow \pi ^\pm +n\pi ^0+\nu _\tau )\nu _\tau \) events. For each track the transverse energy of the clusters within a fixed cone with \(\Delta R=0.15\) of the track, taking into account the effect of the magnetic field, is summed. The ratio of this summed transverse energy of all clusters, in all calorimeters, at the electromagnetic scale to the track \(p_{\text {T}}\) forms the variable of interest, \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\). Based on the simulation, it is not expected that the event selection will bias the response significantly and the selection criteria are varied to check that the results of the analysis are not sensitive to the specific selection criteria.

The summed calorimeter energy deposit is corrected for pile-up contributions. Following the methodology used in the calibration of jets, the measured median \(p_{\text {T}}\) density in the y\(\phi \) plane of the event, \(\rho \) [3, 83], scaled by the area of the cone within which we sum clusters, \(A=\pi \times 0.15^2\), is subtracted. After this subtraction there is a small residual pile-up dependence which is subtracted using the average number of interactions per bunch crossing, \(\mu ^{\text {PU}}\), and the gradient, \(\alpha \), determined from simulation as a function of \(|\eta |\): \(\alpha (|\eta |)\times \mu ^{\text {PU}}\). Only one residual correction is applied compared to two in the calibration of jets as the pile-up contributions are much smaller in this case due to the smaller area considered. This methodology is different from the previous single-hadron response measurement in a low pile-up environment [5, 6], where pile-up and physics backgrounds were both subtracted. In this analysis the pile-up is subtracted but the physics background is included in the functional fit.

Events are selected according to their \(p_{\text {T}}^{\text {trk}}\) and \(|\eta ^{\text {trk}}|\) values to probe the different regions of phase space. Figure 1 shows the distribution of the variable of interest, \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\), for six of the (\(p_{\text {T}}^{\text {trk}}\), \(|\eta ^{\text {trk}}|\)) bins. Due to differences in the \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) scale and resolution between simulation and data, some clear discrepancies are seen in the data-to-MC ratio.

Fig. 1
figure 1

Examples of the \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) distribution and fit performed in the \(16<p_{\text {T}}^{\text {trk}} <20\) \(\text {GeV}\) (top), \(30<p_{\text {T}}^{\text {trk}} <50\) \(\text {GeV}\) (middle) and \(100<p_{\text {T}}^{\text {trk}} <150\) \(\text {GeV}\) (bottom) bins in the barrel (left) and endcaps (right). The data are shown by points with a stacked histogram for simulation. The component where \(\tau \)-leptons decay to multiple charged pions is small and difficult to see. The red fits correspond to simulation and the yellow fits correspond to data. Both the fits to the full distribution of a Gaussian+Landau function and just the Landau component from this fit are shown. The uncertainties shown are those from the statistics of the dataset and the limited Monte Carlo sample size. The bottom panel shows the ratio of data to simulation with statistical error bars on the data points and a yellow band illustrating the statistical uncertainty of the simulation. The simulation is normalised to the integral of the data

The primary background is from events with \(\tau \)-leptons which decay to a charged pion, \(\pi ^\pm \), and at least one neutral pion, \(\pi ^0\). In the calorimeter, more energy will be reconstructed from this process for the same track momentum due to the electromagnetic shower from the \(\pi ^0\), which predominantly decays as \(\pi ^0\rightarrow \gamma \gamma \). This results in an upper tail in the \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) distribution. Other single-charged-particle \(\tau \)-lepton decays, such as \(\tau ^\pm \rightarrow K^\pm \nu _\tau \), also contribute at a lower level.

The aim of this analysis is to extract the mean and width of a Gaussian function fitted to the \(W^\pm \rightarrow \tau ^\pm (\rightarrow \pi ^\pm \nu _\tau )\nu _\tau \) signal in the core of the \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) distribution to probe the calorimeter response and resolution. To extract this, the distribution is fit with the sum of a Gaussian function and another function that describes the background from all processes other than \(W^\pm \rightarrow \tau ^\pm (\rightarrow \pi ^\pm \nu _\tau )\nu _\tau \). The functional form of this second term is taken to be a Landau function [84]. This is chosen empirically as it is seen to accurately capture the shape of the background. The range of the fit is then taken as the mean, \(\mu _{\pi ^\pm }\), of a Gaussian function describing \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) for signal in simulation minus 1.5 times its width, \(\sigma _{\pi ^\pm }\), to 2, i.e. \([(\mu _{\pi ^\pm }-1.5\sigma _{\pi ^\pm }),2]\). The combined Gaussian+Landau function is fit to the distribution in data and simulation. In both cases the shape of the Landau function is fixed from a fit to the background processes in simulation. The three parameters of the Gaussian signal and the normalisation of the Landau function are allowed to float in the final fit. The mean of the Gaussian function gives the calorimeter response scale and the width of the Gaussian function measures the resolution of \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\), which contains components from both the calorimeter and track reconstruction. Figure 1 shows the functions fitted to data and simulation in six of the (\(p_{\text {T}}^{\text {trk}}\), \(|\eta ^{\text {trk}}|\)) bins, and the extracted mean and width of the Gaussian function are displayed.

Fig. 2
figure 2

The fitted mean of the signal \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) distribution as a function of the track \(p_{\text {T}}\) in the central region (left) and endcap region (right). The error bars and uncertainty band show only the statistical uncertainties from the size of the dataset and simulated samples respectively

5 Calorimeter response measurement and comparison with simulation

The data are binned in both the track \(p_{\text {T}}\) and track \(|\eta |\) to probe the calorimeter response at different momenta and in different regions. The calorimeter response is expected to vary significantly across \(|\eta |\) due to the geometry and different calorimeter technologies. In the central region there is a large region which is expected to have a uniform response due to similar calorimeter structures up to \(|\eta |=0.7\). As the rejection of muons is more difficult at \(|\eta |\sim 0\) a barrel selection region is defined by \(0.1<|\eta |<0.7\). In this region there is a high-granularity liquid-argon/lead electromagnetic calorimeter and a steel/scintillator hadronic calorimeter. Another uniform detector region exists in the endcap region of \(1.8<|\eta |<2.4\), where there is liquid-argon/lead electromagnetic calorimetry and liquid-argon/copper hadronic calorimetry. Due to differences in technology and material in the two regions, differences in the modelling might be expected. To study the response across the detector, results are shown for three \(p_{\text {T}}^{\text {trk}}\) bins: \(30<p_{\text {T}}^{\text {trk}} <50\) \(\text {GeV}\), \(50<p_{\text {T}}^{\text {trk}} <70\) \(\text {GeV}\) and \(70<p_{\text {T}}^{\text {trk}} <100\) \(\text {GeV}\), as a function of \(|\eta ^{\text {trk}}|\).

Figure 2 shows the fitted response for the barrel and endcap regions defined above, as a function of the track \(p_{\text {T}}\). Clear differences are seen between the data and simulation in both regions, with a \({\sim }2\%\) overestimate of the response in simulation in the barrel and a \({\sim }4\%\) underestimate of the response in simulation in the endcaps relative to the data. The data/MC differences also follow the same trends as seen in the jet response in these two regions [3]. It is checked that the fits give a reasonable \(\chi ^2\) per degree of freedom and are stable when the fitting range and binning of the data are altered. These plots only show the uncertainties from the limited number of events in the samples, and systematic uncertainties are discussed in the next section. At the highest energies the steeply falling momentum spectrum and the increasing track resolution produces a bias in the measured track momentum compared to the true pion transverse momentum, \(p_\text {T}^\text {true}\), resulting in the distribution flattening in both data and simulation. The biases in simulation from \(\langle p_\text {T}^\text {trk}/p_\text {T}^\text {true}\rangle \) in the central (endcap) region for the highest three \(p_{\text {T}}^{\text {trk}}\) bins are [0.2%, 0.8%, 1.7%] ([1.7%, 4.7%, 10.8%]).

Figure 3 shows the fitted response across the detector in fine \(|\eta ^{\text {trk}}|\) bins for the three different \(p_{\text {T}}^{\text {trk}}\) bins which have the most events. The calorimeter structure is clearly seen and the simulation follows the data, but some differences are present. The largest disagreements are \({\sim }5\%\) in the \(1.0<|\eta ^{\text {trk}}|<1.2\) region. When the two sides of the detector are analysed independently the same trends are seen in the response. Additionally, when the analysis is performed separately for positively and negatively charged tracks consistent results are seen. In this energy regime the responses to \(\pi ^+\) and \(\pi ^-\) are expected to be very similar and this check also tests the alignment of the inner detector as alignment errors would have different effects on positive and negative tracks.

Fig. 3
figure 3

The fitted mean of the signal \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) distribution as a function of the track \(|\eta ^{\text {trk}}|\) in three different \(p_{\text {T}}^{\text {trk}}\) bins. The error bars and uncertainty band show only the statistical uncertainties from the size of the dataset and simulated samples respectively

6 Uncertainties in the response measurement

Several different effects could bias the measurements of the calorimeter response scale. These include uncertainties in the methodology of the fitting procedure, the backgrounds entering the event selection, the corrections for the effects of pile-up, and the modelling of tracks. The methods used to assess each of the different sources are described below and are then summarised in Figs. 4 and 5. All uncertainties which are determined from a difference with respect to the nominal result are symmetrised.

Fit function (closure): A Gaussian+Landau fit to the signal and background is used to extract the calorimeter response scale. This is quantified in simulation by the difference between the fitted mean obtained from a Gaussian fit to just the signal \(W^\pm \rightarrow \tau ^\pm (\rightarrow \pi ^\pm \nu _\tau )\nu _\tau \) events and the fitted mean of the Gaussian function obtained from a combined Gaussian+Landau fit to the simulated signal and background. This closure test is seen to perform very well, showing that the Landau function choice is appropriate. The difference between the two values obtained is taken as a systematic uncertainty of the measurement. It is found to be subdominant and typically at the \({\sim }0.2\%\) level.

Bias in the fitted background shape: The shape of the background is taken from a fit to simulation, which could be biased relative to the shape in data. The background is primarily \(W^\pm \rightarrow \tau ^\pm (\rightarrow \pi ^\pm n\pi ^0\nu _\tau )\nu _\tau \) events. Imperfections in the simulation of the shape of this background can come from detector effects such as the energy response to the charged and neutral pions, or from the energy spectra of the charged and neutral pions as modelled by the MC generator. The energy scale and resolution for electromagnetic showers, such as those from \(\pi ^0\rightarrow \gamma \gamma \) decays, has been measured in ATLAS [74] and is known better than that for the hadronic showers probed here. The impact of detector mismodelling is tested by shifting or smearing the simulated response and evaluating the change in the fit when it is applied to data with the altered background shape. The hadronic response is shifted down (up) by 3% (5%) in the central (endcap) region, and separately smeared to increase the resolution by 10%. These values are determined from the discrepancies between data and simulation observed in this analysis. The smearing in the central region, and scale variation in the endcaps, produce significant uncertainties in the final results. To assess the modelling of the shape an alternative generator is used for the \(W+\)jets background; no significant impact on the shape is seen, so this is neglected. Therefore, the uncertainty in the shape of the fitted background is dominated by the detector modelling.

Electron and muon contamination: Contamination from electrons and muons is expected to be small: less than 0.4% and 0.7% respectively. The residual contamination is tested by tightening the criteria to reject these events: requiring the fraction of energy in the hadronic calorimeters to be \({>}20\%\), and that there are no associated muon segments. Changes in the measured response in data define the uncertainty for muons. For electrons the difference between the changes in the measured response in data and simulation defines the uncertainty because the hadronic energy fraction cut can also bias the signal. Both of these uncertainties are found to be smaller than the statistical uncertainties of the measurement.

Non-lepton backgrounds: There is a small amount of background where the track does not originate from a lepton. The \(Z(\rightarrow \nu \nu )+\)jets contamination can be reduced by a tighter selection on the transverse mass of the track and the \(E_{\text {T}}^{\text {miss}}\) . The deviation from the nominal response when the upper bound on \(m_\text {T}\) is reduced by 10 \(\text {GeV}\) is taken as an uncertainty in this background. Additionally, the deviation from the nominal response when events with \(m_\text {T}<10\) \(\text {GeV}\) are excluded is taken as an uncertainty in the potential contamination from events where the \(E_{\text {T}}^{\text {miss}}\) comes from detector effects. These are also found to be smaller than the statistical uncertainties.

Pile-up uncertainties: Two uncertainties in the subtraction of the contributions from pile-up are considered. Non-linearities in the required corrections can lead to imperfections in the removal of the effects of pile-up. The closure of the pile-up corrections is tested by taking half the difference between the responses for two large single-particle samples with different pile-up, \(\langle \mu ^{\text {PU}}\rangle \sim 20\) and \(\langle \mu ^{\text {PU}}\rangle \sim 50\), after the corrections. Additionally, the pile-up residual correction is determined from simulation where pile-up might not be well modelled. Studies of the \(\mu ^{\text {PU}}\) dependence of the median \(p_{\text {T}}\) density of the event, \(\rho \), the number of clusters, and the cell energy and noise find that the modelling of the energy flow in the simulation is within 10%–20% of the data across the detector. Therefore, 25% of the residual correction is taken as an uncertainty in the modelling of pile-up, which forms a minor uncertainty in the analysis. In data it is checked that the fitted response is stable with respect to pile-up by splitting the dataset into two independent datasets via high pile-up and low pile-up selections based on \(\mu ^{\text {PU}}\), and also into three independent datasets, and no significant differences given the statistical uncertainties are seen in the measured response.

Tracking uncertainties: Any bias in the momentum scale of the reconstructed tracks directly affects the measurement. Muons are precisely calibrated using \(Z\rightarrow \mu \mu \) and \(J/\psi \rightarrow \mu \mu \) events. The analysis selection is modified by removing the \(m_\text {T}\) and transverse impact parameter requirements and requiring that a reconstructed muon is matched to the isolated track. The \(p_{\text {T}}\) of the track and muon are obviously correlated but fits to \(p_\text {T}^\text {trk}/p_\text {T}^\mu \) probe the inner-detector track scale relative to the calibrated muons. Two components are considered, the uncertainty in the calibration of muons and the statistical uncertainties of this cross-calibration, as none of the deviations of simulation from data are significant. Additionally, differences in the track resolution can bias the measurement because of the underlying falling \(p_{\text {T}}^{\text {trk}}\) spectrum. The uncertainty in the momentum resolution of tracks is determined using the muons as described in Ref. [85]. Tracks are smeared to increase the resolution in simulation within the uncertainties and the impact on the parameter of interest is symmetrised and taken as an uncertainty.

The various systematic uncertainties are smoothed [86] across \(p_{\text {T}}\) as they are expected to vary smoothly. Across \(|\eta |\), changes in the detector technology could result in sharp changes in the size of the systematic uncertainties. However, no such variations are seen, so the systematic uncertainties are also smoothed in the measurements as a function of \(|\eta |\). The statistical components in data and simulation are not smoothed. The statistical uncertainty of the data is the largest uncertainty component in much of the phase space covered by the analysis.

Figure 4 shows the uncertainty from each source as a function of track \(p_{\text {T}}\) for the central and endcap regions. The uncertainty from imperfect knowledge of the background width (scale) in the central (endcap) region, and pile-up uncertainties in the endcap region, are the only uncertainties that are larger than the statistical uncertainty of the dataset or the simulated event samples in certain bins. The pile-up uncertainties have some importance at low \(p_{\text {T}}\), and the tracking uncertainties grow at large track momentum. Overall, in the central region the total uncertainty is \({\lesssim }1\%\) for \(20<p_{\text {T}} <200\) \(\text {GeV}\) and reaches \({<}0.6\%\) in the most precise region. The test-beam uncertainties are about \(2\%\) in the barrel region [12], demonstrating the power of this in situ technique. In the endcaps the uncertainties are slightly larger but are \({\lesssim }1\%\) for \(30<p_{\text {T}} <100\) \(\text {GeV}\). Figure 5 shows the uncertainties in the measurements as a function of \(|\eta ^{\text {trk}}|\). The uncertainties are slightly larger due to the limited number of events in the smaller bins used to make fine calorimeter structures visible, but are otherwise dominated by the same sources as in the measurements as a function of \(p_{\text {T}}\).

Fig. 4
figure 4

The various systematic uncertainties which affect the measurement of the calorimeter energy scale for charged pions as a function of \(p_{\text {T}}^{\text {trk}}\) in the central (left) and endcap (right) regions

Fig. 5
figure 5

The various systematic uncertainties which affect the measurement of the calorimeter energy scale for charged pions as a function of \(|\eta ^{\text {trk}}|\) for three \(p_{\text {T}}^{\text {trk}}\) ranges: \(30<p_{\text {T}}^{\text {trk}} <50\) \(\text {GeV}\), \(50<p_{\text {T}}^{\text {trk}} <70\) \(\text {GeV}\) and \(70<p_{\text {T}}^{\text {trk}} <100\) \(\text {GeV}\)

Having established the full uncertainties in the measurement, a calibration can be derived which can form an input to the high-\(p_{\text {T}}\) jet uncertainties [3] and also to the scale uncertainties for hadronic \(\tau \)-lepton decays [19]. Figure 6 shows the measured data to simulation ratio of the response and its uncertainty in different \(p_{\text {T}}^{\text {trk}}\) bins in the central and endcap regions. It is expected that such a calibration will be smooth across \(p_{\text {T}}\). The same procedure as used for the jet energy scale is used [3, 87] to translate the binned measurements and uncertainties into smooth uncertainty eigenvectors and a smooth calibration curve. Each of the measurements is divided into finer bins of 0.1 \(\text {GeV}\) using second-order polynomial splines. The final calibration curve is determined by smoothing the measurements with a Gaussian kernel of varying width. Each individual uncertainty component is treated as correlated across \(p_{\text {T}}\) and is then propagated through the same procedure after varying it by 1\(\sigma \). The difference between the calibration curve with the shifted systematic uncertainty input and the nominal calibration curve is taken as 1\(\sigma \) in the varied uncertainty. Each uncertainty source is treated as correlated across \(p_{\text {T}}\) and uncorrelated with all other sources of uncertainty. This smoothing procedure reduces statistical fluctuations in the central values and in each propagated uncertainty component. Therefore, the final uncertainties are slightly smaller than those for the corresponding individual measurement points. The resulting smooth calibration curve and uncertainty band are shown in Fig. 6.

Figure 7 shows the data/MC ratio as a function of \(|\eta ^{\text {trk}}|\) for the three measured \(p_{\text {T}}\) ranges along with the associated uncertainties. As the changes in detector technology can result in sharp features in the calibration as a function of \(|\eta ^{\text {trk}}|\) a smooth calibration curve is not derived for these results.

Fig. 6
figure 6

The measured data to simulation ratio of the response as a function of \(p_{\text {T}}^{\text {trk}}\) for the barrel (left) and endcaps (right). Points represent the different measurement bins with their total uncertainty from both the statistical and systematic sources. Also shown is the smooth calibration curve with its associated uncertainties

Fig. 7
figure 7

The measured data to simulation ratio of the response as a function of \(|\eta ^{\text {trk}}|\) for the three \(p_{\text {T}}^{\text {trk}}\) ranges: \(30<p_{\text {T}}^{\text {trk}} <50\) \(\text {GeV}\), \(50<p_{\text {T}}^{\text {trk}} <70\) \(\text {GeV}\) and \(70<p_{\text {T}}^{\text {trk}} <100\) \(\text {GeV}\). Inner error bars represent uncertainties from limited sample size and the outer error bars give the total uncertainty including both the statistical errors and systematic uncertainties

7 Width of the \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) distribution

The width of the \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) distribution probes the resolution of both the tracker and the calorimeter; however, the tracking component is relatively small outside of the highest energy bins. The width will also contain components from the modelling of pile-up noise falling into the region of the calorimeter considered in the measurement. These can be reliably corrected for in measurements of the scale, but the effect on the resolution is much harder to mitigate, so the level of agreement between simulation and data, which both include these effects, is considered. The increase in the width of \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) due to the effects of pile-up is measured by looking at the difference between the width in two simulated samples with \(\langle \mu ^{\text {PU}}\rangle \sim 20\) and \(\langle \mu ^{\text {PU}}\rangle \sim 50\). For the central (endcap) region the difference is found to be 8% (16%) for \(16<p_{\text {T}}^{\text {trk}} <20\) \(\text {GeV}\) and 3% (12%) for \(30<p_{\text {T}}^{\text {trk}} <150\) \(\text {GeV}\). The impact of mismodelling the pile-up is expected to be small for most of the \(p_{\text {T}}^{\text {trk}}\) spectrum since the simulation includes pile-up and should capture most of the effect. Figure 8 shows the relative width of the \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) distribution as a function of track \(p_{\text {T}}\) in the central and endcap regions. In the central region the simulation shows about \(10\%\) better resolution than the data, while in the endcaps the level of agreement is generally better. The relative width across \(|\eta |\) is shown for three \(p_{\text {T}}^{\text {trk}}\) bins in Fig. 9. In both figures the resolution for tracks selected in simulation is found to contribute significantly at high momentum, but across much of the spectrum it is significantly smaller than the calorimeter contributions, indicating that the discrepancies between simulation and data are more likely due to the simulation of the calorimeter response.

In the central region where the statistical uncertainties are lower, \(30<p_{\text {T}}^{\text {trk}} <150\) \(\text {GeV}\), it is possible to check if any of the systematic effects considered in Sect. 6 could be the source of the discrepancies. The closure of the fits to obtain the width of the signal component is very good, with differences only at the level of \({\lesssim }2\%\) at \(p_{\text {T}}^{\text {trk}}\) of 30 \(\text {GeV}\) and \({\lesssim }1\%\) above 50 \(\text {GeV}\). Tightening the selections to reject electrons and muons has a minimal effect on the measured resolution, \({\lesssim }0.5\%\). Shifting the response scale of the background and smearing the background response are each seen to change the resolution in data by \({\lesssim }1\%\) in this region of phase space. Tightening the criteria to reject non-lepton backgrounds also results in small changes of \({\lesssim }0.5\%\) beyond 30 \(\text {GeV}\). Therefore, resolution discrepancies of \({\sim }10\%\) between data and simulation are significant in this \(p_{\text {T}}^{\text {trk}}\) range compared to the potential systematic effects on the extraction of the \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) width. Below 30 \(\text {GeV}\) the fit closure grows to up to 6% and the uncertainties related to the fitted background shape and the non-lepton backgrounds all increase to \({\lesssim }1.5\%\), so that at low momentum the discrepancies are less significant.

This supports the observation in the jet resolution measurement using \(p_{\text {T}}\) balance in dijet events that the resolution in simulation is slightly superior to that in data [3] at medium to high \(p_{\text {T}}\) and this is therefore an area where it is desirable to improve the simulation.

Fig. 8
figure 8

The fitted width divided by the mean of the signal \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) distribution as a function of the track \(p_{\text {T}}\) in the central region (left) and endcap region (right). Also shown is the fitted width of the track \(p_{\text {T}}\) divided by the generator-level \(p_{\text {T}}\) for true \(\tau \rightarrow \pi ^\pm \nu _\tau \) events in simulation to give an illustration of the contribution to the total width from the resolution of reconstructed tracks. The uncertainties shown are only those from the limited number of events in the dataset and simulated samples

Fig. 9
figure 9

The fitted width divided by the mean of the signal \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) distribution as a function of the track \(|\eta |\) in three \(p_{\text {T}}^{\text {trk}}\) ranges: \(30<p_{\text {T}}^{\text {trk}} <50\) \(\text {GeV}\), \(50<p_{\text {T}}^{\text {trk}} <70\) \(\text {GeV}\) and \(70<p_{\text {T}}^{\text {trk}} <100\) \(\text {GeV}\). Also shown is the fitted width of the track \(p_{\text {T}}\) divided by the generator-level \(p_{\text {T}}\) for true \(\tau \rightarrow \pi ^\pm \nu _\tau \) events in simulation to give an illustration of the contribution to the total width from the resolution of reconstructed tracks. The uncertainties shown are only those from the limited number of events in the dataset and simulated samples

8 Measurements of the longitudinal energy profile and late showering particles

The longitudinal segmentation of the calorimeter into layers can be used to gain insight into the cause of the discrepancies in the scale and resolution between data and simulation. Backgrounds can bias any measurements of average energies deposited in a single layer. In the electromagnetic calorimeter the main background, \(\tau ^\pm \rightarrow \pi ^\pm +n\pi ^0+\nu _\tau \), will result in significant biases as the \(\pi ^0\rightarrow \gamma \gamma \) showers will be contained within the electromagnetic calorimeter. Therefore, only the energy deposited in the hadronic calorimeter layers is considered, although these measurements are still slightly biased by this background through selection effects because the events are required to pass the hadronic energy fraction requirement. The hadronic calorimeter is less affected by pile-up since low-momentum hadrons are often contained within the electromagnetic calorimeter in addition to the pile-up particles that produce electromagnetic showers.

Figure 10 shows the average energy deposited in each layer of the hadronic calorimeter divided by the track momentum in the barrel and endcap regions as a function of the track \(p_{\text {T}}\). Events are required to have \(0.2<E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}} <1.1\) to select events across the full response and also minimise the contribution from backgrounds. An arithmetic mean of all the selected events is used because the distribution of the energy in an individual layer is not expected to have a Gaussian shape. It can be seen that in the barrel the simulation overestimates the relative energy in the hadronic calorimeter by 3-5% across the \(p_{\text {T}}\) spectrum and this is the case in all layers, although most of the energy is contained within the first two layers. In the endcap region the amount of energy in the hadronic layers is slightly underestimated overall. This is driven by an underestimation of the energy in the first layer. Both of these features are in line with the results in Sect. 5; however, the magnitude of the difference in the energy deposited does not explain the size of the effect seen in the total response, indicating that there is mismodelling either in both the electromagnetic and hadronic calorimeters or in the length of the shower, or in a combination of these or other effects. The results are seen to be largely unaffected by pile-up. These results can be used for the tuning of the detector simulation.

Fig. 10
figure 10

The average energy within each hadronic layer normalised to the track momentum as a function of track \(p_{\text {T}}\) in the barrel (left) and endcaps (right). Also shown are the ratios of the sum of energies in all layers of the hadronic calorimeter to the track momentum. Data are required to have \(0.2<E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}} <1.1\) to primarily select single-pion events. Shown in the lower panel is the ratio of data to simulation. The error bars and uncertainty band show the uncertainties from the limited number of events in the dataset and simulated samples

Fig. 11
figure 11

The fitted response as a function of track \(p_{\text {T}}\) in the barrel (left) and endcaps (right) after requiring the fraction of the associated energy that is in the hadronic calorimeters to satisfy \(f^{\text {had}}>85\%\). The error bars and uncertainty band show the uncertainties from the limited number of events in the dataset and simulated samples respectively

A second method of using the longitudinal segmentation to probe the sources of the discrepancies is to select events where the pion traverses the electromagnetic calorimeter as a minimally ionising particle before showering in the hadronic calorimeter layers. Such events are characterised by having a very large fraction of their energy in the hadronic calorimeter. Selecting events in which more than 85% of the summed cluster energy is in the hadronic calorimeters, \(f^{\text {had}}>85\%\), allows the data-to-MC response ratio to be probed in the hadronic calorimeters. Figure 11 shows the fitted response after applying this selection criterion in the barrel and endcap regions. In the barrel region the data-to-MC response ratio is seen to be similar to the inclusive response ratio; however, larger discrepancies are seen in the endcaps after this selection, with the difference between data and simulation increasing from \({\sim }4\%\) to \({\sim }8\%\). This suggests that the simulation models hadronic showers in the hadronic calorimeter less accurately in the endcaps than in the inclusive case.

9 Conclusion

The energy response of the ATLAS calorimeter has been measured across a wide range of pion momenta with high precision using an innovative technique based on isolated charged pions from the process \(W^\pm \rightarrow \tau ^\pm (\rightarrow \pi ^\pm +\nu _\tau )\nu _\tau \). The calorimeter response is observed to be overestimated by \({\sim }2\%\) in the central region and underestimated by \({\sim }4\%\) in the endcaps in the ATLAS simulation. This supports the observations in the measurement of the jet energy scale, which show a similar structure across the calorimeter [3]. The precision of the measurement of the response is \({\lesssim }1\%\) for \(15<p_{\text {T}} <185\) \(\text {GeV}\) (\(20<p_{\text {T}} <100\) \(\text {GeV}\)) in the central (endcap) regions, and is \({<}0.6\%\) in the most precise region, \(20<p_{\text {T}} <120\) \(\text {GeV}\) in the barrel.

The width of the \(E_{\text {T}}^{\text {EM}}/p_{\text {T}}^{\text {trk}}\) distribution has also been investigated. This measurement convolves the tracking and calorimeter resolutions. For this quantity the simulation is found to agree with the data to within 15%; however, the data are generally observed to have a wider distribution. Finally, the average energy in the layers of the hadronic calorimeter was probed as a method of investigating the cause of the discrepancies between the simulation and the data. These measurements will be used in the tuning of the ATLAS simulation to further improve the hadron shower modelling and detector geometry.

This powerful new method of measuring the hadronic energy response of the ATLAS calorimeter can achieve a precision of \({\lesssim }1\%\) for the response of the calorimeter to charged pions. This detailed understanding of the response of the calorimeter can be used in gaining better understanding of the jet energy scale, its uncertainty for the highest-\(p_{\text {T}}\) jets, the energy scale of hadronically decaying \(\tau \)-leptons, and in measuring the properties of jets.