Non-quadratic improved Hessian PDF reweighting and application to CMS dijet measurements at 5.02 TeV

Eskola, Kari J.; Paakkinen, Petja; Paukkunen, Hannu

doi:10.1140/epjc/s10052-019-6982-2

Non-quadratic improved Hessian PDF reweighting and application to CMS dijet measurements at 5.02 TeV

Regular Article - Theoretical Physics
Open access
Published: 15 June 2019

Volume 79, article number 511, (2019)
Cite this article

Download PDF

You have full access to this open access article

The European Physical Journal C Aims and scope Submit manuscript

Non-quadratic improved Hessian PDF reweighting and application to CMS dijet measurements at 5.02 TeV

Download PDF

1056 Accesses
29 Citations
1 Altmetric
Explore all metrics

A preprint version of the article is available at arXiv.

Abstract

Hessian PDF reweighting, or “profiling”, has become a widely used way to study the impact of a new data set on parton distribution functions (PDFs) with Hessian error sets. The available implementations of this method have resorted to a perfectly quadratic approximation of the initial $\chi ^2$ function before inclusion of the new data. We demonstrate how one can take into account the first non-quadratic components of the original fit in the reweighting, provided that the necessary information is available. We then apply this method to the CMS measurement of dijet pseudorapidity spectra in proton–proton (pp) and proton–lead (pPb) collisions at 5.02 TeV. The measured pp dijet spectra disagree with next-to-leading order (NLO) theory calculations using the CT14 NLO PDFs, but upon reweighting the CT14 PDFs, these can be brought to a much better agreement. We show that the needed proton-PDF modifications also have a significant impact on the predictions for the pPb dijet distributions. Taking the ratio of the individual spectra, the proton-PDF uncertainties effectively cancel, giving a clean probe of the PDF nuclear modifications. We show that these data can be used to further constrain the EPPS16 nuclear PDFs and strongly support gluon nuclear shadowing at small x and antishadowing at around $x \approx 0.1$.

Proton-PDF uncertainties in extracting nuclear PDFs from $$W^\pm $$ production in p+Pb collisions

Article Open access 29 March 2022

Kari J. Eskola, Petja Paakkinen, … Carlos A. Salgado

The effect of LHC jet data on MSTW PDFs

Article Open access 01 July 2014

B. J. A. Watt, P. Motylinski & R. S. Thorne

PDF reweighting in the Hessian matrix approach

Article Open access 15 December 2014

Hannu Paukkunen & Pia Zurita

1 Introduction

The proton structure at high momentum-transfer, as encoded in the collinearly factorized parton distribution functions (PDFs), is not only an interesting subject in its own right, but plays a pivotal role in many applications, such as precision electroweak and Higgs physics, searches for new physics, etc. [1]. Likewise, their counterparts for nucleons bound in nuclei, the nuclear PDFs (nPDFs), are essential in e.g. studying the production of hard probes of the Quark Gluon Plasma [2]. In practice, despite the ongoing effort in lattice methods [3], the PDFs are obtained by the well-established means of global analysis using hard-process data. As such, the PDFs have uncertainties which derive from those in the available data and also from the lack of data constraints in certain phase-space regions. It is then often the case that when new data are published or a future experiment is being planned, one would like to study the impact that the measurement could have on the PDFs. A good example of such a case is the recent CMS measurement of dijet pseudorapidity spectra in proton–proton (pp) and proton–lead (pPb) collisions at 5.02 TeV [4], where, on one hand, the measured pp spectra seem to be in a disagreement with next-to-leading order (NLO) perturbative QCD (pQCD) calculations using CT14 [5] and MMHT14 [6] PDFs (see the Supplemental Material of Ref. [4]), while, on the other hand, the nuclear-modification ratio of the pPb and pp spectra appear to have much smaller uncertainties than predictions with various nPDFs. One should therefore study the impact these data could have on both the free-proton PDFs and their nuclear modifications.

As producing a full global fit remains rather involved, even with publicly available tools like the xFitter [7] (built upon the former HERAFitter [8]) coming available, it is in most cases impractical for a general user to try to learn about the constraining power of a data set in this way. For this purpose, approximative methods have been developed, first in the formalism of Bayesian reweighting of Monte Carlo PDF ensembles [9,10,11,12,13] and later in a framework using Hessian error sets [14,15,16]. These methods have their limitations, as the new PDFs rely on all the theoretical assumptions of the original PDF analysis, such as the parametrization form, the value of $\alpha _{\mathrm{s}}$ and the used heavy-quark scheme. There are also limitations related to how well the methods approximate the true parameter likelihood in the region constrained by the new data. In particular, the applications of Hessian PDF reweighting have resorted to a perfectly quadratic approximation of the $\chi ^2$ goodness-of-fit function before inclusion of the new data and to linear or up to quadratic terms for responses in the new observables. This applies to the implementation in the xFitter package, where the method is referred to as “Hessian profiling”, as well as to the new software package which has appeared under the name ePump [16]. It is not, however, uncommon that non-quadratic terms in the $\chi ^2$ function are large (see e.g. Figure 6 of Ref. [17]), and thus it would be beneficial to have a way to take these into account.

The purpose of this article is twofold: first, in Sect. 2, we describe how one can include into Hessian PDF reweighting the first non-quadratic terms in the $\chi ^2$ function consistently with the original fit, provided that the needed information is available. Second, in Sect. 3, we apply the Hessian PDF reweighting to the aforementioned CMS dijet measurements at 5.02 TeV [4]. We show that the strong disagreement between the pp measurement and next-to-leading order (NLO) calculations using CT14 NLO PDFs [5] can be brought to a much better agreement upon reweighting the CT14 PDFs, but that this requires rather strong modifications for high-x gluons. We demonstrate that such changes in the proton PDFs have also an important impact on predictions for dijet production in pPb. Finally, we then reweight the EPPS16 nPDFs [18] with the nuclear modification ratio of the measured pPb and pp dijet spectra using the non-quadratic approximation developed in Sect. 2 and present a discussion on the importance of these higher-order terms in the reweighting. Preliminary work on this topic can be found in Refs. [19, 20].

2 PDF uncertainties and reweighting in Hessian method

In this section, we first recapitulate the uncertainty determination in the Hessian approach [21], assuming the use of a global tolerance criterion. We then describe how one can perform a reweighting upon such determined error sets, taking into account the first non-quadratic terms in the $\chi ^2$ function. We end the section with a discussion on the applicability of this method in the case of non-global tolerances.

2.1 Hessian uncertainties with global tolerance criterion

In PDF global analyses, the goodness-of-fit of a parameter vector $\mathbf {a}$ is dictated by the $\chi ^2$ function

$$\begin{aligned} \chi ^2(\mathbf {a}) = \sum _{ij}\,(y_i(\mathbf {a}) - y_i^\text {data})\,C^{-1}_{ij}\,(y_j(\mathbf {a}) - y_j^\text {data}), \end{aligned}$$

(1)

where $y_i(\mathbf {a})$ are theory predictions for the observables included in the analysis, $y_i^\text {data}$ the corresponding measured values and $C^{-1}_{ij}$ the elements of the inverse covariance matrix for these data. In the Hessian method for uncertainty estimation, one takes the parameter values $\mathbf {a}^\text {min}$ which minimize Eq. (1), $\chi ^2(\mathbf {a}^\text {min}) \equiv \min \chi ^2(\mathbf {a}) \equiv \chi ^2_0$, as the central, best-fit values and studies the behaviour of the $\chi ^2$ function around this minimum to determine the uncertainty in these parameters.

The leading deviations from the minimum value $\chi ^2_0$ are given by the quadratic approximation

$$\begin{aligned} \chi ^2 \approx \chi ^2_0 + \sum _{ij}\,(a_i - a_i^\text {min})\,H_{ij}\,(a_j - a_j^\text {min}), \end{aligned}$$

(2)

where $H_{ij} = \frac{1}{2} \partial ^2 \chi ^2 / \partial a_i \partial a_j|_{\mathbf {a}=\mathbf {a}^\text {min}}$ are the elements of the Hessian matrix. In practice, these elements need to be obtained numerically. Since the Hessian matrix is symmetric, it has a complete set of orthonormal eigenvectors $\mathbf {v}^{(k)}$ such that

$$\begin{aligned} \sum _j\,H_{ij}\,v^{(k)}_j= & {} \varepsilon _k\,v^{(k)}_i, \end{aligned}$$

(3)

$$\begin{aligned} \sum _i\,v^{(k)}_i\,v^{(\ell )}_i= & {} \delta _{k\ell }, \qquad \sum _k\,v^{(k)}_i\,v^{(k)}_j = \delta _{ij}, \end{aligned}$$

(4)

where $\varepsilon _k$ are the eigenvalues of the Hessian matrix. With this eigendecomposition we can define new parameters

$$\begin{aligned} z_k = \sum _i \sqrt{\varepsilon _k}\,v^{(k)}_i\,(a_i - a_i^\text {min}) \end{aligned}$$

(5)

such that Eq. (2) becomes

$$\begin{aligned} \chi ^2 \approx \chi ^2_0 + \sum _{k}\,z_k^2. \end{aligned}$$

(6)

Since the new parameters $z_k$ are uncorrelated in the quadratic approximation, one can use the standard law of error propagation to translate the uncertainties in the parameters $z_k$ to the uncertainty of any PDF-dependent quantity X as [21]

$$\begin{aligned} \varDelta X = \sqrt{\sum _k \left( \frac{\partial X}{\partial z_k} \varDelta z_k\right) ^2}. \end{aligned}$$

(7)

Given a well justified global tolerance $\varDelta \chi ^2$ for the allowed growth of $\chi ^2$ from its minimum, one can determine the allowed parameter variations $\varDelta z_k$.^{Footnote 1} If the $\chi ^2$ function were perfectly quadratic, the uncertainty of the parameter $z_k$ corresponding to the tolerance $\varDelta \chi ^2$ would be simply $\varDelta z_k = \sqrt{\varDelta \chi ^2}$. As this is generally not true, one instead finds $\delta z^\pm _k$, the positive and negative values of $z_k$ corresponding to the $\varDelta \chi ^2$ increase, and assigns $\varDelta z_k = (\delta z^+_k - \delta z^-_k) / 2$. It is convenient to define error sets $S^\pm _i$ corresponding to parameter values

$$\begin{aligned} z_k[S^\pm _i] = {\left\{ \begin{array}{ll} \delta z^\pm _i, &{} k = i \\ 0, &{} k \ne i \end{array}\right. } \quad , \end{aligned}$$

(8)

along with the central set $S_0$, where $z_k[S_0] = 0$ for all k. Estimating

$$\begin{aligned} \frac{\partial X}{\partial z_k} = \frac{X[S^+_k] - X[S^-_k]}{2\,\varDelta z_k}, \end{aligned}$$

(9)

where $X[S^\pm _k]$ stands for the quantity X calculated with the parameter set of Eq. (8), yields then a simple form

$$\begin{aligned} \varDelta X = \frac{1}{2}\sqrt{\sum _k \left( X[S^+_k] - X[S^-_k]\right) ^2}. \end{aligned}$$

(10)

As the response in X to the upward and downward parameter shifts can be uneven, one can alternatively specify an upward–downward asymmetric error prescription e.g. with [23]

$$\begin{aligned} \delta X^\pm = \sqrt{\sum _k \left[ \begin{array}{c} \max \\ \min \end{array} \left\{ X[S^+_k]-X[S_0],X[S^-_k]-X[S_0],0 \right\} \right] ^2}. \end{aligned}$$

(11)

2.2 Non-quadratic reweighting

In the presence of a new data set, the total $\chi ^2$ can be written as

$$\begin{aligned} \chi ^2_\text {new}(\mathbf {z}) = \chi ^2_\text {old}(\mathbf {z}) + \sum _{ij}\,(y_i(\mathbf {z}) - y_i^\text {data})\,C^{-1}_{ij}\,(y_j(\mathbf {z}) - y_j^\text {data}), \end{aligned}$$

(12)

where $y_i$ ($y_i^\text {data}$) now correspond to the new theoretical (measured) values and $\chi ^2_\text {old}$ incorporates our knowledge of the original global analysis. Now, as we do not wish to produce a full global analysis with $\chi ^2_\text {new}$, we need to make suitable approximations. The simplest choice is to use the quadratic approximation in Eq. (6), according to the method introduced in Ref. [15], but if the parameter variations $\delta z^\pm _k$ and the global tolerance $\varDelta \chi ^2$ of this fit are known (as is the case with EPPS16 nPDFs, see Table 2 in Ref. [18]), then $\chi ^2_\text {old}$ can be approximated with a third order polynomial in each of the eigendirections,

$$\begin{aligned} \chi ^2_\text {old} \approx \chi ^2_0 + \sum _{k} (a_k z_k^2 + b_k z_k^3), \end{aligned}$$

(13)

where the coefficients are obtained with

$$\begin{aligned} a_k= & {} \frac{\varDelta \chi ^2}{\delta z^+_k - \delta z^-_k}\left( \frac{\delta z^+_k}{(\delta z^-_k)^2} - \frac{\delta z^-_k}{(\delta z^+_k)^2}\right) , \end{aligned}$$

(14)

$$\begin{aligned} b_k= & {} \frac{\varDelta \chi ^2}{\delta z^+_k - \delta z^-_k}\left( \frac{1}{(\delta z^+_k)^2} - \frac{1}{(\delta z^-_k)^2}\right) . \end{aligned}$$

(15)

This is illustrated in Fig. 1 (upper diagram), where we show an example of a situation where the $\chi ^2$ grows asymmetrically with respect to $z_k$. The quadratic approximation fails to acknowledge this fact and a third order polynomial is needed to reproduce the $\varDelta \chi ^2$ growth at $\delta z_k^-$ and $\delta z_k^+$. Similarly, as illustrated in Fig. 1 (lower diagram), the $y_i$ can be expanded in terms of $z_k$ as

$$\begin{aligned} y_i(\mathbf {z}) \approx y_i[S_0] + \sum _k (d_{ik} z_k + e_{ik} z_k^2), \end{aligned}$$

(16)

where

$$\begin{aligned} d_{ik}= & {} \frac{1}{\delta z^+_k - \delta z^-_k}\bigg [-\frac{\delta z^-_k}{\delta z^+_k}\left( y_i[S_k^+] - y_i[S_0]\right) \nonumber \\&+ \frac{\delta z^+_k}{\delta z^-_k}\left( y_i[S_k^-] - y_i[S_0]\right) \bigg ], \end{aligned}$$

(17)

$$\begin{aligned} e_{ik}= & {} \frac{1}{\delta z^+_k - \delta z^-_k}\bigg [\frac{1}{\delta z^+_k}\left( y_i[S_k^+] - y_i[S_0]\right) \nonumber \\&- \frac{1}{\delta z^-_k}\left( y_i[S_k^-] - y_i[S_0]\right) \bigg ]. \end{aligned}$$

(18)

One should note that the above approximations do not yield a full Taylor expansion to cubic and quadratic order in $\chi ^2_\text {old}$ and $y_i(\mathbf {z})$, respectively, as we have neglected off-diagonal terms proportional to $z_l z_k^2$ and $z_l z_k$ for $l \ne k$. Even so, we will refer to reweighting with these approximations as a cubic–quadratic one.

Changing variables to $w_k = 2z_k / (\delta z^+_k - \delta z^-_k)$ and defining $r_k = -\delta z^+_k/\delta z^-_k$, we may alternatively write

$$\begin{aligned} \begin{aligned} \chi ^2_\text {new}(\mathbf {w}) - \chi ^2_0 \approx&\sum _{k} (A_k w_k^2 + B_k w_k^3)\\&+ \sum _{ij}\,(y_i(\mathbf {w}) - y_i^\text {data})\,C^{-1}_{ij}\,(y_j(\mathbf {w}) - y_j^\text {data}), \end{aligned} \end{aligned}$$

(19)

where

$$\begin{aligned} A_k= & {} \frac{\varDelta \chi ^2}{4}\left( \frac{1}{r_k^2} + \frac{1}{r_k} + r_k + r_k^2\right) , \end{aligned}$$

(20)

$$\begin{aligned} B_k= & {} \frac{\varDelta \chi ^2}{8}\left( \frac{1}{r_k^2} + \frac{2}{r_k} - 2\,r_k - r_k^2\right) , \end{aligned}$$

(21)

and

$$\begin{aligned} y_i(\mathbf {w})\approx & {} y_i[S_0] + \sum _k (D_{ik} w_k + E_{ik} w_k^2), \end{aligned}$$

(22)

$$\begin{aligned} D_{ik}= & {} \frac{1}{2}\bigg [\frac{1}{r_k}\left( y_i[S_k^+] - y_i[S_0]\right) \nonumber \\&- r_k\left( y_i[S_k^-] - y_i[S_0]\right) \bigg ], \end{aligned}$$

(23)

$$\begin{aligned} E_{ik}= & {} \frac{1}{4}\bigg [\left( 1 + \frac{1}{r_k}\right) \left( y_i[S_k^+] - y_i[S_0]\right) \nonumber \\&+ (1 + r_k)\left( y_i[S_k^-] - y_i[S_0]\right) \bigg ]. \end{aligned}$$

(24)

Now, it is a simple numerical task to minimize Eq. (19) with respect to $\mathbf {w}$. We use MINUIT [24] for the practical applications in the following sections. The found minimum should correspond to that of a full global fit, provided that the approximations (19) and (22) are good enough. This is not trivially true, but we should expect the approximations work better the closer we are to the original minimum. Thus it makes sense to define a “penalty term”

$$\begin{aligned} P = \sum _{k} (A_k (w_k^\text {min})^2 + B_k (w_k^\text {min})^3) \approx \chi ^2_\text {old}(\mathbf {w}^\text {min}) - \chi ^2_0, \nonumber \\ \end{aligned}$$

(25)

which essentially counts how much $\chi ^2_\text {old}$ has grown from its minimum value, $w_k^\text {min}$ being the values of $w_k$ at the minimum of $\chi ^2_\text {new}(\mathbf {w})$. If $P \ll \varDelta \chi ^2$, the approximations (19) and (22) should work well and the reweighted results can be viewed as a proxy for those of a full global fit. Once P grows close to or above $\varDelta \chi ^2$, the results of reweighting become more sensitive on the made assumptions and one should be cautious on the interpretations. Moreover, a large P signals a tension between the original fit and the new data, which might be due to incompatibilities of some data sets, but can also be caused by an inflexible PDF parametrization, or other limitations of theory description, such as missing higher-order corrections.

The beauty of the reweighting method lies in the fact that the reweighted result for any quantity can be obtained simply by using Eq. (22). For example, the new, reweighted, PDFs are obtained by replacing $y_i$ with $f_i$. One should note that while this expression is quadratic in $w_k$, the new PDFs retain a linear dependence on the old ones and thus satisfy the PDF sum rules and DGLAP evolution equations.^{Footnote 2} This applies also to the new error sets, which can be obtained essentially by following the same procedure as in Sect. 2.1, with the exception that the new Hessian matrix ${\hat{H}}_{kl} = \frac{1}{2} \partial ^2 \chi ^2_\text {new} / \partial w_k \partial w_l|_{\mathbf {w}=\mathbf {w}^\text {min}}$ in

$$\begin{aligned} \chi ^2_\text {new}(\mathbf {w}) \approx \chi ^2_\text {new}(\mathbf {w}^\text {min}) + \sum _{kl}\,(w_k - w_k^\text {min})\,{\hat{H}}_{kl}\,(w_l - w_l^\text {min}) \end{aligned}$$

(26)

can be put to an explicit form

$$\begin{aligned} {\hat{H}}_{kl}= & {} (A_k + 3B_k w_k^\text {min})\delta _{kl}\nonumber \\&+ \sum _{ij}(D_{ik} + 2E_{ik} w_k^\text {min})C^{-1}_{ij}(D_{jl} + 2E_{jl} w_l^\text {min})\nonumber \\&+ \sum _{ij}(2E_{ik}\delta _{kl})C^{-1}_{ij}(y_j(\mathbf {w}^\text {min}) - y_j^\text {data}),\nonumber \\ \end{aligned}$$

(27)

by taking second derivatives of Eq. (19). Diagonalizing ${\hat{H}}$ and finding the deviations in the new eigenvector directions corresponding to $\varDelta \chi ^2$ growth from $\chi ^2_\text {new}(\mathbf {w}^\text {min})$, one obtains the parameter values for the new error sets, using which the uncertainties of any quantity can again be obtained according to Eq. (22).

The cubic–quadratic approximation considered above is not applicable to all cases, as it requires the knowledge of the $\delta z^\pm _k$. Lower-order approximations, initially introduced in Ref. [15], can be obtained from the above results by taking appropriate limits. Taking $r_k \rightarrow 1$ one finds $A_k = \varDelta \chi ^2$, $B_k = 0$, thus recovering the quadratic approximation for $\chi ^2_\text {old}$. In this limit also the definition of the penalty term in Eq. (25) reduces to that of Ref. [15]. As $y_i$ retains its quadratic parameter dependence in this limit, we call this a quadratic–quadratic approximation. In many cases this is the best option one can resort to, as it only requires access to the PDF error sets and the value of $\varDelta \chi ^2$. Even simpler, quadratic–linear, approximation can be achieved by taking also $E_{ik} \rightarrow 0$. This version is very easy to implement, as finding the new central and error sets in this approximation involves only solving a system of linear equations [15].

2.3 Comment on non-global tolerances

The reweighting method can also be extended to non-global tolerances [17], simply by setting

$$\begin{aligned} A_k= & {} \frac{1}{4}\left( (T_k^+)^2\left( \frac{1}{r_k^2} + \frac{1}{r_k}\right) + (T_k^-)^2\left( r_k + r_k^2\right) \right) , \end{aligned}$$

(28)

$$\begin{aligned} B_k= & {} \frac{1}{8}\left( (T_k^+)^2\left( \frac{1}{r_k^2} + \frac{2}{r_k} + 1\right) - (T_k^-)^2\left( 1 + 2\,r_k + r_k^2\right) \right) ,\nonumber \\ \end{aligned}$$

(29)

where $(T_k^\pm )^2 = \chi ^2_\text {old}(\delta z^\pm _k) - \chi ^2_0$ are the tolerances of the individual error sets, determined by requiring acceptable values of $\chi ^2$ for each individual data set in the original analysis [17]. While the new, reweighted central PDF set can be obtained uniquely in this way, the determination of the new error sets involves additional arbitrariness. As the new eigenvector directions obtained by diagonalizing the Hessian matrix in Eq. (27) are not parallel to the original ones, it is not directly obvious how large tolerances should be allowed in each of these new parameter directions. It was argued in Ref. [16] that if the new eigendirections are not significantly rotated away from the original ones, it would be sufficient to use the original tolerances $(T_k^\pm )^2$ also for obtaining the new error sets. While this can work in some cases, it would be advisable to have a measure on the amount of parameter rotations in the reweighting to test whether the limits of this assumption are met. Another possibility would be to use a global tolerance for the reweighted PDFs, e.g. by taking the average over the $(T_k^\pm )^2$, but this also would lead to changing the error definition from the original one, thus reducing the comparability of the new and old uncertainties. In general, setting the new non-global tolerances reliably would require a complete refit.

3 CMS 5.02 TeV dijets and their impact on PDFs

The CMS dijet data [4] consist of distributions of dijet pseudorapidity

$$\begin{aligned} \eta _{\mathrm{dijet}} = \frac{1}{2}(\eta ^{\mathrm{leading}} + \eta ^\mathrm{subleading}) \end{aligned}$$

(30)

in bins of average transverse momentum of the jet pair

$$\begin{aligned} p_{\mathrm{T}}^{\mathrm{ave}} = \frac{1}{2}(p_{\mathrm{T}}^{\mathrm{leading}} + p_\mathrm{T}^{\mathrm{subleading}}). \end{aligned}$$

(31)

Here, $\eta ^{\mathrm{(sub)leading}}$ and $p_{\mathrm{T}}^{\mathrm{(sub)leading}}$ refer to the pseudorapidity and transverse momentum of the jet with (second to) largest transverse momentum of the event. Jets are defined with the anti-$k_{\mathrm{T}}$ algorithm [25] using a distance parameter $R = 0.3$. The events used in the analysis are required to have a leading jet with transverse momentum $p_{\mathrm{T}}^{\mathrm{leading}} > 30 \text {~GeV}$ and a subleading jet with $p_{\mathrm{T}}^{\mathrm{subleading}} > 20 \text {~GeV}$ and the two jets are required to have an azimuthal angle separation $\varDelta \phi > 2\pi /3$. In pPb collisions the two jets are required to be in a rapidity interval $-3< \eta ^{\mathrm{lab}}_{\mathrm{jet}} < 3$ in the laboratory frame. Due to unequal beam energies, $E_{\mathrm{p}} = 4 \text {~TeV}$ and $E_{\mathrm{Pb}} = \frac{82}{208}E_{\mathrm{p}}$, the nucleon–nucleon center-of-mass system is boosted in this frame. To attain corresponding coverages in the center-of-mass frames, CMS measured the pp spectra in the interval $-3.465< \eta ^\mathrm{lab}_{\mathrm{jet}} < 2.535$. Here, as in the CMS publication, the pp data are shifted in pseudorapidity by $+0.465$, so that the measured dijets cover a pseudorapidity range $-3< \eta _{\mathrm{dijet}} < 3$ in both pp and pPb.

The CMS data are self-normalized in each bin of $p_{\mathrm{T}}^\mathrm{ave}$, i.e. given in the form

$$\begin{aligned} \frac{1}{\mathrm {d}\sigma /\mathrm {d}p_\mathrm {T}^\text {ave}}\,\mathrm {d}^2\sigma /\mathrm {d}p_\mathrm {T}^\text {ave}\mathrm {d}\eta _\text {dijet}. \end{aligned}$$

(32)

This is advantageous due to a partial cancellation of correlated experimental (including luminosity-) uncertainties and theoretical hadronization corrections.^{Footnote 3} Accordingly, we do not apply nonperturbative corrections to our predictions. We work at NLO as the NNLO calculations of Ref. [27] are not publicly available at this moment. Our theory calculations are performed with NLOJet++ [28] using the anti-$k_{\mathrm{T}}$ algorithm through FastJet package [29]. We fix the factorization and renormalization scales to be the same, $\mu _\mathrm{F} = \mu _{\mathrm{R}} = \mu $, and use $\mu = p_{\mathrm{T}}^{\mathrm{ave}}$ as our central scale choice to keep consistency with the CT14 and EPPS16 fits, but study also variations around this central scale choice to approximate the magnitude of missing higher-order uncertainties (MHOUs).^{Footnote 4} In all figures, PDF uncertainties are presented with the asymmetric prescription of Eq. (11). As the data correlations are not available, we simply add the statistical and systematical uncertainties in quadrature.

3.1 Proton–proton dijet spectra and CT14 reweighting

The self-normalized pp dijet spectra measured by CMS are shown in Fig. 2 along with theory calculations using the CT14 NLO PDFs. While the predictions describe well the $p_\mathrm{T}^{\mathrm{ave}}$ systematics of the data, we see that the predicted pseudorapidity spectra are systematically wider than the measured distributions, with the discrepancy between the data and CT14 central prediction being much larger than the experimental uncertainties, yielding a very poor figure of merit, $\chi ^2/N_\mathrm{data} = 7.5$. To study the possible source of this discrepancy, we show in Fig. 2 both the uncertainties from CT14 PDFs, as well as factor of two scale variations around the central scale choice $\mu = p_{\mathrm{T}}^{\mathrm{ave}}$ and results from a leading order (LO) calculation at the central scale.

We see that in most bins, especially towards high $p_{\mathrm{T}}^\mathrm{ave}$, the discrepancy between the data and theory is larger than the associated scale uncertainty. As the factor of two scale variations often underestimate the true size of higher order corrections (see e.g. Ref. [27]), not much can be learned from this fact alone. However, as the LO-to-NLO corrections shown in Fig. 2 (lower panels) are of the same size as the scale uncertainties, we should not expect the NLO-to-NNLO corrections to be any larger than these. Hence the discrepancy is unlikely to be just due to missing NNLO terms, which in turn points into the direction that the CT14 PDFs need to be modified for a better description of the data. Towards smaller $p_{\mathrm{T}}^{\mathrm{ave}}$ the scale variation effects become more important, leaving room for improvement with NNLO corrections. Another possible scale choice would be the invariant mass of the dijet, $\mu = M_{\mathrm{dijet}}$, a choice which was found in Ref. [27] to yield a better perturbative convergence up to leading-color NNLO precision. We have tested this option, shown also in Fig. 2 (lower panels), and report that here at the NLO level it tends to give smaller scale-uncertainty bands especially at low $p_{\mathrm{T}}^{\mathrm{ave}}$ and that the results do not differ much from the central $\mu = p_\mathrm{T}^{\mathrm{ave}}$ predictions. This points again towards smallness of the NNLO corrections. With even slightly wider predictions, $\mu = M_{\mathrm{dijet}}$ gives a worse data description than the $\mu = p_\mathrm{T}^{\mathrm{ave}}$ scale choice, and thus we work with the latter in what follows.

To see the modifications on the CT14 PDFs the CMS dijet data would indicate, we have performed a reweighting study with these data. As most of the data points lie outside the CT14 uncertainties, we could expect the needed modifications to be rather strong. Nominally, the CT14 uncertainties correspond to a global tolerance $\varDelta \chi ^2 = 100$, but to enforce a $90\%$ confidence level agreement individually with each data set used in the analysis, CT14 uses in addition so called “Tier-2 penalties”. Hence, the parameter variations $\delta z^\pm _k$ in CT14 do not exactly match with $\pm \sqrt{100}$, but can be somewhat smaller. As no detailed information is available on how large these deviations are, the best we can do is to assume $\chi ^2$ to be perfectly quadratic and use $\varDelta \chi ^2 = 100$. For this reason, we perform the CT14 reweighting in the quadratic–quadratic approximation, noting that the reweighted uncertainties might not be directly comparable with the original ones, and that the new central set underestimates the true impact on CT14, as the use of $\varDelta \chi ^2 = 100$ overestimates the growth of $\chi ^2_\text {old}$ in varying the PDF parameters.

The resulting reweighted PDFs are compared with the original CT14 NLO PDFs in Fig. 3. For all quark flavours, the found modifications are modest compared to the size of PDF uncertainties. Only at very large x we can see a clear downward bend in the central valence-quark PDFs, caused by the fit trying to adapt to the data at large rapidities, where gluon–valence-quark scattering dominates the cross sections. There is a similar, but even more pronounced, large-x depletion for the gluons. In addition, we find an enhancement for gluons at $x \sim 0.1$, compensating for the excess in data at midrapidity. Such modifications to gluon PDF are not totally unexpected. The MMHT14 gluon PDF [6], which closely resembles that of CT14, acquires rather similar modifications when confronted with the $7~\mathrm{TeV}$ high-luminosity inclusive jet data [30]. Also, attributed to including $8~\mathrm{TeV}$ differential top-quark data, the NNPDF3.1 fit has large-x gluons suppressed compared to CT14 and MMHT14 [31]. In addition, a recent reweighting study using multiple top-production data sets found very similar CT14 modifications as we do here [32]. Thus, we have evidence that the CT14 gluon distribution is simply too hard to be able to fully describe jet and top-quark measurements.

Figure 4 shows the reweighted dijet spectra in comparison to data and original CT14 predictions. The reweighting clearly improves compatibility with the data, especially in the midrapidity region, where the data and theory are now in agreement within the associated uncertainties. At $\eta _{\mathrm{dijet}} \lesssim -1$, the data still deviates from the reweighted results. This is also reflected in the figure of merit, $\chi ^2/N_{\mathrm{data}} = 2.0$, which is still quite high, but vastly better than before the reweighting. For a comparison, we have calculated the dijet spectra also using the MMHT14 [6], NNPDF3.1 [31] and 5-flavour ABMP16 [33] NLO PDFs. These yield $\chi ^2/N_\mathrm{data}$ goodness-of-fit values 4.7, 4.0 and 2.7, respectively, showing that less than perfect agreement with the data is not only a problem with CT14. However, the very strong disagreement between data and CT14 before reweighting appears to be a rather extreme case. In Fig. 5 the gluon PDFs of MMHT14, NNPDF3.1 and ABMP16 are compared with the CT14 before and after the reweighting. The reweighting brings the CT14 gluon distribution to a closer agreement with the other PDFs, particularly at small x to the MMHT14 and NNPDF3.1 and, more importantly, at large x to the NNPDF3.1 and ABMP16. Clearly a reduction in high-x gluons compared to CT14 similar to those in the NNPDF3.1 and ABMP16 fits is preferred by the data.

The penalty term for the reweighted CT14 fit is rather high, with $P/\varDelta \chi ^2 = 1.17$, clearly indicating that we are reaching the limits of the applicability of the reweighting method. This can be interpreted either as a tension between the dijet data and some datasets used in the CT14 analysis, or as an inflexibility of the CT14 fit form in the high-x region which is probed by the dijets at large rapidities, where the data were not well reproduced and where the data would support even stronger suppression in the PDFs. To test if the CT14 parametrization could adapt to the dijet data, we have performed a reweighting also with an artificially low $\varDelta \chi ^2 = 10$. In a global fit, this would translate to putting an additional tenfold weight on the new data. The results for the new central PDF set are shown as purple lines in Figs. 3 and 4. With stronger low- and high-x suppression and mid-x enhancement for gluons, this fit achieves a much more reasonable goodness-of-fit $\chi ^2/N_{\mathrm{data}} = 0.9$ for these data. For this, substantial help from valence quarks, which get strong modifications in this case, is also needed. Still, the data at $\eta _{\mathrm{dijet}} \lesssim -1$ are not perfectly reproduced, which might be a signal of a parametrization issue, as the relative contribution from the original fit to the total $\chi ^2$ is decreased with the lowered $\varDelta \chi ^2$. With $P/\varDelta \chi ^2 = 3.61$, this fit is in a clear tension with the original CT14 analysis. Of course, once the correlations in the dijet data are made available, one should study whether a shift in some of the systematic parameters could improve the fit at $\eta _{\mathrm{dijet}} \lesssim -1$. It is also conceivable that the residual disagreement is due to the NNLO corrections.

A comprehensive study of possibly conflicting datasets within CT14 is outside the scope of this article, but as a cross check we have tested the compatibility of the reweighted PDFs with the CMS 7 TeV inclusive jet measurements [34] which are included in the CT14 analysis. For these calculations we use the pre-computed fastNLO grids [35], setting the renormalization and factorization scales equal to the transverse momentum $p_{\mathrm{T}}$ of the individual jet as in the CT14 analysis. Figure 6 shows the data-to-theory ratio for the NLO predictions with the CT14 PDFs reweighted with the dijet data using $\varDelta \chi ^2 = 100$. Also the ratios of the original CT14 central predictions with the reweighted ones are indicated. The data-to-theory agreement happens to be even slightly better for the reweighted PDFs, with $\chi ^2/N_{\mathrm{data}} = 1.2$, than for the original set, for which $\chi ^2/N_{\mathrm{data}} = 1.3$. Thus we find that, in the light of reweighting, the CMS measurements of inclusive jets at 7 TeV and dijets at 5.02 TeV are mutually compatible.

3.2 Significance of proton PDF uncertainties in proton–lead dijet spectra

The pPb dijet spectra, shown in Fig. 7, have a rather similar data-to-theory systematics as we had in the pp case. Here, we use the EPPS16 nuclear modifications along with the CT14 NLO proton PDFs in the predictions, i.e. the PDF of a flavour i in a proton bound in lead at scale $Q^2$ is obtained with

$$\begin{aligned} f^{\mathrm{p/Pb}}_i(x,Q^2) = R^{\mathrm{Pb}}_i(x,Q^2) f^{\mathrm{p}}_i(x,Q^2), \end{aligned}$$

(33)

where $R^{\mathrm{Pb}}_i$ is the nuclear modification from the EPPS16 analysis and $f^{\mathrm{p}}_i$ the corresponding CT14 PDF of the free proton. The total PDF uncertainties in the cross sections are calculated with

$$\begin{aligned} \delta X^\pm _\text {total} = \sqrt{\left( \delta X^\pm _\text {EPPS16}\right) ^2 + \left( \delta X^\pm _\text {CT14}\right) ^2}, \end{aligned}$$

(34)

where $\delta X^\pm _\text {EPPS16}$ are the upward and downward uncertainties obtained with Eq. (11) using the EPPS16 error sets and keeping the CT14 central set fixed, and $\delta X^\pm _\text {CT14}$, respectively, the uncertainties from the CT14 error sets keeping the EPPS16 central set fixed.

Again, these predictions give wider distributions than seen in the CMS data, resulting with $\chi ^2/N_{\mathrm{data}} = 6.9$. While in this case the data points are mostly within the combined nuclear and free-proton PDF uncertainty bands, we can expect that the modifications to the CT14 PDFs, which were found necessary to improve the description of the pp data, play a role also here. Indeed, in Fig. 8 we show results with the PDFs obtained by reweighting CT14 with the pp data, observing a clear improvement in the data to theory agreement. We obtain $\chi ^2/N_{\mathrm{data}} = 2.8$ for the predictions with CT14 reweighted using $\varDelta \chi ^2 = 100$ and $\chi ^2/N_{\mathrm{data}} = 1.6$ when using $\varDelta \chi ^2 = 10$. These numbers are somewhat higher than what we obtained in the pp case, reflecting the fact that also the EPPS16 nuclear modifications need to be adjusted for optimal description of the data. This can also be seen by comparing the data-to-theory agreement in pPb at $\eta _{\mathrm{dijet}} \gtrsim 2$ to that in pp: while the CT14 predictions reweighted using $\varDelta \chi ^2 = 100$ describe well the pp data in these rapidities, the pPb data points lie systematically below the predictions, which hints a preference for deeper nuclear shadowing – the suppression in the gluon PDF, $R^{\mathrm{Pb}}_g < 1$, at small x – than that in the EPPS16 central set. We will verify this claim in the next section.

An important thing to notice here is that most of the deviations from central theory predictions actually originate from the issues with the free-proton PDFs instead of the nuclear modifications. This large free-proton PDF bias prevents a clean extraction of the PDF nuclear modifications from the pPb spectra. The dijet spectra are certainly not the only pPb observable sensitive to such a free-proton PDF dependence, but the refined proton PDFs found here could also have an effect for example on the predictions for inclusive $t{\bar{t}}$ production at 8.16 TeV pPb collisions where calculations with CT14+EPPS16 overshoot, but are still compatible with the data [36].

3.3 Nuclear modification ratio and EPPS16 reweighting

Let us now consider the nuclear modification ratio of the normalized dijet spectra discussed above, defined as

$$\begin{aligned} R_{\mathrm{pPb}}^{\mathrm{norm.}} = \frac{\frac{1}{\mathrm {d}\sigma ^\mathrm{pPb}/\mathrm {d}p_\mathrm {T}^{\mathrm{ave}}}\,\mathrm {d}^2\sigma ^\mathrm{pPb}/\mathrm {d}p_\mathrm {T}^{\mathrm{ave}}\mathrm {d}\eta _\mathrm{dijet}}{\frac{1}{\mathrm {d}\sigma ^\mathrm{pp}/\mathrm {d}p_\mathrm {T}^{\mathrm{ave}}}\,\mathrm {d}^2\sigma ^\mathrm{pp}/\mathrm {d}p_\mathrm {T}^{\mathrm{ave}}\mathrm {d}\eta _{\mathrm{dijet}}}. \end{aligned}$$

(35)

As we have seen that the dijet rapidity distributions in pp and pPb have very similar dependence on the free proton PDFs, we can expect this dependence to efficiently cancel in the ratio. This statement is verified in Fig. 9, where we observe the uncertainty band given by CT14 PDFs to be vanishingly small. Also the scale uncertainties, while being larger than the CT14 uncertainties, are small in this observable, implying that MHOUs can be expected to be small as well. This leaves the nuclear modifications as the dominant source of theory uncertainty.

We observe that the CMS data and EPPS16 predictions are in good agreement within the uncertainties. This does not come as a surprise, as part of these data, namely the high-$p_{\mathrm{T}}^\mathrm{ave}$ part of the pPb cross section [37], were used in the EPPS16 fit. Still, this agreement is not trivial as with the new pp baseline and being a more differential measurement, these $R_{\mathrm{pPb}}^{\mathrm{norm.}}$ data contain plenty of new information compared to the 7 data points of forward-to-backward ratios included in the EPPS16 analysis. As was anticipated above, the data points at forward rapidities deviate from the central EPPS16 prediction, indicating a preference for a deeper shadowing in the nPDFs.

Compared to the data, the EPPS16 predictions have much larger uncertainties, which promises a good constraining power when fitting to these data. To study the impact these data would have had in the EPPS16 fit, we have performed a reweighting in the cubic–quadratic approximation introduced in Sect. 2.2, using $\varDelta \chi ^2 = 52$ and taking the values of $\delta z^\pm _k$ from Table 2 of Ref. [18]. The results for $R_\mathrm{pPb}^{\mathrm{norm.}}$ are shown in Fig. 10. Most notably, there is a vast reduction in the EPPS16 uncertainties. Also, at forward rapidities the central prediction comes down a bit, as is expected from the low-lying data points in this region. In the backward direction a slight enhancement in the central prediction can be observed, but this is far less prominent than the suppression in the forward bins. In total, we obtain an improvement in the goodness of fit from $\chi ^2/N_{\mathrm{data}} = 1.7$ to 1.4 with a penalty $P/\varDelta \chi ^2 = 0.14$.

The corresponding effects on the EPPS16 nuclear modifications in lead at the parametrization scale $Q^2 = 1.69~\mathrm{GeV}^2$ are presented in Fig. 11. There is a striking impact on gluon modification uncertainties, which are reduced across all x. In the best-constrained mid-x region, the uncertainties are reduced to less than half of their original size. As the uncertainty band lies clearly above unity in this region, we find strong evidence for gluon antishadowing in lead. At small x, the reweighted uncertainty band goes respectively below unity, giving evidence for gluon shadowing. These findings are in accordance with those of Ref. [38], where inclusive heavy-flavour production data from measurements at the LHC were used to study the gluon PDF modifications in nuclei. As expected from inspecting the ratio of the dijet spectra, the new central set seems to support stronger shadowing than in the original EPPS16 central fit.

Even with the increased gluon shadowing, the most forward bins of $R_{\mathrm{pPb}}^{\mathrm{norm.}}$ are not well reproduced by the reweighted results, which is also the reason why the $\chi ^2/N_{\mathrm{data}}$ remained somewhat high even after the reweighting. To be consistent with these forward data points, a very deep shadowing for the gluons would be required. Moreover, the probed x region changes very little between the last and second-to-last $\eta _{\mathrm{dijet}}$ data point, and thus such a steep drop as that suggested by the data is difficult to attain. This is because the DGLAP evolution efficiently smooths out even steep structures in the gluon nuclear modification, as can be seen in Fig. 12 where we show the gluon nuclear modifications evolved to higher scales. We also note that the systematic uncertainty dominates in the last $\eta _\mathrm{dijet}$ bins, and thus taking into account the data correlations, once available, could improve the fit quality. These findings should, in the future, be contrasted also with the recent ATLAS conditional yield measurement, where an order of 10–$20\%$ nuclear suppression for dijets was found in the most forward configuration [39].

Also at large x, the reweighted gluon modifications are better constrained than in the original EPPS16 analysis. The new central set has $R_g^{\mathrm{Pb}}$ closer to unity at x around 0.7. This is partly enforced by momentum sum rule in combination with the stiffness of the EPPS16 fit function and the deepened small-x shadowing. In any case, the uncertainty remains large, and either an enhancement or a suppression for gluons is possible in this region. On this basis, the conclusion made in Ref. [4], that the dijet data would give evidence of large-x gluon suppression, seems premature. This claim was based on comparison of the data with EPS09 [40] and DSSZ [41] nPDFs, where the former, with gluon suppression at large x, agreed well with the data at backward rapidities, but the latter, having the nuclear gluons unmodified, did not. However, going towards backward rapidities, and thus larger x from the Pb side, the contribution of nuclear quarks to the dijet cross section grows rapidly. Hence the difference in predictions with EPS09 and DSSZ in this region has a large contribution from different valence quark modifications. As DSSZ has much smaller large-x suppression for valence quarks than EPS09 (see e.g. Ref. [42]), this also partly explains the difference in the dijet predictions of Ref. [4].

On these grounds, it might appear surprising that the dijet data are not able to constrain the valence quark modifications at all, as can be seen from the first two panels in Fig. 11. The reason for this is that due to smallness of isospin corrections [43], the backward dijet data mainly probe the average valence modifications,

$$\begin{aligned} R_{u_{\mathrm{V}} + d_{\mathrm{V}}}^{\mathrm{Pb}} = \frac{u^{\mathrm{p/Pb}}_\mathrm{V}+d^{\mathrm{p/Pb}}_{\mathrm{V}}}{u^{\mathrm{p}}_{\mathrm{V}}+d^{\mathrm{p}}_{\mathrm{V}}}, \end{aligned}$$

(36)

shown in Fig. 13. This combination is much better constrained than the individual flavours shown in Fig. 11 and has vastly smaller uncertainties at large x than the gluon modifications. Thus, while large-x valence quarks dominate the dijet cross section at backward rapidities, the uncertainty in the EPPS16 predictions in this region comes dominantly from the less-constrained gluons, and hence it is the gluon modifications which are constrained in the reweighting. Figure 13 shows also the average sea quark modification

$$\begin{aligned} R_{{\overline{u}}+{\overline{d}}+{\overline{s}}}^{\mathrm{Pb}} = \frac{{\overline{u}}^{\mathrm{p/Pb}}+{\overline{d}}^\mathrm{p/Pb}+{\overline{s}}^{\mathrm{p/Pb}}}{{\overline{u}}^\mathrm{p}+{\overline{d}}^{\mathrm{p}}+{\overline{s}}^{\mathrm{p}}}, \end{aligned}$$

(37)

which is the dominant quark combination constrained at forward rapidities. We observe a modest reduction in the small-x uncertainty, much smaller than that for the gluons. At the level of individual flavours, shown in Fig. 11, these constraints affect mostly the s-quark modifications, which were poorly constrained in EPPS16.

3.4 Importance of non-quadratic and non-linear terms in reweighting

We may now ask whether the inclusion of higher-order (non-quadratic and non-linear) components in the reweighting had a sizable effect on our results. Figure 13 shows the impact of the dijet data on the EPPS16 nuclear modifications in all three approximations discussed in Sect. 2.2. While, for simplicity of presentation, we show only the average valence and light-sea-quark modifications in addition to those for gluons, the conclusions below apply to individual flavours as well. We find that the cubic–quadratic and quadratic–quadratic approximations give almost identical results. This is rather easy to understand: The new data are precise enough to dominate the shape of the total $\chi ^2$ function in the parameter directions that it constrains (mainly those related to gluon degrees of freedom), making the non-quadratic components sub-dominant in the reweighting. Moreover, as the new central set does not divert far from the original, we are working in a region where the quadratic approximation for $\chi ^2_\text {old}$ is rather good. Under different circumstances this might not be the case and the cubic terms could alter the reweighting results significantly.

Next, we consider the reweighting results in the quadratic–linear approximation. Here, we use the linear approximation for the cross sections, but decide to keep the quadratic dependence in the PDFs for better comparability.^{Footnote 5} Again, the differences to the results of the cubic–quadratic approximation are rather modest, though for the high-x gluons the quadratic–linear approximation seems to suggest slightly less stringent constraints. The similarity of results in the different approximations can also be seen as a reassuring fact: the results of reweighting do not seem to depend on minute details of our method and we seem to be able to make reliable conclusions based on rather limited information about the original global analysis, at least in this particular case. The obtained results are thus not likely to change if even higher-order contributions are added.

4 Summary and conclusions

In this work, we have presented a non-quadratic extension of the Hessian PDF reweighting introduced in Ref. [15] and applied the method in the context of CMS dijet measurements at 5.02 TeV. This improved method makes use of the knowledge of parameter variations at which the error sets of the original PDFs are defined, to solve for cubic components of the $\chi ^2$ function before inclusion of new data. Similarly, quadratic components in the responses of observables to parameter variations were taken into account. The additional information needed in this cubic–quadratic approximation prevented us from using it when reweighting the CT14 NLO PDFs with the pp dijet distributions, where we had to resort to a simpler quadratic–quadratic approximation, but we were able to apply it to reweight the EPPS16 nPDFs, for which the needed information is available, with the nuclear modification ratio of the dijet spectra. While no large differences were found in the results of reweighting EPPS16 in the cubic–quadratic or quadratic–quadratic approximation, this observation was limited to one specific case, and under different circumstances the cubic terms could become more important. We thus encourage PDF fitters to publish the details of their analysis to a sufficient accuracy, such that the reweighting including the higher-order terms becomes possible. This can be done by publishing the numerical values of the $\delta z^\pm _k$ parameters as defined in Sect. 2 in addition to the tolerance $\varDelta \chi ^2$. Care must be taken in communicating which error set corresponds to each of these values, so that there is no chance of misinterpretation e.g. in what is called a “plus” and what a “minus” direction. A neat way to do this with LHAPDF [44] would be to set in each PDF grid file a custom flag such as “ParamVal” to hold the value $\delta z^\pm _k$. These parameter values could then be retrieved by using the method info().get_entry("ParamVal") for each of the PDF error sets.

Comparing the measured pp dijet pseudorapidity spectra with theory calculations using the CT14 NLO PDFs revealed a large discrepancy. We showed that at high $p_{\mathrm{T}}^{\mathrm{ave}}$ this difference is larger than the associated scale uncertainties and exceeds the size of the NLO corrections, thus being unlikely due to missing NNLO terms alone. This suggested the need for modifying the CT14 PDFs to reach a better agreement with the data. In reweighting CT14 with the dijet data, the gluon PDF acquired significant modifications, especially at large x, where a substantial reduction was observed. We discussed also evidence from other studies pointing into the same direction. After reweighting, a much more reasonable $\chi ^2$ value for the dijet data was found, but this came with a price of a rather high penalty term, i.e. the new central set had diverted quite far from the original minimum. The reason for this apparent discrepancy between CT14 and the dijet data remains elusive. We tested the reweighted PDFs against CMS 7 TeV inclusive jet measurements finding good agreement, and thus no conflict between the considered dijet and inclusive jet data. By performing a reweighting with an artificially low $\varDelta \chi ^2$, we showed that the CT14 PDFs still had trouble in reproducing the data at $\eta _{\mathrm{dijet}} \lesssim -1$, signaling a possible parametrization issue, although NNLO corrections and correlated systematics can also play a role here. Solving this issue is beyond the reach of the reweighting tools and should be studied in the context of a global analysis.

Similar discrepancy as seen with the pp spectra is observed also in the case of pPb. We showed that applying the same CT14 modifications as found in the reweighting with pp data substantially improves the data-to-theory agreement also in pPb. As the pPb dijet distributions contain a substantial free-proton PDF dependence, a clean extraction of their nuclear modifications is not possible from these data directly. Taking the ratio of the pPb and pp spectra, however, leads to a very efficient cancellation of not only the free-proton uncertainties but also of the scale uncertainties, thus giving an excellent probe of the nPDFs. We showed that the measured nuclear-modification ratio of dijet spectra is in a good agreement with the NLO predictions using the EPPS16 nPDFs. Some deviation from the EPPS16 central prediction was observed at $\eta _{\mathrm{dijet}} \gtrsim 2$, supporting a stronger shadowing for gluons than present in the EPPS16 central set. As a whole, these data give compelling evidence of small-x gluon nuclear shadowing and mid-x antishadowing, as was revealed in reweighting the EPPS16 nPDFs. We obtained significant new constraints on the EPPS16 gluon modifications in lead throughout the probed range, reducing the uncertainties even to less than half of their original size.

Data Availability Statement

This manuscript has no associated data or the data will not be deposited. [Authors’ comment: All relevant analysis output data are presented in figures or given in text and the numerical values are available from the authors upon request.]

Notes

The intricacies of choosing an appropriate value for $\varDelta \chi ^2$ are outside the scope of this article, see Refs. [17, 21, 22] for discussion.
The fact that the new PDFs are linear combinations of the original ones, with a certain weight factor applied to each of them, also justifies the usage of term “reweighting” in this context.
For a demonstration of cancellation of the hadronization effects in the normalization, see Ref. [26]. With the relatively small $R = 0.3$, the contribution from underlying event should be small in the first place.
The CT14 analysis uses the individual-jet $p_\mathrm{T}$ as the scale for the inclusive-jet cross sections. To LO, $p_{\mathrm{T}}^{\mathrm{leading}} = p_{\mathrm{T}}^{\mathrm{subleading}}$, and thus using $\mu = p_{\mathrm{T}}^{\mathrm{ave}}$ for dijets recovers the CT14 scale definition in the $2 \rightarrow 2$ limit.
Note that using a linear parameter dependence for the PDFs would render the PDF uncertainties to be perfectly symmetric, so that the comparison with cubic–quadratic and quadratic–quadratic approximations would be meaningful only under the symmetric prescription of Eq. (10).

References

J. Gao, L. Harland-Lang, J. Rojo, Phys. Rep. 742, 1 (2018). arXiv:1709.04922 [hep-ph]
Article ADS MathSciNet Google Scholar
M. Mangano, H. Satz, U. Wiedemann (eds.), Hard Probes in Heavy-Ion Collisions at the LHC. CERN Yellow Report, CERN-2004-009
H.W. Lin et al., Prog. Part. Nucl. Phys. 100, 107 (2018). arXiv:1711.07916 [hep-ph]
Article ADS Google Scholar
A.M. Sirunyan et al. [CMS Collaboration], Phys. Rev. Lett. 121(6), 062002 (2018). arXiv:1805.04736 [hep-ex]
S. Dulat et al., Phys. Rev. D 93(3), 033006 (2016). arXiv:1506.07443 [hep-ph]
Article ADS Google Scholar
L.A. Harland-Lang, A.D. Martin, P. Motylinski, R.S. Thorne, Eur. Phys. J. C 75(5), 204 (2015). arXiv:1412.3989 [hep-ph]
Article ADS Google Scholar
V. Bertone et al., xFitter Developers’ Team, PoS DIS 2017, 203 (2018). arXiv:1709.01151 [hep-ph]
S. Alekhin et al., Eur. Phys. J. C 75(7), 304 (2015). arXiv:1410.4412 [hep-ph]
Article ADS MathSciNet Google Scholar
W.T. Giele, S. Keller, Phys. Rev. D 58, 094023 (1998). arXiv:hep-ph/9803393
Article ADS Google Scholar
R.D. Ball et al. [NNPDF Collaboration], Nucl. Phys. B 849, 112 (2011). Erratum: [Nucl. Phys. B 854, 926 (2012)]. Erratum: [Nucl. Phys. B 855, 927 (2012)]. arXiv:1012.0836 [hep-ph]
R.D. Ball et al., Nucl. Phys. B 855, 608 (2012). arXiv:1108.1758 [hep-ph]
Article ADS Google Scholar
G. Watt, R.S. Thorne, JHEP 1208, 052 (2012). arXiv:1205.4024 [hep-ph]
Article ADS Google Scholar
N. Sato, J.F. Owens, H. Prosper, Phys. Rev. D 89(11), 114020 (2014). arXiv:1310.1089 [hep-ph]
Article ADS Google Scholar
H. Paukkunen, C.A. Salgado, Phys. Rev. Lett. 110(21), 212301 (2013). arXiv:1302.2001 [hep-ph]
Article ADS Google Scholar
H. Paukkunen, P. Zurita, JHEP 1412, 100 (2014). arXiv:1402.6623 [hep-ph]
Article ADS Google Scholar
C. Schmidt, J. Pumplin, C.P. Yuan, P. Yuan, Phys. Rev. D 98(9), 094005 (2018). arXiv:1806.07950 [hep-ph]
Article ADS Google Scholar
A.D. Martin, W.J. Stirling, R.S. Thorne, G. Watt, Eur. Phys. J. C 63, 189 (2009). arXiv:0901.0002 [hep-ph]
Article ADS Google Scholar
K.J. Eskola, P. Paakkinen, H. Paukkunen, C.A. Salgado, Eur. Phys. J. C 77(3), 163 (2017). arXiv:1612.05741 [hep-ph]
Article ADS Google Scholar
K.J. Eskola, P. Paakkinen, H. Paukkunen, PoS DIS 2018, 014 (2018). arXiv:1806.08208 [hep-ph]
Google Scholar
K.J. Eskola, P. Paakkinen, H. Paukkunen, PoS HardProbes 2018, 124 (2019). arXiv:1812.05438 [hep-ph]
Google Scholar
J. Pumplin, D. Stump, R. Brock, D. Casey, J. Huston, J. Kalk, H.L. Lai, W.K. Tung, Phys. Rev. D 65, 014013 (2001). arXiv:hep-ph/0101032
Article ADS Google Scholar
J. Pumplin, Phys. Rev. D 82, 114020 (2010). arXiv:0909.5176 [hep-ph]
Article ADS Google Scholar
H.L. Lai, M. Guzzi, J. Huston, Z. Li, P.M. Nadolsky, J. Pumplin, C.-P. Yuan, Phys. Rev. D 82, 074024 (2010). arXiv:1007.2241 [hep-ph]
Article ADS Google Scholar
F. James, M. Roos, Comput. Phys. Commun. 10, 343 (1975)
Article ADS Google Scholar
M. Cacciari, G.P. Salam, G. Soyez, JHEP 0804, 063 (2008). arXiv:0802.1189 [hep-ph]
Article ADS Google Scholar
D. Gulhan, talk at HI Jet Workshop, Ecole Polytechnique, Paris, July (2016). https://indico.cern.ch/event/507670/contributions/2218034/attachments/1314800/1969379/2016_07_24_HIJetWorkshop.pdf. Accessed 22 Mar 2019
J. Currie, A. Ridder, T. Gehrmann-De Gehrmann, E.W.N. Glover, A. Huss, J. Pires, Phys. Rev. Lett. 119(15), 152001 (2017). arXiv:1705.10271 [hep-ph]
Article ADS Google Scholar
Z. Nagy, Phys. Rev. D 68, 094002 (2003). arXiv:hep-ph/0307268
Article ADS Google Scholar
M. Cacciari, G.P. Salam, G. Soyez, Eur. Phys. J. C 72, 1896 (2012). arXiv:1111.6097 [hep-ph]
Article ADS Google Scholar
L.A. Harland-Lang, A.D. Martin, R.S. Thorne, Eur. Phys. J. C 78(3), 248 (2018). arXiv:1711.05757 [hep-ph]
Article ADS Google Scholar
R.D. Ball et al. [NNPDF Collaboration], Eur. Phys. J. C 77(10), 663 (2017). arXiv:1706.00428 [hep-ph]
M. Azizi, A. Khorramian, H. Abdolmaleki, S. Paktinat Mehdiabadi, Int. J. Mod. Phys. A 33(24), 1850142 (2018). arXiv:1810.12063 [hep-ph]
Article ADS Google Scholar
S. Alekhin, J. Blümlein, S. Moch, Eur. Phys. J. C 78(6), 477 (2018). arXiv:1803.07537 [hep-ph]
Article ADS Google Scholar
S. Chatrchyan et al. [CMS Collaboration], Phys. Rev. D 87(11), 112002 (2013). Erratum: [Phys. Rev. D 87(11), 119902 (2013)]. arXiv:1212.6660 [hep-ex]
M. Wobisch et al. [fastNLO Collaboration]. Report number: FERMILAB-PUB-11-418-PPD, arXiv:1109.1310 [hep-ph]
A.M. Sirunyan et al. [CMS Collaboration], Phys. Rev. Lett. 119(24), 242001 (2017). arXiv:1709.07411 [nucl-ex]
S. Chatrchyan et al. [CMS Collaboration], Eur. Phys. J. C 74(7), 2951 (2014). arXiv:1401.4433 [nucl-ex]
A. Kusina, J.P. Lansberg, I. Schienbein, H.S. Shao, Phys. Rev. Lett. 121(5), 052004 (2018). arXiv:1712.07024 [hep-ph]
Article ADS Google Scholar
M. Aaboud et al. [ATLAS Collaboration]. Report number: CERN-EP-2018-324, arXiv:1901.10440 [nucl-ex]
K.J. Eskola, H. Paukkunen, C.A. Salgado, JHEP 0904, 065 (2009). arXiv:0902.4154 [hep-ph]
Article ADS Google Scholar
D. de Florian, R. Sassot, P. Zurita, M. Stratmann, Phys. Rev. D 85, 074028 (2012). arXiv:1112.6324 [hep-ph]
Article ADS Google Scholar
H. Paukkunen, Nucl. Phys. A 926, 24 (2014). arXiv:1401.2345 [hep-ph]
Article ADS Google Scholar
K.J. Eskola, H. Paukkunen, C.A. Salgado, JHEP 1310, 213 (2013). arXiv:1308.6733 [hep-ph]
Article ADS Google Scholar
A. Buckley, J. Ferrando, S. Lloyd, K. Nordström, B. Page, M. Rüfenacht, M. Schönherr, G. Watt, Eur. Phys. J. C 75, 132 (2015). arXiv:1412.7420 [hep-ph]
Article ADS Google Scholar

Download references

Acknowledgements

We thank Yen-Jie Lee for discussions. We have received funding from the Academy of Finland, Project 297058 of K.J.E. and 308301 of H.P.; P.P. acknowledges the financial support from the Magnus Ehrnrooth Foundation. We thank the Finnish IT Center for Science (CSC) for the computational resources allocated under the Project jyy2580.

Author information

Authors and Affiliations

Department of Physics, University of Jyvaskyla, P.O. Box 35, Fl-40014, Jyvaskyla, Finland
Kari J. Eskola, Petja Paakkinen & Hannu Paukkunen
Helsinki Institute of Physics, University of Helsinki, P.O. Box 64, Fl-00014, Helsinki, Finland
Kari J. Eskola, Petja Paakkinen & Hannu Paukkunen

Authors

Kari J. Eskola
View author publications
You can also search for this author in PubMed Google Scholar
Petja Paakkinen
View author publications
You can also search for this author in PubMed Google Scholar
Hannu Paukkunen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Petja Paakkinen.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Funded by SCOAP³

Reprints and permissions

About this article

Cite this article

Eskola, K.J., Paakkinen, P. & Paukkunen, H. Non-quadratic improved Hessian PDF reweighting and application to CMS dijet measurements at 5.02 TeV. Eur. Phys. J. C 79, 511 (2019). https://doi.org/10.1140/epjc/s10052-019-6982-2

Download citation

Received: 27 March 2019
Accepted: 23 May 2019
Published: 15 June 2019
DOI: https://doi.org/10.1140/epjc/s10052-019-6982-2

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Non-quadratic improved Hessian PDF reweighting and application to CMS dijet measurements at 5.02 TeV

Abstract

Similar content being viewed by others

Proton-PDF uncertainties in extracting nuclear PDFs from $$W^\pm $$ production in p+Pb collisions

The effect of LHC jet data on MSTW PDFs

PDF reweighting in the Hessian matrix approach

1 Introduction

2 PDF uncertainties and reweighting in Hessian method

2.1 Hessian uncertainties with global tolerance criterion

2.2 Non-quadratic reweighting

2.3 Comment on non-global tolerances

3 CMS 5.02 TeV dijets and their impact on PDFs

3.1 Proton–proton dijet spectra and CT14 reweighting

3.2 Significance of proton PDF uncertainties in proton–lead dijet spectra

3.3 Nuclear modification ratio and EPPS16 reweighting

3.4 Importance of non-quadratic and non-linear terms in reweighting

4 Summary and conclusions

Data Availability Statement

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Navigation

Non-quadratic improved Hessian PDF reweighting and application to CMS dijet measurements at 5.02 TeV

Abstract

Similar content being viewed by others

Proton-PDF uncertainties in extracting nuclear PDFs from $$W^\pm $$ production in p+Pb collisions

The effect of LHC jet data on MSTW PDFs

PDF reweighting in the Hessian matrix approach

1 Introduction

2 PDF uncertainties and reweighting in Hessian method

2.1 Hessian uncertainties with global tolerance criterion

2.2 Non-quadratic reweighting

2.3 Comment on non-global tolerances

3 CMS 5.02 TeV dijets and their impact on PDFs

3.1 Proton–proton dijet spectra and CT14 reweighting

3.2 Significance of proton PDF uncertainties in proton–lead dijet spectra

3.3 Nuclear modification ratio and EPPS16 reweighting

3.4 Importance of non-quadratic and non-linear terms in reweighting

4 Summary and conclusions

Data Availability Statement

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation