1 Introduction

Vector boson production is very relevant at hadron colliders and its measurement has achieved an increasing precision in the latest years [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25]. The experimental results have allowed the extraction of important Standard Model quantities like collinear parton densities and the mass of the W-boson [26,27,28,29]. The precise measurements of W cross section and similar observables need a good determination of QCD non-perturbative inputs which can be partially extracted from neutral boson mediated processes. Because we are interested in transverse momentum dependent observables it is mandatory to consider the extraction of transverse momentum dependent parton distribution functions (TMD) that have used the data of the neutral bosons at low transverse momentum [30,31,32,33,34,35].

In [36] it is pointed out that the low-transverse momentum spectrum for Z-boson production at LHC cannot be fully understood without the introduction of TMD, and further study is considered as necessary (see also [37]). The W-production is a natural test for the TMD factorization, and an explicit evaluation of the cross section within this approach is yet missing in the literature to our knowledge. On the other side there exist codes and predictions [38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62] for the W spectrum that include non-perturbative TMD effects only in a limited way.

The W-spectrum is also interesting in order to establish some properties of the TMD like their flavor dependence. In fact, out of all the extractions cited above, only [35] can be sensitive to some non-trivial flavor structure of TMD because it includes also data from semi-inclusive deep inelastic scattering (SIDIS) from Hermes [63] and Compass experiments [64], despite the fact that it analyzes only processes mediated by neutral vector bosons. Some preliminary work in this sense has been performed in [65] and later in [66] where the authors conclude that LHC data on W-boson production must be sensitive to the non-trivial flavor structure of TMD. This statement can partially be tested in the present work. We also consider of major importance to well establish the flavor dependence of the TMD before the starting of the Electron Ion Collider (EIC).

The results that we present want to provide a set of predictions for W-production which include the latest information from TMD studies. We revise the kinematics of W-boson transverse momentum distribution in order to include kinematical power suppressed terms as in [35] and to implement the fiducial cuts typical of these processes. We pay particular attention to errors in our predictions as coming from scale variations, PDF, TMD parameterization whose present knowledge is here described in detail. We recall that the TMD factorization applies to Drell–Yan (DY) type processes for values of the transverse momentum of the vector boson (\(q_T\)) much lower then its virtual mass (Q). The factorization theorem concludes that the non-perturbative parts of the \(q_T\)-differential cross-sections of boson production when \(q_T\ll Q\) are included in the transverse momentum dependent (TMD) distributions and, separately, their evolution kernel [67,68,69,70,71,72,73,74,75,76,77]. Phenomenologically it has been found that the TMD factorization at leading order works for \(q_T/ Q\lesssim 0.2\) [32, 34, 35] and we consider this range of validity also here. A theoretical estimate of this range has been provided in [78].

The theoretical perturbative calculations for TMD distributions at small-b performed in recent years is highly significative and recently it has reached the \(\hbox {N}^3\)LO precisionFootnote 1 [79,80,81,82,83,84,85,86,87]. Results of the same \(\hbox {N}^3\)LO already exist for the universal QCD anomalous dimensions [88,89,90,91,92], so that finally one has an extremely accurate perturbative input. At present the non-perturbative parts of the TMD are extracted at NNLO (which means \(\hbox {N}^3\)LL in the evolution kernel and NNLO in the matching of the TMD to PDF) so that for consistency we will use the perturbative results up to this order which are included in the code Artemide [93] and the non-perturbative parameters as extracted in [35]. The theoretical settings of this work are very similar to the ones of [35] and we make explicit use of the \(\zeta \)-prescription [77, 94]. The main difference with respect to the neutral boson case is represented by the particular kinematics of the W-production, which we study in detail in the next sections. Thanks to our explicit study of the leptonic tensor and implementation of fiducial cuts we can compare our predictions with existing results from LHC (ATLAS, CMS) and TeVatron (CDF, D). We explore explicitly new regions of the measured phase space where TMD effects can be significative and experimentally testable, especially for di-lepton masses below the W-mass peak.

In order to establish some notation we start writing the W mediated reaction

$$\begin{aligned} h_1 h_2 \rightarrow W^+(W^-) \rightarrow l^+(l^-)+\nu _l ({\bar{\nu }}_l), \end{aligned}$$
(1.1)

where \(h_{1,2}\) are hadrons (typically protons and antiprotons) and \(l^\pm =e^\pm , \mu ^\pm \) and \(\nu _l({\bar{\nu }}_l)\) are their corresponding (anti)neutrinos. We consider a cross section differential in the vector boson transverse mass (\(m_T\)) and transverse momentum (\(q_T\)). The typical TMD condition \(q_T\ll Q\) becomes in this case \(q_T\ll m_T\), while the relation among \(m_T\) and lepton momenta is highly non-trivial as outlined in the next sections. For our predictions, we consider the case of LHC experiments at \(\sqrt{s}=13\) TeV, assuming that lepton cuts and fiducial cross sections are similar to the case of \(\sqrt{s}=7\) TeV [25]. In principle this discussion can be extended to other center of mass energies. We consider several intervals of \(m_T\) and \(q_T\) which are relevant both for the TMD flavor determination and the mass of the W-boson. In this sense we discover that several intervals of these variables can be interesting for QCD studies and we investigate them in detail. We consider also the \(p_T^Z/p_T^W\) and \(p_T^{W^-}/p_T^{W^+}\) observables, in different ranges of the transverse invariant mass of the W.

The paper is prepared as in the following. In Sect. 2 we establish the notation and write the cross section of the W distributions with TMD factorization and we explain how fiducial cuts are implemented. The kinematical relations are here described in detail (and we are not aware of a similar detailed description in the literature). In Sect. 3 we list the source of errors for the various observables examined in this paper. In Sect. 4 we provide our prediction for W, \(p_T^{Z}/p_T^W\), \(p_T^{W^-}/p_T^{W^+}\) transverse momentum differential distributions. In Sect. 5 we compare the result of our code with existing theoretical and experimental results. We conclude in Sect. 6 and we provide some details of our calculations in the Appendix.

2 W-boson cross section in TMD factorization

In this section we present the cross section for charged Drell–Yan (DY) process in TMD factorization. Because the derivation of the factorized cross section is analogous to the neutral Drell–Yan case [35, 67,68,69,70,71,72,73,74,75,76,77], we omit a detailed discussion of the factorization, concentrating on the inputs relevant to us.

2.1 Charged DY cross section with transverse variables

Let us consider a charged DY process as in Eq. (1.1) in which two hadrons with momenta \(P_1\) and \(P_2\) lead to a lepton with momentum l and an (anti)neutrino with momentum \(l'\). We approximate the interaction among protons and the lepton-neutrino pair through the production of a \(W^\pm \) boson with momentum \(q=l+l'\). In this paper we do not consider masses of the initial hadrons nor masses of final-state particles, so \( P_1^2=M_1^2 \approx 0, \;P_2^2=M_2^2 \approx 0, \; l^2 =m_l^2 \approx 0, \; l'^2=m_{\nu _l}^2 \approx 0. \)

The combination of these momenta leads to the definition of the relevant kinematical variables of the DY process

$$\begin{aligned} s=(P_1+P_2)^2, \qquad q^2=Q^2, \qquad y=\frac{1}{2}\ln \left( \frac{q^0+q^z}{q^0-q^z}\right) ,\nonumber \\ \end{aligned}$$
(2.1)

where s represents the square of the center of mass energy, \(Q^2\) is the invariant mass of the lepton-neutrino pair and y is the rapidity of the produced W boson. As the neutrino is a non-detected particle the quantity \(Q^2\) is not measurable anymore and one introduces the transverse mass [21, 25, 60, 95,96,97] as a measurable quantity in processes with invisible particles in final state as

$$\begin{aligned} m_T^2=2|l_T||l_T'| (1-\cos \phi _{l\nu }), \end{aligned}$$
(2.2)

where \(l_T\) and \(l_T'\) are the transverse parts of the lepton and neutrino momenta respectively and \(\phi _{l\nu }\) is the relative angle between both particles.

The cross section of a general DY process mediated by a gauge-boson G can be written as in [35]

$$\begin{aligned} d\sigma =\frac{\alpha _{\mathrm{{em}}}^2}{2s}d^4 q \sum _{GG'} L_{GG'}^{\mu \nu } W_{\mu \nu }^{GG'} \Delta _G(q) \Delta ^*_{G'}(q), \end{aligned}$$
(2.3)

where \(\alpha _{\mathrm{{em}}}=e^2/4\pi \), being e the electron charge. This cross section is written in terms of a product of a lepton tensor \(L_{GG'}^{\mu \nu }\) and a hadron tensor \( W_{\mu \nu }^{GG'}\) defined as

$$\begin{aligned} L_{\mu \nu }^{GG'}= & {} e^{-2} \left\langle 0 | J_\mu ^G (0) |l,l' \rangle \langle l,l' | J_\nu ^{G'\dagger }(0) |0\right\rangle , \end{aligned}$$
(2.4)
$$\begin{aligned} W_{\mu \nu }^{GG'}= & {} e^{-2} \int \frac{d^4 x}{(2\pi )^4} e^{-i(x \cdot q)}\nonumber \\&\times \sum _X \langle P_1, P_2 | J_\mu ^{G\dagger } (x) |X \rangle \langle X| J_\nu ^{G'}(0) |P_1,P_2 \rangle , \end{aligned}$$
(2.5)

where \(J_\mu ^G\) is the current for the production of a gauge boson G.

On the other hand, \(\Delta _G\) is the (Feynman) propagator of the gauge boson, in this case \(G=W\) and

$$\begin{aligned} \Delta _G(q)=\frac{1}{Q^2-M_W^2+i \Gamma _W M_W} \delta _{GW}, \end{aligned}$$
(2.6)

where \(M_W\) and \(\Gamma _W\) are the mass and width of the W boson given in [98], respectively. We do not include electroweak (EW) corrections, which are however calculated in [99].

The W-production is usually expressed in terms of the transverse mass. The relation between the invariant and transverse mass in a general case is non-trivial, so we describe it here in detail. We have

$$\begin{aligned} Q^2=m_T^2+f(l,l'), \end{aligned}$$
(2.7)

and

$$\begin{aligned} f(l,l')=2 \left[ (l_T^2+l_z^2)^{1/2}(l_T'^2+l_z'^2)^{1/2}-l_zl_z'\right] -2|l_T| |l_T'|\ .\nonumber \\ \end{aligned}$$
(2.8)

In order to include this function in the cross section we introduce the identity

$$\begin{aligned} 1=\int d m_T^2 \, \delta (Q^2-m_T^2-f(l,l')), \end{aligned}$$
(2.9)

and the cross section in Eq. (2.3) can be rewritten as

$$\begin{aligned} d\sigma =\frac{\alpha _{\mathrm{{em}}}^2}{2s}d^4q \, dm_T^2 \sum _{GG'} L_{GG'}^{\mu \nu } W_{\mu \nu }^{GG'} \Delta _G(q) \Delta ^*_{G'}(q). \end{aligned}$$
(2.10)

Note that the Dirac delta in Eq. (2.9) is relocated inside the definition of the lepton tensor which simplifies the practical evaluation of the integral.

The next two sections are devoted to the factorization of the hadronic tensor and the final expression for the lepton tensor affected by transverse mass and fiducial cuts. This information will lead us to the final form of the cross section of charged DY within TMD factorization.

2.2 Factorization of the hadronic tensor

In this paper we use the unpolarized part of the hadron tensor from [35] neglecting the masses of both hadrons. Thus, for a generic gauge boson G we omit power suppressed higher-twist TMD contributions and we write

$$\begin{aligned} W_{\mu \nu }^{GG'}= & {} -\frac{g_T^{\mu \nu }}{\pi N_c} |C_V(Q^2,\mu )|^2 \sum _{ff'} z_{ff'}^{GG'}\nonumber \\&\times \int \frac{d^2 {\varvec{b}}}{4\pi } e^{i ({\varvec{q}} \cdot {\varvec{b}})} f_{1,f\leftarrow q}(x_1,{\varvec{b}},\mu ,\zeta )\nonumber \\&\times f_{1,f'\leftarrow q}(x_2,{\varvec{b}},\mu ,\zeta ), \end{aligned}$$
(2.11)

where \(C_V\) is the matching coefficient for vector current to collinear/anti-collinear vector and \(z_{ff'}^{GG'}\) are the EW factors that will be defined later. On the other hand \(f_1\) is the unpolarized TMDPDF defined as

$$\begin{aligned}&f_{1,f\leftarrow h}(x,{\varvec{b}},\mu ,\zeta )\nonumber \\&\quad =\int \frac{d\lambda }{2\pi } e^{-i x\lambda p^+} \sum _X \langle h(p)|{\bar{q}} (n\lambda + {\varvec{b}} ) W_n^\dagger (n\lambda + {\varvec{b}}) \nonumber \\&\qquad \times \frac{\gamma ^+}{2} |X \rangle \langle X | W_n(0) q(0) |h(p) \rangle , \end{aligned}$$
(2.12)

where p is the momentum of the hadron, \(W_n(x)\) is a Wilson line rooted at x and pointing along vector n to infinity. The momentum fraction x is defined in DY kinematics as

$$\begin{aligned} x_{1,2}=\frac{\sqrt{Q^2+q_T^2}}{\sqrt{s}}e^{\pm y}. \end{aligned}$$
(2.13)

In the small-b limit, the TMDPDF can be re-factorized in terms of matching coefficients (calculated up to NNLO in [82]) and integrated PDFs. In order to write a complete TMD, we consider also a function modeling non-perturbative effects that are not included into collinear PDFs

$$\begin{aligned} f_{1,f\leftarrow h}&(x,{\varvec{b}},\mu ,\zeta )= C(x,{\varvec{b}},\mu ,\zeta ) \otimes f_1(x,\mu ) f_{NP}(x,{\varvec{b}}), \end{aligned}$$
(2.14)

where the symbol \(\otimes \) represents the Mellin convolution in the x variable. We use the ansatz for \(f_{NP}\) suggested in [35]

$$\begin{aligned} f_{NP}(x,{\varvec{b}})=\exp \left( -\frac{\lambda _1(1-x)+\lambda _2 x +x(1-x) \lambda _5}{\sqrt{1+\lambda _3 x^{\lambda _4}{\varvec{b}}^2}}\right) , \end{aligned}$$
(2.15)

where the parameters \(\lambda _1, \ldots , \lambda _5\) are extracted from a combined DY+SIDIS fit to data in [35]. In the TMD parameterization of Eq. (2.14) the flavor dependence is in principle contained in the parameters \(\lambda _i\) and the PDF. The extraction of [35] has found that, given actual SIDIS data it is sufficient to count flavor dependence in PDF only. While we make predictions using the values of \(\lambda _i\) as in [35], it is possible that the W-boson measurements that we propose will be able to achieve a better control of these parameters, including a flavor dependence.

The angular part of the Fourier integral in Eq. (2.11) can be solved analytically and we can rewrite the hadron tensor

$$\begin{aligned} W_{\mu \nu }^{GG'}= & {} -\frac{g_T^{\mu \nu }}{\pi N_c} |C_V(Q^2,\mu )|^2 \sum _{ff'} z_{ff'}^{GG'} W^{ff'}_{f_1 f_1}\nonumber \\&\times (Q, q_T,x_1,x_2, \mu , \zeta ), \end{aligned}$$
(2.16)

where

$$\begin{aligned}&W^{ff'}_{f_1 f_1}(Q, q_T,x_1,x_2, \mu , \zeta )\nonumber \\&\quad =\int \frac{|{\varvec{b}}|d |{\varvec{b}}|}{2} J_0(|{\varvec{b}}| |{\varvec{q}}|) f_{1,f\leftarrow q}(x_1,{\varvec{b}},\mu ,\zeta )\nonumber \\&\quad \times f_{1,f'\leftarrow q}(x_2,{\varvec{b}},\mu ,\zeta ). \end{aligned}$$
(2.17)

2.3 Lepton tensor and fiducial cuts

The lepton tensor that enters in the cross section is integrated over the lepton and neutrino momenta, thus contracting it with the Lorentz structure that comes from the hadron tensor in Eq. (2.11) we obtain,

$$\begin{aligned}&-g_T^{\mu \nu }L_{\mu \nu }^{GG'(\mathrm{cuts})}\nonumber \\&\quad =32 z_{ll'}^{GG'}\int \frac{d^3l}{2E}\frac{d^3l'}{2E'} [ll'-(ll')_T]\theta (\mathrm{cuts}) \delta ^{(4)}(l+l'-q)\nonumber \\&\qquad \times \delta (Q^2-m_T^2-f(l,l')), \end{aligned}$$
(2.18)

where the extra label cuts represents the fiducial cuts introduced in different experiments. These cuts are implemented over the momenta of lepton and neutrino and the rapidity of the lepton

$$\begin{aligned} \eta _{\mathrm{{min}}}< \eta <\eta _{\mathrm{{max}}}, \qquad l_T^2> p_1^2, \qquad l_T'^2 >p_2^2. \end{aligned}$$
(2.19)

On the other hand note that the difference of this lepton tensor with the one for neutral DY production [32, 35] is the extra dependence on the transverse mass through the delta function introduced in Eq. (2.9). The integral in Eq. (2.18)

$$\begin{aligned}&I_W(Q^2,m_T^2,q_T)\nonumber \\&\quad =\int \frac{d^3l}{2E}\frac{d^3l'}{2E'} [ll'-(ll')_T] \theta (\mathrm{cuts}) \delta ^{(4)}(l+l'-q)\nonumber \\&\qquad \times \delta (Q^2-m_T^2-f(l,l')), \end{aligned}$$
(2.20)

cannot be solved analytically and only numerical results can be obtained. A detailed discussion about this integral can be found in Appendix A.

2.4 Final expression of the cross section

Once we have precise definitions for hadron and lepton tensor and taking into account that

$$\begin{aligned} d^4 q= \frac{\pi }{2} dQ^2 dy d q_T^2, \end{aligned}$$
(2.21)

we get the desired differential cross section as

$$\begin{aligned} \frac{d\sigma }{dm_T^2 dy d q_T^2}&=\int _0^\infty dQ^2 \frac{8}{N_c} \frac{\alpha ^2_{\mathrm{{em}}}}{s} I_W(Q^2,q_T, m_T^2)\nonumber \\&\quad \times \sum _{ff'}\sum _{GG'}z_{ll'}^{GG'} z_{ff'}^{GG'} \Delta _G (q) \Delta _{G'}^* (q) W_{f_1f_1}^{ff'} \nonumber \\&\quad \times (Q^2, q_T,x_1,x_2). \end{aligned}$$
(2.22)

where, in the case of W boson production, the product of EW factors for hadronic and leptonic part and propagators can be written as

$$\begin{aligned}&\!\!\!z_{ll'}^{GG'} z_{ff'}^{GG'}\Delta _G(q)\Delta _{G'}^*(q)\nonumber \\&\!\!\!\quad =\delta _{GW}\delta _{G'W} |V_{ff'}|^2 \frac{e_f e_{f'}}{Q^4} \frac{Q^4}{(Q^2-M_W^2)^2+\Gamma _W^2 M_W^2} z_l^{WW} z_q^{WW},\nonumber \\ \end{aligned}$$
(2.23)

where \(V_{ff'}\) are the elements of CKM matrix that mixes flavors, \(e_f\) are the quark charges in terms of the electron charge and

$$\begin{aligned} z_l^{WW} z_q^{WW}=\left( \frac{1}{4s_W^2}\right) ^2. \end{aligned}$$
(2.24)

Thus, the final expression for the desired cross section is

$$\begin{aligned} \frac{d\sigma }{dm_T^2 dy d q_T^2}= & {} \int _0^\infty \frac{dQ^2}{Q^4}\frac{8}{N_c} \frac{\alpha ^2_{\mathrm{{em}}}}{s} I_W(Q^2,q_T,m_T^2) \nonumber \\&\times \frac{1}{(4s_W^2)^2} \frac{Q^4}{(Q^2-M_W^2)^2+\Gamma _W^2 M_W^2}\nonumber \\&\times \sum _{ff'} |V_{ff'}|^2 e_f e_{f'} W_{f_1f_1}^{ff'} (Q^2, q_T,x_1,x_2).\nonumber \\ \end{aligned}$$
(2.25)

The plots of next sections are done using the PDF set of NNPDF31_nnlo_as_0118 [28]. The numerical inputs that we have used are

$$\begin{aligned} M_Z&=91.1876\text { GeV},\quad M_W=80.379\text { GeV},\quad \\ \Gamma _Z&=2.4952\text { GeV},\quad \Gamma _W=2.085\text { GeV},\quad \end{aligned}$$

and \(G_F=1.1663787\times 10^{-5}\) GeV.

3 Sources of errors

In the course of the paper we consider several sources of errors that come from the way we write and parameterize the cross section. We list them here as a reference for the following sections.

  • Scale variations In the present work we use the \(\zeta \)-prescription defined in [77] and we include the non-perturbative fixing of the \(\zeta \)-scale as defined in [35, 94]. As a result the scale variation is done changing the parameters \(c_{2,4}\in [0.5,2]\). These parameters define the uncertainty in the hard matching scale and in the matching of the TMD on the collinear PDF respectively.

  • Error from a reference PDF set using replicas The central value of our predictions is deduced using the PDF set NNPDF31_nnlo_as_0118 [28], which includes also LHC data. The error on each bin is evaluated taking the variance over 1000 replicas.

  • Predictions from different PDF sets In [35] the authors have performed fits of DY data at low and high energy with different sets of PDF and we report their results in Appendix B. We consider here the same sets of PDF and we show how the predictions change with these different sets.

  • Error from TMD parametrization In [35] the authors provide a set of replicas of their TMD parameterization and we provide the main estimation of this error using them. Alternatively, one could use a more standard error propagation coming from the estimate of the non-perturbative parameters. We have used this second method as a check, and we provide more details in Appendix C.

4 Observables in W production with TMD

In this section we concentrate on some observables that can be, in principle, sensitive to TMD effects and/or are relevant to establish some important properties of the W as its mass. We consider the W-boson differential cross section, the ratios for \(p_T^Z/p_T^W\), and \(p_T^{W^-}/p_T^{W^+}\) all of them as a function of the boson transverse momentum distribution and different intervals of the transverse mass. The fiducial cuts have been set as the ATLAS experiment [25] but for \(\sqrt{s}=13\) TeV. We have identified three interesting intervals of \(m_T\), namely [50,66] GeV, [66,99] GeV, [99,120] GeV corresponding to \(m_T\) values below, around, above the W-mass, with typical cuts on lepton momenta. The \(m_T\)-interval which is mostly studied in the literature is the one with \(m_T\in [66,99] \) GeV, however this is not the only interesting one from the perspective of studying the TMD properties. This is because non-perturbative TMD effect are expected to be more relevant for low values of \(m_T\), similarly to Drell–Yan processes. The \(q_T\) interval that we have considered, that is \(q_T/m_T\lesssim 0.2\) is consistent with the application for TMD factorization. In order to have a reference setup for PDF we have chosen NNPDF31_nnlo_as_0118 [28] as in [35] as it is one of the sets that include LHC data. Nevertheless we have also examined the results taking into account different sets. We have considered a \(q_T\) binning of 1 GeV, which much probably exceeds the possibilities of current experiments. We motivate this choice because it has been used also by other authors, see f.i. [60], and also because it is allows to better evidence the TMD effects.

4.1 Spectra of the \(W^\pm \)

Fig. 1
figure 1

(top) Unnormalized cross sections for \(W^-\) production with different intervals of lepton cuts. (bottom) Normalized cross sections for \(W^-\) production with different intervals of lepton cuts

We have considered the spectra of the \(W^\pm \) and in Fig. 1 we show the case for the \(W^-\) (similar plots are obtained for \(W^+\)). In the upper panels of Fig. 1 we have considered three possible cuts for lepton momenta as reported in the same figures. In the upper panels of Fig. 1 the cross section for \(m_T\in [99,120]\) GeV is not reported because it is much smaller than the others. In the same figure we have included error bands corresponding to scale variations. The figures in these panels show that the cuts have marginal impact for \(m_T\in [66,99]\) GeV while they can suppress the cross section in the lower interval \(m_T\in [50,66]\) GeV. The difference in the values of the cross sections however is about a factor 4. The interval \(m_T\in [99,120]\) GeV instead results to be even more suppressed an it is not shown in the figure. Given this suppression and the fact at high values of \(m_T\) all TMD effects are washed out we neglect it in the rest of the figures.

In the lower panels of Fig. 1 we normalize the cross section to its value integrated in \(q_T\) in the interval shown in the figure. The shape of the curves now changes and in the left and central panels the more peaked cross sections is the one obtained for \(m_T\in [50,66]\) GeV, which is partly due to the fact that we always select \(q_T/m_T\le 0.2\).

In order to drive a conclusion from these plots we have to recall the previous experience of the fit of [34, 35] and also the results of [36]. In these works it shown that in TMD analyses increasing the value of Q the non-perturbative QCD effects are washed out. As a a result having data below the \(W,\, Z\) boson peaks is extremely useful for this kind of research and it can provide valuable information. The plots shown in Fig. 1 actually show that within the current facility it is possible to achieve this goal. The bands shown in Fig. 1 come from scale variations.

Fig. 2
figure 2

Error for \(W^-\) cross section for \(p_T^{l,\nu }>25\) GeV, \(|\eta _l|<2.4\) and the \(m_T\) intervals [50, 66] GeV (left column), [66, 99] GeV (middle column), [99, 120] GeV (right column). In the first line we report the theoretical error from scale uncertainties as explained in the text. In the second line we have the error calculated as a variance in each bin of 100 replicas of the set NNPDF31_nnlo_as_0118 [28]. The uncertainty is referred to the average value of each bin (red line). The value of the cross section given by the central replica is represented by the green line. On the third line we represent the value of each been with different sets of PDF. On the fourth row we have the uncertainty due to non-perturbative parameters. The central value is given by the the central replica of NNPDF31_nnlo_as_0118 [28]

Fig. 3
figure 3

Error for \(W^+\) cross section for \(p_T^{l,\nu }>25\) GeV, \(|\eta _l|<2.4\) and the \(m_T\) intervals [50, 66] GeV (left column), [66, 99] GeV (middle column), [99, 120] GeV (right column). In the first line we report the theoretical error from scale uncertainties as explained in the text. In the second line we have the error calculated as a variance in each bin of 100 replicas of the set NNPDF31_nnlo_as_0118 [28]. The uncertainty is referred to the average value of each bin (red line). The value of the cross section given by the central replica is represented by the green line. On the third line we represent the value of each been with different sets of PDF. On the fourth row we have the uncertainty due to non-perturbative parameters. The central value is given by the the central replica of NNPDF31_nnlo_as_0118 [28]. The cross section for the central replica is shown in Fig. 7

In Figs. 2 and 3 we have studied the errors as listed in Sect. 3 for one particular set of fiducial cuts for \(W^-\) and \(W^+\) respectively. Similar errors are obtained also in the other cases. Going from the top row to the bottom one in Figs. 2 and 3 one finds: (1) the theoretical error from scale uncertainties; (2) the error calculated as a variance in each bin of 1000 replicas of the set NNPDF31_nnlo_as_0118 [28]. Here the bands are referred to the average value of each bin (red line). The value of the cross section given by the central replica is represented by the green line; (3) the uncertainty as coming from different sets of PDF; (4) the uncertainty due to non-perturbative parameters: in this case the central line is given by the the central replica of NNPDF31_nnlo_as_0118 [28].

There is a series of observations that one can make on these plots. As a first the smallest variations are provided by replicas and non-perturbative parameters and both of them are below \(1\%\). The scale variation is certainly the biggest source of error: it is in range 2–9\(\%\) for \(q_T\le \) 4 GeV, and about 1–2\(\%\) for \(q_T\ge \) 4 GeV. We recall that the present analysis uses \(\hbox {N}^3\)LL for TMD evolution and NNLO for all the rest. In the introduction it was pointed out that recently higher order perturbative calculations have been performed. It will be interesting to observe how the scale variation changes when including this higher order term. On the other side before doing this one should have a TMD extraction of the same order.

In exploring different PDF extractions we have considered the ones for which a TMD extraction has been provided in [35] . We recall that the HERA PDF [100] actually provided the best fit in that case. Observing Figs. 2 and 3 one sees that actually this set behaves differently with respect to the others, however the difference with NNPDF31 is below 1-2 \(\%\) an all over the range of qT that we have considered. For all the other sets, there is a major difference with NNPDF31 for \(q_T\le \)4 GeV while they agree within 1\(\%\) for greater values of \(q_T\).

The uncertainties due to non-perturbative parameters result to be almost negligible. The non-perturbative parameters have uncertainties coming from their extraction from data, that is from a \(\chi ^2\) analysis of fits. Despite the fact that some of these uncertainties may be significative on each single parameter the impact of each single variation is just a fraction of the impact of the whole non-perturbative TMD structure because their value is significantly different from zero. It is also possible that the errors on each parameter is under-estimated as suggested in [35].

4.2 Ratio \(p_T^Z/p_T^W\)

Fig. 4
figure 4

Ratio of \(Z/W^-\) spectrum for \(m_T\in [50,66]\) GeV (left column) , and \(m_T\in [66,99]\) GeV (right column). On the first row we have the \(Z/W^-\) spectrum using the central replica of NNPDF31_nnlo_as_0118 [28]. Uncorrelated and correlated theoretical uncertainties are given in second and third row respectively. In the third line we have the error calculated as a variance in each bin of 100 replicas of the set NNPDF31_nnlo_as_0118 [28]. The uncertainty is referred to the average value of each bin (red line). The value of the observable given by the central replica is represented by the green line. On the fourth line we represent the value of each been with different sets of PDF. On the fifth row we have the uncertainty due to non-perturbative parameters

Fig. 5
figure 5

Ratio of \(Z/W^+\) spectrum for \(m_T\in [50,66]\) GeV (left column) , and \(m_T\in [66,99]\) GeV (right column). On the first row we have the \(Z/W^-\) spectrum using the central replica of NNPDF31_nnlo_as_0118 [28]. Uncorrelated and correlated theoretical uncertainties are given in second and third row respectively. In the third line we have the error calculated as a variance in each bin of 100 replicas of the set NNPDF31_nnlo_as_0118 [28]. The uncertainty is referred to the average value of each bin (red line). The value of the observable given by the central replica is represented by the green line. On the fourth line we represent the value of each been with different sets of PDF. On the fifth row we have the uncertainty due to non-perturbative parameters

It has been pointed out since a long time that the observable

$$\begin{aligned} \left( \frac{1}{\Delta \sigma ^{Z}} \frac{d\sigma ^{Z}}{\, dq_T} \right) /\left( \frac{1}{\Delta \sigma ^W}\frac{d\sigma ^W}{\, dq_T} \right) \text { vs. } q_T \end{aligned}$$
(4.1)

can show a very small uncertainty because the PDF contributions tend to cancel and it can be useful for the measurements of the W mass (some recent phenomenological work can be found in [101, 102]). The region of \(q_T\) where this observable is actually measured corresponds to the region of validity of the TMD factorization theorem, so it is interesting to observe what is the impact of TMD. In formula 4.1 the numerator and the denominator are weighted by \(\Delta \sigma ^{W,Z}\) that is the cross section integrated in the \(q_T\) interval under study. This is preferable to normalizing to the total cross section \(\sigma ^{W,Z}\) which should be extracted elsewhere. The result of our prediction is shown in Figs. 4 and 5 for \(W^-\) and \(W^+\) respectively. We have considered two intervals of \(m_T\), being the lower one useful to control better TMD effects and several sources of error as in the previous section. The scale uncertainty now is treated as in [60], considering separately the correlated and uncorrelated cases.

The error from scale variations can be considered as correlated or uncorrelated as in [60], see second and third row in Fig. 4. The uncorrelated uncertainty is well below 1\(\%\) for the central interval and most of the low interval of \(m_T\). On the other side the correlated uncertainty is below 2\(\%\) only for \(q_T\ge \) 4 GeV. A similar trend is shown when different sets of PDF are used. In this case the spread of the results can be at most 2\(\%\) for \(q_T\le \) 4 GeV, and at most 1\(\%\) for \(q_T\ge \) 4 GeV. We find remarkable that the PDF error results slightly inferior for the low \(m_T\) interval, which confirms that this is an interesting case to study. Another interesting observation is that the difference between the HERA and NNPDF31 sets is always well below 1\(\%\). The other sources of error that we have considered, give uncertainties less the 1\(\%\) on all over the intervals that we have studied.

4.3 Ratio \(p_T^{W^-}/p_T^{W^+}\)

The ratio of \(W\pm \) cross sections shown in Fig. 6. We have considered again two intervals of \(m_T\) and it is nice to observe the similarity of these curves in the two intervals. The theoretical errors have been estimated and scale variations represent the biggest error. We have considered both correlated and uncorrelated scale variations and their value is very similar in this case. The scale variations keeps below 2\(\%\) for \(q_T>4\) GeV and grows below this value. All other uncertainties keep below the 1\(\%\).

Fig. 6
figure 6

Ratio of \(W^-/W^+\) spectrum for \(m_T\in [50,66]\) GeV (left column) , and \(m_T\in [66,99]\) GeV (right column). On the first row we have the \(W^-/W^+\) spectrum using the central replica of NNPDF31_nnlo_as_0118 [28]. Uncorrelated and correlated theoretical uncertainties are given in second and third row respectively. In the third line we have the error calculated as a variance in each bin of 100 replicas of the set NNPDF31_nnlo_as_0118 [28]. The uncertainty is referred to the average value of each bin (red line). The value of the observable given by the central replica is undistinguishable from the red line. On the fourth line we represent the value of each been with different sets of PDF. On the fifth row we have the uncertainty due to non-perturbative parameters

5 Comparisons with other groups and experiments

Fig. 7
figure 7

Comparison of our prediction with [60] including errors coming from scales variation. On the right the two cross sections are normalized to the central value of the prediction from Artemide

Fig. 8
figure 8

Ratio of \(W^+\) cross section without \(\lambda _i\) non-perturbative effects over the same cross section with the full model (blue band). The scale errors are shown by the bands. The orange band is ratio of the cross section in the full model over itself, and scale error band is also shown. The left, central, right panels correspond respectively to the cases of \(50\; \mathrm{GeV}< m_T<66; \mathrm{GeV}\), \(66\; \mathrm{GeV}< m_T<99; \mathrm{GeV}\), \(99\; \mathrm{GeV}< m_T<120; \mathrm{GeV}\)

The study of TMD in W-production has not been explored in its full potential in the literature, because the formulation of the TMD factorization theorem with full details is very recent. The main difficulty in establishing this kind of studies is that the TMD factorization only holds for \(q_T\ll Q\), and in this W-case \(q_T\ll m_T\). In this way we are interested in describing only a (relevant) part of the spectrum, the one which contains the peak of the distribution.

In this section we would like to compare the outcome of the results as coming from TMD factorization and recent TMD fits with theoretical and experimental results.

A theoretical prediction of the LHC case can be found in [60] and the comparison with us is done in Fig. 7. In this plot we show the cross section for the two groups in the interval 0 GeV \(<q_T <20\) GeV. The cross section is normalized to the total cross section as provided by [60], in order to have a reference value. The scale uncertainty in the two curves is similar, although slightly reduced in our case. There can be multiple reasons for this, like a different choice of scales or the usage of Monte Carlo calculation in [60]. One observes also a remarkable difference in the central values of the cross section for very low values of \(q_T\le 5\) GeV. The origin of the difference may have multiple motives starting from the parametrization of the non-perturbative effects (which is based on a broader data analysis in our case) to the set of prescription and scale fixing used by the two groups. In general this difference is expected where non-perturbative effects are significative as it is shown by the figure, while it is much less significative for \(q_T\ge 5 \) GeV. In order to explore this we have considered a modified version of the TMD where the effect of the constants \(\lambda _i\) in Eq. (2.15) is nullified. This is achieved considering \(f_{NP}(x,{\varvec{b}})=\exp (-\lambda b^2)\) with \(\lambda \simeq 10^{-3}\). The results are shown in Fig. 8 and they show to be consistent with Fig. 7 where the TMD non-perturbative part was not considered, despite the fact that the kinematical cuts considered in the two figure are slightly different. We postpone anyhow a deeper study to a different work.

We have considered also a comparison with Pythia 8.3 [103], with AZ tune as adapted by ATLAS experiment. We have found two version of this tune one as used in [60]Footnote 2 In the comparison of the cross sections Fig. 9 we observe a general shift of the Pythia result with respect to ours, whose sign depends on the value of \(q_T\), although the two estimates are mostly compatible within the errors (more for 66 GeV \(<m_T<\) 99 GeV than in the less energetic interval). In the ratio of \(W^\pm \) cross section (Fig. 10) instead the agreement is complete, both considering correlate and uncorrelated errors, providing Artemide a smaller error band. Similar conclusion come from the \(p_T^Z/p_T^W\) in Fig. 11, although with more difference for transverse momentum less than 5 GeV.

Fig. 9
figure 9

Comparison of Artemide cross section with Pythia 8.3 AZ tune as in [60] (blue band) and as in the original ATLAS release (green band) for \(W^+\) (top panels) and \(W^-\) (low panels). The Artemide error comes from scale variations, the Pythia errors are commented in the text

Fig. 10
figure 10

Comparison of Artemide \(p_T^{W^+}/p_T^{W^-}\) ratio with Pythia 8.3 AZ tune as in [60] (blue band) and as in the original ATLAS release (green band). Left panels show uncorrelated errors and right panels the correlated ones. The Artemide error comes from scale variations, the Pythia errors are commented in the text

Fig. 11
figure 11

Comparison of Artemide \(p_T^Z/p_T^{W}\) ratio with Pythia 8.3 AZ tune as in [60](blue band) and as in the original ATLAS release (green band).. Left panels show uncorrelated errors and right panels the correlated ones. The Artemide error comes from scale variations, the Pythia errors are commented in the text

Current experiments at LHC at the moment have a limited number of data points. For ATLAS we have found that for the low \(q_T\) region, one has only two points in [95], shown in Fig. 12. The cross section is normalized to its integrated value in the plotted \(q_T\) interval. The theoretical and experimental values are compatible within the errors. However it is evident that such a large binning does not allow a precise statement.

The CMS experiment has published an analysis of data for the \(s=\sqrt{8}\) TeV run in [21]. The data published by this collaboration are totally inclusive in \(m_T\), however because of the cuts on leptons, the values of \(m_T\le 20\) GeV give small contributions. In Fig. 13 we consider the cases of the W, Z/W and \(W^-/W^+\) transverse momentum spectrum comparing our results with [21]. The large bin for \(q_T>18\) GeV is the most sensitive to power corrections which however do not seem to create particular problems. The first thing that one can observe is that there is a good agreement between data and our prediction when theoretical errors are included, which suggests the fact that most of the QCD corrections in the CMS experiment are due to the TMD region.

Going to TeVatron experiments we have found data for and CDF and we show a comparison with our prediction in Fig. 14. In the central plot of this figure we do not show the experimental error, because the uncertainty of the bins are correlated through non-trivial correlation matrices. The agreement with these experiments is in general greater than with LHC, mainly due to the larger experimental error as the following considerations remark.

We have considered the cross sections \(\chi ^2\) for the different experiments with different sets of PDF and the results are shown in Table 1 (we also report the number of relevant points for each experiment). The CDF result are the ones with larger errors and TMD predictions agrees with them while for the at \(\sqrt{s}=1.8\) GeV, the agreement is worse. We have not considered the case of at \(\sqrt{s}=1.96\) GeV, because the computation of the error in this case involves the knowledge of the W-spectrum up to \(q_T=600\) GeV, which we do not have. In the case of LHC, in general we have a very limited number of points for each experiment. ATLAS has a remarkable agreement with the HERA20 PDF sets, definitely better than with other sets. The CMS case is more elaborate. For the case of electronic decay of W at CMS, the high value of \(\chi ^2\) is basically driven by just one point out of four, as can be seen Fig. 13, left panel (it is the point in the bin 7.5 GeV\(<q_T<\) 12.5 GeV). Removing this point, the \(\chi ^2\) is very similar to the ATLAS case. For the muon channel instead we cannot find a particular point which is responsible for the high \(\chi ^2\). As a final remark we recall that the CMS observable is totally inclusive on \(m_T\), which does not allow a perfect control of factorization hypothesis at the theoretical level. We have checked the impact on the \(\chi ^2\) of the theoretical errors (scale uncertainty and PDF replicas) for the case of NNPDF and HERA20 and we have reported it in Table 2. In both cases the errors on each bin are considered uncorrelated to the rest. The scale uncertainty error is dominated by the variation of \(c_4\), the parameter which associated to the scale at which TMD are matched onto PDF. We observe that including this error we have a big reduction of the \(\chi ^2\) on all experiments. In order to understand better this issue it would help to have experimental results with a definite interval of \(m_T\) and also a TMD extraction that includes the W processes, that we postpone to a future work. Concerning this last point we have compared the \(\chi ^2\) as coming from different extractions of TMD with the code artemide. The different extractions are obtained with the NNPDF31, PDF set, and using different data sets as shown in Table 3. We find a substantial agreement among all extractions, which suggests a mild flavor dependence of the TMD. In Table 2, the muon channel in CMS is still not fully agreeing with the theoretical prediction. This fact, which is put in evidence here for the first time, needs further study beyond the present work.

Fig. 12
figure 12

Comparison of our prediction with data from ATLAS in [95, table II], including errors coming from scales variation

Fig. 13
figure 13

Comparison of our prediction with data from CMS in [21]. Theoretical predictions include error coming from scales variation. The CMS data are always evaluated for \(p_T^l> 25\) GeV, \(p_T^\nu > 20\) GeV, \(|\eta _l|<2.4\).(Top left panel) W boson normalized spectrum for electron final state, (top right panel) W boson normalized spectrum for muon final state, (bottom left panel) W/Z ratio of transverse momentum normalized spectrum, (bottom right panel) \(W^+/W^-\) ratio of transverse momentum normalized spectrum

Fig. 14
figure 14

Comparison of our prediction with data from experiment at \(\sqrt{s}=1.8\) TeV [7] and\(\sqrt{s}=1.96\) TeV [12] and with CDF experiment at \(\sqrt{s}=1.8\) TeV [13]. Theoretical predictions include errors coming from scales variation

Table 1 \(\chi ^2/N\) using the extraction of TMD of [35]. For the CMS case \(W\rightarrow e \nu \) in parenthesis we also report the \(\chi ^2\) with one less point as explained in text
Table 2 \(\chi ^2/N\) using the extraction of TMD of [35] and theoretical errors (which include scale variation and PDF error coming from 1000 replicas)
Table 3 \(\chi ^2/N\) using extractions from different data sets. The PDF is NNPDF31. The first line is the same as in Table 4. For all extractions we have used the NNPDF31 PDF set

6 Summary and outlook

In this work we have set the status of the current knowledge of the W-boson spectrum within the TMD factorization formulation of its cross section and using the latest extraction of these distributions in [35]. We have considered the W-production differential cross section, the \(p_T^Z/p_T^W\) and the \(p_T^{W^+}/p_T^{W^-}\) distributions as functions of the boson transverse momentum \(q_T\) in different intervals of \(m_T\). The kinematic of the processes has been deeply studied, providing details that were not treated in the literature (to our knowledge), and showing explicitly how fiducial cuts are implemented together with kinematical power corrections. The kinematical description so achieved has put us in the condition to use the TMD extraction made in [35] with the same level of precision. The perturbative inputs are \(\hbox {N}^3\)LL for the evolution and NNLO for all the rest. Because of our experience with Z-boson production we have given particular attention to values of \(m_T\) below ([50,66] GeV) and around the W-mass ([66,99] GeV). In fact one expects to better control the non-perturbative QCD effects once the details of the observables below the W-mass are also known. It is remarkable that observables at low values for \(m_T\) have been poorly studied despite the fact that there is no particular suppression of the cross section and all errors look very similar.

An important chapter is the error analysis. We have considered as sources of error, scale variations, replicas uncertainty within the NNPDF31 PDF set, uncertainties due to different sets of PDF and the ones due to TMD non-perturbative parameterization. This analysis is interesting because it gives a first step in our understanding the errors. We consider it an important starting point that can certainly be improved with future work within TMD studies. For instance scale uncertainties can be checked including one more perturbative order (including very recent results at \(\hbox {N}^3\)LO, [83,84,85,86,87]), which however require a new TMD extraction. Then, the sets of PDF that we have checked are the ones analyzed also in [35], which is still limited. More PDF sets should be used and included in the TMD extraction. The actual impact of each source of error depends on the observable, and we have described it in Sect. 4.

At present the available data provide already some information on the observables that we have studied. We have considered the cases of TeVatron and LHC. The agreement in the cross section is reasonable for CDF, D at \(\sqrt{s}=1.8 \) TeV, ATLAS, while it is not completely satisfactory CMS. For all these cross sections we have done a \(\chi ^2\) analysis. The message is that inferring and estimating QCD non-perturbative contributions to W processes from neutral boson mediated ones has some effectiveness and that a more global treatment of data is also worth studying. This raises also the question of whether a more sophisticated TMD flavor dependence can improve the agreement with data. Other effects like QED contributions [104, 105] will also be addressed in future studies.

In order to perform this study we have proposed to consider several intervals of \(m_T\), for which we do not observe a particular suppression of the cross section. Comparing experiments, we find a striking difference between LHC and TeVatron data in the \(q_T\) binning of the final result. Definitely the 2 GeV binning of TeVatron allows a much better understanding of the QCD effects, despite the LHC precision. A similar binning would be also desirable for LHC. This problem has already been discussed also in the case of Drell–Yan in [36].