1 Introduction

A fundamental theory of Nature, such as string theory, should be able to describe at the same time particle physics and cosmology, which are phenomena that involve very different scales from the microscopic four-dimensional (4d) quantum gravity length of \(10^{-33}\) cm to large macroscopic distances of the size of the observable Universe \({\sim } 10^{28}\) cm, spanned a region of about 60 orders of magnitude. In particular, besides the 4d Planck mass, there are three very different scales with very different physics corresponding to the electroweak, dark energy and inflation. These scales might be related via the scale of the underlying fundamental theory, such as string theory, or they might be independent in the sense that their origin could be based on different and independent dynamics.

In this work, we make an attempt in this direction by connecting the scale of inflation with the electroweak and supersymmetry breaking scales within the same effective field theory, which at the same time allows the existence of an infinitesimally small (tuneable) positive cosmological constant describing the present dark energy of the Universe. To this end, we use a simple model of supersymmetry breaking in a tuneable metastable de Sitter vacuum, independent of the scale of supersymmetry breaking that can be in the TeV region [1,2,3,4]. The model is based on a shift symmetry of a string modulus S (which we identify with the string dilaton) along its imaginary (axionic) component, which is gauged by a U(1) vector multiplet. The latter can be for instance a linear combination of the ordinary baryon and lepton number, containing the matter parity which guarantees a dark matter candidate [5].

The superpotential W is completely fixed to a simple exponential (\(W=ae^{bS}\), in a Kähler basis where the shift symmetry is an R-symmetry) depending on two parameters, while a third parameter arises from the U(1) gauge coupling. On the other hand, the Kähler potential \(\mathcal K\) is an arbitrary function of \(S+{\bar{S}}\). However, since one is interested in a vacuum where the string coupling is weak (and therefore S large), in order to study the supersymmetry phenomenology, one can restrict \(\mathcal K\) to its tree-level logarithmic form, \(\mathcal{K}=-p\log (S+{\bar{S}})\) with \(p=1\) or 2, depending on whether S is associated to only the D9 or to the D9 and D5 gauge couplings in type I string theory [6].

This model has the necessary ingredients to be obtained as a remnant of moduli stabilisation within the framework of internal magnetic fluxes in type I string theory, turned on along the compact directions for several abelian factors of the gauge group. All geometric moduli can in principle be fixed in a supersymmetric way, while the shift symmetry is associated to the 4d axion and its gauging is a consequence of anomaly cancellation [7,8,9].

The resulting scalar potential has a metastable minimum with a tuneable positive vacuum energy that can be made infinitesimally small and one is left with one free parameter which fixes the scale of supersymmetry breaking. This is due to a tuning between the D- and the F-term contributions to supersymmetry breaking that can have opposite signs in supergravity. Coupling this model to the observable sector (MSSM) is straightforward but requires the introduction of an additional parameter in order to address the problem of anomaly cancellation or of tachyonic scalar masses for U(1) neutral matter fields [3, 4].

The main question we address in this work is whether the same scalar potential can provide inflation with the dilaton playing also the role of the inflaton at an earlier stage of the evolution of the Universe. We show that this is possible if one modifies the Kähler potential by a correction that plays no role around the minimum, but creates an appropriate plateau around the maximum. In general, the Kähler potential receives perturbative and non-perturbative corrections that vanish in the weak coupling limit. After analysing all such corrections, we find that only those that have the form of (Neveu–Schwarz) NS5-brane instantons can lead to an inflationary period compatible with cosmological observations. The scale of inflation turns out then to be of the order of low energy supersymmetry breaking, in the TeV region. On the other hand, the predicted tensor-to-scalar ratio is too small to be observed.

An interesting property of this model is that the inflaton is a component of the goldstino superpartner and shares some of the features of the models proposed in Refs. [10, 11],Footnote 1 although it does not belong to the same class, since there is a D-term contribution to supersymmetry breaking. As a result, the gravitino mass does not satisfy the property to be much lower than the supersymmetry breaking scale.

The outline of the paper is the following. In Sect. 2, we give a brief review of the model and compute the slow-roll parameters of the scalar potential to show that without modifying the Kähler potential, it does not give rise to inflation (Sect. 2.1). We then present its general extension, parametrising appropriately the quantum corrections to the Kähler potential (Sect. 2.2). In Sect. 3, we perform the analysis for a correction that has the form of NS5-brane instantons (depending on two additional parameters) for both \(p=2\) (Sect. 3.1) and \(p=1\) (Sect. 3.2) cases. In particular, we compute the corresponding slow-roll parameters and fix the two parameters of the correction, so that our model reproduces the spectral index and amplitude of density fluctuations in agreement with the cosmological data of Planck ’15. We also extract the predictions for the inflation scale, the number of e-foldings and the tensor-to-scalar ratio. In Sect. 4, we study the impact of the correction to the Kähler potential for the low energy superparticle spectrum that we find to be of order of 10%. Finally, Sect. 5 contains our concluding remarks.

2 The model

2.1 Revision of the model

In [1, 2] a \(\mathcal N=1\) supergravity model was proposed based on the gauged shift symmetry of a single chiral multiplet. For certain values of the parameters, the model allows for a tuneably small and positive value for the cosmological constant. Its anomaly cancellation conditions are discussed in [3], while the resulting low energy spectrum is discussed in [4, 5]. In this section we recall the main properties of the model. We use the conventions of [15].

The model consists of one chiral multiplet S, whose scalar component s is invariant under a gauged shift symmetryFootnote 2

$$\begin{aligned} s \longrightarrow s - ic \theta , \end{aligned}$$
(1)

where \(\theta \) is the gauge parameter, and c is a constant. The Kähler potential, superpotential and gauge kinetic function are given by (in appropriate Kähler coordinates where the superpotential is constant)

$$\begin{aligned}&\mathcal K(s, \bar{s}) = - \kappa ^{-2} p \log (s + \bar{s}) + \kappa ^{-2} b (s + \bar{s}),\nonumber \\&W(s) = \kappa ^{-3} a, \nonumber \\&f(s) = \gamma + \beta s, \end{aligned}$$
(2)

where \(\kappa ^{-1} = m_p = 2.4 \times 10^{15}\) TeV is the inverse of the (reduced) Planck mass. The scalar potential is given by

$$\begin{aligned}&\mathcal V = \mathcal V_F + \mathcal V_D, \nonumber \\&\mathcal V_F = e^{\kappa ^2 \mathcal K} \big ( - 3 \kappa ^2 W \bar{W} + \nabla _\alpha W g^{\alpha \bar{\beta }} \bar{\nabla }_{\bar{\beta }} \bar{W} \big ), \nonumber \\&\mathcal V_D = \frac{1}{2} \left( \text {Re}f \right) ^{-1 \ AB} \mathcal P_A \mathcal P_B, \end{aligned}$$
(3)

where the Greek indices \(\alpha , \beta \) label the chiral multiplets in the theory, and the capital Roman letters AB label the different gauge groups. In Eq. (3), the Kähler covariant derivative of the superpotential is

$$\begin{aligned} \nabla _\alpha W = \partial _\alpha W(z) + \kappa ^2 (\partial _\alpha \mathcal K) W(z), \end{aligned}$$
(4)

and the moment maps \(\mathcal P_A\) are given by

$$\begin{aligned} \mathcal P_A = i(k_A^\alpha \partial _\alpha \mathcal K - r_A), \end{aligned}$$
(5)

where \(k_A^\alpha \) are the Killing vectors, and \(r_A\) is the Fayet–Iliopoulos contribution satisfying \(W_\alpha k_A^\alpha = - \kappa ^2 r_A W\). In the above model \(k^s = -ic\) is the Killing vector associated with the shift symmetry Eq. (1). As a result, the scalar potential is given by (with \(\phi = s + \bar{s}\))

$$\begin{aligned}&\mathcal V = \mathcal V_F + \mathcal V_D, \nonumber \\&\mathcal V_F = \frac{ \kappa ^{-4} |a|^2 }{\phi ^p} e^{b\phi } \left( -3 + \frac{1}{p} \left( b\phi - p \right) ^2 \right) \nonumber \\&\mathcal V_D = \frac{\kappa ^{-4} c^2}{\beta \phi + 2\gamma } \left( b - \frac{p}{\phi } \right) ^2. \end{aligned}$$
(6)

For \(b > 0\), the potential always admits a supersymmetric AdS (anti-de Sitter) vacuum at \(\langle \phi \rangle = b/p\), while for \(b=0\) supersymmetry is broken in AdS space. We therefore focus on \(b<0\). In this case, the potential admits a supersymmetry breaking dS (de Sitter) vacuum for \(p < 3\). For example, for \(p=2\), \(\beta =1\) and \(\gamma = 0\),Footnote 3 the scalar potential Eq. (6) reduces to

$$\begin{aligned} \mathcal V&= \frac{ \kappa ^{-4} |a|^2 }{\phi ^2} e^{b\phi } \left( -3 + \frac{1}{2} \left( b\phi - 2 \right) ^2 \right) \nonumber \\&\quad + \frac{\kappa ^{-4} c^2}{\phi } \left( b - \frac{2}{\phi } \right) ^2. \end{aligned}$$
(7)

A vanishing cosmological constant can be found by tuning the parameters of the model. Solving the equations \(\mathcal V(\phi _{ \mathrm{{min}} }) = 0\) and \(\mathrm{d} \mathcal V( \phi _\mathrm{{min} } ) / \mathrm{d}\phi = 0\) gives

$$\begin{aligned} b \phi _\mathrm{{min}}&= l_0 \approx -0.183268, \end{aligned}$$
(8)

where \(l_0\) is the root of the polynomial \(-x^5 + 7 x^4 - 10 x^3 -22 x^2 +40 x + 8\) close to \(-0.18\), and

$$\begin{aligned} \frac{a^2}{b c} = \mathcal A(l_0) \approx -50.66. \end{aligned}$$
(9)

In Eq. (9), \(\mathcal A(l_0)\) is given by

$$\begin{aligned} \mathcal A(l_0) = \frac{e^{-l_0} }{l_0} \left( \frac{- 4 + 4 l_0 - l_0^2}{\frac{l_0}{2} - 2 l_0 -1} \right) . \end{aligned}$$
(10)

A non-zero cosmological constant \(\Lambda \) can be found if

$$\begin{aligned} \frac{a^2}{bc^2} = \mathcal A (l_0)+ \frac{\kappa ^4 \Lambda }{b^3 c^2} \left( \frac{l_0^2 e^{-l_0}}{\frac{l_0^2}{2} - 2l_0 -1} \right) . \end{aligned}$$
(11)

The gravitino mass parameter is given by

$$\begin{aligned} m_{3/2} = \kappa ^2 e^{\kappa ^2 \mathcal K/2} W = \frac{ \kappa ^{-1} a b}{l_0} e^{l_0/2}. \end{aligned}$$
(12)

Equation (8) fixes the VEV (Vacuum Expectation Value) of \(\phi \) as a function of the parameter b. The field \(\phi \) can be interpreted as the dilatonFootnote 4 related to the string coupling constant \(g_s\) by \(\phi _\mathrm{{min}} = 2/g_s\). A string coupling constant in the perturbative region, for example \(\phi _\mathrm{{min}} = 10\), can therefore be obtained for example by the choice \(b = -0.01820\). Since the parameters a and c are related by Eq. (9), this leaves only one free parameter, which can be tuned to obtain an \(O(10\,\text { TeV})\) gravitino mass. For example, the parameter choice \(c=0.61 \times 10^{-13}\) results in \(m_{3/2} = 12.83\,\text { TeV}\). However, we will show below that this model does not allow slow-roll inflation to occur.

The kinetic terms in the Lagrangian for the scalar \(\phi \) are given by

$$\begin{aligned} \mathcal L_s / e = -g_{s \bar{s}} \partial _\mu s \partial ^\mu \bar{s} = -\frac{p\kappa ^{-2} }{4} \frac{1}{\phi ^2} \partial _\mu \phi \partial ^\mu \phi . \end{aligned}$$
(13)

The canonically normalised field \(\chi \) therefore satisfies \(\chi = \kappa ^{-1} \sqrt{\frac{p}{2}} \log \phi \).

Fig. 1
figure 1

A plot of \( -\frac{\kappa ^4 \mathcal V (\rho )}{b^3 c^2}\) as a function of \(\rho = - b \phi \) (left), and a plot of the slow-roll parameter \(\eta \) as a function of \(\rho \) (right). The slow-roll condition \(|\eta | \ll 1\) is not satisfied for any value of the parameters abc

The slow-roll parameters are given by

$$\begin{aligned} \epsilon&= \frac{1}{2\kappa ^2} \left( \frac{\mathrm{d}V/\mathrm{d}\chi }{V}\right) ^2 = \frac{1}{2\kappa ^2}\left[ \frac{1}{V} \frac{\mathrm{d}V}{\mathrm{d}\phi } \left( \frac{\mathrm{d}\chi }{\mathrm{d}\phi } \right) ^{-1} \right] ^2, \nonumber \\ \eta&= \frac{1}{\kappa ^2} \frac{V''(\chi )}{V} = \frac{1}{\kappa ^2} \frac{1}{V} \left[ \frac{\mathrm{d}^2V}{\mathrm{d}\phi ^2} \left( \frac{\mathrm{d}\chi }{\mathrm{d}\phi }\right) ^{-2} - \frac{\mathrm{d}V}{\mathrm{d}\phi } \frac{\mathrm{d}^2 \chi }{\mathrm{d}\phi ^2} \left( \frac{\mathrm{d} \chi }{\mathrm{d}\phi } \right) ^{-3} \right] . \end{aligned}$$
(14)

It can be shown that, when the conditions (8) and (9) are satisfied, the slow-roll parameters depend only on \(\rho = - b \phi \),

$$\begin{aligned} \epsilon&= \frac{(\rho +2)^2 \left( \mathcal A(l_0) \rho \left( \rho ^2+2 \rho -2\right) -2 e^{\rho } (\rho +6)\right) ^2}{2 \left( \mathcal A(l_0) \rho \left( \rho ^2+4 \rho -2\right) -2 e^{\rho } (\rho +2)^2\right) ^2}, \nonumber \\ \eta&= \frac{2 e^{\rho } \left( 3 \rho ^2+32 \rho +60\right) - \mathcal A(l_0) \rho \left( \rho ^4+5 \rho ^3+10 \rho ^2+2 \rho -16\right) }{2 e^{\rho } (\rho +2)^2-\mathcal A(l_0) \rho \left( \rho ^2+4 \rho -2\right) }. \end{aligned}$$
(15)

Moreover, the scalar potential is given in terms of \(\rho \)

$$\begin{aligned} \frac{\kappa ^4 \mathcal V (\rho )}{b^3 c^2} = \frac{e^{-\rho } \left( \mathcal A(l_0) \rho \left( \rho ^2+4 \rho -2\right) -2 e^{\rho } (\rho +2)^2\right) }{2 \rho ^3}, \end{aligned}$$
(16)

where \(\mathcal A(l_0) \approx -50.66\) as in Eq. (9). In Fig. 1, a plot is shown of \( \frac{\kappa ^4 \mathcal V (\rho )}{|b|^3 c^2}\) as a function of \(\rho \). The minimum of the potential is at \(\rho _\mathrm{{min}} \approx 0.1832\) (see Eq. (8)), while the potential has a local maximum at \(\rho _\mathrm{{max}} \approx 0.4551\). A plot of the slow-roll parameter \(\eta \) (also in Fig. 1) shows that \(|\eta | \ll 1\) is not satisfied. This result holds for any parameters abc satisfying Eqs. (8) and (9).

A similar analysis to the one above can be performed for \(p=1\). In contrast with the case \(p=2\), vacua with a vanishing cosmological constant can also be found when \(\beta = 0\) and \(\gamma \ne 0\). Note, however, that for \(\beta \ne 0\) the Lagrangian contains a Green–Schwarz term

$$\begin{aligned} \mathcal L_\mathrm{GS} = \frac{\beta \ \text {Im}\left( s \right) }{8} \ \epsilon ^{\mu \nu \rho \sigma } F_{\mu \nu } F_{\rho \sigma }, \end{aligned}$$
(17)

which is not gauge invariant. In principle, such a term can be present if the model contains additional fields that are charged under the U(1) gauge symmetry (see Eq. (1)), in order to cancel their contribution to a possible cubic \(U(1)^3\) anomaly via a Green–Schwarz mechanism. However, it turns out (see for example [5]) that even in this case \(\beta \) is very small. We therefore take \(\beta = 0\) for simplicity.Footnote 5 Moreover, as long as the scalar potential is concerned, \(\gamma \) can be absorbed in other parameters of the model, and we therefore take \(\gamma = 1 \).

For \(\gamma =1\) and \(\beta =0\), vacua with a vanishing cosmological constant can be found if \(b\phi _\mathrm{{min}} =l_0 \approx -0.233153\), and \(\frac{a^2}{b c^2} = \mathcal A(l_0) \approx -2.783259\). However, a similar analysis as the one above shows that also in this case the slow-roll condition \(\eta \ll 1\) cannot be satisfied.

2.2 Extensions of the model that satisfy the slow-roll conditions

In the previous section we showed that the slow-roll conditions can not be satisfied in the minimal versions of the model. In this section we modify the above model by modifying the Kähler potential. While the superpotential is uniquely fixed (up to a Kähler transformationFootnote 6), the Kähler potential admits corrections that can always be put in the form

$$\begin{aligned} \mathcal K= -p \kappa ^{-2} \log \left( s + \bar{s} + \frac{\xi }{b} F(s + \bar{s}) \right) + \kappa ^{-2} b (s + \bar{s}), \end{aligned}$$
(18)

while the superpotential, the gauge kinetic function and moment map are given by

$$\begin{aligned}&W= \kappa ^{-3} a, \nonumber \\&f(s) = \gamma + \beta s, \nonumber \\&\mathcal P = \kappa ^{-2} c \left( b - p \frac{ 1 + \frac{\xi }{b} F_s }{s + \bar{s} + \frac{\xi }{b} F } \right) , \end{aligned}$$
(19)

where \(F_s = \partial _s F(s + \bar{s})\). The scalar potential is given by (\(\phi = s + \bar{s}\))

$$\begin{aligned} \mathcal V&= \mathcal V_F + \mathcal V_D, \nonumber \\ \mathcal V_F&= \kappa ^{-4} \frac{|a|^2 e^{b\phi } }{( \phi + \frac{\xi }{b} F)^p}\nonumber \\&\quad \times \left[ -3 - \frac{1}{p} \frac{ \left( b \left( b \phi + \xi F \right) - p (b + \xi F_\phi ) \right) ^2}{ \xi F_{\phi \phi } ( b \phi + \xi F) - (b + \xi F_\phi )^2} \right] , \nonumber \\ \mathcal V_D&= \kappa ^{-4} \frac{b^2 c^2}{2 \gamma + \beta \phi } \left[ 1 - p \frac{ 1 + \frac{\xi }{b} F_\phi }{b \phi + \xi F} \right] ^2. \end{aligned}$$
(20)

As was discussed above, we take \(\gamma = 1, \beta = 0\) for \(p=1\) and \(\gamma =0, \beta =1\) for \(p=2\).

Identifying \(\text {Re}(s)\) with the inverse string coupling, the function F may contain perturbative contributions that can be expressed as power series of \(1/(s+\bar{s})\), as well as non-perturbative corrections which are exponentially suppressed in the weak coupling limit. The later can be either of the form \(e^{-\delta (s+\bar{s})}\) for \(\delta >0\) in the case of D-brane instantons, or of the form \(e^{-\delta (s+\bar{s})^2}\) in the case of (Neveu–Schwarz) NS5-brane instantons (since the closed string coupling is the square of the open string coupling). We have considered a generic contribution of these three different types of corrections and we found that only the last type of contributions can lead to an inflationary plateau providing sufficient inflation. The other corrections can be present but do not modify the main properties of the model (as long as weak coupling description holds). In the following section, we analyse in detail a function F describing a generic NS5-brane instanton correction to the Kähler potential.

3 Slow-roll inflation

3.1 \(p=2\) case

We now consider the case with

$$\begin{aligned} F(\phi ) = \exp ( \alpha b^2 \phi ^2), \end{aligned}$$
(21)

where \(b<0\) and \(\alpha <0\). \(F(\phi )\) vanishes asymptotically at large \(\phi \). In this case, we obtain

$$\begin{aligned} \mathcal V_D = \frac{\kappa ^{-4}b^3 c^2}{b\phi }\left[ \frac{b \phi - 2 + \xi e^{\alpha b^2 \phi ^2}(1-4 \alpha b \phi )}{b \phi + \xi e^{\alpha b^2 \phi ^2}}\right] ^2 \end{aligned}$$
(22)

and

$$\begin{aligned} \mathcal V_F= & {} -\frac{\kappa ^{-4}|a|^2 b^2 e^{b \phi }}{2 \left( \xi e^{\alpha b^2 \phi ^2}+b\phi \right) ^2} \nonumber \\&\times \left[ \frac{\left( b\phi +\xi e^{\alpha b^2 \phi ^2 }(1-4 \alpha b \phi ) -2\right) ^2}{2 \alpha \xi e^{\alpha b^2 \phi ^2} \left( 2 \alpha b^3 \phi ^3+\xi e^{\alpha b^2 \phi ^2}-b\phi \right) -1}+6\right] .\nonumber \\ \end{aligned}$$
(23)

There are four parameters in this model namely \(\alpha \), \(\xi \), b and c. The first two parameters \(\alpha \) and \(\xi \) control the shape of the potential. There are some regions in the parameter space of \(\alpha \) and \(\xi \) that the potential satisfies the slow-roll conditions i.e. \(\epsilon \ll 1\) and \(|\eta | \ll 1\). In order to obtain the potential with flat plateau shape which is suitable for inflation and in agreement with Planck ’15 data, we chooseFootnote 7

$$\begin{aligned}&\alpha = -4.841115384560439 \quad \text { and }\nonumber \\&\quad \xi = 0.025350051999999998. \end{aligned}$$
(24)

Note that in the case of \(\xi =0\) and \(b < 0\), we can find the Minkowski minimum by solving the equations \(\mathcal V(\phi _\mathrm{min})=0\) and \(d \mathcal V(\phi _\mathrm{min})/\mathrm{d}\phi =0\), where \(\phi _\mathrm{min} = s_\mathrm{min}+\bar{s}_\mathrm{min}\) is the value of \(\phi \) at the minimum of the potential. In the case of \(\xi \ne 0\), we cannot solve the equations analytically and the relations (8), (9) are not valid. We can always assume that they are modified into

$$\begin{aligned}&b \phi _\mathrm{min} = l(\xi ,\alpha )\quad \text { and }\nonumber \\&\quad \frac{a^2}{bc^2} = -50.66016761885055 \times \lambda (\xi ,\alpha ,\Lambda )^2, \end{aligned}$$
(25)

where \(\lambda \) takes positive values and satisfies \(|\lambda - 1| \ll 1\). For any given value of parameters \(\xi \), \(\alpha \) and the cosmological constant \(\Lambda \), one can numerically fix the value of l and \(\lambda \). By fine-tuning the cosmological constant \(\Lambda \) to be very close to zero, we can numerically solve the equations \(\mathcal V=0\) and \(\mathrm{d}\mathcal V/\mathrm{d}\phi = 0\) for the value of l and \(\lambda \) in (25) as

$$\begin{aligned} l\approx & {} -0.180386,\end{aligned}$$
(26)
$$\begin{aligned} \lambda= & {} 1.0172553241157374. \end{aligned}$$
(27)
Fig. 2
figure 2

A plot of the scalar potential for \(p=2\), with \(b=-0.020\), \(b=-0.015\) and \(b = -0.012\). Note that we choose the parameters \(\alpha \) and \(\xi \) as in Eq. (24) with \(c = 0.06\)

Fig. 3
figure 3

A plot of the scalar potential for \(p=2\), with \(c=0.04\), \(c=0.05\) and \(c = 0.06\). Note that we choose the parameters \(\alpha \) and \(\xi \) as in Eq. (24) with \(b = -0.0182\)

Fig. 4
figure 4

A plot of \(\epsilon \) and \(\eta \) versus \(\phi \) for \(p=2\), with \(b=-0.0182\), \(c=0.61 \times 10^{-13}\), \(\xi = -0.1392\), where the parameters \(\alpha \) and \(\xi \) are given in Eq. (24). The vertical (red) line indicates the initial value of \(\phi \)

From Eq. (25), we can see that the third parameter, b, controls the vacuum expectation value \(\phi _\mathrm{min}\). This can be shown in Fig. 2 where we compare the scalar potential for different values of b. Motivated by string theory, we have the identification \(\phi \sim 1/g_s\). We can choose the value of the parameter b such that \(\phi _\mathrm{min}\) is of the order of 10 to make sure that we are in the perturbative regime in \(g_s\). The last parameter, c, controls the overall scale of the potential but does not change its minimum and its shape. This can be shown in Fig. 3. In the following, we will fix b and c by using the cosmological data.

In order to compare the predictions of our models with Planck ’15 data, we choose the following boundary conditions:

$$\begin{aligned} \phi _\mathrm{int} = 27.3204582176, \end{aligned}$$
(28)
$$\begin{aligned} \phi _\mathrm{end} = 22.6843813287. \end{aligned}$$
(29)

The initial conditions are chosen very near the maximum on the (left) side, so that the field rolls down towards the electroweak minimum. Any initial condition on the right of the maximum may produce also inflation, but the field will roll towards the SUSY vacuum at infinity. The results are therefore very sensitive to the initial conditions (Eqs. (28) and (29)) of the inflaton field.

The slow-roll parameters are given in Eq. (14). The total number of e-folds N can be determined by

$$\begin{aligned} N= \kappa ^2 \int _{\chi _\mathrm{end}}^{\chi _\mathrm{int}} \frac{\mathcal V}{\partial _{\chi } \mathcal V} \mathrm{d}\chi = \kappa ^2 \int _{\phi _\mathrm{end}}^{\phi _\mathrm{int}} \frac{\mathcal V}{\partial _\phi \mathcal V}\left( \frac{\mathrm{d}\chi }{\mathrm{d}\phi } \right) ^2 \mathrm{d}\phi . \end{aligned}$$
(30)

Note that we choose \(|\eta (\chi _\mathrm{end})| = 1\). We can compare the theoretical predictions of our model to the experimental results via the power spectrum of scalar perturbations of the CMB, namely the amplitude \(A_s\) and tilt \(n_s\), and the relative strength of tensor perturbations, i.e. the tensor-to-scalar ratio r. In terms of slow-roll parameters, these are given by

$$\begin{aligned}&A_s = \frac{\kappa ^4 \mathcal V_*}{24\pi ^2 \epsilon _*},\end{aligned}$$
(31)
$$\begin{aligned}&n_s = 1 + 2\eta _* - 6\epsilon _*,\end{aligned}$$
(32)
$$\begin{aligned}&r = 16 \epsilon _*, \end{aligned}$$
(33)

where all parameters are evaluated at the field value \(\chi _\mathrm{int}\).

In order to satisfy the Planck ’15 data, we choose the parameters \(b = -0.0182 \), \(c=0.61 \times 10^{-13}\). The slow-roll parameters \(\epsilon \) and \(\eta \) during the inflation are shown in Fig. 4. The value of the slow-roll parameters at the beginning of inflation are

$$\begin{aligned} \epsilon (\phi _\mathrm{int}) \simeq 1.86 \times 10^{-24}\quad \text { and }\quad \eta (\phi _\mathrm{int}) \simeq -1.74 \times 10^{-2}. \end{aligned}$$
(34)

The total number of e-folds N, the scalar power spectrum amplitude \(A_s\), the spectral index of curvature perturbation \(n_s\) and the tensor-to-scalar ratio r are calculated and summarised in Table 1, in agreement with Planck ’15 data [18]. Figure 5 shows that our predictions for \(n_s\) and r are within 1\(\sigma \) C.L. of Planck ’15 contours with the total number of e-folds \(N \approx 1075\). Note that N is the total number of e-folds from \(\phi _\mathrm{int}\) to \(\phi _\mathrm{end}\). However, the number of e-folds associated with the CMB observation corresponds to a period between the time of horizon crossing and the end of inflation, which is much smaller than 1075. According to the general formula in [18], the number of e-folds between the horizon crossing and the end of inflation is roughly estimated to be around 50–60.

Fig. 5
figure 5

We plot the theoretical predictions for the case \(p=2\), shown in Table 1, in the \(n_s \)r plane together with the Planck ’15 results for TT, TE, EE, \(+\) lowP and assuming \(\Lambda \mathrm{CDM} + \mathrm{r}\) [18]

Table 1 The theoretical predictions for \(p=2\), with \(b =-0.0182\) and \(c = 0.61\times 10^{-13}\), where \(\alpha \) and \(\xi \) are given in Eq. (24)

We would like to remark that the parameter c also controls the gravitino mass at the minimum of the potential around O(10) TeV. Indeed, the gravitino mass is written as

$$\begin{aligned} m_{3/2} = \kappa ^2 e^{\kappa ^2\mathcal {K}/2 } W = \frac{1}{\kappa }\left( \frac{a b e^{b \phi /2}}{b \phi + \xi F(\phi )} \right) . \end{aligned}$$
(35)

For \(b=-0.0182\), we get \(\phi _{\min } \approx 9.91134\) and for the gravitino mass at the minimum of the potential

$$\begin{aligned} \langle m_{3/2}\rangle \approx 14.98\,\text {TeV}. \end{aligned}$$
(36)

The Hubble parameter during inflation (evaluated at \(\phi _* = \phi _\mathrm{int}\)) is

$$\begin{aligned} H_* = \kappa \sqrt{\mathcal V_* / 3} = 1.38\,\text {TeV}. \end{aligned}$$
(37)

This shows that our predicted scale for inflation is of the order of TeV. The mass of gravitino during the inflation \(m^*_{3/2} = 4.15\) TeV is higher than the inflation scale, and the gauge boson mass is \(M_{A_\mu }^* = 3.12\) TeV.Footnote 8 In fact, the gauge boson acquires a mass due to a Stueckelberg mechanism by eating the imaginary component of s, where its mass at the minimum of the potential is given by

$$\begin{aligned} \langle M_{A_\mu } \rangle = 15.48\,\text {TeV}. \end{aligned}$$
(38)

As a result, the model essentially contains only one scalar field \(\text {Re}(s)\), which is the inflaton. This is in contrast with other supersymmetric models of inflation, which usually contain at least two real scalars [23].Footnote 9

3.2 \(p=1\) case

In this case, we obtain

$$\begin{aligned} \mathcal V_D = \frac{\kappa ^{-4}b^2 c^2}{2}\left[ \frac{b \phi - 1 + \xi e^{\alpha b^2 \phi ^2}(1-2 \alpha b \phi )}{b \phi + \xi e^{\alpha b^2 \phi ^2}}\right] ^2 \end{aligned}$$
(39)

and

$$\begin{aligned} \mathcal V_F= & {} -\frac{\kappa ^{-4}|a|^2 b e^{b \phi }}{ \xi e^{\alpha b^2 \phi ^2}+b\phi } \nonumber \\&\times \left[ \frac{\left( b\phi +\xi e^{\alpha b^2 \phi ^2 }(1-2 \alpha b \phi ) -1\right) ^2}{2 \alpha \xi e^{\alpha b^2 \phi ^2} \left( 2 \alpha b^3 \phi ^3+\xi e^{\alpha b^2 \phi ^2}-b\phi \right) -1}+3\right] .\nonumber \\ \end{aligned}$$
(40)

The potential has similar properties to the \(p=2\) case, although it may give different phenomenological results at low energy. For \(\xi =0\) and \(b < 0\), the Minkowski minimum satisfies the following relations [2]:

$$\begin{aligned}&b \phi _\mathrm{min} = l_0 \approx -0.233153\quad \text { and }\nonumber \\&\frac{bc^2}{a^2} = \mathcal {A}(l_0) \approx - 0.35929085159984514. \end{aligned}$$
(41)

However, similar to the previous case, the above relations are not valid when \(\xi \ne 0\) and we assume that they are modified into

$$\begin{aligned}&b \phi _\mathrm{min} = l(\xi ,\alpha )\quad \text { and }\nonumber \\&\quad \frac{bc^2}{a^2} = - 0.35929085159984514 \times \lambda (\xi ,\alpha ,\Lambda )^{-2}. \end{aligned}$$
(42)

By choosing \(\alpha = -0.781\) and \(\xi = 0.3023\) and tuning the cosmological constant \(\Lambda \) to be very close to zero, we can numerically fix \(l = -0.562536\) and \(\lambda = 1.2937645\) for this case. The gravitino mass for \(p=1\) case can be written as

$$\begin{aligned} m_{3/2} = \kappa ^2 e^{\kappa ^2\mathcal {K}/2 } W = \frac{1}{\kappa }\left( \frac{a \sqrt{b} e^{b \phi /2}}{\sqrt{b \phi + \xi F(\phi )}} \right) . \end{aligned}$$
(43)

By choosing the parameters \(b=-0.0234\), \(c=1 \times 10^{-13}\), the gravitino mass at the minimum of the potential is

$$\begin{aligned} \langle m_{3/2} \rangle = 18.36\,\text {TeV}. \end{aligned}$$
(44)

with \(\phi _{\min } \approx 21.53\), and

$$\begin{aligned} \langle M_{A_\mu } \rangle = 36.18\,\text {TeV}. \end{aligned}$$
(45)

By choosing appropriate boundary conditions, we find

$$\begin{aligned}&\phi _\mathrm{int} = 64.5324301784,\end{aligned}$$
(46)
$$\begin{aligned}&\phi _\mathrm{end} = 50.9915000000. \end{aligned}$$
(47)

As summarised in Table 2, the predictions for the \(p=1\) case are similar to those of \(p=2\), in agreement with Planck ’15 data with the total number of e-folds \(N\approx 888\). In this case, the Hubble parameter during inflation is

$$\begin{aligned} H_* = \kappa \sqrt{\mathcal V_* / 3} = 5.09\,\text {TeV}. \end{aligned}$$
(48)

Note that, for the \(p=1\) case, the mass of the gauge boson is \(M_{A_\mu }^* = 6.78\,\text {TeV}\), and the mass of the gravitino during inflation is \(m^*_{3/2} = 4.72\) TeV, a bit smaller than but comparable with the Hubble scale. It would be interesting to investigate how much this would affect the power spectrum and bispectrum, inspired by a class of inflation models called quasi-single field inflation [19,20,21,22,23,24,25,26,27], which may contain also a light inflaton as well as massive fields of Hubble scale mass, which can amplify non-Gaussianities. Moreover, the U(1) mass is close to the Hubble scale (although slightly above it) so that it may produce observable effects in the E-mode polarisation spectrum.

Table 2 The theoretical predictions for \(p=1\) case with \(b=-0.0234\), \(c=1 \times 10^{-13}\), \(\alpha = -0.781\) and \(\xi = 0.3023\)

4 SUGRA spectrum

The above model can be coupled to MSSM-like fields \(\varphi \). In this case the multiplet containing the inflaton \(\text {Re}(s)\) is considered to be a “hidden sector” field which is responsible for breaking supersymmetry by both F- and D-terms, as described above. The supersymmetry breaking is then communicated to the visible sector (MSSM) via gravity mediation. We consider the following Kähler potential and superpotential:

$$\begin{aligned}&\mathcal K = \mathcal K(s + \bar{s} ) + \sum \varphi \bar{\varphi }, \nonumber \\&W = W_h(s) + W_\mathrm{{MSSM}}, \end{aligned}$$
(49)

where \(\mathcal K(s + \bar{s})\) is given by Eq. (18), the hidden sector superpotential \(W_h(s)\) is given by Eq. (19), and \(W_\mathrm{{MSSM}}\) is the MSSM superpotential, which only depends on the MSSM fields \(\varphi \). The soft supersymmetry breaking terms can be calculated as follows:

$$\begin{aligned}&m_0^2 = e^{\kappa ^2 \mathcal K} \big ( -2 \kappa ^4 W_h(s) \bar{W}_h(s) + \kappa ^2 g^{s \bar{s}} \left| \nabla _s W_h \right| ^2 \big ), \nonumber \\&A_0 = \kappa ^2 e^{\kappa ^2 \mathcal K /2 } g^{s \bar{s}} K_s \big ( \bar{W}_{\bar{s}} + \kappa ^2 K_s \bar{W} \big ), \nonumber \\&B_0 = \kappa ^2 e^{\kappa ^2 \mathcal K /2 } \big ( g^{s \bar{s}} K_s \big ( \bar{W}_{\bar{s}} + \kappa ^2 K_s \bar{W} \big ) - \bar{W} \big ). \end{aligned}$$
(50)

Here, \(m_0^2\) is the scalar soft mass squared. All trilinear couplings are the same and given by \(A_0 \hat{y}_i\), where \(\hat{y}_i\) are the Yukawa couplings of the rescaled MSSM superpotential \(\hat{W}_\mathrm{{MSSM}} = e^{\kappa ^2 \mathcal K /2} W_\mathrm{{MSSM}}\). The \(B\mu \)-term parameter is given by \(B_0 \hat{\mu }\), where \(\hat{\mu }= e^{\mathcal K/2} \mu \).

For \(p=2\) the Lagrangian contains a Green–Schwarz term Eq. (17), and the theory is not gauge invariant (without the inclusion of extra fields that are charged under the U(1)). We therefore focus on \(p=1\). The soft terms can be written in terms of the gravitino mass (see Eq. (35))

$$\begin{aligned}&m_0^2 = m_{3/2}^2 \left[ - 2 + \mathcal C \right] , \nonumber \\&A_0 = m_{3/2} \ \mathcal C, \nonumber \\&B_0 = A_0 - m_{3/2}, \end{aligned}$$
(51)

where

$$\begin{aligned} \mathcal C&= \left. - \frac{\big ( - \xi e^{\alpha b^2 \phi ^2} +b \phi \big (4 \alpha \xi e^{\alpha b^2 \phi ^2}-1\big )+2\big )^2}{4 \alpha \xi ^2 e^{2 \alpha b^2 \phi ^2}-4 \alpha b \xi \phi e^{\alpha b^2 \phi ^2}+8 \alpha ^2 b^3 \xi \phi ^3 e^{\alpha b^2 \phi ^2}-2} \right| _{\phi = \phi _\mathrm{{min}} }. \end{aligned}$$
(52)

Using the parameters presented in Sect. 3.2, we find \(m_{3/2} = 18.36\) TeV and \(\mathcal C = 1.53\). For \(\xi = 0\) the model reduces to the one analysed in [4], where one has \(\mathcal C = 1.52\) and \(m_{3/2} = 17.27\) TeV (with \(\phi _\mathrm{{min}} = 9.96\)). As in [4], the scalar soft mass is tachyonic. This can be solved either by introducing an extra Polonyi-like field, or by allowing a non-canonical Kähler potential for the MSSM-like fields \(\varphi \). The resulting low energy spectrum is expected to be similar to the one described in [4]. We do not perform this analysis, but we only summarise their results.

Since the tree-level contribution to the gaugino masses vanishes, their mass is generated at one-loop by the so-called ‘Anomaly Mediation’ contribution [28,29,30]. As a result, the spectrum consists of very light neutralinos (\(O(10^2)\) GeV), of which the lightest (a mostly Bino-like neutralino) is the LSP dark matter candidate, slightly heavier charginos and a gluino in the 1–3 TeV range. The squarks are of the order of the gravitino mass (\({\sim }10\) TeV), with the exception of the stop squark, which can be as light as 2 TeV.

5 Conclusions

In this paper, we have studied the cosmology of a simple model of supersymmetry breaking, based on a single chiral multiplet (the string dilaton) with a gauged shift symmetry (R-symmetry in an appropriate field basis) leading to a de Sitter vacuum with a tuneable cosmological constant [1, 2]. When coupled to the MSSM, the model have been shown to yield an interesting low energy pattern of supersymmetry breaking, depending on one parameter, distinct from other proposals [4, 5]. By modifying the dilaton Kähler potential with a non-perturbative correction that could arise from NS5-brane instantons (depending on two extra parameters), we show that the scalar potential can acquire an inflationary plateau producing sufficient inflation, consistent with cosmological observations, without altering the low energy particle spectrum around the minimum.

The predicted tensor-to-scalar ratio is unfortunately too small to be detected. On the other hand, our model may lead to measurable non-gaussianities and/or E-mode polarisation effects that deserve further study.

Our model provides therefore an interesting example of connecting the inflation sector with the ‘hidden’ sector of supersymmetry breaking by identifying the inflaton with the dilaton which is also the scalar partner of the goldstino (sgoldstino). It does not belong, however, to the class of models studied in the past [10, 11], since there is also a D-term component in the order parameter of supersymmetry breaking. It would be interesting to analyse in detail the generalised class of such models.

Another interesting question is the possible implementation of this model in string theory. As mentioned in the introduction, the framework of moduli stabilisation by internal magnetic fields for several abelian factors of the gauge group along the compact directions of the compactified manifold, combined with non-criticality, seems to be a promising direction to explore.