Abstract
We consider the classical vacua of the Weinberg–Salam (WS) model of electroweak forces. These are no-particle, static solutions to the WS equations minimizing the WS energy locally. We study the WS vacuum solutions exhibiting a non-vanishing average magnetic field of strength b and prove that (i) there is a magnetic field threshold \(b_*\) such that for \(b<b_*\), the vacua are translationally invariant (and the magnetic field is constant), while, for \(b>b_*\), they are not, (ii) for \(b>b_*\), there are non-translationally invariant solutions with lower energy per unit volume and with the discrete translational symmetry of a 2D lattice in the plane transversal to b, and (iii) the lattice minimizing the energy per unit volume approaches the hexagonal one as the magnetic field strength approaches the threshold \(b_*\). In the absence of particles, the Weinberg–Salam model reduces to the Yang–Mills–Higgs (YMH) equations for the gauge group U(2). Thus, our results can be rephrased as the corresponding statements about the U(2)-YMH equations.
Similar content being viewed by others
1 Introduction
The Weinberg–Salam (WS) model of electroweak interactions [22, 35, 47] was the first triumph of the program to unify the four fundamental forces of nature. It is a key part of the standard model of elementary particles. It unifies electromagnetic and weak interactions, two of the three forces dealt with in the standard model. It involves particle, gauge and the Higgs fields.
While the gauge fields describe the electroweak interactions, the role of the Higgs field is to convert the original massless fields (zero masses are required by the relativistic invariance) to massive ones. This phenomenon is called the Higgs mechanism. This mechanism, together with the Goldstone theorem, leads to all gauge particles but one acquiring mass, resulting in two massive bosons—denoted W and Z—and a massless one—the photon. The W and Z particles where discovered experimentally 16 years after their theoretical prediction.
In this paper, we consider the vacuum solutions of the classical WS model with a non-vanishing average magnetic field \(\langle \vec b\rangle \). These are static, no-particle solutions minimizing the WS energy locally for a fixed \(\vec b\). They are also no-particle solutions of the entire standard model.Footnote 1
We prove that (i) there is a magnetic field threshold \(b_*\) such that for \(|\vec b|<b_*\), the vacua are translationally invariant, while, for \(|\vec b|>b_*\), they are not, (ii) for \(|\vec b|>b_*\), there are non-translationally invariant solutions with lower energy per unit volume and with the discrete translational symmetry of a 2D lattice in the plane transversal to the magnetic field, and (iii) the lattice minimizing the energy of the latter solutions per unit volume approaches the hexagonal one as the magnetic field strength approaches the threshold \(b_*\). We expect that these solutions are stable under field fluctuations and, in fact, minimize the energy locally.
The phenomenon above was investigated extensively in the physics literature (see, e.g. [5,6,7,8,9,10,11,12, 19, 26,27,28,29, 32, 41, 42] and the references therein). It is similar to the one occurring in superconductivity and the solutions whose existence we establish are analogous to the superconducting Abrikosov vortex lattices ([1], see, e.g. [37], for a review). It is estimated in [26] that the spontaneous symmetry breaking takes place at the critical average magnetic field of approximately \(10^{24}\) Gauss \(=10^{20}\) Tesla. By comparison, the strongest magnetic field produced on Earth is \(10^{14} \) Tesla.
Note that, in the absence of particles, the WS system reduces to the Yang–Mills–Higgs (YMH) one with the gauge group U(2). So ultimately, these are the equations we deal with.
The only rigorous result [43, 44] on the classical WS model deals with the vortices in the self-dual regime, where the WS (or corresponding YMH) equations are equivalent to the first-order equations, and it uses this equivalence in an essential way. (The self-dual regime in this context was discovered in [6,7,8], see also [41, 42].)
Open problems and further directions:
(a) Stability of the emerging solutions.
(b) Existence of vortex lattices at \(|\vec b|\gg b_*\).
(c) Quantum corrections to the values of the classical critical magnetic field \(b_*\) and the optimal lattice shape parameter \(\tau _*\).
For the stability and existence problems, (a) and (b), see, e.g. [38, 40] and [39], respectively. The last problem brings up the regime of ‘sparse’ vortex lattices as opposite to the case of \(|\vec b|\) close to (and >) \(b_*\) resulting in densely packed vortices: the lattice step \(\rightarrow 0\) as \(|\vec b|\rightarrow b_*\) and \(\rightarrow \infty \) as \(|\vec b|\rightarrow \infty \). Hence, the existence of vortex lattices at \(|\vec b|\gg b_*\) is closely related to the problem of existence of vortices (elementary excitations).
For existence of Abrikosov lattices in the Ginzburg–Landau and Chern–Simons equations, see [33, 37].
For the quantum corrections, problem (c), it would be natural to start with a BCS-type, or quasi-free, version of the WS model and a Bogoliubov-type expansion of a regularized (say, lattice) WS model around it, see, e.g. [14, 15].
The paper is organized as follows: In Sect. 2, we formulate the problem and describe results. In Sects. 3–4, we fix the gauge and pass from the original Yang–Mills fields to the W and Z (massive boson) and A (photon) fields and rescale the resulting equations. The proofs of the main results are given in Sect. 5 (Theorem 2.1), Sects. 6–10 (Theorem 2.2) and Sect. 11 (Theorem 2.3). In “Appendix A”, we discuss various covariant derivatives used in the main text, and in “Appendix B”, we review the time-dependent YMH equations and derive the expression for the conserved energy as well as the YMH equations used in the main text. Furthermore, there we write the YMH equations in coordinate form and derive a convenient expression for the energy functional. In “Appendices D.1–D.2”, we derive the WS equations in 3D and 2D, respectively, in terms of the fields W, Z, A and \(\varphi \). In the remaining appendices, we carry out technical computations.
Throughout the paper, we use the Einstein convention of summing over repeated indices.
2 No-particle and Vacuum Sectors of the Weinberg–Salam Model
The no-particle sector of the Weinberg–Salam (WS) model involves the interacting Higgs and SU(2) and U(1) gauge fields, \(\Phi \) and V and X, while the particle fields are set to zero. The field \(\Phi \) is a vector function defined on the Minkowski space-time \(\mathbb {R}^{3+1}\) with values in \(\mathbb {C}^2\), and the fields V and X are one-forms on \(\mathbb {R}^{3+1}\) with values in the algebras \(\mathfrak {su}(2)\) and \(\mathfrak {u}(1)\), respectively. We write
where g and \(g'\) are coupling constants, which is a one-form with values in \(\mathfrak {u}(2)\). We consider SU(2) as a matrix group and U(1) as multiples of the identity matrix \({\textbf{1}}\) acting on \(\mathbb {C}^2\).
These fields satisfy the WS equations, which are the Euler–Lagrange equations for the action functional
where M is a bounded domain in spacetime \(\mathbb {R}^{3+1}\) equipped with the Minkowski metric \(\eta \) of signature \((-,+,+,+)\), \(\lambda \) and \(\varphi _0\) are positive parameters, and the remaining symbols are defined as follows:
\(\nabla _{Q}\) is the covariant derivative mapping \(\mathbb {C}^2\)-valued functions (sections) into \(\mathbb {C}^2\)-valued one-forms defined as
with d, the exterior derivative; \(F_{Q}\) is the curvature 2-form of the connection one-form Q, given by
where [A, B] is defined in local coordinates \(\{x^i\}\) as
with \(A= A_i dx^i\) and \(B=B_i dx^i\);
\(\Omega ^p_U\equiv U\otimes \Omega ^p\) denotes the space of U-valued p-forms with the Minkowski, indefinite inner product,
where \(A=A_{\alpha }(x) dx^{\alpha }\) and \(B=B_{\alpha }(x) dx^{\alpha }\) are U-valued p-forms, \(\alpha \) is a p-form index and \(\langle \cdot , \cdot \rangle _U\) is the standard, positive definite inner product on U with the indices raised and lowered with help of the Minkowski metric \(\eta \) on M. For instance, for \(U = \mathfrak {s u}(2)\), the inner product is given by
Solutions of the no-particle WS equations solve also the full WS system as well as that for the standard model of the particle physics.
The vacuum sector of the Weinberg–Salam (WS) model consists of static, no-particle solutions.
The static Higgs and SU(2) and U(1) gauge fields \(\Phi \), V and X are now defined on the physical space \(\mathbb {R}^3\) with the same respective values as in the time-dependent case. Geometrically, V, X and Q can be thought of as connection one-forms on the trivial bundles \(\mathbb {R}^3\times SU(2), \mathbb {R}^3\times U(1)\) and \(\mathbb {R}^3\times U(2)\).
The fields \(\Phi \), V and X satisfy the static no-particle WS equations, which are the Euler–Lagrange equations for the static WS energy functional originating in (2.1)Footnote 2
where N is a bounded domain in \(\mathbb {R}^3\) with appropriate boundary conditions (specified in (2.17) below) and \(\Vert \cdot \Vert _{\Omega ^p_{U}}\) is the standard norm on the space \(\Omega ^p_{U}:= U \otimes \Omega ^p\) of U-valued p-forms at \(x\in N\) (e.g. for \(B=B_i(x)dx^i\in \Omega ^1_{U}\), we have \(\Vert B\Vert _{\Omega ^1_{U}}:=(\sum _i\Vert B_i(x)\Vert ^2_U)^{1/2}\) with the usual Euclidean metric and with the indices running through 1, 2, 3), while now, (2.5) (and (2.6)) become the usual inner products. The symbols \(\nabla _{Q}\) and \(F_{Q}\) are as defined above but without the time component.
Since \(Q=gV+g'X\) and X has the values in the centre, u(1), of the algebra u(2), we have \(F_{Q} = g F_{V} + g' F_{X}\), where
are the curvatures of the connections V and XFootnote 3 and \(\Vert F_{Q}\Vert ^2_{\Omega ^2_{\mathfrak {u}(2)}}=\Vert F_{V}\Vert ^2_{\Omega ^2_{\mathfrak {u}(2)}} +\Vert F_{X}\Vert ^2_{\Omega ^2_{\mathfrak {u}(1)}}\).
We introduce the covariant derivative \(d_{Q}\) mapping \(\mathfrak {u}(2)\)-valued k-forms into \(\mathfrak {u}(2)\)-valued \((k+1)\)-forms, \(k\ge 1\), as
This formula originates in the equation \((\delta _Q F_Q)( B)=d_Q B\), where \(\delta _Q\) is the Gâteaux derivative with respect to Q. For 0-forms, we set \(d_Q=\nabla _Q\).
The Euler–Lagrange equations for energy functional (2.7) are given by (see “Appendix B”Footnote 4)
where \(\nabla _{Q}^*\) is the adjoint of \(\nabla _{Q}\) and maps \(\mathbb {C}^2\)-valued one-forms into \(\mathbb {C}^2\)-valued functions, \(d_{Q}^*\) is the adjoint of \(d_{Q}\) and maps \(\mathfrak {u}(2)\)-valued two-forms into \(\mathfrak {u}(2)\)-valued one-forms, and \(J(Q, \Phi )\) is the electroweak current, which is the \(\mathfrak {u}(2)\)-valued one-form given by
where summing over repeated indices is understood, \(\tau _0:= \textbf{1}\) and \(\tau _a, a=1, 2, 3,\) are the Pauli matrices,
(The Pauli matrices, multiplied by \(-i/2\), form an orthonormal basis in \(\mathfrak {su}(2)\) with the inner product \(\langle g,h\rangle _{\mathfrak {su}(2)}:= 2{\text {Tr}}(g^* h) = -2{\text {Tr}}(gh)\).) We call system (2.10)–(2.11) the (static) WS equations.
The energy functional (2.7) and Euler–Lagrange equations (2.10)–(2.11) are invariant under the group of rigid motions and the gauge transformations (gauge symmetry)
where \(\gamma =\gamma (x)=h_1(x)h_2(x)\), with \( \,h_1(x)\in SU(2),\ h_2(x)\in U(1),\) and
The physical quantities here are (a) the Higgs field density \(\Vert \Phi \Vert \), (b) the magnetic field \({\text {Tr}}F_{Q}\) and (c) the YM current \(J(Q, \Phi )\). It is easy to check that these quantities are gauge invariant. We say that a solution \((Q, \Phi )\) to (2.10)–(2.11) is homogeneous if \(\Vert \Phi \Vert \), \({\text {Tr}}F_{Q}\) and \(J(Q, \Phi )\) are independent of x. (We say that \({\text {Tr}}F_{Q}\) is independent of x, if it is a multiple of a constant 2-form, see (2.16).) Otherwise, we say that \((Q, \Phi )\) is inhomogeneous.
Furthermore, we say that a solution \((Q, \Phi )\) is gauge-translation invariant if it is invariant under translations up to gauge transformations.
Clearly, a solution \((Q, \Phi )\) which is gauge-translation invariant is also homogeneous. The converse in general might not be true.
We are interested in the vacuum solutions of the WS equations with a non-vanishing average magnetic field,
i.e. solutions minimizing the WS energy locally under the constraint above. In physical field theories, one expects the vacua to have the maximal available symmetry. Consequently, we first consider gauge-translation invariant solutions with a fixed (constant) magnetic field.
For \(\vec b=(b_1, b_2, b_3)\ne 0\), Eqs. (2.10)–(2.11) have the gauge-translation invariant solution given (up to a gauge symmetry) by
where \(\Phi ^{\vec b}\) is a constant field and \(Q^{\vec b}\) is a connection with a constant magnetic field
with the sum taken over all cyclic permutations of (1, 2, 3), and \(e:= \frac{g g'}{\sqrt{g^2+{g'}^2}}.\) (e turns out to be the electron charge.) We specify this solution at the end of this section in Eqs. (2.23) and (2.24). (For it, \(Q^{\vec b}\) solves the YM equation \(d_{Q}^* F_Q = 0\).)
Fixing the average magnetic field breaks the full special Euclidean symmetry (i.e. translations and rotations but not reflections) but maintains the special Euclidean symmetry in the plane orthogonal to \(\vec b\) and the translational symmetry along \(\vec b\). Looking for the simplest non-trivial solutions, we consider solutions which do not depend on the coordinate along \(\vec b\) and look for solutions spontaneously breaking the transversal translational symmetry.
With the notation \(b=|\vec b|\), we show that for appropriate perturbations:
-
(i)
(2.15) is linearly stable for \(b<b_*\) and unstable for \(b>b_*\), where \(b_*:= g^2\varphi _0^2/2e\);
-
(ii)
At \(b=b_*\), a new inhomogeneous solution (breaking the gauge-translational invariance) bifurcates, and this solution has the discrete translational symmetry of a lattice in the plane orthogonal to \(\vec b\) and has lower energy per unit area;
-
(iii)
The lattice shape minimizing the energy per unit area approaches the hexagonal lattice as b approaches \(b_*\).
To formulate these results precisely, we introduce some definitions. Since we consider solutions which do not depend on the coordinate along \(\vec b\), we can restrict our analysis to the plane \(\perp \vec b\). We choose the \(x^3\)-axis along \(\vec b\) and identify the plane \(\perp \vec b\) with \(\mathbb {R}^2\).
We fix a lattice \(\mathcal {L}\) in \(\mathbb {R}^2\) and say a triple \(( \Phi (x), V(x), X(x))\) is \(\mathcal {L}\)-gauge-periodic, or, \(\mathcal {L}\)-equivariant, if and only if it satisfies the equation
for some \(\gamma _s \in C^1(\mathbb {R}^2, SU(2)\times U(1))\). Here \(T^\textrm{gauge}_{\gamma }\) is given by (2.14) and \(T^\textrm{trans}_s\) is the group of translations, \(T^\textrm{trans}_s f(x)=f(x+s)\). (When \(\mathcal {L}\) is clear, we omit it from the definition above.)
We denote by \(\mathcal {H}^s_{\mathcal {L}},\ s\in \mathbb {N},\) the Sobolev space of \(\mathcal {L}\)-equivariant triples \(U\equiv (V, X, \Phi )\) on \(\mathbb {R}^2\), with the norm
where \(\Omega \) is an arbitrary fundamental domain of \(\mathcal {L}\), \(d_Q^k\) is the k-th iterate of the covariant derivative \(d_Q\) and \( \Vert \cdot \Vert \) is the (fibre) norm in the space \(\Omega ^{k+1}_{\mathfrak {s u}(2)}\times \Omega ^{k+1}_{\mathfrak {u}(1)}\times \Omega ^k_{\mathbb {C}^2}\), see (2.5) (and (2.6)), and with corresponding the inner product. Note that \(L^2_{\mathcal {L}}=\mathcal {H}^0_{\mathcal {L}}\).
The resulting Sobolev spaces \(\mathcal {H}_{\mathcal {L}}^{s}\) are independent (up to isomorphism) of the choice of the fundamental domain, \(\Omega \). All Sobolev embedding theorems are valid for \(\mathcal {H}_{\mathcal {L}}^s\). They can be proven by passing to a vector bundle over the torus \(\mathbb {R}^2/\mathcal {L}\) and then to the local charts and then using standard Sobolev embedding theorems. By the Sobolev embedding \(\mathcal {H}_{\mathcal {L}}^{1}\subset L_{\mathcal {L}}^{p},\ p<\infty \), and the definitions 2.7 and 2.18,
(recall that \(\Omega \subset \mathbb {R}^2\)) and is independent of a choice of \(\Omega \).
We say a solution \(U_*:= (V_*, X_*, \Phi _*)\) of the WS system (2.10)–(2.11) is energetically stable if and only if it is a local minimum of the WS energy \(E_{N} \), in the sense that the spectrum of the \(L^2\)-Hessian of \(E_{N} \) at \(U_*\) on \(L^2_{\mathcal {L}}\) (which is real) is non-negative. \(U_*\) is said to be unstable if it is a saddle point of \(E_{N} \) (so that the spectrum of its Hessian has a negative part).
For an \(\mathcal {L}\)-equivariant triple U and a fundamental domain \(\Omega \) of \(\mathcal {L}\), we define the energy per fundamental cell by
where \(|\Omega |\) denotes the area of \(\Omega \). This energy is independent of the choice of \(\Omega \).
In what follows, \(\Omega \) denotes an arbitrary (but fixed throughout) fundamental domain of \(\mathcal {L}\), and \(|\mathcal {L}|\), the area of a fundamental cell of \(\mathcal {L}\), which is independent of the choice of the cell \(\Omega \) and is called the covolume of \(\mathcal {L}\).
Let \(M_W:= \frac{1}{\sqrt{2}} g\varphi _0\), \(M_Z:= \frac{1}{\sqrt{2}\cos \theta } g\varphi _0\) and \(M_H:= \sqrt{2}\lambda \varphi _0\), where \(\theta \) is the Weinberg angle defined by \( \cos \theta = \frac{g }{\sqrt{g^2+{g'}^2}}\). These are the masses of the W, Z and Higgs bosons, respectively (this nomenclature will be explained in the discussion following Eq. (3.10)). Finally, let
With the above definitions, we will prove the following:
Theorem 2.1
The gauge-translational invariant solution (2.15) is energetically stable for \(b<b_*\) and unstable for \(b>b_*\).
Theorem 2.2
Let \(\mathcal {L}\) be a lattice satisfying \(0 < 1-\frac{M_W^2}{2\pi }|\mathcal {L}| \ll 1\) and assume that \(M_Z < M_H\).Footnote 5 Then there exist \(\delta >0\) such that the following holds:
-
(a)
Equations (2.10)–(2.11) have an inhomogeneous solution \(U_{\mathcal {L}}\in \mathcal {H}^2_{\mathcal {L}}\) in the \(\delta \)-ball \(B_{\mathcal {H}_{\mathcal {L}}^2}( U_*^{\vec {b}}; \delta )\) in \(\mathcal {H}_{\mathcal {L}}^2\) around the homogeneous solution (2.15);
-
(b)
\(U_{\mathcal {L}}\) is the unique, up to gauge symmetry transformation, inhomogeneous solution in the \(\delta \)-ball \(B_{\mathcal {H}_{\mathcal {L}}^2}( U_*^{\vec {b}}; \delta )\);
-
(c)
\(U_{\mathcal {L}}\) has energy per unit area less than vacuum solution (2.15): \(E^{\mathcal {L}}(U_{\mathcal {L}}) < E^{\mathcal {L}}(U^b_*)\).
The solutions described in this theorem can be reinterpreted geometrically as representing sections (\( \Phi (x)\)) and connections ((V(x), X(x))) on a U(2) vector bundle over a torus (cf. [20]). However, a vector bundle over a torus is topologically equivalent to a direct sum of line bundles. In our case, this equivalence follows from Eqs. (3.5)–(3.7).
For the next result, we use the topology on the space of (normalized) lattices induced by the standard parameterization of lattices defined as follows. Identifying \(\mathbb {R}^2\) with \(\mathbb {C}\) via \((x_1,x_2) \leftrightarrow x_1 + i x_2\) and viewing a lattice \(\mathcal {L}\subset \mathbb {R}^2\) as a subset of \(\mathbb {C}\) and using a translation and a rotation, any lattice \(\mathcal {L}\subset \mathbb {C}\) can be reduced to the form \(\mathcal {L}=r\mathcal {L}_\tau \), where \(r>0\), \(\mathcal {L}_\tau :=\mathbb {Z}+\tau \mathbb {Z}\) and \(\tau \in \mathbb {H}:=\{\tau \in \mathbb {C}: {\text {Im}}\tau > 0 \}\). Furthermore, any two \(\tau \)’s produce the same lattice iff they are related by an element the modular group \(SL(2, \mathbb {Z})\) acting on the Poincaré half-plane \(\mathbb {H}\) (see, e.g. [4]). Hence, it suffices to restrict \(\tau \) to the fundamental domain of \(SL(2, \mathbb {Z})\),
Theorem 2.3
For \(M_Z < M_H\), the lattice \(\mathcal {L}_*\) minimizing the average energy, \(E^{\mathcal {L}}(U_{\mathcal {L}})\), approaches the hexagonal lattice \(\mathcal {L}_{\textrm{hex}}\) as \(b \rightarrow b_*\) in the sense that the shape parameter \(\tau _*\) of the lattice \(\mathcal {L}_*\) approaches \(\tau _{\textrm{hex}} = e^{i\pi /3}\) in \(\mathbb {C}\).
Now, we construct explicitly the solution (2.15). We define
where \(A^{\vec b}(x)\) be a (U(1)-) magnetic potential of the constant magnetic field \(d A^{\vec b}=\omega _{\vec b}\) and \(\theta \) is Weinberg’s angle, given by \(\tan \theta =g'/g\). We have
Lemma 2.4
The pair \((Q^{\vec b}, \Phi _0)\) satisfies (2.10)–(2.11). Moreover, the connection \(Q^{\vec b}\) has the constant curvature
Proof
(2.24) follows easily from \(d A^{\vec b}=\omega _{\vec b}\). To check that \((Q^{\vec b}, \Phi _0)\) satisfies (2.10)–(2.11), we observe that \(d_{Q^{\vec b}}\Phi _0 = (g V^{\vec b}+ g' X^{\vec b})\Phi _0 = ( g A^{\vec b} \sin \theta \tau _{3}+ g' A^{\vec b}\cos \theta \tau _{0})\Phi _0 = eA^{\vec b} (\tau _{3}+ \tau _{0})\Phi _0\). Since \( (\tau _3 + \tau _0) \Phi _0 = 0\), this implies \(\nabla _{Q}\Phi _0=0\). This gives (2.10) and reduces (2.11) to \(d_{Q^{\vec b}}^* F_{Q^{\vec b}} = 0\), which follows easily from (2.24). \(\square \)
Our approach is based on a careful examination of the linearization of the WS equations on the homogeneous vacuum. The spectrum of the linearized problem determines the domains of the linear, or energetic, stability and the transition threshold. In the instability domain, we apply an equivariant bifurcation theory. This gives Theorem 2.2(a) and (b). For Theorems 2.2(c) and 2.3, we carefully study the asymptotic behaviour of the energy functions for small values of the bifurcation parameter.
3 Gauge Fixing and W and Z Bosons
In this section, we choose a particular gauge and pass from the fields (one-forms) V and X to more suitable gauge fields. We eliminate a part of the gauge freedom by assuming that the Higgs field \(\Phi \) is of the form
with \(\varphi \) real. (This can be done using only the SU(2) part of the gauge group.) Then
where, recall, \(\tau _a,\ a=1, 2, 3,\) are the Pauli matrices generating the Lie algebra su(2), and \(\tau _0=\textbf{1}\). However, there is one linear combination of \(\tau _a\)’s (unique up to a scalar multiple) which annihilates \(\Phi \):
Thus, for the gauge \(\Phi =(0, \varphi )\) the symmetries generated by \(\tau _1, \tau _2, \tau _3 - \tau _0\) are broken and the U(1) symmetry generated by \(\tau _3 + \tau _0\) remains unbroken. The unbroken gauge symmetry is given by transformations (2.14) with
where \(\gamma \in C^1(\mathbb {R}^3, \mathbb {R})\).
Continuing in the gauge \(\Phi =(0, \varphi )\) and writing \(V= -\frac{i}{2} \tau _a V^a\)Footnote 6 and \(X = -\frac{i}{2} \tau _0 X^0\), where \(X^0\) and \(V^a, a=1, 2, 3,\) are real fields (since V takes values in su(2) and therefore \(V^*=-V\)), we pass to the new fields corresponding to the broken and unbroken generators, \( \tau _3 - \tau _0\) and \(\tau _3 + \tau _0\), respectively:
where recall, \(\theta \) is Weinberg’s angle, defined by \(\tan \theta =g'/g\). Note that Z and A are real fields. Moreover, it is convenient to pass from the remaining two components, \(V^1, V^2\), of V to a single complex field
The gauge invariance of the original field equations with the unbroken gauge symmetry given by transformations (2.14) with (3.4) leads to the invariance under following gauge transformations:
for \( \gamma \in C^1(\mathbb {R}^3, \mathbb {R})\), where \(e^{i \gamma } W=\sum e^{i \gamma } W_i d x^i\) for \(W=\sum W_i d x^i\), e is the electron charge. Here, we replaced \(\Phi := (0, \varphi )\) by \(\varphi \).
The WS energy in terms of W, Z, A and \(\varphi \) fields in 3D is given in (D.1), “Appendix D”. The WS equations in terms of W, Z, A and \(\varphi \) in 3D can be found by taking variational derivatives of this energy w.r.t different fields.
In terms of W, A, Z and \(\varphi \) fields, the vacua (2.15) of the Weinberg–Salam model become (up to a gauge symmetry):
where recall, \(A^b(x)\) is a magnetic potential for the constant magnetic field of strength b in the \(x^3\)-direction, \(d A^b(x)=b dx_1\wedge dx_2\), and \(\varphi _0\) is a positive constant from (2.7). We choose the gauge so that \(A^b(x)\) is of the form
We will show that for a large magnetic field b, these homogeneous vacua become unstable and new, inhomogeneous vacua emerge from them. This is a bifurcation problem from the branch of gauge-translationally invariant (homogeneous) solutions, (3.8).
Since we consider the WS system with the fields independent of the third dimension \(x^3\), i.e. in \(\mathbb {R}^2\), we can choose the gauge with \(V_3 = X_3=0\) (and hence \(W_3 = A_3=Z_3=0\)).
Also, we will work in a fixed coordinate system, \(\{x^i\}_{i=1}^2\), and write the fields as \(W = W_{i} dx^i,\) \( Z = Z_{i} dx^i\) and \(A = A_{i} dx^i\). For ease of comparing our arguments with earlier results, and given that we use the standard Euclidean metric in \(\mathbb {R}^2\), we identify (complex) one-forms W, Z and A with the (complex) vector fields \((W_1, W_2), (Z_1, Z_2)\) and \((A_1, A_2)\). With this, we show in “Appendix D.2” that in this case, WS energy functional (2.7) can be written as
where \(\kappa := \frac{g^2}{2\cos ^2\theta }\), \({\text {curl}}_{U} W:= (\nabla _U)_1 W_2 - (\nabla _U)_2 W_1\), \((\nabla _U)_i:= \partial _i - i U_i \), \(\partial _i \equiv \partial _{x^i}\) (for a \(\mathfrak {u}(1)-\)valued vector field U), \(\xi \times \eta :=\xi _1 \eta _2- \xi _2 \eta _1\) and \({\text {curl}}V^3:= \partial _1 V^3_2 - \partial _2 V^3_1\). It follows from (3.5) that \(V^3= Z \cos \theta + A \sin \theta \).
Expanding (3.10) in \(\varphi \) around \(\varphi _0\), we see that the W, Z and \(\phi \) (Higgs) fields have the masses \(M_W:= \frac{1}{\sqrt{2}} g \varphi _0\), \(M_Z:= \frac{1}{\sqrt{2}\cos \theta }g \varphi _0\) and \(M_H = \sqrt{2}\lambda \varphi _0\), respectively.
Using the relation \(\xi \times \eta = J\xi \cdot \eta \), where \(\cdot \) denotes the Euclidean scalar product in \(\mathbb {R}^2\) and J is the symplectic matrix,
we find the Euler–Lagrange equations for (3.10), which give the WS system (2.10)–(2.11) in 2D in terms of the fields W, A, Z and \(\varphi \)
where, recall, \(\kappa = \frac{g^2}{2\cos ^2\theta }\), \(V^3= Z \cos \theta + A \sin \theta \) and \(\Delta \) is the standard Laplacian. (For a derivation of (3.12)–(3.15) from (3.10), see “Appendix D.2” and also [26, 43].) Of course, (3.12)–(3.15) can also be derived directly from WS system (2.10)–(2.11).
In terms of the \((W, A, Z, \varphi )\) fields, the lattice gauge—periodicity (2.17) is expressed as:
for all \(s\in \mathcal {L},\) where \(\gamma _s \in C^1(\mathbb {R}^2,\mathbb {R})\) for all \(s\in \mathcal {L}\), \( {\tilde{T}}^\textrm{gauge}_\gamma \) given in (3.7) and \(T^\textrm{trans}_s\) is the group of translations, \(T^\textrm{trans}_s f(x)=f(x+s)\). We say that \((W,A, Z, \varphi )\) satisfying (3.16) is an \(\mathcal {L}\)-equivariant state. By evaluating the effect of translation by \(s+t\) in two different ways, we see that the family of functions \(\gamma _s\) has the co-cycle propertyFootnote 7
Since \(T^\textrm{trans}_s\) is an Abelian group, the co-cycle condition (3.17) implies that, for any basis \(\{j_1, j_2\}\) in \(\mathcal {L}\), the quantity
is independent of x and of the choice of the basis \(\{j_1, j_2\}\), and is an integer. This topological invariant is equal to the degree of the corresponding line bundle.
Using Stokes’ theorem, one can show, for any A satisfying (3.16)–(3.18), that the magnetic flux through any fundamental domain \(\Omega \) of the lattice \(\mathcal {L}\) is quantized:
where e is defined after (3.7) and \(n=c(\gamma _s)\in \mathbb {Z}\) defined in (3.18). The left-hand side of (3.19) is called the Chern number of the line bundle corresponding to \(\gamma _s\). (We note that n is independent of the choice of \(\Omega \).)
The vacuum state (3.8) is \(\mathcal {L}\)-equivariant if and only if the magnetic field b is given by the relation
where, by definition, \(|\mathcal {L}|= |\Omega |\) for any fundamental cell \(\Omega \). In particular, b is quantized. For such b, the vector field \(\frac{1}{e} A^b\) satisfies (3.19).
Furthermore, due to the reflection symmetry of the problem, we may assume that \(b \ge 0\). Clearly, we have:
Lemma 3.1
Equations (2.10)–(2.11) for \(\mathcal {L}\)-equivariant fields (2.17) in the gauge \(\Phi =(0, \varphi )\) are equivalent to Equations (3.12)–(3.15) for \(\mathcal {L}\)-equivariant fields (3.16), with the equivalence realized by the transformation (3.5)–(3.6).
Finally, we use the invariance of (3.12)–(3.15) under the gauge transformation (3.7) to choose a convenient gauge for the fields W(x) and A(x). We say that the fields \((W,A, Z, \varphi )\) and \((W',A', Z', \varphi ')\) are gauge-equivalent if there is \(\gamma \in C^1(\mathbb {R}^2, \mathbb {R})\) such that
Clearly, if \((W, A, Z, \varphi )\) and \((W',A', Z', \varphi ')\) are gauge-equivalent, then \((W, A, Z, \varphi )\) solves (3.12)–(3.15) if and only if \((W',A',Z',\varphi ')\) solves (3.12)–(3.15). The following proposition was first used in [31] and proven in [45] (an alternate proof is given in “Appendix A” of [46]):
Proposition 3.2
Let \((W',A', Z', \varphi ')\) be an \(\mathcal {L}\)-equivariant state and let b be given by (3.20). Then, there is a \(\mathcal {L}\)-equivariant state \((W,A, Z, \varphi )\), gauge-equivalent to \((W',A', Z', \varphi ')\), which satisfies (3.16), with \(\chi _s(x)=\frac{eb}{2}\,s\wedge x + k_s\), i.e. such that, \(\forall s\in \mathcal {L}\),
Here \(k_s\) satisfies the condition \(k_{s+t}-k_s-k_t-\frac{eb}{2}\,s\wedge t\in 2\pi \mathbb {Z}\), for all \(s,t\in \mathcal {L}\), the matrix J is given in (3.11).
Note that with the gauge (3.23), the homogeneous vacua (3.8) satisfy (3.21)–(3.24).
Our goal is to prove the instability of the vacuum state (3.8) and the existence of \(\mathcal {L}-\)equivariant (in the sense of (3.16)) solutions to transformed WS system (3.12)–(3.15) having the properties described in Theorems 2.2 and 2.3.
4 Rescaling
In this section, we rescale transformed WS system (3.12)–(3.15) to keep the lattice size fixed. Specifically, we define the rescaled fields \((w,a,z,\phi )\) to be
where in the second equality (4.2), we used (3.20). Clearly, \((W(x), A(x), Z(x),\varphi (x))\) is \(\mathcal {L}\)-equivariant if and only if \((w(x),a(x), z(x),\phi (x))\) is \(\mathcal {L}'\)-equivariant, where
Now, the rescaled lattice \(\mathcal {L}'\) is independent of b and the size of a fundamental domain, \(\Omega '\), of \(\mathcal {L}'\) is fixed as \(|\Omega '|=2\pi \).
Plugging the rescaled fields into (3.12)–(3.15) gives the rescaled Weinberg–Salem equations:
where \(\xi := r\varphi _0\) (with r given in (4.2)), \(\nu := g(a\sin \theta + z\cos \theta )\) and, recall, \({\text {curl}}_{q} w = \nabla _1 w_2 - \nabla _2 w_1\), \(\nabla _i:= \partial _i - iq_i \), \(\partial _i \equiv \partial _{x^i}\) (for a \(\mathfrak {u}(1)-\)valued vector-field iq) and, recall, \(\overline{w} \times w:=\overline{w}_1 w_2- \overline{w}_2 w_1\). We define the rescaled energy by:
with \((W, A, Z,\varphi )\) related to \((w,a, z,\phi )\) by (4.1) and \(E_{\Omega } (W, A, Z, \varphi )\) given in (3.10). Explicitly, we have
We note that after rescaling, the average magnetic flux per fundamental domain becomes n/e and the vacuum solution (3.8),
where \(a^n(x)\equiv A^n(x) =\frac{n}{2} J x\),. Furthermore, (3.16) and Proposition 3.2 imply that \((w, a, z, \phi )\) satisfy
where \(c_s\) satisfies the condition \(c_{s+t}-c_s-c_t-\frac{n}{2} s\times t\in 2\pi \mathbb {Z}\), for all \(s,t\in \mathcal {L}'\).
Finally, the Sobolev spaces here, denoted again by \(\mathcal {H}_{\mathcal {L}'}^{s}\), can be obtained by rescaling the Sobolev spaces defined above or defined directly, again as above, see (2.18) and the text around it. Similarly to (2.19), by a Sobolev embedding theorem, the rescaled energy is finite,
and is independent of a choice of \(\Omega '\).
5 The Linearized Problem
In this section, we prove Theorem 2.1, describing the stability properties of the vacuum (3.8). Equivalently, we will investigate the energetic stability of the rescaled vacuum solution (4.9) of the rescaled WS equations (4.3)–(4.6).
Let \( m:= (w, a, z,\phi )\) and denote by \(G(b, m) \equiv G(m)\) the map \(G: \mathcal {H}_{\mathcal {L}'}^2 \rightarrow \mathbb {C}^7\) given by the left-hand side of (4.3)–(4.6), written explicitly as
where, recall, J is the symplectic matrix given in (3.11), \(\xi := r\varphi _0\) (with r given in (4.2)), \(\nu := g(a\sin \theta + z\cos \theta )\), \(\Delta \) is the standard Laplacian and the parameter b enters through periodicity conditions (4.10)–(4.13). Now, the WS system can be written as:
Recall the definition of stability given above Eq. (2.20). To apply it to the rescaled WS Eqs. (4.3)–(4.6), we observe that the map G is the \(L^2\)-gradient, \({\text {grad}}_{L^2}\mathcal {E}_{\Omega '}\), of the energy \(\mathcal {E}_{\Omega '}\), see (4.8), considered as a functional of \(u=(w, a, z, \phi )\). Namely, \(\langle G(m), \xi \rangle _{L^2}=\delta \mathcal {E}_{\Omega '}(m) \xi \), where \(\delta \mathcal {E}_{\Omega '}(m)\) is the Gâteau derivative
of \(E_{\Omega '}\) at m, defined on the space of variations \(\mathcal {Y}\) tangent to the space of \(L^2_{\textrm{loc}}\) functions of the form \((w, a, z, \phi )\) satisfying the gauge—periodicity conditions (4.10)–(4.13):
Here \(L_n^2, L^2_0\) and \(L^2\) are given by
Since \(G(m)={\text {grad}}_{L^2}\mathcal {E}_{\Omega '}(m)\), the \(L^2\)-Hessian for \(\mathcal {E}_{\Omega '}\) and m is the formally symmetric operator
Denote the \(L^2\)-Hessian at the vacuum solution \(m^{n, r}\) (see (4.9)) by
As seen from its explicit form given below, the operator \(L_{n, \mu }\), acting on the space \(\mathcal {Y}\), is self-adjoint, and therefore, its spectrum is real.
Thus, applied to the rescaled WS equations (4.3)–(4.6), the definition of stability can be rephrased as:
the vacuum solution \(m^{n, r}\) is energetically stable (respectively, unstable) if and only if \(\inf {\text {spec}}(L_{n, \mu })\ge 0\) (respectively, \(\inf {\text {spec}}(L_{n, \mu })< 0\)).
We consider the operator \(L_{n, \mu }\) on the space \(\mathcal {Y}\), with the domain
where \(\mathcal {H}_n^s\), \(\mathcal {H}_0^s\) and \(\mathcal {H}^s\) are the respective Sobolev spaces for the \(L^2\)-spaces (5.9)–(5.11), with inner products given (for \(s\in \mathbb {Z}_{\ge 0}\)) by
where \(w^\#=(w_1^\#, w_2^\#), a^\#=(a_1^\#, a_2^\#)\), \(\Omega '\) is an arbitrary fundamental domain of the lattice \(\mathcal {L}'\) and \(\gamma \) is a multi-index. The \(\mathcal {L}'\)-equivariance of the above functions implies that these inner products do not depend on the choice of fundamental domain \(\Omega '\).
We compute the linear operator \(L_{n, \mu }\) explicitly. In what follows we use the notation \(\oplus _{j} A_j\) for diagonal operator-matrices with the operators \(A_j\) on the diagonal.
Passing from the parameter \(\xi = r\varphi _0\), or r, to the parameter \( \mu := g^2 \xi ^2/2\) and using that \(\nu \big |_{a=a^n/e, z =0}=\frac{1}{e} a^n g\sin \theta = a^n \), we find
where, recall, \({\text {curl}}_{q} w = (\nabla _q)_1 w_2 - (\nabla _q)_2 w_1\), \((\nabla _q)_i:= \partial _i - iq_i\), \(\partial _i \equiv \partial _{x^i}\). (Note that the matrix iJ is self-adjoint.)
The gauge invariance of Eq. (5.6) and the partial symmetry breaking of vacuum solution (4.9) imply that \(L_{n,\mu =n}\) has the gauge zero mode:
For a null vector \(\Gamma _f\) defined in (5.21) to be in \(\mathcal {X}\), f must satisfy \({\text {div}}(\nabla f) = -\Delta f = 0\). This implies that f is a linear function, \(f(x) = c\cdot x + d\) for some \(c\in \mathbb {R}^2\) and \(d\in \mathbb {R}\), and so
In this section, we shall prove the following result implying Theorem 2.1:
Theorem 5.1
The operator \(L_{n, \mu }\) on the space \(\mathcal {X}\) has purely discrete spectrum. For \(\mu \ne n\), \(L_{\mu ,n}\) has the multiplicity 2 eigenvalue 0 with the eigenfuctions \((0, e_i,0,0)\), \(i=1, 2, e_1=(1, 0), e_2=(0, 1)\) (see (5.22)).
Furthermore, the smallest non-zero eigenvalue given by \(\mu - n\), having multiplicity n. For \(\mu =n\), the eigenvalue 0 has the multiplicity \(n+2\).
Theorem 5.1 follows from Propositions 5.2 and 5.3 given below. \(\square \)
Proposition 5.2
The operators \(H_2(\mu )\), \(H_3(\mu )\) and \(H_4(\mu )\) have purely discrete spectra. Furthermore, \(H_3(\mu )\) and \(H_4(\mu )\) are strictly positive and \(H_2(\mu )\) is non-negative and has the null space \(\{(0, c, 0, 0): c\in \mathbb {R}^2\}\) of dimension 2.
Proof
The strict positivity of \(H_3(\mu )\) and \(H_4(\mu )\) and the non-negativity of \(H_2(\mu )\) are obvious. The discreteness of the spectra and the form of the null space of \(H_2(\mu )\) follow from the discreteness of the spectrum of the Laplacian on compact domains and the identity \({\text {curl}}^*{\text {curl}}v = -\Delta v\) when \({\text {div}}(v)=0\). To compute the null space of \(H_2(\mu )\), we observe that the solutions of the equations \(\Delta v=0\) and \({\text {div}}(v)=0\) are constant vectors in \(\mathbb {R}^2\). \(\square \)
Let \(\nabla _{q}:= \nabla - iq = ((\nabla _{ q})_1, (\nabla _{ q})_2)\), \((\nabla _{ q})_j:= \partial _j - iq_j\), and \(\Delta _q:=\nabla _{q}^2=-\nabla _{q}^*\nabla _{q}\). We also introduce the complexified covariant derivative \({\bar{\partial }}_{ q}:= (\nabla _{ q})_1 + i(\nabla _{ q})_2.\)
We have
Proposition 5.3
(i) \(H_1(\mu )\) is a self-adjoint operator on \(\mathcal {H}^2_n\) and its spectrum is given by
where \(n:= eb|\mathcal {L}|/2\pi \).
(ii) The eigenspace of the eigenvalue \(-n+ \mu \) is n-dimensional and is spanned by functions of the formFootnote 8
and the eigenspace of the eigenvalue \(\mu \) is of the form
In the proof of this proposition, we use the following standard result whose proof, for reader’s convenience, is given in “Appendix G”:
Proposition 5.4
The operator \(-\Delta _{ a^n}\) is self-adjoint on its natural domain and its spectrum is given by:
with each eigenvalue is of the multiplicity n. Moreover,
In more detail, with \(z= (x^1+i x^2)/ \sqrt{\frac{2\pi }{{\text {Im}}\tau } }\) and \(\tau \) coming from \(\mathcal {L}'=\mathbb {Z}+\tau \mathbb {Z}\), we have
where \(V_n\) is spanned by functions of the form
Remark 5.5
Functions of the form (5.29) are determined entirely by the values of \(c_0,\ldots ,c_{n-1}\) and therefore form an n-dimensional vector space..
Proof of Proposition 5.3
First, we will show that \(\mathcal {H}^2_n = \mathcal {Y}\oplus \mathcal {Z}\) (the Hodge decomposition), where
with \({\text {div}}_{ a^n}w:= (\nabla _{ a^n})_1 w_1 + (\nabla _{ a^n})_2 w_2=-\nabla _{ a^n}^*\). We write any \(w\in \mathcal {H}^2_n\) as \(w = w_0 + \nabla _{ a^n}f\), where f solves the equation \(\Delta _{ a^n} f = {\text {div}}_{ a^n} w\) and \(w_0\) is defined by this relation. By Proposition 5.4, 0 is not in the spectrum of \(\Delta _{ a^n}\) and therefore the equation \(\Delta _{ a^n} f = {\text {div}}_{ a^n} w\) has the unique solution \(f\in \mathcal {H}^3_n\). Then, since \(\Delta _{ a^n}:={\text {div}}_{ a^n} \nabla _{ a^n}\), we have \({\text {div}}_{ a^n}w_0=0\). This proves \(\mathcal {H}^2_n = \mathcal {Y}\oplus \mathcal {Z}\).
Now, recall that the operator \(H_1(\mu )\) acts on complex vectors \(w=(w_1, w_2)\). The definition \(H_1(\mu ):= {\text {curl}}_{ a^n}^* {\text {curl}}_{ a^n} - n iJ + \mu \) and the relations \({\text {curl}}_{ a^n}^* = -J\nabla _{ a^n}\) and
yield that \((H_1(\mu )-\mu )\nabla _{ a^n}f = 0\), which proves that the \(\mu \)-eigenspace of \(H_1(\mu )\) is of the form (5.25) giving the second part of (ii).
By the above the subspace \(\mathcal {Y}\) is invariant under \(H_1(\mu )\). To compute the spectrum of the operator \(H_1(\mu )\) on the subspace \(\mathcal {Y}\), we use the definitions of \({\text {curl}}_{ a^n}\) and \({\text {curl}}_{ a^n}^*\) and recall the relation \( [(\nabla _{a^n})_1, (\nabla _{a^n})_2] = -in\) to compute
By above, we have \(H_1(\mu ) w_0 =(-\Delta _{ a^n} - 2n iJ -\mu )w_0\), for any \(w_0\in \mathcal {Y}\).
(We check using \({\text {div}}_{a^n} (-\Delta _{ a^n} - 2n iJ) w_0 = (-\Delta _{ a^n}) {\text {div}}_{ a^n} w_0 =0\), that \(H_1(\mu )\) sends \(\mathcal {Y}\) to \(\mathcal {Y}\) and hence, \(\mathcal {Y}\) is invariant under \(H_1(\mu )\).) Thus, we conclude that
Identifying one-forms with vector fields, we compute
which gives
By Proposition 5.4, we know that
and so the spectrum of \(H_1(\mu )\) on \(\mathcal {Y}\) is given by the first set on the r.h.s. of (5.23). Hence, by \(\mathcal {H}^2_n = \mathcal {Y}\oplus \mathcal {Z}\), (5.23) follows, giving (i).
Furthermore, by (5.35) and (5.36), any eigenvector \(\chi \) of \(h_{1}\) corresponding to the eigenvalue \(-n\) must be of the form
where \(\beta \) satisfies
This relation, together with the equation \({\text {Null}}(-\Delta _{ a^n} - n) = {\text {Null}}{\bar{\partial }}_{ a^n}\) (see (5.27)), implies \(\overline{\partial }_{ a^n}\beta = 0\). Since \({\text {curl}}_{ a^n}\chi = i\overline{\partial }_{ a^n}\beta \), this gives
Furthermore, by Proposition 5.4, the space of such functions is n-dimensional. Thus (after rescaling \(\omega \) by a factor of \(\sqrt{2}\)) \(\chi \) is of the form (5.24). This gives also the first part of (ii) completing the proof of the proposition. \(\square \)
We see that the operator \(H_1(\mu )\) is non-negative for the magnetic fields satisfying \(b < b_*:= g^2 \varphi _0^2/2e = M_W^2/e\) and acquires a negative eigenvalue \( \mu -n = (b_*/b - 1)\,n\) of multiplicity n as the magnetic field increases to \(b > b_*\). Theorem 2.1 follows by undoing the rescaling (4.1)–(4.2).
6 Setup of the Bifurcation Problem
We substitute \(a=\frac{1}{e} a^n+\alpha \) (with \({\text {div}}(\alpha )=0\)), \(\phi =\xi +\psi \), \(\nu = a^n+{\tilde{\nu }}\) and \(\xi = \sqrt{2\mu }/g\) into (4.3)–(4.6) and relabel the unknowns \(w, \alpha ,z,\psi \) as \(u_1, u_2, u_3, u_4\) to obtain the system
where \(u= (u_1, u_2, u_3, u_4)\equiv (w, \alpha ,z,\psi )\), the operators \(H_i\) on the left-hand side are defined in (5.17) - (5.20), and
with \( {\tilde{\nu }}:= g(\alpha \sin \theta +z\cos \theta )\), \(\xi \times \eta := \xi _1 \eta _2 - \xi _2 \eta _1\), recall, \({\text {curl}}_{q} w = \nabla _1 w_2 - \nabla _2 w_1\), \(\nabla _i:= \partial _i - iq_i\) and, recalling that \(w:\mathbb {R}^2\rightarrow \mathbb {C}^2\),
with \(M_{ij}:= i {\tilde{\nu }}_i (\nabla _{a^n})_j + i {\tilde{\nu }}_j (\nabla _{ a^n})_i + i \partial _i{\tilde{\nu }}_j + {\tilde{\nu }}_i\nu _j\).
Note that system (6.1) can be also written as \(G(m^{n, r}+u)|_{\xi = \sqrt{2\mu }/g}=0\), where G is defined in (5.1) and \(m^{n, r}:= (0,\frac{1}{e} a^n,0,\xi )\).
Applying \({\text {div}}\) to the second equation in (6.1), we find that a solution \((\mu , u)\) should satisfy \({\text {div}}J_2 (\mu , u)=0\). To prove that a solution \((\mu , u)\) satisfies this constraint, we consider the following auxiliary problem
where \(P'=\textbf{1}\otimes P_0\otimes \textbf{1}\otimes \textbf{1}\), with \(P_0\) the orthogonal projection onto the divergence-free vector fields (\(P_0=\frac{1}{-\Delta }{\text {curl}}^*{\text {curl}}\)), and, recall, \(L_{n, \mu }=\oplus H_i\) and \(J(\mu ,u)\) given in (5.16) and
We consider \(F(\mu ,u)\) as a map from the space \(\mathbb {R}_{>0} \times \mathcal {X}\), where \(\mathcal {X}:= \mathcal {H}_n^2 \oplus \mathcal {H}^2_0 \oplus \mathcal {H}^2_0 \oplus \mathcal {H}^2\), to the space \(\mathcal {Y}:= L_n^2 \oplus L_0^2 \oplus L_0^2 \oplus L^2\), and let \(F = (F_1, \dots , F_4)\), where
In what follows, we denote the partial (real) Gâteaux derivatives with respect to \(\#\) by \(\delta _\#\).
Proposition 6.1
Assume \((\mu ,u)\) is a solution of the system (6.7) satisfying the gauge—periodicity conditions (4.10)–(4.13). Then \({\text {div}}J (\mu ,u)=0\) and therefore \((\mu ,u)\) solves the original system (6.1).
Proof
We follow [46]. Assume \(\chi \in H^1_{\textrm{loc}}\) and is \(\mathcal {L}-\)periodic (we say, \(\chi \in H^1_{\textrm{per}} \)). The gauge invariance implies that
where \(E_{\Omega '}(w, a, z, \phi )\) is given in (4.8). Differentiating this equation with respect to s at \(s=0\) gives \(\delta _w E_{\Omega '}(w, a, z, \phi )(i\chi w)+\delta _a E_{\Omega '}(w, a, z, \phi )(\nabla \chi ) =0\). Now, we use the fact that the partial Gâteaux derivative with respect to w vanishes, \(\delta _w E_{\Omega '}(w, a, z, \phi )=0\), and that \({\text {curl}}\nabla \chi =0\), and integrate by parts, to obtain
(Due to conditions (4.10)–(4.13) and the \(\mathcal {L}-\)periodicity of \(\chi \), there are no boundary terms.) Since the last equation holds for any \(\chi \in H^1_{\textrm{per}} \), we conclude that \({\text {div}}J (\mu ,u)=0\). \(\square \)
In Sects. 7–8, we solve Eq. (6.7), subject to conditions (4.10)–(4.13).
In conclusion of this section, we investigate properties of the map \(F(\mu ,u)\). For \(f=(f_1, f_2, f_3, f_4)\) and \(\delta \in \mathbb {R}\), define the global transformation
Proposition 6.2
The map \(F(\mu ,u)\) defined in (6.7) has the following properties:
-
(i)
\(F:\mathbb {R}_{>0}\times \mathcal {X}\rightarrow \mathcal {Y}\) is continuously Gâteau differentiable of all orders;
-
(ii)
\(F(\mu ,0)=0\) for all \(\mu \in \mathbb {R}_{>0}\);
-
(iii)
\(\delta _u F(\mu ,0)=L_{n, \mu }\) for all \(\mu \in \mathbb {R}_{>0}\);
-
(iv)
\(F(\mu ,T_{\delta }u)=T_{\delta } F(\mu ,u)\) for all \(\delta \in \mathbb {R}\);
-
(v)
\(\langle u,F(\mu ,u)\rangle _{\mathcal {Y}} \in \mathbb {R}\) (respectively \(\langle w,F_1(\mu ,u)\rangle _{L_n^2}\in \mathbb {R}\)) for all \(u\in \mathcal {X}\) (respectively \(w\in \mathcal {H}_n^2\)).
Proof
(i) follows because F is a polynomial in the components of u and their first- and second-order (covariant) derivatives. (ii), (iii) and (iv) follow from an easy calculation (in fact, u and \(L_{n, \mu }\) were defined so that (ii) and (iii) hold). For (v), it suffices to show that \(\langle w,F_1(\mu ,u)\rangle _{L_n^2}\in \mathbb {R}\). To simplify notation, we return to the coordinates \((w,a,z,\phi ) = (w,\frac{1}{e} a^n+\alpha ,z,\frac{\sqrt{2\mu }}{g} +\psi )\). Then
The first, second and fourth terms are clearly real, while the third term is real because \(\nu \) is real and \(\overline{w}\times w\) is imaginary. \(\square \)
7 Reduction to a Finite-Dimensional Problem
In this section, we shall reduce solving Eq. (6.7), i.e. \(F(\mu , u)=0\), with \(u= (u_1, u_2, u_3, u_4)\) \( \equiv (w, \alpha , z, \psi )\) and \(F: \mathbb {R}_{>0} \times \mathcal {X}\rightarrow \mathcal {Y}\) defined in (6.7)–(6.8)), to a finite-dimensional problem. To this end, we use the Lyapunov–Schmidt reduction.
Recall that \(L_{n,\mu }\) is defined in (5.16). Let P be the orthogonal projection onto \(\mathcal {K}:= {\text {Null}}(L_{n,\mu =n})\), which can be written explicitly as
where \(H_1(n)\) is defined in (5.17), \(\gamma _n\) is any simple closed curve in \(\mathbb {C}\) containing the eigenvalue 0 and no other eigenvalues of \(H_1(n)\) (see Proposition 5.3), and \(\langle \alpha \rangle \) is the mean value of \(\alpha \) in \(\Omega '\), \(\langle \alpha \rangle := \frac{1}{|\Omega '|} \int _{\Omega '} \alpha \). \(P_1\) is a projection onto \({\text {Null}}(H_1(n))\) (spanned by vectors of the form (5.24)). Since \(H_1(n)\) is self-adjoint, \(P_1\) is an orthogonal projection (relative to the inner product of \(L^2_n\)). By Theorem 5.1, \(\mathcal {K}:= {\text {Null}}(L_{n,\mu =n})\) is \((n+1)\)-dimensional.
Let \(P^{\perp }=1-P\) be the projection onto the orthogonal complement of \(\mathcal {K}\). Applying P and \(P^{\perp }\) to the equation \(F(\mu ,u)=0\) (see (6.7)), we split it into two equations for two unknowns as
where \(v:= P u, \ u':= P^{\perp }u\).
Our next goal is to solve (7.5) for \(u'\) in terms of \(\mu \) and v. For \(n=1\), \(\mathcal {K}\) is two-dimensional and we write \(v = (v_1,v_2,v_3,v_4)\equiv (v_1,v_2, 0, 0)\in \mathcal {K}\). Let \(\mathcal {X}^{\perp }:= P^{\perp }\mathcal {X}=\mathcal {X}\ominus \mathcal {K}\) and \(\mathcal {Y}^{\perp }:= P^{\perp }\mathcal {Y}=\mathcal {Y}\ominus \mathcal {K}\), and let \(\partial _i \equiv \partial _{x_i}\).
Proposition 7.1
There is a neighbourhood \(U \subset \mathbb {R}_{>0} \times \mathcal {K}\) of (n, 0) such that for every \((\mu ,v)\in U\), Eq. (7.5) for \(u'\) has a unique solution \(u' = u'(\mu ,v)= (u'_1, u'_2, u'_3, u'_4)\). Furthermore, this solution has the following properties:
where \(i=1,... 4,\; m=0,1,\; j=1,2, \; k=2,3,4\), \(v = (v_1,v_2,v_3,v_4)\), \(v^{i} \equiv v|_{v_i=0}\) and \(\mathcal {H}^2_k=\mathcal {H}^2_0\), \(\mathcal {H}^2_0\), \(\mathcal {H}^2\) for \(k=2, 3, 4\).
Proof
Define \(F^{\perp }:\mathbb {R}_{>0}\times \mathcal {K}\times \mathcal {X}^{\perp } \rightarrow \mathcal {Y}^{\perp }\) by
By Proposition 6.2 (i) and (ii), \(F^{\perp }\) is continuously differentiable of all orders as a map between Banach spaces and \(F^{\perp }(\mu ,0,0)=0\) for all \(\mu \in \mathbb {R}_{>0}\). Furthermore,
which is invertible for \(\mu =n\) because \(P^\perp \) is the projection onto the orthogonal complement of \(\mathcal {K}= {\text {Null}}(L_{n,\mu =n})\). By the Implicit Function Theorem (see, e.g. [16]), there exists a function \(u'(\mu ,v)\) with continuous derivatives of all orders such that for \((\mu ,v)\) in a sufficiently small neighbourhood \(U\subset \mathbb {R}_{>0}\times \mathcal {K}\) of (n, 0), \((\mu ,v,u')\) solves (7.5) if and only if \(u' = u'(\mu ,v)\). This proves the first statement and property (7.6).
We define the operator
Then by (6.7) and (7.13), we can write equation (7.5) as \(L_{n, \mu }^{\perp } u' = - P^{\perp } P' J(\mu ,u)\). By Theorem 5.1 and the relation \(\mathcal {K}:= {\text {Null}}(L_{n,\mu =n})={\text {Null}}(L_{n,\mu }-\mu +n)\), for \(\mu \) in a neighbourhood of n, the operator \(L_{n, \mu }^{\perp }\) has a uniformly bounded inverse \((L_{n, \mu }^{\perp })^{-1}:\mathcal {Y}^\perp \rightarrow \mathcal {X}^\perp \). Hence, equation \(L_{n, \mu }^{\perp } u' = - P^{\perp } P' J(\mu ,u)\), with \((\mu , v)\in U\) (replacing U with a smaller neighbourhood if necessary), is equivalent to
hence
uniformly in \(\mu \). Recall that \(\mathcal {X}= \mathcal {H}^2_n\oplus \mathcal {H}^2_0\oplus \mathcal {H}^2_0\oplus \mathcal {H}^2\) and \(\mathcal {Y}= L^2_n\oplus L^2_0\oplus L^2_0\oplus L^2\). \(J(\mu ,u)\) is a polynomial in the components of u and their first-order (covariant) derivatives consisting of terms of degree at least 2, so the left-hand side of (7.16) can be bounded above by a sum of products of one \(\mathcal {L}^2\)-norm and at least one \(\mathcal {L}^{\infty }\)-norm of these terms. \(\mathcal {H}^1\) is trivially continuously embedded in \(\mathcal {L}^2\), and by the Sobolev embedding theorem, \(\mathcal {H}^1\) is continuously embedded in \(\mathcal {L}^{\infty }\). Therefore,
Recalling that \(u=v+u'\), this proves (7.7) and (7.8) when \(m=0\). The other case is proven similarly.
For \(v = (v_1, \dots , v_4)\), we let \(v_{{\widehat{i}}} \equiv v |_{v_{i} = 0}\), \(i=1, \dots , 4\). By the Taylor theorem for Banach spaces (see, e.g. [16]), we have
Let \((\mu ,v) \in U\) with \(\Vert v_{{\widehat{i}}}\Vert =\Vert v_{ i}\Vert =1\), and let \(\epsilon >0\). Then
with the norm taken in the appropriate space for \(v_{ i}\). Taking the supremum over all \(v_{ i}\) with \(||v_{ i}||=1\) gives
proving (7.9)–(7.10) for \(m=0\). The other cases are proven in exactly the same way.
Again by Taylor’s theorem,
By Eqs. (7.8) and (7.9)–(7.10) with \(m=0\), we have \(u'(\mu ,0) = 0\) and \(\delta _{v} u'(\mu ,0) = 0\), so
proving (7.11). \(\square \)
We plug the solution \(u' = u'(\mu ,v)\) into Eq. (7.4) to get the bifurcation equation
Corollary 7.2
In a neighbourhood of (n, 0) in \(\mathbb {R}_{>0}\times \mathcal {X}\), the pair \((\mu ,u)\) solves (6.7) if and only if \((\mu ,v)\) solves the finite-dimensional Eq. (7.27). Moreover, a solution of (6.7) can be constructed from a solution \((\mu ,v)\) of (7.27) by setting \(u = v + u'(\mu ,v)\), where \(u'(\mu ,v)\) is given by Proposition 7.1.
Since \(F:\mathbb {R}_{>0}\times \mathcal {X}\rightarrow \mathcal {Y}\) and \(u':\mathbb {R}_{>0}\times \mathcal {K}\rightarrow \mathcal {Y}^\perp \) have been shown to be continuously differentiable of all orders, we conclude:
Corollary 7.3
\(\gamma :\mathbb {R}\times \mathcal {K}\rightarrow \mathcal {K}\) is continuously Gâteau differentiable of all orders.
Furthermore, \(\gamma (\mu ,v)\) inherits the following symmetry of \(F(\mu ,u)\), which we will use to find a solution of (7.27):
Lemma 7.4
Let \(T_{\delta }\) be given by (6.12). For every \(\delta \in \mathbb {R}\) and \((\mu , v)\) in a neighbourhood of (n, 0), we have
Proof
For Eq. (7.28), we note that by Proposition 6.2 (iv)
(Here we used \(P^{\perp }T_{\delta } = T_{\delta }P^{\perp }\), which follows because \(T_{\delta } = e^{i\delta }\oplus 1\oplus 1\oplus 1\) and \(P^{\perp } = 1-P\) where P is defined in (7.1).) Since \(u' = u'(\mu ,T_{\delta }v)\) is the unique solution to \(P^{\perp }F(\mu , T_{\delta }v + u')=0\) for \((\mu ,v)\) in a neighbourhood \(U\subset \mathbb {R}\times \mathcal {K}\) of (n, 0), we conlcude that \(u'(\mu ,T_{\delta }v) = T_{\delta } u'(\mu ,v)\).
For Eq. (7.29), we note that by (7.28) and Proposition 6.2 (iv),
(where again we used \(PT_{\delta }=T_{\delta }P\)). \(\square \)
8 The Bifurcation Result When \(n=1\)
Theorem 8.1
Assume that \(n=1\) and \(|1-b_*/b|\ll 1\), \(b_*:= M_W^2/e\). Then there exists \(\epsilon >0\) and a branch \((\mu _s,u_s):= (\mu _s, w_s, \alpha _s,z_s, \psi _s)\), with \(s\in [0,\sqrt{\epsilon })\), of non-trivial solutions of Eq. (6.1), unique modulo a gauge symmetry in a sufficiently small neighbourhood of the rescaled vacuum solution (4.9) in \(\mathbb {R}_{>0}\times \mathcal {X}\), such that
where \(\chi \) solves the eigenvalue problem \(H_1(n) \chi =0\) (it is defined in (5.24), see Proposition 5.3), \(\mu := g^2 \xi ^2/2= g^2 r^2\varphi _0^2/2\), \(g_1:[0,\epsilon )\rightarrow \mathcal {H}_n^2\) and is orthogonal to \({\text {Null}}(H_1(n))\), \(g_2:[0,\epsilon )\rightarrow \mathcal {H}^2_0\), \(g_3:[0,\epsilon )\rightarrow \mathcal {H}^2_0\), \(g_4:[0,\epsilon )\rightarrow \mathcal {H}^2\), \(g_5:[0,\epsilon )\rightarrow \mathbb {R}_{>0}\), and \(g_j\) for \(j=1,\cdots , 5\) are functions, continuously differentiable of all orders in s, such that \(g_j(0)=0\).
Proof of Theorem 8.1
For the proof below, recall that we denote the partial (real) Gâteaux derivatives with respect to \(\#\) by \(\delta _\#\), and let \(\partial _i \equiv \partial _{x_i}\).
By Proposition 6.1, solving Eq. (6.1) is equivalent to solving (6.7). By Corollary 7.2, solving (6.7) is equivalent to solving the bifurcation equation (7.27). Hence, we address the latter equation.
Recall that P is the projection onto \(\mathcal {K}= {\text {Null}}L_{n,\mu =n}={\text {Null}}(H_1(n)) \times \{\text {constants}\}\times \{0\}\times \{0\}\). The projection onto constant vector fields in \(\mathcal {H}_0^2\) can be written as the mean value \(\langle \alpha \rangle := \frac{1}{|\Omega '|}\int _{\Omega '}\alpha \). Since \(\text {dim} {\text {Null}}(H_1(n)) =1\) for \(n=1\), we may choose \(\chi \in {\text {Null}}(H_1(n))\) such that
and \(\chi \) satisfies \(\Vert \chi \Vert _{L_n^2}^2=\langle |\chi |^2 \rangle =1\) (see (5.13)), where, recall, \(\chi \) is described in (5.24). Hence, we may write the \(\gamma \) from the bifurcation equation (7.27) as \(\gamma = (\tilde{\gamma }_1 \chi , \tilde{\gamma }_2,0,0)\), where \(\tilde{\gamma }_1,\tilde{\gamma }_2:\mathbb {R}_{>0} \times \mathbb {C}\times \mathbb {R}^2 \rightarrow \mathbb {C}\) are given by
where, recall, \(F_j\), \(j=1, \dots , 4\) are defined by (6.9), \(s\in \mathbb {C},\ c\in \mathbb {R}^2\) and (see (8.2))
Note that \({\tilde{\gamma }}_1\) and \({\tilde{\gamma }}_2\) are continuously differentiable of all orders in \(\mu \), s and c by Corollary 7.3. (\({\tilde{\gamma }}_2\) is independent of \(\mu \).) The bifurcation equation (7.27) is then equivalent to the equations
Lemma 8.2
There exists a neighbourhood \(U\subset \mathbb {R}_{>0}\times \mathbb {R}_{>0}\) of (n, 0) and a unique function \(c:U\rightarrow \mathbb {R}^2\) with continuous derivatives of all orders such that
and
Proof
Recall that \(F_2(\mu ,u) = H_2(\mu ) \alpha + P_0 J_2(\mu , u)\) (see Equation (6.7)), with \(P_0\) the projection onto the divergence-free vector fields and
where \(v=v(s,c)\) and \(u'=u'(\mu , v)\) solves (7.5). By definition, \((\textbf{1}- P_0)f=\Delta ^{-1}\nabla {\text {div}}f\) and therefore \(\langle (\textbf{1}- P_0)f \rangle =0\). Hence \(\langle P_0 f \rangle =\langle f \rangle \). This and the relation \(\langle H_2(\mu )\alpha \rangle = \frac{1}{|\Omega '|} \int _{\Omega '}{\text {curl}}^*{\text {curl}}\alpha = 0\) give
Using (6.3), \(\nu =a^n+{\tilde{\nu }}\), \({\text {curl}}_{a^n}w={\text {curl}}_{a^n}w- i{\tilde{\nu }} \times w\) and that the final term in (6.3) vanishes after taking the mean, we find
Recall \(u'=( w', \alpha ',z',\psi ')\). Then (8.6) and (8.11) give \(w=s\chi +w'\) and (using that \(e=g \sin \theta \)) \( {\tilde{\nu }} =e c+\nu '\). Using these relations and \({\text {curl}}_{ a^n} \chi = 0\) (by (5.24)) and (8.12) and (8.13), we find for \(\overline{\gamma }_2(\mu ,s,c):= (2e)^{-1} |s|^{-2}\tilde{\gamma }_2(\mu ,s,c)\)
Note that we expect (8.14) \(=\mathcal {O}(|s|^2)\) and (8.15) \(=\mathcal {O}(|s|^4)\). We now simplify (8.14). For the first term on the right-hand side, we use (5.24) and the condition \(\langle |\chi |^2 \rangle =1\) to compute
For the second term on the right-hand side of (8.14), we use \(\langle f J \overline{\chi }\rangle =\langle f (i\overline{\eta }, \overline{\eta })\rangle = \langle f{\bar{\eta }}\rangle (i, 1)=\langle \eta , f\rangle (i,1)\) and integrate by parts to compute
Abusing notation, we write in what follows \(w(\mu ,s,c)\equiv w(\mu , v(s,c))\). Then (8.14) becomes
Now, Equation (7.7), with \(m=0\), implies that
Furthermore, we show below the following estimate on the remainder:
Hence \(\overline{\gamma }_2(\mu , 0, 0)=0\).
To apply the implicit function theorem to solve for c as a function of \(\mu \) and s, we have to estimate the derivative
at (n, s, 0). At the first step, we use the following
Lemma 8.3
Using Dirac’s bra-ket notation, we have
Proof of Lemma 8.3
By definition (7.2), \( P^\perp _1\) projects onto the orthogonal complement of the eigenspace of \(H_1(n)\) corresponding to the eigenvalue 0 and therefore the operator \(H_1^\perp (n)\) is invertible on \({\text {Ran}}P^\perp _1\).
Hence (6.1) with \(i=1\) can be rewritten as \( w' = - (H_1^\perp (n))^{-1}P^\perp _1 J_1(n, u)\) (which is the first component of (7.15)), which gives
where \(u\equiv u(s,c):=v(s,c) + u'(\mu ,v(s,c))\). By (6.2) and (6.6), we have
Using \(w=s\chi +w',\) \(\nu = a^n+ec+\nu '\) and \({\text {curl}}_{\nu }={\text {curl}}_{a^n} + i J (ec+\nu ') \cdot \), \({\text {curl}}_{\nu }^*={\text {curl}}_{a^n }^* - i J (ec+\nu ')\) and that \(\nu '=\mathcal {O}(|s|^2)\), we compute
Since \({\text {curl}}_{a^n}\chi =\nabla _1i\beta - \nabla _2\beta =i{\bar{\partial }}_{a^n}\beta = 0\) and \( Jc' \cdot \chi = (-c_2', c_1') \cdot (\beta , i \beta )=-c_2'\beta + c_1' i \beta =i (c'_1+ic_2')\beta \) and therefore \({\text {curl}}_{a^n}^* Jc' \cdot \chi =i{\text {curl}}_{a^n}^* \beta (c'_1+ic_2')\), this yields
By Proposition 5.3(ii), \({\text {Null}}{(H_1(\mu )-\mu +n)}=\{\chi =(\beta , i\beta ): {\text {curl}}_{ a^n} \chi =i{\bar{\partial }}_{ a^n} \eta = 0\}\). The relation \({\text {curl}}_{a^n}\chi = 0\) implies also \(\langle \chi , {\text {curl}}_{ a^n}^* \chi \rangle =\langle {\text {curl}}_{ a^n} \chi , \chi \rangle =0\), which, for \(n=1\), gives that \(P^\perp _1 \partial _c J_1(n, u) c' = \partial _c J_1(n, u) c' \) and therefore
By (5.24), we have \({\text {curl}}_{a^n}^*\eta = i\nabla _{a^n}\beta \), and by (5.25), we have \(H_1(n)\nabla _{ a^n}\beta = n\nabla _{ a^n}\beta \); hence, \((H_1^\perp (n))^{-1}{\text {curl}}_{ a^n}^* \beta =n^{-1}{\text {curl}}_{ a^n}^* \beta \). This relation, together with (8.26) and (8.31), yields
which gives (8.25). \(\square \)
Using Eq. (8.25), we calculate the second term on the right-hand side of (8.24) at (n, s, 0):
The inner product term is real. Integrating it by parts and using that, by Equation (5.38), \(\beta \) satisfies \({\text {curl}}_{ a^n} {\text {curl}}_{a^n}^* \beta =-\Delta _{ a^n}\beta = n\beta \) and using that \(\Vert \beta \Vert _{L^2_n}^2 = \frac{1}{2} \Vert \chi \Vert _{L^2_n}^2 = \frac{1}{2}\), gives
The last two equations and the relation \( {\text {Im}}(c'_1+ic_2') (i,1)={\text {Im}}\bigg (\begin{array}{cc} i &{} \quad - 1 \\ 1 &{} \quad i \end{array} \bigg )c'=\textbf{1}c'\) imply
This, together with (8.24), gives
Therefore, (8.36) and (8.23) (with \(l=1\)) imply
proving that \(\partial _c\overline{\gamma }_2(n,0,0)\) is invertible, as required.
Recall that, by (8.21), (8.22) and (8.23) (with \(l=0\)), we have
Since \(\partial _c\overline{\gamma }_2(n,0,0)\) is invertible, by the implicit function theorem there exists a unique function \({\tilde{c}}:\mathbb {R}_{>0}\times \mathbb {C}\rightarrow \mathbb {R}^2\) with continuous derivatives of all orders such that \(\overline{\gamma }_2(\mu ,s,{\tilde{c}}(\mu ,s))=0\) for \((\mu ,s)\) in a sufficiently small neighbourhood of (n, 0). Furthermore, the symmetry (7.29) implies that \(\overline{\gamma }_2(\mu ,|s|,{\tilde{c}}(\mu , s)) = \overline{\gamma }_2(\mu ,e^{i\arg s}|s|,{\tilde{c}}(\mu , s)) = \overline{\gamma }_2(\mu ,s,{\tilde{c}}(\mu , s))=0\), so by the uniqueness of the branch \({\tilde{c}}(\mu , s)\) we have
In particular, \(\partial _{\mu }^l{\tilde{c}}(\mu , s)\), \(l=0,1\), restricted to \(s\in \mathbb {R}\) are even functions with continuous derivatives of all orders; thus \(\partial _s \partial _{\mu }^l{\tilde{c}}(\mu , 0) = 0\) and hence \(\partial _{\mu }^l {\tilde{c}}(\mu , s) = \mathcal {O}(|s|^2)\), since the first two terms of the Taylor expansion are 0. We define \(c:\mathbb {R}_{>0}\times \mathbb {R}_{>0}\rightarrow \mathbb {R}^2\) by \(c(\mu , s):= {\tilde{c}}(\mu , \sqrt{s})\), which is a function with continuous derivatives of all orders satisfying \(||\partial _{\mu }^l c(\mu , s^2)||_{\mathbb {R}^2} = \mathcal {O}(|s|^2)\), \(l=0,1\), and \(\tilde{\gamma }_2(\mu ,s,c(\mu , s^2)) = |s|^2\overline{\gamma }_2(\mu , s, c(\mu , s^2)) = 0\), as required. \(\square \)
Lemma 8.4
For \(\epsilon >0\) sufficiently small, there exists a unique function \(\mu :[0,\epsilon )\rightarrow \mathbb {R}_{>0}\) with continuous derivatives of all orders such that
Proof
To simplify notation for this lemma, we set \(u=v_s + u'_s\), with \(v\equiv v_s \equiv (s\chi , c(\mu ,s^2),0,0), \ u'\equiv u'_s \equiv u'(\mu , v_s),\ c \equiv c(\mu ,s^2)\).
We first show that \(\tilde{\gamma }_1(\mu ,s,c)\in \mathbb {R}\) for \(s\in \mathbb {R}\). Since \(u'\) by definition solves \(P_1^\perp F_1(\mu , v+ u') = 0\), where \(P_1^\perp w' = w'\) and \(P_1^\perp \) is self-adjoint, we have
Therefore, for \(s\ne 0\), we find
which is real by Proposition 6.2 (v). Furthermore, by equations (7.29) and (8.39), we have \(\tilde{\gamma }_1(\mu ,s,c(\mu , s^2)) = e^{i\text {arg}(s)}\tilde{\gamma }_1(\mu ,|s|,c(\mu ,|s|^2))\), so we may restrict s to be real.
Next, we show that
Indeed,
Recall that \(H_1(n)\chi =0\), so that
By the definition \(v\equiv v_s \equiv (s\chi , c(\mu ,s^2),0,0)\) and equation (8.10), \(\Vert v\Vert _{\mathcal {X}} = \mathcal {O}(|s|)\); hence, by Proposition 7.1,
Furthermore, by equation (7.17) and recalling that \(H_1(n) \chi = 0\),
This proves that \( \tilde{\gamma }_1(n,s,c(n,s^2))\) is \(\mathcal {O}(|s|^2)\), as required.
In light of equation (8.43), we can define a function \(\overline{\gamma }_1: \mathbb {R}_{>0}\times \mathbb {R}_{>0} \rightarrow \mathbb {R}\) with continuous derivatives of all orders by
We now find a non-trivial branch of solutions \((\mu ,s) = (\tilde{\mu }(s),s)\) by applying the implicit function theorem to \(\overline{\gamma }_1\). First, we prove the following proposition to bound the polynomials of functions appearing below:
Lemma 8.5
Let X be one of the spaces \(\mathcal {H}^2_n\), \(\mathcal {H}_0\) or \(\mathcal {H}^2\) defined before equation (5.9). Let \(p(x_1,...,x_n)\) be a polynomial with positive coefficients and let \(f_1,...,f_n\in X\). Then \(\Vert p(f_1,...,f_n)\Vert _X \lesssim p(\Vert f_1\Vert _X,...,\Vert f_n\Vert _X)\).
Proof
Write \(p(x_1,...,x_n) = \sum _{|\alpha |\le N} p_{\alpha } x^{\alpha }\), where \(\alpha = (\alpha _1,...,\alpha _n)\) is a multi-index, \(x^\alpha = \prod _{i=1}^n x_i^{\alpha _i}\) and \(p_{\alpha } \ge 0\). Since by the Sobolev Embedding Theorem (see, e.g. [2]), X is a Banach algebra, we have
which implies the desired result. \(\square \)
Lemma 8.6
There exists \(\epsilon >0\) and a unique function \(\tilde{\mu }:(-\sqrt{\epsilon },\sqrt{\epsilon })\rightarrow \mathbb {R}_{>0}\) with continuous derivatives of all orders such that \(\tilde{\mu }(0)=n\) and \(\mu =\tilde{\mu }(s)\) solves \(\overline{\gamma }_1(\mu ,s)=0\) for \(s\in (-\sqrt{\epsilon },\sqrt{\epsilon })\). Moreover, \(\tilde{\mu }\) is an even function: \(\tilde{\mu }(s) = \tilde{\mu }(-s)\).
Proof
Recall that \(F_1(\mu , u) = H_1(\mu ) w + J_1(\mu , u)\) (where \(H_1(\mu )\) and \(J_1(\mu ,u)\) are defined in (5.17) and (6.2)). Using that \(\partial _{\mu }F_1(\mu ,u) = (1+\frac{g}{2\sqrt{2\mu }}\psi )w\) and setting \(u=v_s + u'_s\), with \(v\equiv v_s \equiv (s\chi , c(\mu ,s^2),0,0), \ u'\equiv u'_s \equiv u'(\mu , v_s),\ c = c(\mu ,s^2)\), we compute
By Lemma 8.2, \(\Vert \partial _{\mu }^l c\Vert _{\mathbb {R}^2} = \mathcal {O}(|s|^2)\), \(l=0,1\). Since \(||v||_{\mathcal {X}}\) is \(\mathcal {O}(|s|)\), by Proposition 7.1 the terms \(\Vert \partial _\mu ^l u'_i\Vert \) (\(l=0,1\), \(i=1, \dots , 4\), with the norms taken in the appropriate spaces), are \(\mathcal {O}(|s|^2)\). By Lemma 8.5, this implies that all terms in (8.49) containing \(c, w',\alpha ',z',\psi '\) or their \(\mu \)-derivatives vanish at \((\mu ,s) = (n,0)\). Therefore,
and hence
Since \(\overline{\gamma }_1(\mu ,s)\) is continuously differentiable of all orders in \(\mu \) and s, by the implicit function theorem, we obtain the first statement of the lemma.
By the symmetry \(\overline{\gamma }_1(\mu ,-s)=-\overline{\gamma }_1(\mu ,s)\) of \(\overline{\gamma }_1\) and the uniqueness of the branch \(\tilde{\mu }(s)\), we have \(\tilde{\mu }(s) = \tilde{\mu }(-s)\), which gives the second statement. \(\square \)
We define \(\mu (s) \equiv \tilde{\mu }(\sqrt{s})\), which is a function with continuous derivatives of all orders for \(s\in [0,\epsilon )\) for the same reasons that \(c(\mu ,s):= {\tilde{c}}(\mu , \sqrt{s})\) was shown to be continuously differentiable of all orders in Lemma 8.2. Furthermore, \(\mu \) satisfies \(\tilde{\gamma }_1(\mu (s^2),s,c(\mu (s^2), s^2)) = s\overline{\gamma }_1(\mu (s^2),s,c(\mu (s^2), s^2)) = 0\), as required. \(\square \)
We will now use the branch of solutions to (8.7)–(8.8), provided by Lemmas 8.2 and 8.4, and Corollary 7.2 to obtain the corresponding unique branch, \((\mu _s, u_s)\), of solutions to (6.7), with
(8.52)–(8.54) have continuous s-derivatives of all orders because each component function has continuous derivatives of all orders. Symmetry (7.28) with \(\delta = \pi \) and the relation \(T_{\pi }(f_1,f_2,f_3,f_4) = (-f_1,f_2,f_3,f_4)\) imply that \((u_s')_1\) is an odd function of s and \((u_s')_2\), \((u_s')_3\) and \((u_s')_4\) are even functions of s. Arguing as in the case of Lemma 8.2 shows that the functions:
are well-defined for \(s\ge 0\) and have continuous derivatives of all orders. By Proposition 7.1, these functions have the properties listed in Theorem 8.1. The above definitions and equations (8.52)–(8.54) imply \(u_s = (s\chi ,\frac{1}{e} a^n,0,0)+(g_1(s), \dots , g_4(s))\). Hence, this solution is of the form (8.1). Now, by Proposition 6.1, this also solves system (4.3)–(4.6), completing the proof. \(\square \)
9 Proof of Theorem 2.2(a), (b)
Recall that \(M_W\), \(M_Z\), \(M_H\) are the masses of the W, Z and Higgs bosons, respectively, and that \(\tau \) is the shape parameter of the lattice \(\mathcal {L}\) (see the paragraph before Theorem 2.3 of Sect. 2). We introduce the notation
the average of f over fundamental domain \(\Omega '=\sqrt{\frac{2\pi }{|\Omega |}}\Omega \). Furthermore, we introduce the function (cf. [26])
with, recall, \(m_w:= \sqrt{n}\), \(m_z:= \frac{\sqrt{n} }{\cos \theta }\) and \(m_h:= \frac{\sqrt{4\lambda n}}{g}\) the masses of the rescaled W, Z and Higgs boson fields, w, z and \(\phi \), respectively, and
Here \(\chi \) is defined in (5.24) and \(G_{m,m'}\) is the operator-family on the space (5.11) given by
Note that \(G_{m, m'}>0\) for \(m'<m\). Recall \(M_W:= \frac{1}{\sqrt{2}} g\varphi _0\), \(M_Z:= \frac{1}{\sqrt{2}\cos \theta } g\varphi _0\) and \(M_H:= \sqrt{2}\lambda \varphi _0\).
Proposition 9.1
If \(M_Z < M_H\), the parameter s of the branch (8.1) is related to the magnetic field strength by
where \(R_s(\omega )\) is a real, smooth function of \(\omega \) satisfying
Before proving Proposition 9.1, we shall see how it implies statements (a) and (b) of Theorem 2.2.
Proof of Theorem 2.2(a), (b)
Since the operator \(G_{m_z,m_h}\) is positivity preserving, the function \(G_{m_z,m_h}(|\chi |^2)\) is positive for \(M_Z < M_H\), and hence \(\alpha _{m_z,m_h}(\tau )\) and \(\eta _{m_z,m_h}(\tau )\) are positive. Furthermore, when the right-hand side of (9.5) is positive, we solve (9.5) for s as a function of b, \(s=s(b)\), having continuous derivatives of all orders. When \(|1-\frac{M_W^2}{eb}| \ll 1\), the right-hand side of (9.5) is positive if and only if \(1-\frac{M_W^2}{eb} > 0\).Footnote 9 Plugging \(s = s(b)\) into (8.1) (i.e. passing from the bifurcation parameter s to the physical parameter b), undoing the rescaling (4.1), and recalling that \(b_* = \frac{M_W^2}{e}\), we arrive at the branch, \(U_{\mathcal {L}} \equiv (W_b, A_b, Z_b, \varphi _b)\), of solutions of (3.12)–(3.15), which has the properties listed in statements (a) and (b) of Theorem 2.2. \(\square \)
The following statement follows from the proof above:
Lemma 9.2
\(U_{\mathcal {L}}\) is continuously differentiable of all orders in b for b in an open right half-interval of \(b_*\)
Proof of Proposition 9.1
Consider the solution branch \((\mu _s,w_s,a_s, z_s)\) given in equation (8.1) and described in Theorem 8.1. Using Taylor’s theorem for Banach space-valued finctions (see, e.g. [16]) and recalling the relation \(\xi = \sqrt{2\mu }/g\), we may expand this branch in s as follows:
where \(w',a',z',\psi ',\xi '\) and \(a''\) are the coefficients of \(s^2\) and \(s^4\), respectively, in the Taylor expansion of \(g_j(s^2)\), \(j=0,...,5\), in (8.1), and \(\chi \) is defined in (5.24). Here \(\mathcal {O}(|s|^p)\) stand for various error terms which, together with their (covariant) derivatives, have norms of order \(\mathcal {O}(|s|^p)\) when taken in the appropriate spaces.
To rewrite the asymptotics in terms of the parameter b, we analyse how s depends on b. For this, we use the definitions \(\xi _{s} = \sqrt{2\mu _s}/g\) and \(\mu := \frac{1}{2} (g \xi )^2=\frac{1}{2} (g r \varphi _0)^2\), with \(r:= \sqrt{\frac{n}{eb}}\) (see (4.2)) to find the following equation for \(s^2\):
To solve this equation for \(s^2\), we use the Implicit Function Theorem. By (9.7), we can write \(\xi _{s}=\sqrt{2n}/g+ g_{\xi }(s^2)\), where recall, \(g_\xi (0) = 0\) and \(g'_{\xi }(0) = \xi '\). Hence, we have to show that \(\xi ' \ne 0\).
Lemma 9.3
We have \(\xi ' \ne 0\).
Proof
We find relations between \(\psi '\), \(a'\) and \(z'\) entering (9.7). Plugging (9.7) into Equations (4.4)–(4.6), we obtain at order \(s^4\)
We solve these equations, using that \({\text {curl}}^* |\chi |^2={\text {curl}}^* (|\chi |^2-\langle |\chi |^2\rangle ')\) and \(|\chi |^2-\langle |\chi |^2\rangle '\in {\text {Ran}}(\Delta )\), for the first one, to findFootnote 10
where \(G_m:=(-\Delta + m^2)^{-1}\) acting on the space (5.11) (cf. (9.4)), and \(m_z:= \frac{\sqrt{n} }{\cos \theta }\) and \(m_h:= \frac{\sqrt{4\lambda n}}{g}\) are the masses of the rescaled Z and Higgs boson (\(\Phi \)) fields, z and \(\phi \), respectively. Next, we use the following relation proven in “Appendix E”:
where, recall, \(\nu ':= g(a'\sin \theta + z'\cos \theta )\). First, we evaluate \({\text {curl}}\nu '\). The relations \((-\Delta + m^2) G_m = \textbf{1}\) and \({\text {curl}}{\text {curl}}^* =-\Delta \) imply \({\text {curl}}a'= e (|\chi |^2-\langle |\chi |^2\rangle )\). Next, the second relation in (9.10) and the relation \({\text {curl}}{\text {curl}}^* =-\Delta \) yield \({\text {curl}}z'=g\cos \theta (-\Delta )(-\Delta + m_z^2)^{-1} |\chi |^2\), which, together with \(m_z:= \frac{\sqrt{n} }{\cos \theta }\), gives \({\text {curl}}z' = g \cos \theta |\chi |^2- g\frac{ n }{\cos \theta }G_{m_z}|\chi |^2\). Finally, using that \(e:=g\sin \theta \), we conclude that
Plugging the last relation and equation (9.10) into the relation (9.11), gives
where \(m_w:= \sqrt{n}\) is the mass of the rescaled W boson field w and the operator-family \(G_{m,m'}\) is defined by (9.4). We solve for \(\xi '\) and write the solution as
where \(\eta \equiv \eta _{m_z,m_h}(\tau )\) is defined in (9.2)–(9.3). The operator \(G_{m_z,m_h}\) in (9.3) is positivity preserving and therefore the function \(\alpha _{m_z,m_h}(\tau )\) (and hence \(\eta _{m_z,m_h}(\tau )\)) is positive, if and only if \(m_z < m_h\) (equivalently, \(M_Z < M_H\)), in which case \(\xi ' <0\). \(\square \)
We now derive the estimate (9.5)–(9.6) for \(s^2\). Equations (9.7) and (9.8) give \(\xi _s\) as a function of s and b respectively, yielding
which can be rearranged to give
where, recall, \(\omega =1-\frac{M_W^2}{eb}\), with \(M_W = \frac{1}{\sqrt{2}} g\varphi _0\). Recall that \(g_\xi (0) = 0\) and \(g_\xi '(0) = \xi '\). We have
Since \(\xi '\ne 0\) and \(g_{\xi }(s^2)\) is continuously differentiable of all orders (see Theorem 8.1), by the implicit function theorem, we may solve (9.16) for \(s^2\), with the solution, \(s^2 = s^2(\omega )\), with continuous derivatives of all orders in \(\omega \). Explicitly, (9.16)–(9.17) give
Plugging (9.14) into (9.18) gives
which is (9.5), with \(R_s(\omega )\) satisfying \(R_s(\omega ) = \mathcal {O}(|\omega |^2)\). Furthermore, since the solution \(s^2 = s^2(\omega )\) is continuously differentiable of all orders in \(\omega \), so is the remainder term \(R_s(\omega )\). \(\square \)
10 Asymptotics of the Weinberg–Salam Energy Near \(b=M_W^2/e\)
Recall \(\omega =1-\frac{M_W^2}{eb}\), with \(M_W = \frac{1}{\sqrt{2}} g\varphi _0\), and \(\eta _{m_z, m_h}( \tau )\) is defined in (9.2). The main result of this section is the following:
Theorem 10.1
If \(M_Z < M_H\), then the WS energy (3.10) of the branch of solutions (8.1) has the following expansion:
where \(R_E(\omega )\) is a real function with continuous derivatives of all orders satisfying
Before proving Theorem 10.1, we derive from it Theorem 2.2 (c).
Proof of Theorem 2.2 (c)
Since \(\eta _{m_z, m_h}(\tau )\) is positive,Footnote 11 the second term in Equation (10.1) is negative, and so for \( 0< 1-\frac{M_W^2}{eb} \ll 1\), \(E_{\Omega }^{WS}\) is less than the vacuum energy \(\frac{1}{2}b^2|\Omega |\). This proves Theorem 2.2 (c). \(\square \)
Proof of Theorem 10.1
Let \(\mathcal {E}' (w_s, a_s, z_s, \psi _s+\xi _{s}; r):= \frac{1}{|\Omega '|}\mathcal {E}_{\Omega '} (w_s, a_s, z_s, \psi _s+\xi _{s}; r)\), where \(\mathcal {E}_{\Omega '}\) is the rescaled WS energy given in (4.8). In “Appendix F”, we derive the following expansion (to order \(s^4\)) of \(\mathcal {E}'\) evaluated at family (9.7) of solutions:
where \(R_\varepsilon (s) = \mathcal {O}(|s|^6)\) and has continuous derivatives of all orders, \(\nu ':= g(a'\sin \theta + z'\cos \theta )\) and, recall, \(\xi _{ s}= \sqrt{2\mu _s}/g\).
To simplify notation, in what follows, we shall suppress the argument \((w_s, a_s, z_s, \psi _s+\xi _{ s}; r)\) of \(\mathcal {E}'\).
We claim the following relation:
Proof of (10.4)
We simplify the integral at order \(s^4\) in (10.3) by applying eq. (9.9) for \(a'\), \(z'\) and \(\psi '\) to convenient groupings of terms.
First, we address the \(z'\) terms in (10.3). Integrating by parts and factoring out \(z'\) gives
Applying (9.9) for \(z'\) gives
Integrating by parts again gives
Next, we address the \(a'\) term in (10.3). Integrating by parts gives
Inserting into this expression (9.9) for \(a'\) gives
Integrating by parts again gives
Next, we address the \(\psi '\) terms. Integrating by parts and factoring out \(\psi '\) gives
Inserting into this expression (9.9) for \(\psi '\) gives
For the \(\xi '\) term in (10.3), we have by (9.11) and (9.14),
where, recall, \(\nu ':= g(a'\sin \theta + z'\cos \theta )\). Finally, there are two remaining terms of the integral at order \(s^4\) in (10.3),
which we will not presently simplify.
Adding Eqs. (10.7), (10.10), (10.12), (10.13) and (10.14) and remembering (10.3) gives Eq. (10.4), as required. \(\square \)
Plugging (9.19) into (10.4) gives
where \({\tilde{R}}_\varepsilon (\omega )\) has continuous derivatives of all orders and satisfies \({\tilde{R}}_\varepsilon (\omega ) = \mathcal {O}(|\omega |^3)\).
For the WS energy (3.10), evaluated at \((W_b, A_b, Z_b, \varphi _b)\), we recall that \(E_{\Omega }^{WS} = \frac{1}{r^2} \mathcal {E}_{\Omega '}= \frac{1}{r^2}|\Omega '| \mathcal {E}'\), which implies
Eq. (10.1) follows by plugging (10.15) into (10.16). Since the remainder term \({\tilde{R}}_\varepsilon \) of (10.15) has continuous derivatives of all orders, so does the remainder term \(R_E\) of (10.1). \(\square \)
11 Shape of Lattice Solutions
In this section, we shall prove Theorem 2.3. Recall the shape parameter \(\tau \) described in the paragraph preceding (2.22). We return briefly to working with the rescaled fields to prove that \(\mathcal {E}_{\Omega '}(u; r),\ u=(w, \alpha , z, \psi ),\) given in (4.8) (and hence \(E_{\Omega } (U)\)) is continuously Gâteau differentiable of all orders in the shape parameter \(\tau \) (restricted to domain (2.22)), which enters through \(\Omega '\) (and \(\Omega \)), as well as the spaces containing u (and U). Below, we write
to emphasize the dependence of the family of solutions (9.7), the corresponding energy (4.8) (respectively (3.10)) and the space (5.12) containing these solutions on the shape parameter \(\tau \), the magnetic field strength b and the position in space \(x\in \mathbb {R}^2\). Also, recall the notation \(r:=\sqrt{n/eb}\).
To get rid of the dependency of the space \(\mathcal {X}_{\tau }\) containing \(u_{\tau ,b}\), on the shape parameter \(\tau \), we make the change of coordinates
mapping \(\Omega '\) into a square of area \(2\pi \). This allows us to define the functions \(G':\mathbb {C}\times \mathbb {R}\times \mathcal {X}_1\rightarrow \mathbb {C}\times \mathbb {R}\times \mathcal {Y}_1\) and \(\varepsilon ': \mathbb {C}\times \mathbb {R}\times \mathcal {X}_1\rightarrow \mathbb {C}\times \mathbb {R}\times \mathcal {Y}_1\) on the fixed space \(\mathcal {X}_1\):
where, recall, G(b, v) is the map given by the left-hand side of (4.3)–(4.6), given explicitly in (5.1), and \(\varepsilon _{WS} (b,u):= \varepsilon (u; r)\) is the rescaled energy density given by the integrand in (4.8), see (11.2) (\(\varepsilon \) depends on the magnetic field strength b but does not directly depend on the shape parameter \(\tau \)).
Lemma 11.1
\(G'(\tau ,b,v)\) and \(\varepsilon '(\tau ,b,v)\) are continuously Gâteau differentiable of all orders in \({\text {Re}}(\tau )\), \({\text {Im}}(\tau )\), b and v.
Proof
Since G(b, u) and \(\varepsilon (b,u)\) have continuous b and u derivatives of all orders, and \(M_{\tau }\) is a linear map independent of b and v, it follows that \(G'(\tau ,b,v)\) and \(\varepsilon '(\tau ,b,v)\) have continuous b- and v-derivatives of all orders.
For the \(\tau \)-derivatives, note that
are continuously differentiable of all orders in \({\text {Re}}(\tau )\) and \({\text {Im}}(\tau )\). Since G(b, u) and \(\varepsilon _{WS}(b,u)\) are polynomials in the components of u and their (covariant) derivatives, \(G'\) and \(\varSigma \) are simply G and \(\varepsilon _{WS}\) with the coefficients of the derivative-containing terms multiplied by smooth functions of \({\text {Re}}(\tau )\) and \({\text {Im}}(\tau )\). Therefore \(G'(\tau ,b,v)\) and \(\varSigma (\tau ,b,v)\) have continuous \({\text {Re}}(\tau )\)- and \({\text {Im}}(\tau )\)-derivatives of all orders. \(\square \)
Lemma 11.2
\(v_{\tau ,b}:= M_{\tau }u_{\tau ,b}\) is continuously differentiable of all orders in \(\text {Re}(\tau )\) and \(\text {Im}(\tau )\).
Proof
Let \(\tau _0\) be an arbitrary shape parameter, and recall that \(\delta _\#\) denotes the partial (real) Gâteaux derivative with respect to \(\#\). Then \(G'(\tau _0,b,v_{\tau _0,b}) = M_{\tau _0}G(b,u_{\tau _0,b}) =0\), \(\delta _v G(\tau _0,b,v_{\tau _0,b}) = M_{\tau _0} \circ \delta _u G(b,u_{\tau _0,b}) \circ M_{\tau _0}^{-1}\) is invertible, and by Lemma 11.1, \(G'\) is continuously Gâteau differentiable of all orders in \(\tau \), b and v. Therefore, by the Implicit Function Theorem, the unique solution \(v_{\tau ,b}\) to the equation \(G(\tau ,b,v) = 0\) is continuously differentiable of all orders in \(\text {Re}(\tau )\) and \(\text {Im}(\tau )\) near \((\text {Re}(\tau ), \text {Im}(\tau )) = (\text {Re}(\tau _0), \text {Im}(\tau _0))\). Since \(\tau _0\) was arbitrary, this proves the result. \(\square \)
Proposition 11.3
\(E (\tau ,b,U_{\tau ,b})\) is continuously differentiable of all orders in \(\text {Re}(\tau )\) and \(\text {Im}(\tau )\).
Proof
To get rid of the dependency of \(\mathcal {E}(\tau ,b,u_{\tau ,b})\) on the domain of integration \(\Omega '\), we again make the change of coordinates \(y = m_{\tau }^{-1}x\). Then
By Lemma 11.2, \(v_{\tau ,b}\) has continuous \({\text {Re}}(\tau )\)- and \({\text {Im}}(\tau )\)-derivatives of all orders, and by Lemma 11.1, \(\varSigma \) has continuous derivatives of all orders mapping \(\mathbb {C}\times \mathbb {R}\times \mathcal {X}_1\) to \(\mathbb {C}\times \mathbb {R}\times \mathcal {Y}_1\). In particular, the \({\text {Re}}(\tau )\)- and \({\text {Im}}(\tau )\)-derivatives of \(\varepsilon '(\tau ,b,v_{\tau ,b})\) remain integrable, so we conclude that \(\mathcal {E}(\tau ,b,u_{\tau ,b})\) (and hence \(E (\tau ,b,U_{\tau ,b})\)) is continuously differentiable of all orders in \(\text {Re}(\tau )\) and \(\text {Im}(\tau )\).
\(\square \)
Theorem 11.4
When \(M_Z < M_H\), the minimizers \(\tau _b\) of \(E (\tau , b, U_{\tau ,b})\) are related to the maximizers \(\tau _*\) of \(\eta _{m_z, m_h}(\tau )\) as \(\tau _b-\tau _* = \mathcal {O}(|1-\frac{M_W^2}{eb}|^{\frac{1}{2}})\). In particular, \(\tau _b \rightarrow \tau _*\) as \(b \rightarrow b_* = M_W^2/e\).
Proof
The minimizers (\(\tau _b\)) of \(E (\tau ,b,U_{\tau ,b})\) are equivalent to the minimizers of the energy functional \(\tilde{E} (\tau ,U_{\tau ,b}):= \omega ^{-2}(E (\tau ,b,U_{\tau ,b})-\frac{1}{2} b^2)\). By Theorem 10.1, we have
where, recall, \(\omega =1-\frac{M_W^2}{eb}\). Since \(\partial _{\tau }\tilde{E} (\tau ,U_{\tau ,b})|_{\tau =\tau _b}=0\), we have the expansion
For both expansions to hold, we must have \(\tau _b-\tau _* = \mathcal {O}(|\omega |^{\frac{1}{2}})\), as required. \(\square \)
The maximizer of \(\eta _{m_z, m_h}(\tau )\), defined in (9.2), was found numerically in [26] with some analytical results in [27]:
Theorem 11.5
([26]) For \(M_Z < M_H\), the function \(\eta _{m_z, m_h}(\tau )\) has a maximum at \(\tau _* =e^{i\pi /3}\).
Theorem 2.3 follows from Theorems 11.4 and 11.5.
Remark 11.6
Using symmetries of \(\eta _{m_z, m_h}(\tau )\), one might be able to prove that its only critical points are \(\tau =e^{i\pi /3}\) and \(\tau =e^{i\pi /2}\), i.e. the hexagonal and square lattices, cf. [3, 30].
Notes
The no-particle sector of the standard model splits into the U(2)-YMH (electroweak) and U(3)-YM (strong, or QCD) parts. Correspondingly, the vacuum of the standard model is the product of the electroweak and strong vacua and the vacuum energy is the sum of the corresponding energies.)
For more discussion of covariant derivatives and their curvatures, see “Appendices A” for the general case, or “Appendix C”, for the case of the gauge group \(G = U(2)\).
Note that the lower indices i, j, k, as in \(A=A_{i} dx^{i}\), refer to vectorial components and run through 1, 2, while the upper indices a, b, c, as in \(V= -\frac{i}{2} \tau _a V^a\), refer to U(2)-algebra components.
A function \(\gamma _s: \mathcal {L}\times \mathbb {R}^2\rightarrow G\) satisfying the co-cycle property (3.17) is called the automorphy exponent and \(e^{i \gamma _s}\), the automorphy factor.
\(\beta \) can be expressed in terms of the Jacobi theta function, see Proposition 5.4 and “Appendix G”.
The condition \(0 < 1-\frac{M_W^2}{eb} \ll 1\) is equivalent to the condition \(0 < 1-\frac{M_W^2}{2\pi }|\mathcal {L}| \ll 1\) of Theorem 2.2.
To check the solutions, one may use that \({\text {curl}}{\text {curl}}^* =-\Delta \).
See the discussion following Proposition 9.1 for details.
More generally, if A is a \(\mathfrak {g}-\)valued \(p-\)form and B is a \(\mathfrak {g}-\)valued \(q-\)form, written as \(A= A^a \otimes \gamma _a\) and \(B= B^b \otimes \gamma _b\), where \(A^a\) and \(B^b\) are \(p-\) and \(q-\)forms and \(\{\gamma _a\}\) is a basis in \(\mathfrak {g}\), then
$$\begin{aligned} {[}A, B]:= (A^a\wedge B^b) \otimes [\gamma _a, \gamma _b] =(-1)^{pq+1}[B, A]. \end{aligned}$$(A.6)Compared with the notation of “Appendix A”, to simplify the notation we omit the tilde over \(\rho _{g}\) in action of the Lie algebra \(\mathfrak {g}\) on V.
References
Abrikosov, A.A.: On the magnetic properties of superconductors of the second group. J. Explt. Theoret. Phys. (USSR) 32, 1147–1182 (1957)
Adams, R.A., Fournier, J.J.F.: Sobolev Spaces. Elsevier, New York (2003)
Aftalion, A., Blanc, X., Nier, F.: Lowest Landau level functional and Bargmann spaces for Bose Einsein condensates. J. Fun. Anal. 241, 661–702 (2006)
Ahlfors, L.: Complex Analysis. McGraw-Hill, New York (1966)
Ambjorn, J., Olesen, P.: Anti-screening of large magnetic fields by vector bosons. Phys. Lett. B 214, 565–569 (1988)
Ambjorn, J., Olesen, P.: A magnetic condensate solution of the classical electroweak theory. Phys. Lett. B218, 67 (1989), Erratum: Phys. Lett. B220 (1989) 659
Ambjorn, J., Olesen, P.: On electroweak magnetism. Nucl. Phys. B 315, 606–614 (1989)
Ambjorn, J., Olesen, P.: A condensate solution of the electroweak theory which interpolates between the broken and the symmetric phase. Nucl. Phys. B 330, 193–204 (1990)
Ambjorn, J., Olesen, P.: Electroweak magnetism: theory and application. Int. J. Mod. Phys. A 5, 4525–4558 (1990)
Ambjorn, J., Olesen, P.: W condensate formation in high-energy collisions. Phys. Lett. B 257, 201–206 (1991)
Ambjorn, J., Olesen, P.: Electroweak magnetism, W condensation and antiscreening, arXiv preprint hep-ph/9304220
Andersen, J.O., et al.: Phase diagram of QCD in a magnetic field: a review. Rev. Mod. Phys. 88, 025001 (2016)
ATLAS, CMS Collaborations: Combined measurement of the Higgs boson mass in \(pp\) collisions at \(s=\sqrt{7}\) and \(8 TeV\) with the ATLAS and CMS experiments. Phys. Rev. Lett. 114, 191803 (2015)
Bach, V., Breteaux, S., Chen, T., Fröhlich, J., Sigal, I.M.: The time-dependent Hartree–Fock–Bogoliubov equations for bosons. arXiv:1602.05171v2
Benedikter, N., Porta, M., Schlein, B.: Effective Evolution Equations from Quantum Dynamics. Springer, New York (2016)
Berger, M.S.: Nonlinearity & Functional Analysis: Lectures on Nonlinear Problems in Mathematical Analysis. Academic Press, Inc., Cambridge (1977)
Beringer, J., et al.: (Particle Data Group), Review of particle physics. Phys. Rev. D 86, 010001 (2012)
Chenn, I., Smyrnelis, P., Sigal, I.M.: On Abrikosov lattice solutions of the Ginzburg–Landau equations. Math. Phys. Anal. Geom. 21, 7 (2018)
Chernodub, M.N., Van Doorsselaere, Jos, Verschelde, Henri: Magnetic-field-induced superconductivity and superfluidity of W and Z bosons: in tandem transport and kaleidoscopic vortex states. Phys. Rev. D 88, 065006 (2013)
Chouchkov, D., Ercolani, N.M., Rayan, S., Sigal, I.M.: Ginzburg–Landau equations on Riemann surfaces of higher genus. Ann. Inst. Henri Poincaré C Anal. non linéaire 37(1), 79–103 (2020)
Dubrovin, B.A., Fomenko, A.T., Novikov, S.P.: Modern Geometry—Methods and Applications: Part I: The Geometry of Surfaces. Springer, Transformation Groups and Fields (1984)
Glashow, S.L.: Partial symmetries of weak Interactions. Nucl. Phys. 22, 579–588 (1961)
Gustafson, S.J., Sigal, I.M.: Mathematical Concepts of Quantum Mechanics, 3rd edn. Springer, New York (2020)
Jaffe, A., Taubes, C.: Vortices and Monopoles: Structure of Static Gauge Theories. Progress in Physics 2. Birkhäuser, Boston, Basel, Stuttgart (1980)
Jost, J.: Riemannian Geometry and Geometric Analysis, 2nd edn. Springer, New York (2011)
MacDowell, S.W., Törnkvist, O.: Structure of the ground state of the electroweak gauge theory in a strong magnetic field. Phys. Rev D 45(10), 3833 (1992)
MacDowell, S.W.: Existence of lattice structures in a class of magnetic phase transitions. Nucl. Phys. B 398, 516–530 (1993)
Miransky, V.A., et al.: Quantum field theory in a magnetic field: from quantum chromodynamics to graphene and Dirac semimetals. Phys. Rep. 576, 1–209 (2015). arXiv:1503.00732
Nielsen, N.K., Olesen, P.: An unstable Yang–Mills field mode. Nucl. Phys. B 144, 376–396 (1978)
Nonnenmacher, S., Voros, A.: Chaotic eigenfunctions in phase space. J. Stat. Phys. 92, 431–518 (1998)
Odeh, F.: Existence and bifurcation theorems for the Ginzburg–Landau equations. J. Math. Phys. 8, 2351–2356 (1967)
Olesen, P.: Anti-screening ferromagnetic superconductivity, arXiv preprint arXiv:1311.4519
Rajaratnam, K., Sigal, I.M.: Vortex lattice solutions of the ZHK Chern–Simons equations. Nonlinearity 33, 5246 (2020). arXiv:1910.09689v1
Rubakov, V.: Classical Theory of Gauge Fields. Princeton University Press, Princeton (2002)
Salam, A.: In: Svartholm, N. (ed.) Elementary Particle Theory, pp. 367–377. Almqvist and Wiskell, Stockholm (1968)
Schwarz, A.S.: Quantum Field Theory and Topology. Springer, New York (2010)
Sigal, I.M.: Magnetic vortices, Abrikosov lattices and automorphic functions. In: Mathematical and Computational Modelling (With Applications in Natural and Social Sciences, Engineering, and Arts). Wiley, New York (2014)
Sigal, I.M., Tzaneteas, T.: Stability of Abrikosov lattices under gauge-periodic perturbations. Nonlinearity 25, 1–24 (2012)
Sigal, I.M., Tzaneteas, T.: Abrikosov lattices at weak magnetic fields. J. Funct. Anal. 263(3), 675–702 (2012)
Sigal, I.M., Tzaneteas, T.: On stability of Abrikosov lattices. Adv. Math. 326, 108–199 (2016)
Skalozub, V.V.: Abrikosov lattice in the theory of electroweak interactions. Sov. J. Nucl. Phys. 43, 665–669 (1986)
Skalozub, V.V.: The structure of the vacuum in the Weinberg Salam theory. Sov. J. Nucl. Phys. 45, 1058–1064 (1987)
Spruck, J., Yang, Y.: On multivortices in the electroweak theory I: existence of periodic solutions. Commun. Math. Phys. 144, 1–16 (1992)
Spruck, J., Yang, Y.: On multivortices in the electroweak theory II: existence of Bogomol’nyi solutions in \(\mathbb{R}^2\). Commun. Math. Phys. 144, 215–234 (1992)
Takác̆, P.: Bifurcations and vortex formations in the Ginzburg–Landau equations. Z. Angew. Math. Mech. 81, 523–539 (2001)
Tzaneteas, T., Sigal, I.M.: Abrikosov lattice solutions of the Ginzburg–Landau equations. Contemp. Math. 535, 195–213 (2011)
Weinberg, S.: A Model of Leptons. Phys. Rev. Lett. 19(21), 1264–66 (1967)
Acknowledgements
The second author is grateful to Nicholas Ercolani, Jürg Fröhlich, Gian Michele Graf and Stephan Teufel for many instructive and stimulating discussions of the YMH equations. Both authors thank the anonymous referee for many constructive remarks.
Funding
The research on this paper is supported in part by NSERC Grant No. 2017-06588.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors have no relevant financial or non-financial interests to disclose.
Additional information
Communicated by Claude-Alain Pillet
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
To Nicholas Ercolani and Gian Michele Graf, scientists and friends.
Appendices
Appendix A: Covariant Derivatives and Curvature
In this appendix, we briefly review some basic definitions from gauge theory. For some geometrical background, see [21, 25, 34]. Recall that we use the Einstein convention of summing over repeated indices.
Let V be an inner product vector space, G a Lie group acting transitively on V via a unitary representation \(\rho : g \mapsto \rho _g\), and let \(\mathfrak {g}\) be the Lie algebra of G acting on V via the representation \({\tilde{\rho }}: A \mapsto {\tilde{\rho }}_A\) induced by \(\rho \).
To simplify notation below, we take \(V=\mathbb {C}^m\) and G a matrix group, acting on V by matrix rules (and similarly for \(\mathfrak {g}\)) and write \(\rho _g\Psi = g \Psi \) and \({\tilde{\rho }}_A\Psi = A \Psi \). Moreover, we assume that G is either U(m) or a Lie subgroup of U(m).
Let M be an open subset in a finite-dimensional vector space, with a metric h and local coordinates \(\{x^i\}\), and let \(\partial _i \equiv \partial _{x^i}\).
For a \({\mathfrak {g}}\)-valued connection (one-form) \(A \equiv A_i dx^i\) on M, we define the covariant derivatives:
- \(\nabla _{A}\), mapping functions (sections), \(\Psi : M\rightarrow V\), into \(\mathfrak {g}\)-valued one-forms, as
- \(d_{A}\), mapping \(\mathfrak {g}\)-valued functions (0-forms) f into \(\mathfrak {g}\)-valued one-forms
- \(d_{A}\), mapping \(\mathfrak {g}\)-valued one-forms into \(\mathfrak {g}\)-valued two-forms
with [A, B] defined in local coordinates \(\{x^i\}\) as
for \(A= A_i dx^i\) and \(B=B_j dx^j\).Footnote 12
The curvature form of the connection A is the \(\mathfrak {g}\)-valued two-form given by the formula
It is related to the curvature operator (denoted by the same symbol) \(F_{A}:= d_A \circ d_A\). As a simple computation shows, this operator is a matrix-multiplication operator given by the matrix-valued 2-form (A.6).
Let U be a vector space (V or \(\mathfrak {g}\) in our case) and let \(\Omega ^p_U\equiv U\otimes \Omega ^p\) denote the space of U-valued p-forms. On \(\Omega ^p_U\), one defined the inner product, \(\langle \cdot , \cdot \rangle _{\Omega ^p_U}\equiv \langle \cdot , \cdot \rangle _{\Omega ^p_U}^h\) as
where \(A=A_{\alpha } dx^{\alpha }\) and \(B=B_{\alpha } dx^{\alpha }\) are U -valued p-forms, \(\alpha \) is a p-form index and \(\langle \cdot , \cdot \rangle _U\) is the inner product on U. Here the indices are raised and lowered with help of the inner product h on M.
Above, we did not display the coupling constants. Doing so would change the covariant derivative to \(d_{A} \Psi =(d + g A) \Psi \), if G is simple. If G is not simple, then each simple component of G gets its own coupling constant, as was done in the main text for \(G=SU(2)\times U(1)\) (see also (C.2)–(C.6) below).
Appendix B: The Time-Dependent Yang–Mills–Higgs System
In this appendix, we briefly review the Yang–Mills–Higgs theory, including the derivation of the energy functional (2.7). In what follows, we use the convention of raising or lowering an index by contracting a tensor T with the metric tensor:
where \(\eta \) is the Minkowski metric of signature \((+,-,...,-)\) on \(M\subset \mathbb {R}^{d+1}\) and \(\alpha , \beta \) are multi-indices. The same equations could be reinterpreted as stationary equations by taking the Euclidean metric \(\delta _{i j}\), instead of \( \eta _{ij}\), and letting the indices range over \(1, \dots , d\), rather than \(1, \dots , d+1\). In this case, \(T^{\alpha }_{i,\beta } = T^{i,\alpha }_{\beta }\).
Lagrangian. Let \(\Omega \) be a bounded domain in \(\mathbb {R}^{d}\) and \(M=\Omega \times [0, T]\subset \mathbb {R}^{d+1}\) be spacetime equipped with the Minkowski metric \(\eta \) of signature \((+,-,...,-)\) and V and G be as in “Appendix A”. The theory involves a Higgs field \(\Psi : M\rightarrow V\) interacting with the gauge field A, a connection (one-form) on M with values in the algebra \({\mathfrak {g}}\). The dynamics are given by the Lagrangian
with corresponding action \(\mathcal {S}:= \int _0^T \mathcal {L}(\Psi , A) dt\), \(T>0\), given explicitly by
where \(U: V\rightarrow \mathbb {R}^+\) is a self-interaction potential, which is assumed to be gauge invariant: \( U(\rho _g \Psi )= U(\Psi )\). Typical examples of G, V and \(U(\Psi )\) are \(U(m), \mathbb {C}^m\) and \(U(\Psi )=\frac{1}{2} \lambda (1 - \Vert \Psi \Vert ^2_V)^2\).
Euler–Lagrange equations.The Euler–Lagrange equations (called Yang–Mills–Higgs equations) for the fields \(\Psi \) and A are
where \(\nabla _{A}^{*_\eta }\) and \(d_{A}^{*_\eta }\) are the adjoints of \(\nabla _{A}\) and \(d_{A}\) in the appropriate inner products involving the metric \(\eta \) and \(J(\Psi , A)\) is the YMH current given by
where \(\gamma _a\) is an orthonormal basis of \({\mathfrak {g}}\) and \(\nabla _i:= \partial _i + {A_i}\), with \(\partial _i\equiv \partial _{x^i}\), so that \(\nabla _{A} \Psi =\nabla _{i} \Psi dx^i \). (B.5) is the Yang–Mills equation.
Proof of (B.4) - (B.5)
For convenience, we assume periodic or Dirichlet boundary conditions and that \(\Psi \) and A are T -periodic in t and calculate the Gâteaux derivatives formally.
Recall that \(\delta _\#\) denotes the partial (real) Gâteaux derivative with respect to \(\#\). First we calculate the (complex) Gâteaux derivative of (B.3) in the \(\Psi \)-direction. Define \(\partial _z \equiv \frac{1}{2} (\partial _{{\text {Re}}z} - i\partial _{{\text {Im}}z})\) and \(\delta _{\Psi } \equiv \frac{1}{2} (\delta _{{\text {Re}}\Psi } - i \delta _{{\text {Im}}\Psi })\). Then \(\delta _{\Psi }\mathcal {S}(\Psi ,A)\Psi ' = \partial _z\mathcal {S}(\Psi _z,A)|_{z=0}\), where \(\Psi _z = \Psi + z\Psi '\), \(z\in \mathbb {C}\). Using this, we find
Integrating the first term by parts and factoring out \(\Psi '\) gives
For this derivative to be zero for every variation \(\Psi '\), (B.4) must hold.
Next we calculate the Gâteaux derivative of (B.3) in the A-direction. Using the definition \(\delta _A f(A)B = \partial _s f(A_s)|_{s=0}\), where \(A_s = A + s A'\), \(s\in \mathbb {R}\), we find
Writing \(B = B^a\gamma _a = B_i^a dx^i \otimes \gamma _a\) (with \(B_i^a\) real) and \(\nabla _{A} \Psi =\nabla _{i} \Psi dx^i \), so that
and using that \(B^a C^a=-{\text {Tr}}[(B^c \gamma _c) (C^a \gamma _a)]\) (since \({\text {Tr}}(\gamma _c^*\gamma _a)=-{\text {Tr}}(\gamma _c\gamma _a)=\delta _{c a}\)), gives
which gives \(I= \int _{M} \langle B, J(\Psi ,A) \rangle _{\Omega ^1_\mathfrak {g}}\). For the second term on the r.h.s. of (B.10), integrating by parts yields \(II= \int _{M} \langle B, d_A^* F_A \rangle _{\Omega ^1_\mathfrak {g}}\). Collecting the last two equations gives
For this derivative to be zero for every variation B, (B.5) must hold. \(\square \)
Conserved energy. Again, the Gâteaux derivative calculations in the following subsection are formal. Recall that \(M:=\Omega \times [0, T]\subset \mathbb {R}^{d+1}\).
To find the expression for the energy, we use, as in classical mechanics, the (partial, i.e. without passing to the momentum fields) Legendre transform of (B.2) is given by
Proposition B.1
The (partial) Legendre transform (B.15) of Lagrangian (B.2) yields the conserved energy
where the norms are taken using the Euclidean metric on \(\mathbb {R}^{d+1}\) (rather than the Minkowski metric).
Note that for static (time-independent) fields, \(E(\Psi , A) = -\mathcal {L}(\Psi , A)\).
Proof
Let \(\partial _\#\) denote the partial derivative with respect to the symbol \(\#\), and recall that \(\delta _\#\) denotes the partial (real) Gâteaux derivative with respect to \(\#\). We calculate
and
(B.16) results.
It remains to show that (B.16) is conserved by the YMH equations (B.4)–(B.5). This can be done by using the (partial) Legendre transform (B.15) as in classical mechanics, or by a direct computation. We proceed in the second way. Applying the chain rule gives
where, recall, \(\partial _i\equiv \partial _{x^i}\). We now calculate the first term using (B.4).
Integrating the second term by parts gives
By (B.4), we have
so (B.21) becomes
Here \(\nabla _0^* = -\partial _0 + A_0^\dagger = -\partial _0 - A_0\), where the second equality follows because the representation of \(\mathfrak {g}\) is unitary. Therefore,
Similarly,
and so
where \(J_0(\Psi ,A)\) is the time component of the YMH current (B.6).
One may show using (B.5) that
Hence, by (B.19) we have \(\frac{d}{dt} E(\Psi ,A) = 0\), as required. \(\square \)
Gauge symmetries. We define the local action, \( \rho _{g} A\),Footnote 13 of the group G on A, by the equation \(d_{ \rho _{g} A}=g d_{A}g^{-1}\), for all \(g \in C^1(N, G)\), where N is either M or \(\Omega \). We compute
Proposition B.2
The Lagrangian (B.2) is invariant under the Poincaré group and the gauge transformations
Proof
The invariance under the Poincaré group follows from the definition of this group and the choice of the Minkowski metric on \(M\subset \mathbb {R}^{d+1}\).
For the gauge invariance, recall that \(U(\Psi )\) is \(\mathfrak {g}\)-invariant, and that the representations \(g \mapsto \rho _g\) (on V) and the adjoint representation \(g\mapsto \text {ad}_g\) (on \(\mathfrak {g}\)) are unitary. Therefore, to prove invariance under the gauge transformation (B.29), it suffices to show that
We shall use the equation
which follows from \(d(hh^{-1}) = 0\). For (B.30) we compute
Since \( g {dg^{-1}g}=- g {g^{-1}dg} = -dg\), this gives \(\nabla _{ \rho _g A} g\Psi = g \nabla _A \Psi \).
For (B.31), computing in coordinates \(\{x^i\}\) and writting \(F_{\rho _g A}:= (F_{\rho _g A})_{ij} dx^i\wedge dx^j\) and \(F_A:= (F_A)_{ij} dx^i\wedge dx^j\), we find
where, recall, \(\partial _i\equiv \partial _{x^i}\). Expanding the partial derivative and commutators gives
Expanding the product on the second line gives
Cancelling terms symmetrical in i and j and simplifying gives
as required. \(\square \)
Specifying (B.16) to the WS model gives (2.7).
The YMH equations in coordinate form. In coordinate form, the differential form (gauge field) entering the YMH Lagrangian (B.2) is written as \(A =A_{i} dx^i \). The local coordinate expression for the curvature is \(F_A = F_{ij} dx^i \wedge dx^j\), where \(F_{i j}:= \frac{1}{2}(\partial _i A_j- \partial _j A_i) +\frac{1}{2}[A_i, A_j]\). Furthermore, for the covariant derivatives \(\nabla _A\) and \(d_A\), we have \(\nabla _{A} \Psi =\nabla _{i} \Psi dx^i \) and \(d_{A}^* F_A=-\nabla ^{i} F_{i j} dx^j \), where \(\nabla _{i} \Psi := (\partial _i + A_{i}) \Psi \) and \(\nabla ^{i} F_{i j}:= \partial ^i F_{i j} + [A^i, F_{i j}]\).
For an arbitrary \(\mathfrak {g}\)-valued one-form \(B = B_i dx^i\), we have \(d_{A} B =\nabla _{i} B_j dx^i \wedge dx^j\) and \(d_{A}^* B =-\nabla ^{i} B_i\), where
We write \(F_{ij} = F_{ij}^a \gamma _a\) for an orthonormal basis \(\gamma _a\) of \(\mathfrak {g}\) and the lower case roman indices run over the spatial components \(1, 2, \dots , d\). Note that \(F_{i j}=[\nabla _{i}, \nabla _{j}]\), but \(F_{ij} \ne \frac{1}{2} (\nabla _i A_j - \nabla _j A_i)\).
Let \(\Omega \) be either a bounded domain in \(\mathbb {R}^d\) or \(\mathbb {R}^{d+1}\). In the former case, we assume either periodic or Dirichlet boundary conditions.
Proposition B.3
The Lagrangian and energy for the YMH model are given in coordinates by
(with different ranges of indices as mentioned above). The YMH equations are given in coordinates by:
Proof
Equations (B.41) and (B.42) follow from the coordinate expressions \(d_{A} \Psi =\nabla _k \Psi dx^k\) and \(F_A = F_{ij}^a \gamma _a\otimes dx^i \wedge dx^j\), together with the fact that \(dx^k\) and \(\gamma _a\otimes dx^i\wedge dx^j\) form orthonormal bases for \(\Omega ^1\) and \(\Omega ^2_\mathfrak {g}\), respectively.
Equations (B.43)–(B.44) follow from Eqs. (B.4)–(B.6) and the coordinate expressions for \(d_A\) and \(d_A^*\) above. \(\square \)
Appendix C: The WS Equations in Coordinate Form
For the gauge group \(G=U(2)=SU(2)\times U(1)\), we choose the standard inner product
on \(\mathfrak {u}(2)\), for which \(-\frac{i}{2}\tau _a\), \(a=0,1,2,3\) (where \(\tau _a\), \(a=1,2,3\), are the Pauli matrices together with \(\tau _0:= \textbf{1}\)) form an orthonormal basis. It is customary to factor out the coefficient of \(-\frac{i}{2}\). In coordinates, we write
with \(Q_i(x)\), \( Q_{ij}(x)\in i\mathfrak {u}(2)\). Using Eq. (2.3), we compute \(Q_{ij} = \frac{1}{2} (\partial _i Q_j - \partial _j Q_i) - \frac{i}{4} [Q_i,Q_j]\). Furthermore, we write \(Q=V+X\) and
with \(V_i(x)\in i\mathfrak {su}(2)\) and \(X_i(x)\in i\mathfrak {u}(1)\). Then \(Q_{i j}=V_{i j} + X_{i j} \) and
We specify Eqs. (B.41)–(B.44) for to the Weinberg–Salam (WS) model, which has the gauge group \(G=U(2)=SU(2)\times U(1)\). As was mentioned in “Appendix A”, in this case, there is a slight discrepancy in the definition of the covariant derivative due to the fact that U(2) is not simple, but a (semi-)direct product of the simple group SU(2) and U(1), with each component having a coupling constant, see (C.2)–(C.6).
Using Eqs (C.2)–(C.6), we express the Lagrangian and the energy in coordinates as
(with indices ranging from 0 to d and 1 to d, respectively, as mentioned above), and the Euler–Lagrange equations are written in coordinates as
Equations (C.8)–(C.10) can be expressed in terms of the W, Z, Higgs and electromagnetic fields resulting in 2D Eqs. (3.10)–(3.15), see “Appendix D.2”.
Appendix D: The Weinberg–Salam Energy in Terms of the Fields W, A, Z and \(\varphi \)
1.1 Appendix D.1: Dimension 3
We work in a fixed coordinate system, \(\{x^i\}_{i=1}^3\) and write the fields as \(W = W_{i} dx^i,\) \( Z = -\frac{i}{2} Z_{i} dx^i\) and \(A = -\frac{i}{2} A_{i} dx^i\). We show
Proposition D.1
Energy (2.7), written in terms of the fields W, A, Z and \(\varphi \) and coordinates \(\{x^i\}_{i=1}^3\), is given by (see also [43]):
where \(W_{i j}:= \nabla _{i} W_{j} -\nabla _{j}W_{i}\), with \(\nabla _{k}:= \partial _k-i g V^3_k, \partial _k\equiv \partial _{x^k}\), \(Z_{i j}:= \partial _{i} Z_{j} -\partial _{j}Z_{i}\), \(A_{i j}:= \partial _{i} A_{j} -\partial _{j}A_{i}\) and T(W, A, Z) is the sum of super-quadratic terms,
where \(V^3:= Z\cos \theta + A\sin \theta \) and \(V^3_{ij}:= \partial _i V_j - \partial _j V_i\), with the important property that T(W, A, Z) is invariant under the gauge transformation (3.7).
Proof of (D.1)
We proceed by rewriting the terms in the coordinate expression of the WS energy (C.8),
in terms of the fields \(W = W_{i} dx^i\), \(Z = -\frac{i}{2} Z_{i} dx^i\), \(A = -\frac{i}{2} A_{i} dx^i\) and \(\varphi \).
For the first term, first we calculate \(\nabla _i\Phi \). Recall the definition \(\nabla _i\Phi := (\partial _i - \frac{ig}{2}V_i - \frac{ig'}{2} X_i)\Phi \). We simplify the matrix representing the connection’s action on \(\Phi \):
In terms of the fields Z, A and W (see Eqs. (3.5)–(3.6) for the definitions of these fields), (D.3) becomes
Hence, for \(\Phi = (0, \varphi )\),
Therefore, the first term of (C.8), written in terms of the fields W, A, Z and \(\varphi \), becomes
The second term of (C.8) becomes
For the third term of (C.8), we will use the fact that \({\text {Tr}}Q_{i j} Q^{i j} = {\text {Tr}}V_{i j} V^{i j} + {\text {Tr}}X_{i j} X^{i j}\), where \(V_{i j}\) and \(X_{i j}\) are defined in (C.5) and (C.6). Furthermore, we have
We recall \(V_{ij}^3 = \partial _i V_j^3 - \partial _j V_i^3\) and \(W_{ij}^0 = \partial _i W_j - \partial _j W_i\) and calculate
and, with \(K_{ij}:= V^3_i W_i - V^3_j W_i\),
Adding (D.9) and (D.10), using that \(W_{ij}=W_{ij}^0+K_{ij}\) and denoting \(L_{ij}:=V^3_{ij} - ig (W_i{\overline{W}}_j - {\overline{W}}_i W_j)\) gives
Since \(V_{ij}\) and \(X_{ij}\) are Hermitian, \({\text {Tr}}V_{ij}V^{ij}\) and \({\text {Tr}}X_{ij}X^{ij}\) are the sum of the squared absolute values of the matrix coefficients of \(V_{ij}\) and \(X_{ij}\), respectively. Thus
Using \(L_{ij} = V^3_{ij} - ig (W_i{\overline{W}}_j - {\overline{W}}_i W_j)\) and expanding the first term gives
Recall that \(A_{ij} = V^3_{ij}\sin \theta + X_{ij}\cos \theta \) and \(Z_{ij} = V^3_{ij}\cos \theta - X_{ij}\sin \theta \). Writing the first line of (D.13) in terms of these fields gives
Expanding the first term of the second line, and using \(V_{ij}^3 = -V^3_{ij}\) in the second term, (D.14) becomes
Recalling the definition (D.2) of T(W, A, Z) gives
Adding (D.6), (D.7) and (D.16) gives (D.1). \(\square \)
1.2 Appendix D.2: Dimension 2: Proof of (3.10)
Proof of (3.10)
Now, we consider the Weinberg–Salam (WS) model in \(\mathbb {R}^2\) with fields independent of the third dimension \(x_3\), and correspondingly choose the gauge with \(V_3 = X_3=0\) (and hence \(W_3 = A_3=Z_3=0\)). In this case the summation in (D.1) contains only two terms, \((ij)=(12)\) and \((ij)=(21)\), and we use this to simplify (D.1).
We proceed by simplifying the terms of (D.2) and the first line of (D.1); the remaining terms are unchanged.
Replacing corresponding terms in (D.1)–(D.2) with (D.17)–(D.19) proves (3.10).
\(\square \)
Proof of (3.12) - (3.15)
We proceed by calculating the (complex) Gâteaux derivatives of (3.10).
Let \(\delta _\#\) denote the partial (real) Gâteaux derivative with respect to \(\#\). Let \(W_z = W + z W'\), \(z\in \mathbb {C}\), and define \(\partial _{{\overline{z}}}\equiv \frac{1}{2}(\partial _{{\text {Re}}z} + i\partial _{{\text {Im}}z})\) and \(\delta _{{\overline{W}}}\equiv \frac{1}{2}(\delta _{{\text {Re}}W} + i\delta _{{\text {Im}}W})\). Then
Integrating the first term by parts and factoring out W and \({\overline{W}}'\) gives
For the derivative to be zero for every variation \(W'\), (3.12) must hold.
Let \(A_s = A + s A'\), \(s\in \mathbb {R}\). Then
Using \(A'\times W = -JW\cdot A'\) in the first two terms, and integrating the last two terms by parts, gives
which simplifies to
For the derivative to be zero for every variation \(A'\), (3.13) must hold.
The proof of (3.14) is essentially the same as the proof of (3.13), so we omit it.
Let \(\varphi _s = \varphi + s \varphi '\), \(s\in \mathbb {R}\). Then
Integrating the third term by parts and factoring out \(2\varphi '\) gives
For the derivative to be zero for every variation \(\varphi '\), (3.15) must hold. \(\square \)
Appendix E: Proof of (9.11)
In the proof below, we will use the following result:
Lemma E.1
Let \(L^2_{per}\) denote any of the spaces (5.9)–(5.11), and let \(\mathcal {H}^2_{per}\) denote the corresponding Sobolev space. Suppose that \(f_s,g_s:\mathbb {R}\rightarrow \mathcal {H}^2_{per}\) satisfy \(||f_s||_{\mathcal {H}^2_{per}} = \mathcal {O}(|s|^k)\) and \(||g_s||_{\mathcal {H}^2_{per}} = \mathcal {O}(|s|^l)\) for some \(k,l\in \mathbb {Z}\). Then for \(i,j=1,2\) and \(p,q=0,1\),
Furthermore, if \(f_s\) and \(g_s\) have continuous derivatives of all orders in s, then so does the above integral.
Proof
Equation (E.1) follows from the following chain of inequalities:
If \(f_s\) and \(g_s\) have continuous derivatives of all orders in s, then their s-derivatives of all orders are in \(\mathcal {H}^2_{per}\). In particular, this means that \(\partial _s^k (f_s g_s)\), \(k\in \mathbb {Z}_{\ge 0}\), remains integrable, so the \(s-\)derivatives of the above integral (obtained by differentiation under the integral sign) are well-defined. \(\square \)
Proof of (9.11)
To prove (9.11), we use the w-field Eq. (4.3), and \(\nu _s:= g(a_s\sin \theta + z_s\cos \theta )\), to get
We shall calculate each term of the integral (E.3) up to order \(s^3\) using Lemma E.1 and the Taylor expansions (9.7).
Integrating the first term of (E.3) by parts gives
Plugging in the Taylor expansions (9.7) gives
where, recall, \(\nu ':= g(a'\sin \theta + z'\cos \theta )\). Recall from Equation (5.24) that \({\text {curl}}_{a^n}\chi = 0\). Therefore, applying Lemma E.1 gives
Plugging the Taylor expansions (9.7) into the second term of (E.3) gives
Expanding this product and applying Lemma E.1 gives
Recall that \(\chi \in {\text {Null}}(H_1(n))\) and that \(w'\) is orthogonal to \(\text {Null}\ H_1(n)\). Therefore the third term vanishes:
Plugging the Taylor expansions (9.7) into the third term of (E.3) gives
Recall from Eq. (5.24) that \(\chi \) is of the form \(\chi = (\omega , i\omega )^T\), so
\({\overline{\chi }}\cdot J\chi = -i|\chi |^2\) and \({\overline{\chi }}\cdot J w' = -i{\overline{\chi }}\cdot w'\). Therefore (E.10) simplifies to
Expanding this product and applying Lemma E.1 gives
Recall that \(\chi \in {\text {Null}}(H_1(n))\) and that \(w'\) is orthogonal to \(\text {Null}\ H_1(n)\). Therefore the third term vanishes:
Using \({\overline{\chi }}\cdot Jw_s = -{\overline{\chi }}\times w_s\), the fourth term of (E.3) becomes
Plugging in the Taylor expansions (9.7) gives
Recall from Eq. (5.24) that \(\chi \) is of the form \(\chi = (\omega , i\omega )\), so \({\overline{\chi }}\times \chi = i|\chi |^2\). This fact and Lemma E.1 gives
The \(s^3\) terms of (E.6), (E.9), (E.13) and (E.16) must sum to 0, and so (9.11) results. \(\square \)
Appendix F: Proof of (10.3)
Proof of (10.3)
We shall calculate each term in the integral (4.8) up to order \(s^6\) using Lemma E.1 and the Taylor expansions (9.7).
Plugging the Taylor expansions (9.7) into the first term of (4.8) gives
Recall from Eq. (5.24) that \({\text {curl}}_{a^n}\chi = 0\). Therefore, applying Lemma E.1 gives
Plugging the Taylor expansions (9.7) into the second term of (4.8) gives
Expanding the square and applying Lemma E.1 gives
Plugging the Taylor expansions (9.7) into the third term of (4.8) gives
Recall that \({\text {curl}}a^n = n\). Expanding the square gives
The second and third terms vanish because \(a'\) and \(a''\) are \(\mathcal {L}'\)-periodic. Therefore, applying Lemma E.1 gives
Plugging the Taylor expansions (9.7) into the fourth term of (4.8) gives
Expanding the square terms gives
Expanding this product and applying Lemma E.1 gives
Recall that \(\chi \in {\text {Null}}(H_1(n))\) and that \( w'\) is orthogonal to \({\text {Null}}(H_1(n))\). Therefore the third term vanishes:
Plugging the Taylor expansions (9.7) into the fifth term of (4.8) and expanding the square terms gives
Expanding this product and applying Lemma E.1 gives
Plugging the Taylor expansions (9.7) into the sixth term of (4.8) gives
Recall from Eq. (5.24) that \(\chi \) is of the form \(\chi = (\omega ,i\omega )\), so \({\overline{\chi }}\times \chi = i|\chi |^2\). Therefore, applying Lemma E.1 gives
Plugging the Taylor expansions (9.7) into the seventh term of (4.8) gives
where, recall, \(\nu ':= g(a'\sin \theta + z'\cos \theta )\). Recall that \({\text {curl}}a^n = n\) and \(e = g\sin \theta \). Expanding the wedge product of the second and third terms gives
Recall from Eq. (5.24) that \(\chi \) is of the form \(\chi = (\omega ,i\omega )\), so \({\overline{\chi }}\times \chi = i|\chi |^2\) and \({\overline{\chi }}\times w' = i{\overline{\chi }}\cdot w'\). Therefore
Expanding this product and using Lemma E.1 gives
Recall that \(\chi \in {\text {Null}}(H_1(n))\) and \( w'\) is orthogonal to \({\text {Null}}(H_1(n))\). Therefore the second term vanishes:
Plugging the Taylor expansions (9.7) into the eigth term of (4.8) gives
Expanding the square and using Lemma E.1 gives
Plugging the Taylor expansions (9.7) into the ninth term of (4.8) and expanding the inner squares gives
Expanding the outer square gives and using Lemma E.1 gives
Adding (F.2)–(F.24) and dividing by \(|\Omega '|\) gives (10.3), where \(R_\varepsilon \) collects the \(\mathcal {O}(|s|^6)\) remainder terms. \(R_\varepsilon \) has continuous derivatives of all orders because it is a sum of integrals of the form (E.1) with \(f_s\) and \(g_s\) coming from the continuously differentiable remainder terms \(\mathcal {O}(|s|^p)\) of (9.7). \(\square \)
Appendix G: Spectral Analysis of the Operator \(-\Delta _{a^n}\)
Recall from the main text, but in vector notation, that \(a^n:= \frac{n}{2}x^\perp \), where \((x^1, x^2)^\perp = (-x^2, x^1)\), \(\nabla _{q}:= \nabla - iq = (\nabla _1, \nabla _2)\), \(\nabla _j:= \partial _j - iq_j, \partial _j\equiv \partial _{x^j}\), and \(\Delta _q:=\nabla _{q}^2=-\nabla _{q}^*\nabla _{q}\). The next proof follows Section 5 of [18].
Proof of Proposition 5.4
The self-adjointness of the operator \(-\Delta _{ a^n}\) is well-known. To find its spectrum, we introduce the complexified covariant derivatives (harmonic oscillator annihilation and creation operators), \({\bar{\partial }}_{ a^n} \) and \({\bar{\partial }}_{ a^n}^*=- \partial _{ a^n}\), with
One can redily verify that these operators satisfy the following relations:
As for the harmonic oscillator (see, e.g. [23]), this gives explicit information about the spectrum of \(-\Delta _{ a^n}\), namely (5.26), with each eigenvalue is of the same multiplicity. Furthermore, the above properties imply (5.27).
We find \({\text {Null}}{\bar{\partial }}_{ a^n}\). A simple calculation gives the following operator equation
(The transformation on the left-hand side is highly non-unique.) This immediately proves that
if and only if \(\theta = e^{-\frac{ n}{2}(i x^1 x^2-(x^2)^2)}\psi \) satisfies \((\partial _{x^1} + i\partial _{x^2})\theta = 0\). We now identify \(x \in \mathbb {R}^2\) with \(z = x^1 + ix^2 \in \mathbb {C}\) and see that this means that \(\theta \) is analytic and
where we display the dependence of \(\theta \) on \(\tau \). The quasiperiodicity of \(\psi \) transfers to \(\theta \) as follows:
The first relation ensures that \(\theta \) have a absolutely convergent Fourier expansion of the form \(\theta (z, \tau ) = \sum _{m=-\infty }^{\infty } c_m e^{2\pi m iz}.\) The second relation, on the other hand, leads to relation for the coefficients of the expansion: \( c_{m + n} = e^{-in\pi z} e^{i2m\pi \tau } c_m\), which together with the previous statement implies (5.29).
\(\square \)
Next, we claim that the solution (G.5) satisfies
By (G.5), it suffices to show that \(\theta (z) = \theta (- z)\). We show this for \(n=1\). Denote the corresponding \(\theta \) by \(\theta (z, \tau )\). Iterating the recursive relation for the coefficients in (5.29), we obtain the following standard representation for the theta function
We observe that \(\theta (- z, \tau )=\theta (z, \tau )\) and therefore \(\psi _0 ( - x)=\psi _0 ( x)\). Indeed, using the expression (G.7), we find, after changing m to \(-m'\), we find
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gardner, A., Sigal, I.M. Instability of Electroweak Homogeneous Vacua in Strong Magnetic Fields. Ann. Henri Poincaré (2024). https://doi.org/10.1007/s00023-024-01430-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00023-024-01430-5