1 Introduction

There has been great success proving boundedness results in characteristic zero using the techniques and results of the LMMP. Beyond dimension 2, however, there has not been much progress in positive characteristic. This is perhaps a consequence of the relative newness of the LMMP results in this setting, but it also tells of the existence of difficulties unique to characteristic p.

In this direction, we prove the following.

Theorem 1.1

Fix \(0< \delta , \epsilon <1\). Let \(S_{\delta ,\epsilon }\) be the set of threefolds satisfying the following conditions

  • X is a projective variety over an algebraically closed field of characteristic \(p >7, \frac{2}{\delta }\);

  • X is terminal, rationally chain connected and F-split;

  • \((X,\Delta )\) is \(\epsilon \)-klt and log Calabi-Yau for some boundary \(\Delta \); and

  • The coefficients of \(\Delta \) are greater than \(\delta \).

Then there is a set \(S'_{\delta ,\epsilon }\), bounded over \(\text {Spec}(\mathbb {Z})\) such that any \(X\in S_{\delta ,\epsilon }\) is either birational to a member of \(S'_{\delta ,\epsilon }\) or to some \(X'\in S_{\delta ,\epsilon }\), Fano with Picard number 1.

The constraints on the characteristic of the field are required to control the singularities arising in terminal Mori fibrations. In particular the \(p>7\) requirement ensures that terminal del Pezzo fibrations have generically smooth fibres and the \(p> \frac{2}{\delta }\) is needed to control the singularities appearing in the base of a conic bundle. This in turn allows for lower dimensional boundedness results to be applied.

The condition that X be terminal is to allow us to reduce to the case that X is a terminal Mori fibre space. While we might normally achieve this by taking a terminalisation \(\tilde{X} \rightarrow X\), we cannot do so while also ensuring that the coefficients of \(\tilde{\Delta }\) are still bounded below. In fact while bounding the coefficients below is used to prove a canonical bundle formula for Mori fibre spaces of relative dimension 1 it is in many ways the relative dimension 2 case that forces the assumption X is terminal.

If \((X,\Delta ) \rightarrow S\) is a klt Mori fibre space with coefficients bounded below by \(\frac{2}{p}\) then we may freely take a terminalisation and run an MMP to obtain a tame conic bundle, which is what we require for our boundedness proof. If however the relative dimension is 2 then after taking a terminalisation and running an MMP we may end with a Mori fibration of relative dimension 1, where we cannot easily control the singularities of the base. This happens whenever X is singular along a curve C which maps inseparably onto the base and we expect this is the only way it might happen.

The main motivation for this result comes from [9] where a similar result is proven in the characteristic zero setting. More generally we have the following generalisation of BAB, which essentially appeared in [24] and remains unsolved even in characteristic 0.

Conjecture 1.2

Fix \(\kappa \), an algebraically closed field of characteristic 0, let d be a natural number and take \(\epsilon \) a positive real number. Then the projective varieties X over \(\kappa \) such that

  • X has dimension d;

  • (XB) is \(\epsilon \)-klt for some boundary B;

  • \(-(K_{X}+B)\) is nef; and

  • X is rationally connected;

are bounded.

With the LMMP for klt pairs known in dimension 3 and characteristic \(p>5\), it is natural to turn our attention to results and conjectures of this type in positive and mixed characteristic. There are several major problems one would face in the pursuit of such a result, even in the weaker case of birational boundedness in dimension 3, which do not arise in characteristic zero. Perhaps the most immediate is that X rationally connected no longer removes the possibility that \(K_{X} \not \equiv 0\). For example, in positive characteristic there are families of K3 surfaces which are rationally connected. It is not clear then, even in dimension 2, that such a result would hold.

It is also very difficult to control the singularities of the base, and indeed the fibres, of a Mori fibre space, which makes proofs of an inductive nature very challenging. The failure of Kawamata-Viehweg vanishing presents a similar difficulty.

Unique to positive characteristic, we have singularities characterised by properties of the Frobenius morphism. In particular there are notions of globally F-split and globally F-regular which can be thought of as positive characteristic analogues of lc log Calabi-Yau varieties and klt log Fano varieties. While the exact nature of this analogy is the subject of a variety of results and conjectures, it is expected, and often known, that these varieties should behave similarly to their characteristic zero counterparts.

Most notably, in this context, the F-split and globally F-regular conditions are preserved under the steps of the LMMP including Mori fibrations. In fact the conditions are also preserved under taking a general fibre of a fibration. They also come naturally equipped with vanishing theorems, with globally F-regular pairs satisfying full Kawamata-Viehweg vanishing.

We also have some relevant characterisations of uniruled F-split varieties. If X is smooth it cannot be simultaneously F-split, Calabi-Yau and uniruled. In particular, an F-split, canonical surface cannot be uniruled and have pseudo-effective canonical divisor.

In many ways then, global F-singularities begin to resolve the most obvious difficulties in proving positive characteristic boundedness results. They present their own problems however, there is no satisfactory notion of “\(\epsilon \)-F-split” or “\(\epsilon \)-globally F-regular” which makes it difficult to work solely with these notions in the context of boundedness.

That said, while the F-split and globally F-regular conditions fit naturally into the study of log pairs, we may also choose to consider them as properties of the underlying base varieties. In such a way we may formulate the following questions, though in practice even the most optimistic might expect further conditions on the characteristic. One could also reasonably ask that the \(\epsilon \)-klt pair (XB) is itself F-split, or globally F-regular, in place of the base variety.

Question 1.3

Fix d a natural number and \(\epsilon \) a positive real number. Then is the set, S, (resp. \(S'\)) of projective varieties X such that \((1)-(4)\) (resp. \((1),(2),(3'),(4')\)) hold bounded over \(\mathbb {Z}\)?

  1. (1)

    X has dimension d over some closed field \(\kappa \).

  2. (2)

    (XB) is \(\epsilon \)-klt for some boundary B.

  3. (3)

    \(-(K_{X}+B)\) is big and nef.

  4. (3’)

    \(K_{X}+B\equiv 0\).

  5. (4)

    If \(\kappa \) has characteristic \(p>0\), then X is globally F-regular.

  6. (4’)

    If \(\kappa \) has characteristic \(p>0\), then X is F-split and rationally chain connected.

Remark 1.4

Here rationally chain connected is chosen over rationally connected in light of [16] which shows that globally F-regular threefolds are rationally chain connected in characteristic \(p > 7\). Further in characteristic zero, under mild assumptions on the singularities (X admits a boundary \(\Delta \) with \((X,\Delta )\) dlt), rational chain connectedness coincides with rational connectedness so this is still a natural generalisation. In any case, in dimension 3 the globally F-regular condition is strictly stronger than F-split and rationally chain connected whenever the characteristic is greater than 7.

In fact other than the case of Fano varieties of Picard number 1, Gongyo et al are able to show separable rational connectedness. This might, therefore, also be a natural condition to impose instead, especially since the classical proof of the boundedness of characteristic zero prime Fano threefolds so heavily relies on the existence of a free curve.

Given Question 1.3, it is natural to ask what can be gleaned from Theorem 1.1 about globally F-regular varieties of the type described in Question 1.3. Unfortunately the answer is very little, while every globally F-regular variety is F-split and if X is of \(\epsilon \)-log Fano type it is also of \(\epsilon \)-LCY type, we cannot sensibly ensure that the resulting \(\epsilon \)-LCY pair \((X,\Delta )\) has coefficients bounded below, even if we require it for the pair \(\epsilon \)-log Fano pair \((X,\Delta ')\).

As part of this work we prove the following weak BAB result in Theorems 6.1 and 6.4. This draws heavily on the arguments of Jiang in [19].

Theorem 1.5

Fix \(0< \delta , \epsilon <1\) and let \(T_{\delta ,\epsilon }\) be the set of threefold pairs \((X,\Delta )\) satisfying the following conditions

  • X is projective over a closed field of characteristic \(p >7,\frac{2}{\delta }\);

  • X is terminal, rationally chain connected and F-split;

  • \((X,\Delta )\) is \(\epsilon \)-klt and LCY;

  • The coefficients of \(\Delta \) are greater than \(\delta \); and

  • X admits a Mori fibre space structure \(X \rightarrow Z\) where Z is not a point.

Then the set \(\{Vol (-K_{X}) :\exists \Delta \text { with } (X,\Delta ) \in T_{\delta ,\epsilon }\}\) is bounded above.

Remark 1.6

Together with the observation that taking a terminalisation and running a \(K_{X}\)-MMP can only increase the anti-canonical volume, we reduced weak BAB for varieties in \(S_{\Delta ,\epsilon }\) to the case of prime Fano varieties of \(\epsilon \)-LCY type. Over a fixed field, however, this is essentially superseded by the result of [12], which gives weak BAB for varieties X with \(K_{X}+\Delta \equiv 0\) for some boundary \(\Delta \) taking coefficients in a DCC set and making \((X,\Delta )\) klt.

Results similar to Theorem 1.1 and Theorem 1.5 are proven in [36, Theorem 1.7, Theorem 1.8] for Fano threefolds satisfying certain conditions on the Seshadri constant at a smooth closed point. Further these conditions are closely related to global F-regularity by [36, Theorem 1.3].

We begin by collecting some relevant definitions and results for later usage. Then Sect. 4 establishes key results about the behaviour of conic bundles in sufficiently high characteristic. Next Sect. 5 contains the key boundedness arguments, with weak BAB deferred to Sect. 6. Finally Theorem 1.1 is proved in Sect. 7.

2 Definitions

2.1 MMP Singularities

Here \(\mathbb {K}\) will be taken to mean either \(\mathbb {R}\) or \(\mathbb {Q}\). If no field is specified, it is taken to be \(\mathbb {R}\). We outline the key notions of singularity arising in the MMP.

Definition 2.1

Let X be a normal variety. A \(\mathbb {K}\)-boundary is an effective \(\mathbb {K}\)-divisor \(\Delta \) where \(K_{X}+\Delta \) is \(\mathbb {K}\)-Cartier and the coefficients of \(\Delta \) are at most 1.

A \(\mathbb {K}\) pair is a couple (XB) where X is normal and B is a \(\mathbb {K}\)-boundary.

If B is not effective but (XB) would otherwise be a \(\mathbb {K}\) pair we call it a \(\mathbb {K}\) sub pair.

Since \(K_{X}+\Delta \) is \(\mathbb {R}\)-Cartier, we may pull it back along any morphism \(\pi :Y \rightarrow X\). If \(\pi \) is birational then there is a unique choice of \(\Delta _{Y}=\sum -a(Y,E,X,\Delta )E\) which agrees with \(\Delta \) away from the exceptional locus of \(\pi \) such that \(\pi ^{*}(K_{X}+\Delta )\sim _{\mathbb {R}}K_{Y}+\Delta _{Y}\). In a slight abuse of notation we write \(f^{*}(K_{X}+\Delta )=(K_{Y}+ \Delta _{Y})\).

Suppose that \(f:Y \rightarrow X\) is birational morphism of normal varieties and there is a some normal variety Z with \(g:Z\rightarrow Y\). If E is a divisor on Y with strict transform \(E'\) on Z then \(a(Z,E',X,\Delta )=a(Z,E',Y,\Delta _{Y})=a(Y,E,X,\Delta )\). We may view then the coefficients \(a(Y,E,X,\Delta )\) as being independent of Y and write \(a(E,X,\Delta )\) instead.

Definition 2.2

Given a sub pair \((X,\Delta )\) we define the discrepancy

$$\begin{aligned} \text {Disc}(X,\Delta ):=\inf \{a(E,X,\Delta ) \text { such that } E \text { is exceptional and has non-empty center on } X\} \end{aligned}$$

and the total discrepancy

$$\begin{aligned} \text {TDisc}(X,\Delta ):=\inf \{a(E,X,\Delta ) \text{ such } \text{ that } E \text { has non-empty center on } X\} \end{aligned}$$

We then use these to define a suite of singularities.

Definition 2.3

Let \((X,\Delta )\) be a (sub) pair then we say that \((X,\Delta )\) is

  • (Sub) terminal if \(\text {Disc}(X,\Delta ) > 0\).

  • (Sub) canonical if \(\text {Disc}(X,\Delta )\ge 0\).

  • (Sub) plt if \(\text {Disc}(X,\Delta ) > -1\).

  • (Sub) \(\epsilon \)-klt if \(\text {TDisc}(X,\Delta ) > \epsilon -1\).

  • (Sub) \(\epsilon \)-lc if \(\text {TDisc}(X,\Delta ) \ge \epsilon -1\).

For \(\epsilon =0\) we say klt, lc respectively.

When we have resolution of singularities there is another, more practical version.

Definition 2.4

Let \((X,\Delta )\) be a (sub) pair and \(\pi :Y\rightarrow X\) a log resolution of \((X,\Delta )\). Let

$$\begin{aligned} t=\min \{a(E,X,\Delta ) \text { such that } E \text { is a divisor on } Y\} \end{aligned}$$

and

$$\begin{aligned} d=\min \{a(E,X,\Delta ) \text { such that } E \text { is an exceptional divisor of } \pi :Y\rightarrow X\}. \end{aligned}$$

Then \((X,\Delta )\) is

  • (Sub) \(\epsilon \)-klt if \(t > \epsilon -1\);

  • (Sub) \(\epsilon \)-lc if \(t\ge \epsilon -1\).

If \(\Delta =0\) then X is

  • terminal if \(d > 0\);

  • canonical if \(d\ge 0\);

This also gives rise to an additional notion of singularity, which is dependent on the choice of resolution and can be thought of as the limit of a klt pair.

Definition 2.5

A pair \((X,\Delta )\) is called dlt if there is a log resolution \(\pi :Y \rightarrow X\) of \((X,\Delta )\) with \(K_{Y}+\Delta _{Y}=\pi ^{*}(K_{X}+\Delta )\) such that \(\text {Coeff}_{E}(\Delta _{Y}) < 1\) for every E exceptional.

If \((X,\Delta )\) is sub klt or sub lc etc and \(\pi :Y \rightarrow X\) is a birational morphism from a normal variety and \(K_{Y}+\Delta _{Y}=\pi ^{*}(K_{X}+\Delta )\) then \((Y,\Delta _{Y})\) has the same singularities. Conversely we have the following.

Lemma 2.6

[17, Lemma 3.38] Suppose \((X,\Delta ),(X',\Delta ')\) are pairs equipped with proper birational morphisms \(f:X \rightarrow Y\) and \(f':X'\rightarrow Y\) with \(f_{*}\Delta =f'_{*}\Delta '\).

Suppose further that \(-(K_{X}+\Delta )\) is f nef and \((K_{X'}+\Delta ')\) is \(f'\) nef. Then \(a(E,X,\Delta ) \le a(E,X',\Delta ')\) for any E with non-trivial center on Y.

In particular, these notions of singularity are preserved under a \((K_{X}+\Delta )\) MMP.

Definition 2.7

A (sub) \(\epsilon \)-klt pair \((X,\Delta )\) where \(K_{X}+\Delta \equiv 0\) is said to be (sub) \(\epsilon \)-log Calabi-Yau, or just (sub) \(\epsilon \)-LCY.

If instead \(-(K_{X}+\Delta )\) is big and nef, it is said to be (sub) \(\epsilon \)-log Fano.

Again for \(\epsilon =0\) we just say LCY and log Fano, equally if \(\Delta =0\) we drop the log.

Of particular interest is the class of prime Fano varieties which we may think of as Mori fibre spaces over a point.

Definition 2.8

A terminal Fano variety is said to be prime if it has Picard rank 1.

Corollary 2.9

Suppose that \((X,\Delta )\) is (sub) \(\epsilon \)-LCY and \(f:X\dashrightarrow X'\) is either a flip or a divisorial contraction then \((X',f_{*}\Delta )\) is (sub) \(\epsilon \)-LCY.

Proof

Both \((K_{X}+\Delta )\) and \((K_{X'}+\Delta ')\) are numerically trivial so it suffices to show that \((K_{X'}+\Delta ')\) is \(\mathbb {R}\)-Cartier by Lemma 2.6.

If \(g:X \rightarrow Y\) is the contraction of an extremal ray and \(D\equiv _{g} 0\) is Cartier, there is some L Cartier on Y with \(g^{*}L=D\). Suppose first that f is a divisorial contraction. Then \(K_{X}+\Delta =f^{*}L\), say, and so \(K_{X'}+\Delta '=L\) by the projection formula.

Otherwise f is a flip and there is \(g:X \rightarrow Y\) a flipping contraction together with \(g':X' \rightarrow Y\) such that \(f=g'^{-1}\circ g\). Hence writing \(K_{X}+\Delta =g^{*}L\) again gives \(K_{X'}+\Delta '=g'^{*}L\).

In either case, \((K_{X'}+\Delta ')\) is \(\mathbb {R}\)-Cartier. \(\square \)

We will be interested in LCY varieties in which general points can be connected by rational curves in the following senses.

Definition 2.10

Let X be a variety over a field \(\kappa \). Then X is said to be:

  • Uniruled if there is a proper family of connected curves \(f:U \rightarrow Y\) where the generic fibres have only rational components together with a dominant morphism \(U \rightarrow X\) which does not factor through Y.

  • Rationally chain connected (RCC) if there is \(f:U \rightarrow Y\) as above such that \(u^{2}:U \times _{Y} U \rightarrow X \times _{k} X\) is dominant.

  • Rationally connected if there is \(f:U \rightarrow Y\) as above witnessing rational chain connectedness such that the general fibres are irreducible.

  • Separably rationally connected if f as above is separable.

If \(X \rightarrow X'\) is a dominant morphism from X uniruled/RCC/rationally connected then we may compose \(U \rightarrow X \rightarrow X'\) to see that \(X'\) is uniruled/RCC/rationally connected.

2.2 F-Singularities of pairs

We now introduce Frobenius singularities, unique to positive characteristic. We focus on the F-pure and F-split conditions, as F-regularity will not be needed.

Definition 2.11

Given a \(\kappa \) algebra R in positive characteristic we denote the Frobenius morphism by \(F:R\rightarrow R\) sending \(x \rightarrow x^{p}\). Any R module M then has an induced module structure, denoted \(F_{*}M\) where R acts as \(r.x=F(r)x=r^{p}x\). Finally R is said to be F-finite if \(F_{*}R\) is a finite R module. These definitions naturally extend to schemes over \(\kappa \).

Note that all perfect fields are F-finite, and so is every variety over an F-finite field. In this context we can view the Frobenius morphism as a map of R modules \(F:R \rightarrow F_{*}R\). We will also write \(F^{e}:R \rightarrow F_{*}^{e}R\) for the \(e^{th}\) iterated Frobenius.

Definition 2.12

Let X be a variety over an F-finite field. We say X is:

  • F-pure if the Frobenius morphism \(\mathcal {O}_{X}\rightarrow F_{*}\mathcal {O}_{X}\) is pure, or equivalently locally split.

  • (Globally) F-split if the Frobenius morphism \(\mathcal {O}_{X}\rightarrow F_{*}\mathcal {O}_{X}\) is split.

Being F-split is a particularly strong condition, giving the following vanishing result almost immediately.

Lemma 2.13

Let X be an F-split variety and A an ample \(\mathbb {Q}\)-Cartier divisor. Then \(H^{i}(X,A)=0\) for all \(i>0\).

Proof

By assumption \(\mathcal {O}_{X}\rightarrow F^{e}_{*}\mathcal {O}_{X}\) splits, and hence so does \(A=A\otimes \mathcal {O}_{X}\rightarrow A \otimes F^{e}_{*}\mathcal {O}_{X}=F^{e}_{*}A^{p^{e}}\). That is we have \(id:A \rightarrow F^{e}_{*}A^{p^{e}} \rightarrow A\), and taking cohomology we see that \(H^{i}(X,A)\) injects into \(H^{i}(X,F^{e}_{*}A^{p^{e}})=H^{i}(X,A^{p^{e}})\) which vanishes for \(e>>0\). \(\square \)

Take X a normal variety. To mirror the notion of a boundary we introduce pairs \((\mathcal {L}, \phi )\) where \(\mathcal {L}\) is a line bundle and \(\phi :F^{e}_{*}\mathcal {L} \rightarrow \mathcal {O}_{X}\). By applying duality on the smooth locus, which contains all the codimension 1 points we observe that \(Hom _{\mathcal {O}_{X}}(F^{e}_{*}\mathcal {L},\mathcal {O}_{X})=H^{0}(X,\mathcal {L}^{-1}((1-p^{e})K_{X}))\). Therefore such a pair corresponds to a divisor \(\Delta _{\phi } \ge 0\) with \((p^{e}-1)(K_{X}+\Delta _{\phi }) \sim \mathcal {L}\).

Reversing this procedure is slightly more involved. If \({(p^{e}-1)(K_{X}+\Delta ) \sim \mathcal {L}}\) (we write this \(K_{X}+\Delta \sim _{\mathbb {Z}_{(p)}} \mathcal {L}\)) we may obtain \(\phi _{\Delta }:F^{e}_{*}\mathcal {L} \rightarrow \mathcal {O}_{X}\), however we could also write say \((p^{2e}-1)(K_{X}+\Delta ) \sim \mathcal {L'}\) where \(\mathcal {L'} \not \sim \mathcal {L}\). We introduce, therefore, the following notion of equivalence.

First, we say that two such pairs, \((\mathcal {L}, \phi )\) and \((\mathcal {L}', \phi ')\) are equivalent if:

  • There is an isomorphism \(\psi : \mathcal {L} \rightarrow \mathcal {L'}\) such that following diagram commutes; or

  • \(\mathcal {L}=\mathcal {L}^{p^{e'}+1}\) and \(\phi ':F^{e+e'}_{*}\mathcal {L}^{p^{e'}+1}\rightarrow \mathcal {O}_{X}\) is the precisely the map given by

    $$\begin{aligned} F^{e+e'}_{*}(\mathcal {L}\otimes \mathcal {L}^{p^{e'}}) \xrightarrow {F^{e}_{*}\phi } F^{e}_{*}\mathcal {L} \xrightarrow {\phi } \mathcal {O}_{X}. \end{aligned}$$

We then expand the notion of equivalence to allow any finite combination of the above equivalences, more precisely we take the transitive closure of our initial relation. This gives a bijection between equivalence classes of pairs \((\mathcal {L}, \phi )\) and \(\Delta \ge 0\) with \((K_{X}+\Delta )\) \(\mathbb {Z}_{(p)}\)-Cartier. Full details on such pairs can be found in Chapter 16 of Schwede’s notes on F-singularities [29].

To extend this framework to allow for sub pairs we can instead work with morphisms \(F^{e}_{*}\mathcal {L} \rightarrow K(X)\) where we view K(X) as a constant sheaf on X. Given such a morphism \(\phi \), we can always find \(E \ge 0\) Cartier such that when we twist by E we obtain \(\phi ':=F^{e}_{*}(\mathcal {L}((1-p^{e})E)) \rightarrow \mathcal {O}_{X}\) and thus associate a divisor \(\Delta _{\phi '}\) with \((1-p^{e})(K_{X}+\Delta _{\phi '})\sim \mathcal {L}((1-p^{e})E\) and then take \(\Delta _{\phi }=\Delta _{\phi '}-E\).

Lemma 2.14

[14, Lemma 2.3] With the notation as above, \(\Delta _{\phi }\) does not depend on the choice of E.

Definition 2.15

A sub \(\mathbb {Z}_{(p)}\) pair is a couple (XB) where \((K_{X}+B)\) is \(\mathbb {Z}_{(p)}\)-Cartier and the coefficients of B are at most 1. We write \(\phi {e}_{B}:F_{*}^{e,B}\mathcal {L}_{e,B} \rightarrow K(X)\) for the associated morphism dropping the dependence on B when it remains clear. If B is effective (XB) is called a \(\mathbb {Z}_{(p)}\) pair and we view \(\phi \) as being a morphism to \(\mathcal {O}_{X}\).

Let (XB) be a (sub) \(\mathbb {Z}_{(p)}\) pair, then (XB) is

  • (sub) F-pure if \(\mathcal {O}_{X}\subseteq \text {Im}(\phi ^{e})\) for some e.

  • (sub) F-split if \(1\in \text {Im}(H^{0}(X,\phi ^{e}))\) for some e.

Being F-split is also sometimes called globally F-split to distinguish to from F-pure, which can be thought of as being locally split.

Locally to a point of codimension 1 these definitions are particularly well-behaved.

Lemma 2.16

[14, Lemma 2.14] Let R be a regular DVR with parameter t, then a sub \(\mathbb {Z}_{(p)}\) pair \((R,\lambda t)\) is sub F-pure iff \(\lambda \le 1\) and sub F-regular iff \(\lambda < 1\).

In particular we see that the coefficient of \(\Delta _{\phi }\) at E depends only on \(\phi \) near E.

Corollary 2.17

Suppose \(\phi :F^{e}_{*}\mathcal {L} \rightarrow K(X)\) has associated divisor \(\Delta \) then \(1-\text {Coeff}_{E}(\Delta )=\inf \{t:(X,\Delta +tE) \text { is } F \text {-pure at the generic point of } E\}\).

While these definitions do not pullback along birational morphisms as obviously as the usual MMP singularities, it is still possible.

Lemma 2.18

[7, Lemma 7.2.1] Suppose that \(f:X \rightarrow Y\) is a birational morphism with X normal and \((Y,\Delta )\) a sub F-split pair. Then there is \(\Delta '\) on X making \((X,\Delta ')\) a sub F-split pair such that \((K_{X}+\Delta ')=f^{*}(K_{Y}+\Delta )\).

Proof

Take the corresponding map \(\phi :F^{e}_{*}\mathcal {L} \rightarrow K(Y)\). Then we may freely view \(\mathcal {L}\) as a subsheaf of K(X) and so extend \(\phi \) to a map \(\phi :F^{e}_{*}K(Y) \rightarrow K(Y)\). Taking the inverse image gives \(f^{-1}(\phi ):f^{-1}F^{e}_{*}K(Y) \rightarrow f^{-1}K(Y)\) and \(f^{-1}F^{e}_{*}\mathcal {L} \rightarrow f^{-1}K(Y)\). Since \(\pi \) is birational we obtain an isomorphism \(f^{-1}K(Y) \rightarrow K(X)\). We then have the following situation.

Note however that \(f^{-1}F^{e}_{*}(\mathcal {L}) \otimes _{f^{-1}F^{e}_{*}\mathcal {O}_{Y}}\mathcal {O}_{X}= F^{e}_{*}f^{*}\mathcal {L}\) and hence we obtain the desired map \(\tilde{\phi }:F^{e}_{*}f^{*}\mathcal {L} \rightarrow K(X)\). This induces a divisor \(\Delta '\) on X with

$$\begin{aligned} (p^{e}-1)(K_{X}+\Delta ') \sim f^{*}\mathcal {L} \sim (p^{e}-1)f^{*}(K_{Y}+\Delta ). \end{aligned}$$

The coefficient of \(\Delta '\) at a codimension one point can be recovered from \(\tilde{\phi }\) by working locally around that point. In particular, wherever f is an isomorphism, \(\phi \) and \(\tilde{\phi }\) agree. Therefore the coefficients of \(\Delta \) and \(\Delta '\) agree on this locus also, so we have \(f^{*}(K_{Y}+\Delta )=(K_{X}+\Delta ')\) as required. Moreover commutativity of the earlier diagram gives that whenever \(1 \in \text {Im}(H^{0}(Y,\phi ))\) then it is also in the image of \(H^{0}(X,\tilde{\phi })\), and hence \((X,\Delta )\) is sub F-split. \(\square \)

In general the local forms of these singularities cannot be pushed forward, however the global ones often can be, even along morphisms which are not birational.

Lemma 2.19

[14, Theorem 5.2] Suppose that \((X,\Delta )\) is sub F-split and there is a map \(f:X \rightarrow Y\) with \(f_{*}\mathcal {O}_{X}=\mathcal {O}_{Y}\) and \((K_{X}+\Delta ) \sim _{\mathbb {Z}_{(p)}} f^{*}\mathcal {L}\) for some line bundle \(\mathcal {L}\) on Y. If every component of \(\Delta \) which dominates Y is effective then there is \(\Delta _{Y}\) with \((Y,\Delta _{Y})\) sub F-split and \(\mathcal {L}\sim _{\mathbb {Z}_{(p)}} (K_{Y}+\Delta _{Y})\).

If \(f:X \rightarrow Y\) is birational then the conditions are automatically satisfied and the induced \(\Delta _{Y}\) is just the pushforward \(f_{*}\Delta \) by Corollary 2.17. Therefore if X is sub F-split so is every \(X'\) birational to X. Further if X is F-split and \(X'\) is obtained by taking a terminalisation or running a \(K_{X}+B\) MMP for any B then \(X'\) is F-split.

2.3 Boundedness

Finally we introduce the relevant notions of boundedness.

Definition 2.20

We say that a set \(\mathfrak {X}\) of varieties is birationally bounded over a base S if there is a flat, projective family \(Z \rightarrow T\), where T is a reduced quasi-projective scheme over S, such that every \(X\in \mathfrak {X}\) is birational to some geometric fibre of \(Z \rightarrow T\). If the base is clear from context, say if every \(X \in \mathfrak {X}\) has the same base, we omit dependence on S.

If for each \(X \in \mathfrak {X}\) the map to a geometric fibre is an isomorphism we say that \(\mathfrak {X}\) is bounded over S.

If \(S=\text {Spec }{R}\) we often just say (birationally) bounded over R. In practice we characterise boundedness over \(\mathbb {Z}\) via the following result, coming from existence of the Hilbert and Chow schemes.

Lemma 2.21

[32, Proposition 5.3] Fix integers d and r. Then there is a flat projective family \(Z \rightarrow T\) where T is a reduced quasi-projective scheme over \(\mathbb {Z}\) satisfying the following property. If

  1. (1)

    \(\kappa \) is a field;

  2. (2)

    X is a geometrically integral projective scheme of dimension r over \(\kappa \); and

  3. (3)

    there is a closed immersion \(j:X \rightarrow \mathbb {P}^{m}_{\kappa }\) for some \(m\in \mathbb {Z}\) such that \(j^{*}(\mathcal {O}(1))^{r} \le d\).

Then X is realised as a geometric fibre of \(Z \rightarrow T\)

Corollary 2.22

Suppose \(\mathfrak {X}\) is a set of varieties over closed fields and there are positive real numbers dV such that for every \(X \in \mathfrak {X}\),

  • X has dimension at most d; and

  • There is M on X with \(\phi _{|M|}\) birational and \(Vol (M)\le V\).

Then \(\mathfrak {X}\) is birationally bounded over \(\mathbb {Z}\). If in fact each M is very ample then \(\mathfrak {X}\) is bounded.

Conversely, if S is Noetherian then we may always choose H relatively very ample on \(Z \rightarrow T\) with trivial higher direct images. The restriction of H to any geometric fibre is therefore very ample, and of bounded degree.

3 Preliminary results

In this section we gather necessary results for later usage. We begin with some results on surfaces, followed by some MMP results and their applications. We also collect some Bertini type theorems at the end of the section.

Theorem 3.1

[1, Theorem 6.9] Fix \(\epsilon >0\) and an algebraically closed field of arbitrary characteristic. Let S be the set of all projective surfaces X which admit a \(\Delta \) such that:

  • \((X,\Delta )\) is \(\epsilon \)-klt;

  • \(-(K_{X}+\Delta )\) is nef; and

  • Any of the following holds \(K_{X} \not \equiv 0\), \(\Delta \ne 0\), X has worse than Du Val singularities.

Then S is bounded.

Alexeev shows boundedness over a fixed field, however it is not immediately clear if such varieties are collectively bounded over \(\mathbb {Z}\). We briefly show that his methods can be extended, via the arguments of [34] to give a boundedness result in mixed characteristic.

Theorem 3.2

Fix \(\epsilon \) a positive real number. Let S be the set of projective surfaces X such that following conditions hold:

  • X is a variety over some closed field \(\kappa \);

  • (XB) is \(\epsilon \)-klt for some boundary B;

  • \(-(K_{X}+B)\) is nef; and

  • X is rationally chain connected and F-split (if \(\kappa \) has characteristic p).

Then S is bounded.

Proof

We consider first \(\hat{S}:=\{X \in S:K_{X} \not \equiv 0\}\). Take any such \(X \in \hat{S}\), then by Alexeev [1, Chapter 6] we have the following:

  • The minimal resolution \(\tilde{X}\rightarrow X\) has \(\rho (X) < A \), for some constant A, depending only on \(\epsilon \) and admits a birational morphism to \(\mathbb {P}^{2}\) or \(\mathbb {F}_{n}\) for \(n < \frac{2}{\epsilon }\). In particular there is a set \(T_{\epsilon }\) bounded over \(\mathbb {Z}\) such that every \(\tilde{X}\) is a blowup of some \(Y \in T_{\epsilon }\) along a finite length subscheme of dimension 0. That is the set of minimal desingularisations is bounded over \(\mathbb {Z}\).

  • We may run a \(K_{X}\)-MMP to obtain \(X'\) a Mori fibre space.

  • There is an N, independent of the field of definition, such that \(NK_{X'}\) is Cartier for any Mori fibre space \(X'\) obtained as above.

  • \(Vol (-K_{X'})\) is bounded independently of the base field.

  • If \(X'\) is such a Mori fibre space \(X' \rightarrow \mathbb {P}^{1}\) and F a general fibre then \(-K_{X} +(\frac{2}{\epsilon }-1)F\) is ample.

It is sufficient then to show \(S'=\{X' \text { an } \epsilon -\text {LCY type, Mori fibre space }\}\) is bounded in mixed characteristic, then \(\hat{S}\) is bounded by sandwiching as in Alexeev’s original proof and the full result follows. In turn by Corollary 2.22 it is enough to find V such that every \(X' \in S'\) has a very ample divisor, H, satisfying \(H^{2}\le V\). We do this first for positive characteristic varieties.

Fix, then, \(m > \frac{2}{\epsilon }-1\) and suppose \(X'\rightarrow \mathbb {P}^{1}\) is a Mori fibre space in positive characteristic. Then \(A=-K_{X'} +mF\) is ample and NA is Cartier. Further we have that \(A'=7NK_{X'}+27N^{2}A=(7N-27N^{2})K_{X'}+27N^{2}mF\) is very ample by [34, Theorem 4.1]. Since F is base point free, we may add further multiples of F and consider the very ample Cartier divisor \(\hat{A}=(27N^{2}-7N)(-K_{X'}+2mF)\). Then

$$\begin{aligned} \hat{A}^{2}=Vol (X',\hat{A})\le (27N^{2}-7N^{2})(Vol (X',-K_{X'})+2mVol (F,-K_{F})) \end{aligned}$$

which is bounded above, since \(Vol (X',-K_{X'})\) is bounded and \(Vol (F,-K_{F})=2\).

Similarly if \(X'\) has \(\rho (X')=1\) and \(-K_{X'}\) ample then \(-nK_{X'}\) is a very ample Cartier divisor with vanishing higher cohomology for some n fixed independently of \(X'\). Then \((-nK_{X'})^{2}=n^{2}Vol (X,-K_{X'})\) is bounded and the result follows similarly.

Suppose then that \(X \in S\) with \(K_{X} \equiv 0\), then it must have worse than canonical singularities by Corollary 3.5. Let \(\pi :Y \rightarrow X\) be a minimal resolution, with \(K_{Y}+B=\pi ^{*}K_{X} \equiv 0\) and \(B >0\), then Y is still \(\epsilon \)-klt, so \(Y \in \hat{S}\). Consequently X has \(\mathbb {Q}\)-Cartier Index dividing N also. Moreover, there is H on Y very ample with \(H^{2}\) bounded above. Let \(H'=\pi _{*}H\), so that \(NH'\) is ample and Cartier on X. Applying [34, Theorem 4.1] again we see that \(A\equiv 27N^{2}H\) is very ample, since \(K_{X}\equiv 0\), with \(A^{2}\) bounded above.

The arguments in characteristic 0 are essentially the same, making use of Kollár’s effective base-point freeness result [20, Theorem 1.1, Lemma 1.2] instead of Witaszek’s result, and the existence of very free rational curves on smooth rationally connected surfaces instead of Corollary 3.5. \(\square \)

Remark 3.3

In particular we have an affirmative answer to Question 1 in dimension 2.

Theorem 3.4

[25, Theorem 1.2] Let X be a normal, Cohen Macaulay variety with \(W\mathcal {O}\)-rational singularities over a perfect field of positive characteristic. Then X cannot simultaneously satisfy all the following conditions.

  1. (1)

    X is uniruled.

  2. (2)

    X is F-split.

  3. (3)

    X has trivial canonical bundle.

If in fact X is smooth then we may replace \(K_{X}\sim 0\) with \(K_{X} \equiv 0\).

We refer to [25, Definition 3.8] for a definition of \(W\mathcal {O}\)-rational singularities. It suffices to know that regular varieties have \(W\mathcal {O}\)-rational singularities, from which we obtain the following.

Corollary 3.5

Let X be a uniruled, F-split surface over a perfect field of positive characteristic. If \(K_{X} \equiv 0\) then X has worse than canonical singularities.

Proof

Suppose for contradiction that X has canonical singularities. Then we can replace X with its minimal resolution and suppose that X is smooth. In particular it is Cohen-Macaulay and has \(W\mathcal {O}\)-rational singularities. We then apply Theorem 3.4 to obtain the result.

\(\square \)

Lemma 3.6

[18, Lemma 2.5] Suppose X is projective and normal, D is an \(\mathbb {R}\)-Cartier divisor and S is a basepoint free normal and prime divisor. Then for any \(q >0\),

$$\begin{aligned}Vol (X,D+qS) \le Vol (X,D) + q\dim (X)Vol (S,D|_{S}+qS|_{S}).\end{aligned}$$

We now collect the necessary results from the positive characteristic MMP and consider a few applications.

Theorem 3.7

[6, Theorem 1.7], [5, Theorem 1.2] Let k be an algebraically closed field of characteristic \(p>5\). Let \((X, \Delta )\) be a three-dimensional klt pair over k, together with a projective morphism \(X \rightarrow Z\) a quasi-projective k scheme, then there exists a \((K_X+\Delta )\)-MMP over Z that terminates.

In particular, if X is \(\mathbb {Q}\)-factorial, then there is a sequence of birational maps of three-dimensional normal and \(\mathbb {Q}\)-factorial varieties:

$$\begin{aligned} X=:X_0 \overset{\varphi _0}{\dashrightarrow }\ X_1 \overset{\varphi _1}{\dashrightarrow }\ \cdots \overset{\varphi _{\ell -1}}{\dashrightarrow } X_{\ell } \end{aligned}$$

such that if \(\Delta _i\) denotes the strict transform of \(\Delta \) on \(X_i\), then the following properties hold:

  1. (1)

    For any \(i \in \{0, \ldots , \ell \}\), \((X_i, \Delta _i)\) is klt and projective over Z.

  2. (2)

    For any \(i \in \{0, \ldots , \ell -1\}\), \(\varphi _i:X_i \dashrightarrow X_{i+1}\) is either a \((K_{X_i}+\Delta _i)\)-divisorial contraction over Z or a \((K_{X_i}+\Delta _i)\)-flip over Z.

  3. (3)

    If \(K_X+\Delta \) is pseudo-effective over Z, then \(K_{X_{\ell }}+\Delta _{\ell }\) is nef over Z.

  4. (4)

    If \(K_X+\Delta \) is not pseudo-effective over Z, then there exists a \((K_{X_{\ell }}+\Delta _{\ell })\)-Mori fibre space \(X_{\ell } \rightarrow Y\) over Z.

Theorem 3.8

[15, Theorem 10.4] Let X be a normal quasi-projective variety of any dimension and characteristic for which the log MMP holds. Let B be an effective divisor with \(K_{X}+B\) \(\mathbb {R}\)-Cartier then there is a birational morphism \(f:Y \rightarrow X\), called a dlt modification, such that the following holds:

  • Y is \(\mathbb {Q}\)-factorial;

  • \(a(E,X,B) \le -1\) for every f exceptional divisor E;

  • If \(B_{Y}=f^{-1}_{*}B' + \sum _{E \text { exceptional}} E\) then \((Y,B_{Y})\) is dlt; and

  • \(K_{Y}+B_{Y}+F=f^{*}(K_{X}+B)\) where \(F= \sum _{E:a(E,X,B)<-1} -(a(E,X,B)+1)E\).

where \(B'\) has coefficient \(\min \{\text {Coeff}_{E}(B),1\}\) at each E. Further if (XB) is a log pair then F is exceptional.

Theorem 3.9

(Nlc Cone Theorem) Let \((X,\Delta )\) be a threefold \(\mathbb {Q}\)-pair, over a closed field of characteristic \(p > 5\). Then write \(\overline{NE}(X/T)_{nlc}\) for the cone spanned by curves contained in the non log canonical locus of X. Then we have the following decomposition

$$\begin{aligned}\overline{NE(X)}(X)=\overline{NE(X)}_{K_{X}+B \ge 0}+ \overline{NE(X)}_{nlc}+R_{i}\end{aligned}$$

where \(R_{i}\) are extremal rays with \(R_{i} \cap \overline{NE(X)}_{nlc}=\{0\}\), generated by curves \(C_{i}\) such that \(0> (K_{X}+B).C_{i} \ge -6\).

Proof

If \((X,\Delta )\) is dlt this is part of the usual Cone Theorem [6, Theorem 1.1].

Suppose next that \(\Delta =B+F\) where (XB) is dlt and F has support contained in \(\lfloor B \rfloor \). Note that if C is an irreducible curve with \(F.C <0\) then \(C \subseteq F\). Therefore any effective curve C can be written \(C=C_{0} +C_{F}\) where \(F.C_{0}\ge 0\) and \(C_{F} \subseteq F\). Thus by compactness of the unit ball in a finite dimensional vector space, any \([\gamma ] \in \overline{NE}(X/T)\) can be written \([\gamma ] = [\gamma _{0}] + [\gamma _{F}]\) with \(F.\gamma _{0} \ge 0\) and \([\gamma _{F}] \in \overline{NE}(F/T)\) in the same fashion.

Take any \(K_{X}+\Delta \) negative extremal ray L. Take a non-zero \([\gamma ] \in L\), then as L is extremal we have \([\gamma _{F}],[\gamma _{0}] \in L\). If \([\gamma _{F}] \ne 0\) then \(L \subseteq \overline{NE}(F/T)\). Otherwise if \([\gamma _{F}]=0\) then L is \(K_{X}+B\) negative. Hence we can conclude the result from the Cone Theorem for dlt pairs.

Suppose finally that X is not dlt. Let \(\pi :Y \rightarrow X\) be a dlt modification of (XB) with \((Y,B_{Y})\) dlt and \(K_{Y}+B_{Y}+F=\pi ^{*}(K_{X}+B)\). Take any \(K_{X}+B\) negative extremal ray, L, such that \(L \cap \overline{NE(X)}_{nlc}=\{0\}\). Take any class \(\gamma \) with \([\gamma ] \in L\setminus \{0\}\) and choose \([\gamma '] \in \overline{NE}(Y/T)\) with \(f_{*}[\gamma ']=[\gamma ]\). Then by the projection formula we have that \((K_{Y}+B_{Y}+F).\gamma '=(K_{X}+B).f_{*}\gamma '=(K_{X}+B).\gamma < 0\).

From above, we can write \(\gamma '=C_{0}+C_{F}+ \sum \lambda _{i}C_{i}\) where \(\lambda _{i} >0\), \((K_{Y}+B_{Y}+F).C_{0} \ge 0\) \(C_{F} \in \overline{NE}(F/T)\) and the \(C_{i}\) each generate \((K_{Y}+B_{Y}+F)\) negative extremal rays with \(-(K_{Y}+B_{Y}+F).C_{i} \le 6\). From our choice of R we must have \(f_{*}C_{0}=f_{*}C_{F}=0\) and hence it follows that \([f_{*}C_{k}] \in R\setminus \{0\}\) for some k. Thus \((K_{X}+B).f_{*}C_{k}=(K_{Y}+B_{Y}+F).C_{k} \ge -6\).

Since each R is the pushforward of a \((K_{Y}+B_{Y})\) negative extremal ray, there are only countably many generating curves \(C_{i}\) and they cannot accumulate in \((K_{X}+\Delta )_{< 0}\) else they would accumulate on Y also. \(\square \)

Lemma 3.10

Let X be a normal curve over any field and \(\Delta \ge 0 \) be a divisor with \(-(K_{X}+\Delta )\) big and nef. Then the non-klt locus of \(\Delta \) is either empty or geometrically connected.

Proof

If \(-(K_{X}+\Delta )\) is big and nef then so is \(-K_{X}\). After base changing to \(H^{0}(X,\mathcal {O}_{X})\) if necessary we have \(\deg K_{X} = -2\) by [31, Corollary 2.8] giving that \( \deg \Delta <2\). The non-klt locus of \((X,\Delta )\) is precisely the support of \(\lfloor \Delta \rfloor \) and hence can contain at most one point. \(\square \)

Theorem 3.11

[31, Theorem 5.2] Let \((X,\Delta )\) be a surface log pair over any field \(\kappa \). Let \(\pi :X \rightarrow S\) be a morphism of \(\kappa \) schemes with \(\pi _{*}\mathcal {O}_{X}=\mathcal {O}_{S}\). Suppose that \(-(K_{X}+\Delta )\) is \(\pi \)-nef and \(\pi \)-big, then for any \(s \in S\) \(X_{S}\cap Nklt (X,\Delta )\) is either empty or geometrically connected.

Theorem 3.12

[Weak Connectedness Lemma] Let X be a threefold over any closed field \(\kappa \) of characteristic \(p>5\) together with \(\Delta \ge 0\) on X such that \(K_{X}+\Delta \) is \(\mathbb {R}\)-Cartier. Suppose that \(-(K_{X}+\Delta )\) is ample, then \(Nklt (X,\Delta )\) is either empty or connected.

Proof

If \((X,\Delta )\) is klt the result is trivially true, so suppose otherwise.

Let \((Y,\Delta _{Y}) \rightarrow (X,\Delta )\) be a dlt modification. Then \(-L:=K_{Y}+\Delta _{Y}+F=f^{*}(K_{X}+\Delta )\) with \((Y,\Delta _{Y})\) dlt and L nef and big. We may further write \(L=A+E\) with A ample and E effective and exceptional over X. In particular E has support contained inside \(S_{Y}=\lfloor \Delta _{Y} \rfloor \). Note that \(S_{Y}\) maps surjectively onto \(Nklt (X,\Delta )\) so it is sufficient to show that \(S_{Y}\) is connected.

Take a general \(G_{Y} \sim \epsilon A +(1-\epsilon ) L-\delta S_{Y}\), then for small \(\delta \) we may assume \(G_{Y}\) is ample, and hence further that \((X,\Delta _{Y}+G_{Y})\) is dlt. Write \(K_{Y}+\Delta _{Y}+G_{Y}\sim - P_{Y}=-(\epsilon E + F + \delta S_{Y})\) and note \(\text {Supp}(P_{Y})=S_{Y}\). In particular \(K_{Y}+\Delta _{Y}+G_{Y}\) is not pseudo-effective and hence we may run a \((Y,\Delta _{Y}+G_{Y})\) LMMP which terminates in a Mori fibre spaces \(Y' \rightarrow Z\). By the arguments of [5, Theorem 9.3] on the induced pair \((Y',\Delta _{Y'})\), \(Nklt (Y',\Delta _{Y'})=\text {Supp}(\lfloor \Delta _{Y'} \rfloor )=\text {Supp}(P_{Y'})\) has the same number of connected components as \(Nklt (X,\Delta )\), so it suffices to prove the result here.

Suppose first that \(\dim Z=0\). Then \(\rho (Y')=1\). In particular if \(D,D'\) are effective and H ample, then \(H.D.D' >0\), so certainly \(D.D'>0\). Thus \(P_{Y'}\) cannot have disconnected support.

Suppose next that \(\dim Z > 0 \). Let T be the generic fibre. We must have \(P_{Y'}|_{T}> 0\) since \(Y' \rightarrow Z\) is a \(P_{Y'}\sim -(K_{Y'}+\Delta _{Y'}+G_{Y'})\) positive contraction. However \(P_{Y'}\) has the same support as \(\lfloor \Delta _{Y'} \rfloor \) so at least one connected component must dominate Z. Suppose then, for contradiction, there is a second connected component. Clearly it must also dominate Z, else it could not possibly be disjoint from the first. Consider then \((T,\Delta _{T}=\Delta _{Y'}|_{T})\). Since \(T \rightarrow Y'\) is flat, the pullback of \(\Delta _{Y'}\) is just the inverse image, and in particular \(\lfloor \Delta _{T} \rfloor \) contains the pullback of both connected components. Suppose R is the extremal ray whose contraction induces the Mori fibration. Then we have \(-(K_{Y'}+\Delta _{Y'}+G_{Y'}).R >0\), but since R is spanned by a nef curve, as contracting it defines a fibration, and \(G_{Y'}\) is effective, we must have \(G_{Y'}.R \ge 0\). Hence in fact \(-(K_{Y'}+\Delta _{Y'}).R >0\) also, and so \(-K_{T}+\Delta _{T}\) is ample. Then, however, the non-klt locus of \((T,\Delta _{T})\) must be connected, a contradiction. \(\square \)

Lemma 3.13

[23, Proposition 4.37] Suppose that (SB) is a klt surface and \((K_{S}+B+D) \sim 0\) for D effective, integral and disconnected, then D has exactly two connected components.

Finally we collect some needed Bertini type theorems.

Theorem 3.14

[30, Theorem 1] Let \((X,\Delta )\) be a log canonical (resp. klt) pair over an algebraically closed field where \(\Delta \) is an effective \(\mathbb {Q}\)-divisor. Suppose D is a semiample divisor on X then there is an effective divisor \(D'\sim D\) with \((X,\Delta +D')\) log canonical (resp. klt).

Corollary 3.15

Suppose that \((X,\Delta )\) is a sub klt pair over an algebraically closed field together with D a divisor on X and \(\pi :(X',\Delta ') \rightarrow X\) a log resolution of \((X,\Delta )\). Further assume that there is some \(D'\) on \(X'\) with \(\pi _{*}D'=D\), \(-(K_{X'}+\Delta '+D')\) \(\pi \)-nef, \((X,\Delta ')\) sub klt and \(D'\) semiample. Then there is \(E \sim D\) on X effective with \((X,\Delta +E)\) sub klt. If in fact \((X,\Delta )\) is \(\epsilon \)-klt then we may choose E such that \((X,\Delta +E)\) is also.

Proof

We may write \(\Delta '=\Delta _{p}-\Delta _{n}\) as the difference of two effective divisors. Since \((X',\Delta ')\) is log smooth we must have that \((X',\Delta _{p})\) is klt. Thus by the proceeding theorem we have that there is some \(E' \sim D'\) with \((X',\Delta _{p}+E')\) klt. Then we must also have that \((X',\Delta '+E')\) is sub klt Write \(E=\pi _{*}E'\), then \(R=\pi ^{*}(K_{X}+\Delta +E)- (K_{X'}+\Delta '+E')\equiv _{f}-(K_{X'}+\Delta '+D')\) is \(\pi \)-nef and exceptional. Hence by the negativity lemma we have that \(-R\) is effective, and \(\pi ^{*}(K_{X}+\Delta +E) \le (K_{X'}+\Delta '+E')\) giving that \((X,\Delta +E)\) is klt.

If \((X',\Delta )\) is \(\epsilon \)-klt then so is \((X',\Delta _{p})\). Let \(\delta =\min (1-\epsilon -c_{i})\) where \(c_{i}\) are the coefficients of \(\Delta _{p}\) and take \(m \in \mathbb {N}\) such that \(\frac{1}{m} < \delta \). Applying the previous theorem to \(mD'\) instead of \(D'\), yields \(E'' \sim mD\) with \((X',\Delta '+E'')\) klt. Taking \(E'=\frac{1}{m}E\) then continuing as above gives the required divisor. \(\square \)

Theorem 3.16

[26, Corollary 1.6] Let \(f:X \rightarrow Z\) be a projective fibration of relative dimension 2 from a terminal variety with \(f_{*}\mathcal {O}_{X}=\mathcal {O}_{Z}\) over a perfect field of positive characteristic \(p > 7\), such that \(-K_{X}\) is ample over Z. Then a general fibre of f is smooth.

Theorem 3.17

[Bertini for residually separated morphisms] [10, Theorem 1] Let \(f:X \rightarrow \mathbb {P}^{n}\) a residually separated morphism of finite type from a smooth scheme over a closed field. Then the pullback of a general hyperplane H on \(\mathbb {P}^{n}\) is smooth.

Here residually separated means that the induced map on residue fields \(\mathcal {O}_{X,x} \rightarrow \mathcal {O}_{\mathbb {P}^{n}, f(x)}\) is a separable extension.

4 Conic bundles

In this section the ground field will always be algebraically closed of characteristic \(p> 0\). In some results we put additional restrictions on the characteristic, most often that \(p \ne 2\). We start with some useful results on finite morphisms and klt singularities.

Definition 4.1

Take a finite, separable and dominant morphism of normal varieties \(f:X \rightarrow Y\).

If D is a divisor on Y then f is said to be tamely ramified over D if for every prime divisor \(D'\) lying over D the ramification index is not divisible by p and the induced residue field extension is separable.

Moreover f is said to be divisorially tamely ramified if for any proper birational morphism of normal varieties \(Y' \rightarrow Y\) we have the following. If \(X' \rightarrow X\) is the normalisation of the base change \(X\times _{Y}Y'\), and \(f':X'\rightarrow Y'\) the induced map, then \(f'\) is tamely ramified over every prime divisor in \(Y'\).

If instead f is generically finite, we say it is divisorially tamely ramified if the finite part of its Stein factorisation is so. Equally if either of X or Y is not normal, \(f:X \rightarrow Y\) is said to be divisorially tamely ramified if the induced morphism on their normalisations is.

If f is generically finite of degree \(d <p\) then it is always divisorially tamely ramified. If \(D'\) lies over D then both the ramification index, \(r_{D'}\) and the inertial degree, \(e_{D'}\) are bounded by d, in fact \(d= \sum _{f(D')=D} r_{D'}e_{D'}\) by multiplicativity of the norm. This remains the case on any higher birational model.

Lemma 4.2

Let \(f:Y \rightarrow X\) be a dominant, separable, finite morphism of normal varieties. Suppose that \(K_{X}\) is \(\mathbb {Q}\)-Cartier then \(K_{Y}=f^{*}K_{X}+\Delta \) where \(\Delta \ge 0\). Further if f is divisorially tamely ramified, then for \(Q \in Y\) a codimension 1 point lying over \(P \in X\) we have \(\text {Coeff}_{Q}(\Delta )=r_{Q}-1\) where \(r_{Q}\) is the degree of \(f|_{Q}:Q \rightarrow P\).

Proof

By localising at the codimension 1 points of X we reduce to the case of Riemann-Hurwitz-Hasse to see that \(\Delta \) exists as required and \(\text {Coeff}_{Q}(\Delta )=\delta _{Q}\) where \(\delta _{Q} \ge r_{Q}-1\) with equality when \(p\not \mid r_{q}\). In particular when f is divisorially tamely ramified, we ensure \(\delta _{Q}=r_{Q}-1\). \(\square \)

The singularities of the domain and image of a finite divisorially tame ramified morphism are closely connected, as the following lemma shows.

Lemma 4.3

[21, Proposition 3.16] Let \(f:X' \rightarrow X\) be a dominant, divisorially tamely ramified, finite morphism of normal varieties of degree d. Fix \(\Delta \) on X with \(K_{X}+\Delta \) \(\mathbb {Q}\)-Cartier. Write \(K_{X'}+\Delta '=f^{*}(K_{X}+\Delta )\) then the following hold:

  1. (1)

    \(1+\text {TDisc}(X,\Delta ) \le 1+\text {TDisc}(X',\Delta ') \le d(1+\text {TDisc}(X,\Delta ))\).

  2. (2)

    \((X,\Delta )\) is sub klt (resp. sub lc) iff \((Y,\Delta ')\) is sub klt (resp. sub lc).

Proof

By restricting to the smooth locus of X, which contains all the codimension 1 points of X, we may suppose that \(K_{X}\) is Cartier and apply the previous lemma. Hence we get \(\Delta '=f^{*}(K_{X}+\Delta )-K_{X'}\) where for \(Q\in X'\) lying over \(P\in X\) we have \(\text {Coeff}_{Q}(\Delta ')=r_{Q}(\text {Coeff}_{P}(\Delta ))-(r_{Q}-1)\).

Suppose that we have proper birational morphisms \(\pi :Y \rightarrow X\) and we write \(Y'\) for the normalisation of \(Y\times _{X} X'\) so that we have the following diagram.

Let \(E'\) be a divisor on \(Y'\) exceptional over \(X'\) and E the corresponding divisor on Y.

At \(E'\) we can write

$$\begin{aligned} K_{Y'}= \pi '^{*}(K_{X'}+\Delta ')+a(E',X',\Delta ')E'=g^{*}\pi ^{*}(K_{X}+\Delta )+a(E',X',\Delta ')E' \end{aligned}$$

essentially by definition. Conversely however we have \(K_{Y'}=g^{*}K_{Y}+\delta _{E'}E'\) which may be rewritten as

$$\begin{aligned} K_{Y'}=g^{*}(\pi ^{*}(K_{X}+\Delta )+a(E,X,\Delta )E)+\delta _{E'}E'. \end{aligned}$$

In particular equating the two descriptions, as \(\delta _{E'}=r_{E'}-1\) by Lemma 4.3, we have that

$$\begin{aligned}r_{E'}a(E,X,\Delta )+(r_{E'}-1)=a(E',X',\Delta ')\end{aligned}$$

and thus \(a(E,X,\Delta )+1=\frac{1}{r_{E'}}(a(E',X',\Delta ')+1)\) with \(1 \le r_{E'} \le d\).

Since, by a theorem of Zariski [22, Theorem VI.1.3], every valuation with center on \(X'\) is realised by some birational \(Y' \rightarrow X'\) occurring as a pullback of a birational morphism \(Y \rightarrow X\), this is sufficient to show that \(1+\text {TDisc}(X,\Delta ) \le 1+\text {TDisc}(X',\Delta ') \le d(1+\text {TDisc}(X,\Delta ))\). The second part then follows. \(\square \)

We will be interested in conic bundles satisfying certain tameness criteria. This in turn will allow us to control the singularities arising on the base of the fibration. This is done in Theorem 4.8.

Definition 4.4

A conic bundle is a threefold sub pair \((X,\Delta )\) equipped with a morphism \(f:X \rightarrow Z\) where Z is a normal surface, \(f_{*}\mathcal {O}_{X}=\mathcal {O}_{Z}\), the generic fibre is a smooth rational curve and \((K_{X}+\Delta )=f^{*}D\) for some \(\mathbb {Q}\)-Cartier divisor on X. We will call it regular if X and Z are smooth and f is flat; and terminal if X is terminal and f has relative Picard rank 1. Further we call it (sub) \(\epsilon \)-klt or log canonical if \((X,\Delta )\) is.

If each horizontal component of \(\Delta \) is effective and divisorially tamely ramified over Z then the conic bundle is said to be tame.

For P a codimension 1 point of Z we define

$$\begin{aligned} d_{P}=\max \{t:(X,\Delta +tf^{*}P) \text { is lc over the generic point of } P\}. \end{aligned}$$

The discriminant divisor of \(f:X \rightarrow Z\) is \(D_{Z}=\sum _{P \in X}(1-d_{P})P\). The moduli part \(M_{Z}\) is then given by \(D-D_{Z}-K_{Z}\).

In positive characteristic the discriminant divisor is not always well defined for a general fibration, it may be that \(d_{P} \ne 1\) for infinitely many P. This can be caused by either a failure of generic smoothness or inseparability of the horizontal components of \(\Delta \) over the base.

Suppose, however, that \((X,\Delta ) \rightarrow Z\) is a tame conic bundle. We may take a log resolution \(X' \rightarrow X\) as this does not change \(d_{P}\) and is still a tame conic bundle by Lemma 4.5. Thus we may suppose that \(\Delta \) is an SNC divisor and hence near P, \(\Delta +f^{*}P\) is also SNC for all but finitely many P, by generic smoothness of the fibres and as the horizontal components are divisorially tamely ramified over Z. Hence in fact \(D_{Z}\) is well defined in this case.

Lemma 4.5

Let \(f:(X,\Delta ) \rightarrow Z\) be a tame conic bundle, and \(X' \rightarrow X\) either a birational morphism from a normal variety or the base change by a divisorially tamely ramified morphism from a normal variety \(g:Z' \rightarrow Z\). Then there is \(\Delta '\) with \((X',\Delta ')\) a tame conic bundle over Z or \(Z'\) as appropriate. Moreover in this case \(X' \rightarrow X\) is also divisorially tamely ramified.

Proof

If \(\pi :X' \rightarrow X\) is a birational morphism with \(K_{X'}+\Delta '=\pi ^{*}(K_{X}+\Delta )\) then the only horizontal components of \(\Delta '\) are the strict transforms of horizontal components of \(\Delta \). Take such a component \(D'\) then, normalising if necessary, it factors \(D' \rightarrow D \rightarrow Z\) with \(D \rightarrow Z\) divisorially tamely ramified but then it must itself be divisorially tamely ramified.

Suppose then \(g:Z' \rightarrow Z\) is generically finite. From above, and by Stein factorisation we may freely suppose that g is finite. Then the base change morphism \(g':X' \rightarrow X\) is a finite morphism of normal varieties and we may induce \(\Delta '\) with \(g'^{*}(K_{X}+\Delta )=K_{X'}+\Delta '\). Again the horizontal components of \(\Delta '\) are precisely the base changes of the horizontal components of \(\Delta \).

It suffices to show then that if D is a horizontal divisor on X such that \(D \rightarrow Z\) is divisorially tamely ramified then \(D' \rightarrow Z'\), the base change, is also divisorially tamely ramified. Certainly \(D' \rightarrow Z'\) is still separable. Suppose C is any curve on Z and \(C'\) a curve on \(Z'\) lying over it. In turn take any \(C_{D'}\) lying over \(C'\) on \(D'\). Then \(C_{D'}\) is the base change of some \(C_{D}\). Since \(C_{D} \rightarrow C\) is separable, so too is \(C_{D'} \rightarrow C'\). Equally as the ramification indices of \(C', C_{D}\) are not divisible by p, neither can the ramification index of \(C_{D'}\) over \(C_{D}\) be. This same argument holds after base change by any higher birational model of Z, and by [22, Theorem VI.1.3] every valuation with centre on \(Z'\) is can be realised on the pullback of some such model. Thus \(D' \rightarrow Z'\) is divisorially tamely ramified and hence \((X',\Delta ') \rightarrow Z'\) is tame.

It is enough to show that \(X' \rightarrow X\) is divisorially tamely ramified after base changing by a higher birational model of Z. In particular, after taking a flatification we may assume \(f:X \rightarrow Z\) is flat. Now suppose D is a divisor on X, lying over some curve C on Z. We have \(f^{*}C=\sum E_{i}\) with \(E_{0}=D\). Let \(C_{j}\) be the curves lying over C in \(Z'\), then if \(E_{i,j}\) are the divisors lying over \(E_{i}\), for some fixed i, they are in one-to-one correspondence with the \(C_{j}\). We have \(g'^{*}f^{*}C=\sum r_{i,j}E_{i,j}=\sum _{j} r_{i}\sum _{i}E_{j}\) and thus none of the \(r_{i,j}\), in particular the \(r_{0,j}\) are divisible by p. Moreover the \(E_{0,j} \rightarrow E_{0}\) must be separable since the \(C_{j} \rightarrow C\) are.

The same holds after taking a higher birational model of X, and thus \(X' \rightarrow X\) is divisorially tamely ramified as claimed.

\(\square \)

In practice we deal exclusively with tame conic bundles arising in the following fashion.

Lemma 4.6

Suppose that \((X,\Delta )\) is klt and LCY, equipped with a Mori fibre space structure over a surface Z and the horizontal components of \(\Delta \) have coefficients bounded below by \(\delta \). Then if X is defined over a field of characteristic \(p > \frac{2}{\delta }\), \(f:(X,\Delta ) \rightarrow Z\) is a tame conic bundle.

Proof

Since \(\delta <1\), the characteristic is larger than 2 and the general fibre is necessarily a smooth rational curve, in particular X is a conic bundle. Let G be the generic fibre, so that \((G,\Delta _{G})\) is klt and G is also smooth rational curve. Then if D is some horizontal component of \(\Delta \) the degree of \(f:D \rightarrow Z\) is precisely the degree of \(D|_{G}\). However \(\deg \delta D|_{G} < \deg \Delta |_{G} =-2\) and thus \(\deg D < p\). Replacing D by its normalisation, \(D'\) does not change the degree, so \(D'\rightarrow Z\) has degree \(<p\) and thus is divisorially tamely ramified. \(\square \)

Remark 4.7

One might be tempted to ask if this bound could be further improved for \(\epsilon \)-klt pairs, \((X,\Delta )\). In this case we have \((G,\Delta _{G})\) is \(\epsilon \)-klt and so one might attempt to use a bound of the form \(p > \frac{1-\epsilon }{\delta }\) to prevent any component of \(\Delta \) mapping inseparably onto the base. It does not seem however that such a bound would ensure that every component is divisorially tamely ramified and there may be wild ramification away from the general fibre.

Theorem 4.8

Let \(f:(X,\Delta ) \rightarrow Z\) be a sub \(\epsilon \)-klt, tame conic bundle. Then for some choice of \(M\sim _{\mathbb {Q}} M_{Z}\) we have \((Z,D_{Z}+M)\) sub \(\epsilon \)-klt. If in fact \(\Delta \ge 0\), we may take \(D_{Z},M\) to be effective also.

Remark 4.9

The implicit condition that \((X,\Delta )\) is a threefold pair is necessary only in that it assures the existence of log resolutions. This result holds in dimension d so long as the existence of log resolutions of singularities holds in dimensions \(d,d-1\).

We will prove this in several steps. First we consider the case that \(\Delta ^{h}\), the horizontal part of \(\Delta \), is a union of sections of f. In this setting we have an even stronger result. After moving to a higher birational model, we have that \((Z,D_{Z})\) is klt and \(M_{Z}\) is semiample.

Lemma 4.10

Suppose that \(f:(X,\Delta ) \rightarrow Z\) is a sub \(\epsilon \)-klt conic bundle with \(\Delta ^{h}\) effective and with support that is generically a union of sections of f, then there is \(\pi :Z'\rightarrow Z\) a birational morphism with \((Z',D_{Z'})\) sub \(\epsilon \)-klt and \(M_{Z'}\) semiample. In particular for some choice of \(M\sim M_{Z'}\) we have \((Z,D_{Z}+\pi _{*}M)\) sub \(\epsilon \)-klt.

Proof

This result is well known and essentially comes from [27]. Details specific to positive characteristic can be found in [13, Section 4], [35, Lemma 3.1] and [8, Lemma 6.7]

We sketch, some key points of the proof.

Since generically \(X \rightarrow Z\) is a \(\mathbb {P}^{1}\) bundle and the horizontal part of \(\Delta \) is a union of sections, we induce a rational map \(\phi :Z \dashrightarrow \overline{\mathcal {M}}_{0, n}\), the moduli space of n-pointed stable curves of genus 0. By taking appropriate resolutions we may suppose that \((X,\Delta )\) is log smooth, Z is smooth and \(\phi \) is defined everywhere on Z. Blowing down certain divisors on the universal family over \(\overline{\mathcal {M}}_{0, n}\) and pulling back to Z we may further assume that \(X\rightarrow Z\) factors through a \(\mathbb {P}^{1}\) bundle over Z via a birational morphism.

Then working locally over each point of codimension 1 and applying 2 dimensional inversion of adjunction, we see that in fact \(D_{Z}\) is determined by the vertical part of \(\Delta \), indeed \(\Delta ^{V}=f^{*}D_{Z}\), and that \(M_{Z}\) is the pullback of an ample divisor on \(\overline{\mathcal {M}}_{0, n}\) by \(\phi \). In particular \(M_{Z}\) is semiample and \(D_{Z}\) takes coefficients in the same set as \(\Delta ^{V}\), therefore they are bounded above by \(1-\epsilon \).

From the following lemma, we see that in fact we may further suppose that \((Z,D_{Z})\) is log smooth. Since if \(\pi :(Z',\Delta ')\rightarrow Z\) is a log resolution of \((Z,D_{Z})\) we have \(K_{Z'}+\Delta '=\pi ^{*}(K_{Z}+D_{Z})\), \(\pi ^{*}M_{Z}=M_{Z'}\) and \(K_{Z'}+D_{Z'}+M_{Z'}=\pi ^{*}(K_{Z}+D_{Z}+M_{Z})=K_{Z'}+\Delta '+M_{Z'}\), giving \(D_{Z'}=\Delta '\) as required. In particular then Corollary 3.15 gives that \((Z,D_{Z}+M_{Z})\) is sub \(\epsilon \)-klt. \(\square \)

Lemma 4.11

Suppose that Z is as given above and \(Z'\rightarrow Z\) is the birational model found in the proof with \(M_{Z'}\) semiample. Suppose further that Y is a normal variety admitting a birational morphism \(\pi :Y \rightarrow Z'\). If \(M_{Y}\) is the moduli part coming from the induced conic bundle \(X_{Y} \rightarrow Y\) then \(\pi ^{*}M_{Z'}=M_{Y}\).

Proof

Let \(\phi :Z' \rightarrow \overline{\mathcal {M}}_{0, n}\) and \(\chi :Y \dashrightarrow \overline{\mathcal {M}}_{0, n}\) be the rational maps induced by the base changes of \(X\rightarrow Z\). By assumption \(\phi \) is a morphism.

Although \(\chi \) is a priori defined only on some open set, it must factor through \(\phi \) whenever it is defined, and hence extends to a full morphism \(\chi =\phi \circ \pi \).

Write then that \(M_{Z'}=\phi ^{*}A\) and \(M_{Y}=\chi ^{*}A'\). A more careful study of the proof of the previous result would give \(A=A'\) and the result follows. However for simplicity one can also note that \(M_{Z'}=\pi _{*}M_{Y}=\pi _{*}\chi ^{*}A'=\phi ^{*}A'\), so that \(M_{Y}=\pi ^{*}\phi ^{*}A'=\pi ^{*}M_{Z'}\). \(\square \)

We now reduce from the general case of Theorem 4.8 to the special case of Lemma 4.10 to prove the theorem. This requires the following lemma, due essentially to Ambro.

Lemma 4.12

[2, Theorem 3.2] Suppose that \(f:(X,\Delta ) \rightarrow Z\) is a tame conic bundle. Let \(g:Z' \rightarrow Z\) be a finite, divisorially tamely ramified morphism of normal varieties and \((X',\Delta ') \rightarrow Z'\) the induced fibration. Then \((X',\Delta ') \rightarrow Z\) is tame and \(g^{*}(K_{Z}+D_{Z})=K_{Z'}+D_{Z'}\) for \(D_{Z'}\) the induced discriminant divisor of \((X',\Delta ') \rightarrow Z'\).

Proof

By Lemma 4.5, \((X',\Delta ') \rightarrow Z'\) is tame and hence \(D_{Z'}\) is well defined by the discussion proceeding Lemma 4.5.

It remains to show that \(g^{*}(K_{Z}+D_{Z})=K_{Z'}+D_{Z'}\). To see this fix Q a prime of \(Z'\) and write \(r_{Q}\) for the degree of the induced map onto some P a prime of Z.

From the proof of Lemma 4.3 we see that if \(K_{Z'}+B=g^{*}(K_{Z}+D_{Z})\) then \(1-\text {Coeff}_{Q}(B)=r_{Q}(\text {Coeff}_{P}(D_{Z})-1)\). In particular then it suffices to show that \(d_{Q}=r_{Q}d_{P}\). We consider two cases.

Suppose that \(c \le d_{P}\). Then we have \((X,\Delta +cf^{*}P)\) log canonical over P. Hence \((X',\Delta '+g'^{*}f^{*}P=\Delta +cf'^{*}g^{*}P)\) is also log canonical by the Lemma 4.3. But \(f'^{*}g^{*}P \ge f'^{*}r_{Q}Q\) so it must be that \(d_{Q} \ge r_{Q}c\). Hence in fact \(d_{Q} \ge r_{Q}d_{P}\).

Conversely if \(c \ge d_{P}\) then,\((X,\Delta +cf^{*}P)\) is not log canonical over P. In particular replacing X with a suitable birational model \(X'' \rightarrow X\) we suppose that there is some prime E of X with \(f_{E}=P\) and \(\text {Coeff}_{E}(\Delta +cf^{*}P) < -1\). Similarly there is \(E'\) on \(X'\) with \(g'(E')=E\) and \(f'(E')=Q\) which also has \(\text {Coeff}_{E}(\Delta '+cg'^{*}f^{*}P) < -1\) but \(\text {Coeff}_{E}(cg'^{*}f^{*}P)=\text {Coeff}_{E}(cf^{*}r_{Q}P)\) and hence \(c \ge rd_{Q}\). Thus we have the equality \(d_{Q}=r_{Q}d_{Q}\). \(\square \)

Note that in the setup above \(g^{*}(K_{Z}+D_{Z}+M_{Z})=K_{Z'}+D_{Z'}+M_{Z'}\) so we must have that \(M_{Z'}=g^{*}M_{Z}\).

Lemma 4.13

Suppose that \(f:X \rightarrow Z\) is a tame conic bundle. Then there is a finite, divisorially tamely ramified morphism \(g:Z' \rightarrow Z\) with \(g^{*}(K_{Z}+D_{Z}+M_{Z})=K_{Z'}+D_{Z'}+M_{Z'}\) and a birational morphism \(h:Z'' \rightarrow Z'\) such that \(M_{Z''}\) is semiample.

Proof

Let D be any horizontal component of \(\Delta \) which is not a section of f then f restricts to a divisorially tamely ramified morphism \(D \rightarrow Z\). After replacing D with its normalisation and Stein factorising, we may suppose that \(D\rightarrow Z\) is finite with D normal. Taking the fibre product of \(X \rightarrow Z\) with the normalisation \(\tilde{D}\) of D we find \(X' \rightarrow \tilde{D}\) satisfying the initial conditions but with the one component of \(\Delta \) is now generically a section.

In this fashion, we eventually get to \(Z' \rightarrow Z\) with \(g^{*}(K_{Z}+D_{Z}+M_{Z})=K_{Z'}+D_{Z'}+M_{Z'}\) and all the horizontal components of \(\Delta \) being generically sections. Hence we may apply Lemma 4.10 to give the result. \(\square \)

Proof of Theorem 4.8

Take \(f:(X,\Delta ) \rightarrow Z\) as given. Then we have \(g:Z' \rightarrow Z\) and \(h:Z''\rightarrow Z'\) as above. Write d for the degree of g. Fix \(B_{Z''}\sim M_{Z''}\) making \((Z'',D_{Z''}+B_{Z''})\) sub klt. Write \(B_{Z}=\frac{1}{d}g_{*}h_{*}B_{Z''}\). It is sufficient to show that \((Z,D_{Z}+B_{Z})\) is sub \(\epsilon \)-klt since \(B_{Z} \sim M_{Z}\) is always effective and \(D_{Z} \ge 0\) whenever \(\Delta \) is.

Let \(Y \rightarrow Z\) be a log resolution of \((Z,D_{Z}+B_{Z})\) and take \(Y',Y''\) appropriate fibre products to form the following diagram.

We have that \(M_{Y''}=\pi ''^{*}M_{Z''}\), so write \(B_{Y''}=\pi ''^{*}B_{Z''}\) and \(\frac{1}{d}g'_{*}h'_{*}B_{Y''}=B_{Y}\). Then we must have that \(\pi _{*}B_{Y}=B_{Z}\) and \(K_{Y}+D_{Y}+B_{Y}\sim \pi ^{*}(K_{Z}+D_{Z}+B_{Z})\). Note further that \(\pi ^{*}B_{Z}\) and \(B_{Y}\) differ only over the exceptional locus, hence \(B_{Y}\) has SNC support. Indeed \(D_{Y}+B_{Y}\) has SNC support. Further since \((Y'',D_{Y''}+B_{Y''})\) is sub \(\epsilon \)-klt and \(g'_{*}h'_{*}(D_{Y''}+B_{Y''})=d(D_{Y}+B_{Y})\) it must be that \(D_{Y}+B_{Y}\) have coefficients strictly less than \(1-\epsilon \), thus \((Y,D_{Y}+B_{Y})\) is sub \(\epsilon \)-klt and therefore so is \((Z,D_{Z}+B_{Z})\). \(\square \)

4.1 Generic smoothness

We will also need to consider the pullbacks of very ample divisors on the base of a suitably smooth conic bundle. This is done to obtain an adjunction result which is required in the next section. We work here under the assumption the ground field is closed of positive characteristic \(p>2\). This requirement on the characteristic is due entirely to the following lemma.

Lemma 4.14

Let \((X,\Delta ) \rightarrow Z\) be a regular conic bundle. Then there is some, possibly reducible, curve C on Z such that for any \(P \in Z\) the fibre, \(F_{P}\), over P is determined as follows:

  1. (1)

    If \(P \in Z\setminus C\) then \(F_{p}\) is a smooth rational curve.

  2. (2)

    If \(P \in C\setminus Sing (C)\) then \(F_{p}\) is a the union of two rational curves meeting transversally.

  3. (3)

    If \(P \in Sing (C)\) then \(F_{p}\) is a non-reduced rational curve.

Further if H is a smooth curve meeting C transversely away from \(Sing (C)\) then \(f^{*}H\) is smooth.

Proof

This is essentially [28, Proposition 1.8]. We sketch the proof as our statement is slightly different.

Since X is smooth \(-K_{X}\) is relatively ample and defines an embedding into a \(\mathbb {P}^{2}\) bundle over Z. Fix any point P in X then in some neighbourhood U around P, \(X_{U}\) is given inside \(\mathbb {P}^{2} \times U\) by the vanishing of \(x^{t}Qx\). Here Q is a diagonalisable \(3\times 3\) matrix taking coefficients in \(\kappa [U]\), unique up to invertible linear transformation, so we may take C to be the divisor on which the rank of Q is less than 3. That Q has rank 3 on some open set follows from smoothness of the generic fibre.

Then the singular points of C are precisely the locus on which Q has rank less than 2. By taking a diagonalisation of Q we may write \(X_{U}\) as the vanishing of \(\sum A_{i}x_{i}^{2}\) for some \(A_{i} \in \kappa [U]\) and we obtain the classification of fibres by consideration of the rank.

Suppose then H is a smooth curve as given. Away from C, \(f^{*}H\) is clearly smooth, so it suffices to consider the intersection with C, however we can see it is smooth here by computing the Jacobian using the local description of X given above. \(\square \)

Theorem 4.15

[Embedded resolution of surface singularities] [11, Theorem 1.2] Suppose that V is a non-singular threefold, S a reduced surface in V and E a simple normal crossings divisor on V then there is a sequence of blowups \(\pi :V_{n} \rightarrow V_{n-1} \rightarrow ... V\) such that the strict transform \(S_{n}\) of S to \(V_{n}\) is smooth. Further each blowup is the blowup of a non-singular curve or a point and the blown up subvariety is contained in the locus of \(V_{i}\) on which the preimage of \(S+E\) is not log smooth.

Corollary 4.16

Suppose \((X,\Delta ) \rightarrow Z\) is a regular, tame conic bundle and we fix a very ample linear system |A| on Z. Then there is a log resolution \((X',\Delta ') \rightarrow (X,\Delta )\) such that for any sufficiently general element \(H\in |A|\), its pullback \(G'\) to \(X'\) has \((X',G'+E)\) log smooth for E the reduced exceptional divisor of \(\pi \).

Proof

By the previous theorem we may find birational morphism \(\pi :X' \rightarrow X\) which is a log resolution of \((X,\Delta )\) factoring as blowups \(X'=X_{n} \rightarrow X_{n-1} \rightarrow .... X_{0}=X\) of smooth subvarieties contained in the non-log smooth locus of each step.

We show first a general \(G'\) is smooth. At each stage we blow-up smooth curves \(V_{i}\) in the non-log smooth locus. Let \(G_{i}\) be the pullback of H to \(X_{i}\), suppose for induction it is smooth. That \(G_{0}\) is smooth is the content of Lemma 4.14 and so the base case of the induction argument holds.

We may assume that \(f_{i,*}V_{i}=V_{Z,i}\) is a curve for \(f_{i}:X_{i} \rightarrow X \rightarrow Z\) else a general H avoids it and so a general \(G_{i+1}\) is smooth also. Note that each vertical component of \(\Delta \) is log smooth near the generic point of their image, since X is a regular conic bundle, so \(V_{i}\) must be contained in the strict transform of some horizontal component of \(\Delta \). Since \(V_{i}\) is not contracted, it follows that \(V_{i} \rightarrow V_{Z,i}\) is separable as \((X,\Delta ,Z)\) is tame. Thus as a general H meets \(V_{Z,i}\) transversely, a general \(G_{i}\) meets \(V_{i}\) transversely and hence a general \(G_{i+1}\) is smooth. By induction then \(G'=G_{n}\) is smooth.

Suppose that V is a curve contained in the locus on which \(\pi ^{-1}\) is not an isomorphism that is not contracted by f. Then for a general point P of V, we claim that the fibre over P is log smooth. As before we argue by induction, the the base case trivially true. Suppose then that we blowup a curve \(V_{i}\) lying over V on X and \(V_{Z}\) on Z. Then \(V_{i}\) must meet the fibre over P transversally. Indeed \(V_{i} \rightarrow V \rightarrow V_{Z}\) is separable, as above, forcing \(V_{i} \rightarrow V\) to be separable also. But then \(V_{i}\) meets a general fibre transversally as claimed.

Suppose now that E is an integral exceptional divisor of \(X'\rightarrow X\). Let \(V=\pi _{*}E\), then as before general G meets V transversely if V is a curve, or not at all otherwise. Suppose V is a curve, then for a general point P of V, the fibre over P is a system of log smooth curves. Finally then the intersection of a general \(G'\) and E is a scheme of pure dimension 1 contained in the disjoint union of such systems of log smooth curves, in particular it is log smooth.

Suppose then we fix two exceptional divisors \(E_{1},E_{2}\) meeting at a curve V. Again we suppose that V is not contracted by \(f'=f \circ \pi \). Write \(\pi _{*}V=V_{X}\) and \(f'_{*}V=V_{Z}\). Then \(V_{X} \rightarrow V_{Z}\) is separable as before and for a general \(G'\) meeting V transversely, the intersection of G with \(\pi ^{*}V'\) is a log smooth system of rational curves, and then \(G.V \subseteq G.\pi ^{*}V_{X}\) is log smooth, or equally it is finitely many points with multiplicity 1. \(\square \)

Theorem 4.17

Let \((X,\Delta ) \rightarrow Z\) be a regular, tame conic bundle and |A| a very ample linear system on Z. Then there is a log resolution \((X',\Delta ') \rightarrow (X,\Delta )\) such that for a general \(H \in |A|\), the pullback \(G'\) to \(X'\) is smooth with \((X',\Delta '+G')\) log smooth.

Proof

Write E for the reduced exceptional divisor. For a general \(H \in |A|\) we let \(G=f^{*}H\) be the pullback to X. We then take \(X'\) as in Corollary 4.16.

Clearly a general \(G'\) avoids the intersection of any 3 components of \(\text {Supp}(\Delta ')+E\), and from above \((X',G'+E)\) is log smooth. Suppose D is a vertical component of \(\Delta \). Then either G can be assumed to avoid it, or to meet it at a smooth fibre. By the usual arguments, since the only non-contracted curves we blow up map separably onto their image, \(G'\) meets \(D'\) the strict transform of D on \(X'\) along a log smooth locus. Further this locus meets any exceptional divisor either transversally or not at all. Now suppose \(D_{2}\) is any other component of \(\text {Supp}(\Delta ')+E\) which does not dominate Z. Then if either \(D_{2}.D'\) has dimension less than 1 or is contracted over Z then a general \(G'\) avoids it, so suppose otherwise. In which case \(D_{2}\) must be exceptional over X with image \(V\subseteq D\) on X. However \(D_{2}.D'\) is just the strict transform of V inside \(D'\) and, for a general \(G'\), \(G'.D_{2}.D\) is log smooth as required.

It remains then to consider the horizontal components of \(\Delta \). Let D be any such component and \(D'\) its strict transform. Since \((X,\Delta ,Z)\) is tame, so is \((X',\Delta ',Z)\). In particular then \(D' \rightarrow Z\) is divisorially tamely ramified and so residually separated over Z away from finitely many points of Z. Hence by Bertini’s Theorem, Theorem 3.17, the pullback of a general H, which is just the intersection of a general \(G'\) with \(D'\) is smooth. Further as \(D' \rightarrow Z\) is divisorially tamely ramified, if V is any curve on \(D'\) not contracted over Z a general \(G'|_{D'}\) meets it transversally. Hence for any other component \(D_{2}\) of \(\text {Supp}(\Delta ')+E\) we have \((X',D'+D_{2}+G')\) log smooth for a general \(G'\) and the result follows. \(\square \)

Corollary 4.18

Suppose \((X,\Delta ,Z)\) is a terminal, sub \(\epsilon \)-klt, tame conic bundle. Take a general very ample H on Z, with \(G=f^{*}H\), then \((G,\Delta |_{G}=\Delta _{G})\) is sub \(\epsilon \)-klt.

Proof

Throwing away finitely many points of Z we may freely suppose that the conic bundle is regular.

By the previous theorem there is a log resolution \(\pi :(X',\Delta ') \rightarrow (X,\Delta )\) with \((X',\Delta '+G')\) smooth. Write \(\pi _{G}:G' \rightarrow G\) for the restricted map. Then \((K_{X'}+\Delta '+G')|_{G'}=\pi _{G}^{*}(K_{G}+\Delta _{G})=K_{G'}+\Delta '|_{G}\). However \(\Delta '|_{G}\) is log smooth with coefficients less than \(1-\epsilon \) by construction, and hence \((G,\Delta _{G})\) is \(\epsilon \)-klt by assumption. \(\square \)

5 F-split mori fibre spaces

The aim of this section is to prove the following theorem.

Theorem 5.1

For a field \(\kappa \) of positive characteristic we let \(S_{\kappa }\) be the set of \((X,\Delta )\), \(\epsilon \)-LCY threefold pairs with X terminal, globally F-split and rationally chain connected over \(\kappa \). We further require that \((X,\Delta )\) admits a \(K_{X}\) Mori fibration \(f:(X,\Delta ) \rightarrow Z\) where either

  1. (1)

    Z is a smooth rational curve, there is H on Z very ample of degree 1 and a general fibre G of \(X \rightarrow Z\) is smooth.

    $$\begin{aligned} \text {or} \end{aligned}$$
  2. (2)

    \(p>2\) and \((X,\Delta ) \rightarrow Z\) is a tame, terminal conic bundle such that there is a very ample linear system |A| on Z with \(A^{2} \le c\). In which case G the pullback of a sufficiently general \(H \in |A|\) is smooth with \((G,\Delta _{G}=\Delta |_{G})\) \(\epsilon \)-klt by Corollary 4.18.

Then the set of base varieties

$$\begin{aligned} S'=\{X \text { such that } \exists \Delta \text { with } (X,\Delta ) \in S_{\kappa } \text { for algebraically closed } \kappa \} \end{aligned}$$

is birationally bounded over \(\mathbb {Z}\).

Remark 5.2

In practice this will be applied to pairs over fields of characteristic \(p > 7,\frac{2}{\delta }\) with boundary coefficients bounded below by \(\delta \). The constraints on p come from Theorem 3.16 and Theorem 4.8, via Lemma 4.6.

This chapter is devoted to the proof, but the outline is as follows. We fix a general, very ample divisor H on the base and write \(G=f^{*}H\). Then argue that \(A=-mK_{X}+nG\) is ample, for mn not depending on \(X,\Delta \) or G. This is done by bounding the intersection of \(K_{X}\) with curves not contracted by f and generating an extremal ray in the cone of curves. We then show that in fact we may choose these mn such that A defines a birational map, by lifting sections from G using appropriate boundedness results in lower dimensions. The F-split assumption is used to lift sections from G with Lemma 2.13, it will also be needed to apply Theorem 5.1 by ensuring that the bases Z are suitably bounded.

If, for some \(t>0\), the non-klt locus of \((X,(1+t)\Delta )\) is contracted then since \((K_{X}+(1+t)\Delta ) \sim -tK_{X}\) it follows that every \(-K_{X}\) negative extremal ray is generated by a curve \(\gamma \) with \(K_{X}.\gamma \le \frac{6}{t}\). In particular as we have \(G.C \ge 1\) for any \(-K_{X}\) negative curve C it must be that \(-K_{X}+\frac{7}{t}G\) is ample. Clearly for any \((X,\Delta ) \rightarrow Z\) there is such a t, however we wish to find one independent of the pair. For this we may use a result due to Jiang, the original proof is a-priori for characteristic 0, but the proof is arithmetic in nature and holds in arbitrary characteristic.

Theorem 5.3

[18, Theorem 5.1] Fix a positive integer m and \(\epsilon >0\) a real number. Then there is some \(\lambda \) depending only on \(m,\epsilon \) satisfying the following property.

Take (TB) any smooth, projective \(\epsilon \)-klt surface. Write \(B=\sum b_{i}B_{i}\) and suppose \(K_{T}+B \equiv N-A\) for N nef and A ample. If \(B.N,\sum b_{i}, B^{2} \le m\) then \((T,(1+\lambda )B)\) is klt.

First we show that results of this form lift to characterisations of the non-klt locus of \((X,(1+t)\Delta )\), then show how the result above may be applied here.

Lemma 5.4

We use the notation of Theorem 5.1. Suppose Z is a surface and there is t such that \((G,(1+t)\Delta _{G})\) is klt. Then every curve in the non-klt locus of \((X,(1+t)\Delta )\) is contracted by f.

Proof

Let \(\pi :X' \rightarrow X\) be a log resolution of \((X,\Delta +G)\) with \(K_{X'}+\Delta '=\pi ^{*}(K_{X}+\Delta )\), then \((X',\Delta '+G')\) is log smooth and \(\Delta '\) and G have no common components, where \(G'\) is the pullback of G. Now \(X' \rightarrow X\) must also be a log resolution of \((X,(1+t)\Delta )\), and hence if we write \(K_{X'}+B=\pi ^{*}(K_{X}+(1+t)\Delta )\) then it is also true that \((X',B+G')\) is log smooth and that B and \(G'\) have no common components. Hence \((G',B|_{G'})\) is sub klt by assumption and in particular it has coefficients strictly less than 1.

Suppose Z is a non-klt center of \((X,(1+t)\Delta )\) and E is a prime divisor lying over Z inside \(X'\). Then E has coefficients strictly larger than 1 in B. Since \((X',B+G')\) is log smooth, it must be that \(E|_{G'}\) is an integral divisor and it is trivial if and only if E and \(G'\) do not meet. But then \(E|_{G'} =\lfloor E|_{G'} \rfloor =0\) and so E does not meet \(G'\). Hence neither does H meet \(f_{*}\pi _{*}E=f_{*}Z\). In particular if C is a curve in the non-klt locus, then there is an ample divisor H on Z not meeting \(f_{*}C\). This is possible only if \(f_{*}C\) is a point. \(\square \)

Lemma 5.5

Using the notation of Theorem 5.1 suppose that Z is a curve and write Y for the generic fibre of \(f:X\rightarrow Z\). If there is t such that \((Y,(1+t)\Delta _{Y})\) is klt, then every curve in the non-klt locus of \((X,(1+t)\Delta )\) is contracted by f.

Proof

This follows essentially as above. Take a log resolution \(\pi :(X',\Delta ') \rightarrow (X,\Delta )\). Write \(Y'\) for the generic fibre of \(X' \rightarrow Z\). Then \((Y',\Delta '|_{Y'}) \rightarrow (Y,\Delta _{Y})\) is a log resolution. Again write \(K_{X'}+B=\pi ^{*}(K_{X}+(1+t)\Delta )\). Then again if B has a component D with coefficient at least 1 then D cannot dominate Z, else it would pull back to \(G'\) to give a contradiction. Hence the non-klt locus of \((X,(1+t)\Delta )\) must be contracted as claimed. \(\square \)

Lemma 5.6

Using the notation of the previous lemmas. There is some \(\lambda \) independent of \((X,\Delta )\) and G for which the non-klt locus of \((X,(1+t)\Delta )\) is contracted for all \(t \le \lambda \).

Proof

We consider two cases.

Suppose first Z is a curve, so the generic fibre Y is a regular del Pezzo surface and \((G,\Delta _{G})\) is \(\epsilon \)-klt LCY. Then, by the work of Tanaka [32, Corollary 4.8], \((-K_{G})^{2} \le 9\). We write \(\Delta _{G}=\sum \lambda _{i}D_{i}\) and since G is regular we have \(D_{i}.K_{G} \ge 1\). Hence \(\sum \lambda _{i} \le \Delta _{G}.(-K_{G}) \le 9\) and \(\Delta _{G}^{2} =(-K_{G})^{2} \le 9\). We conclude the result holds by Theorem 5.3 with \(N=-K_{G}\) and \(A=-K_{G}\).

Suppose then that Z is a surface. Then by Lemma 4.14, G is a smooth surface, geometrically ruled over a general very ample divisor H on Z. Further by Corollary 4.18, \((G,\Delta _{G})\) is \(\epsilon \)-klt and by assumption \(K_{G}+\Delta _{G}\sim kF\) where F is the general fibre over H and \(H^{2}=k \le c\). Finally note that \(\Delta _{G}^{V} \sim _{f,\mathbb {Q}} 0\).

We may write \(\Delta _{G}= \sum \lambda _{i}D_{i}+ \sum \mu _{i}F_{i}\) where \(F_{i}\) are fibres over H and \(D_{i}\) dominate H. Since \(F_{i}\) is a fibre and G is smooth, each \(F_{i}\) is reduced by the genus formula and contains at most 2 components since \(-K_{X}.F_{i}=-2\). Further \(\Delta _{G}.F=(-K_{G}).F=2\) and hence \(\Delta _{G}^{2}=(-K_{G}+kF)^{2}=(-K_{G})^{2}-2kK_{G}.F +(kF)^{2} \le (-K_{G})^{2}+4c\) which in turn is bounded above by \(8+4c\) due to [3, Proposition 11.19], since G is a smooth geometrically ruled surface.

It remains then to show that the sum of the coefficients of \(\Delta _{G}\) is bounded. Note that \(\sum \lambda _{i} \le \sum \lambda _{i}D_{i}.F =\Delta _{G}.F =2\). We therefore need only bound \(\sum \mu _{i}\).

Suppose for contradiction that \(w=\sum \mu _{i} >3 +k\). Let \(B=\sum \lambda _{i}D_{i} +(1-\frac{3+k}{w})\sum \mu _{i}F_{i} \sim -K_{G}-(F^{1}+F^{2}+F^{3})\), for general fibres \(F^{i}\).

Then (GB) is klt and so by Lemma 3.13, \(D=F^{1}+F^{2}+F^{3}\) has 2 connected components, a clear contradiction.

Therefore we may choose A small and ample with \(A.\Delta _{G} < c\) and write \(N=kF+A\) to satisfy the conditions of Theorem 5.3. The result then follows as \(\Delta _{G}.N=kF.\Delta _{G}+A.B\le 3c\) is still bounded. \(\square \)

Corollary 5.7

There is some n such that for any \((X,\Delta ) \rightarrow Z\) and G as in Theorem 5.1 we have \(-K_{X}+nG\) is ample.

Proof

Take any \(n \ge \frac{7}{\lambda }\) for \(\lambda \) as in the previous lemma. Suppose R is a \(K_{X}+(1+\lambda )\Delta \equiv -\lambda K_{X}\) negative extremal ray. By construction, every curve in the nlc locus is contracted by \(X \rightarrow Z\) so \(-\lambda K_{X}\) is positive on \(\overline{NE(X)}_{nlc}\setminus \{0\}\). Hence by Theorem 3.9 any such ray is spanned by a curve C with \(0< -\lambda K_{X}.C \le 6\) and \(G.C >0\). Since G is Cartier, we have \(G.C > 0\) and hence \((-K_{X}+nG).G \ge \frac{1}{\lambda } >0\). In particular \(-K_{X}+nG\) is ample as claimed. \(\square \)

Theorem 5.8

Let \((X,\Delta ) \rightarrow Z\) and G be as in Theorem 5.1. Then there is t not depending on the pair \((X,\Delta )\) nor on G with \(-3K_{X}+tG\) ample and defining a birational map.

Proof

Consider first the case that \(\dim Z=1\). Then G is a smooth del Pezzo surface, so \(-3K_{G}\) is globally generated by [4, Proposition 2.14]. Let \(G_{1},G_{2}\) be other general fibres and consider

$$\begin{aligned}0\rightarrow & {} \mathcal {O}_{X}(-3K_{X}+kG-G_{1}-G_{2}) \rightarrow \mathcal {O}_{X}(-3K_{X}+kG)\\\rightarrow & {} \mathcal {O}_{G_{1}}(-3K_{G_{1}})\oplus \mathcal {O}_{G_{2}}(-3K_{G_{2}}) \rightarrow 0.\end{aligned}$$

Since X is globally F-split \(H^{i}(X,A)=0\) for all \(i>0\) and A ample by Lemma 2.13. In particular then \(H^{1}(X,\mathcal {O}_{X}(-3K_{X}+kG-G_{1}-G_{2}))\) vanishes when \(k\ge 3n+2\) for n as given by the proceeding corollary. Therefore we may lift sections of \(-3K_{G_{i}}\) to see that \(-3K_{X}+kG\) defines a birational map for any \(k \ge 3n+2\).

Suppose instead that \(\dim Z=2\), so G is a conic bundle. Choose a general \(H'\sim H\) on Z and let \(G'\) be its pullback. Consider \(A_{k}=(-K_{X}+kG)|_{G'}=(-k_{G'}+(k-1)dF)\) for \(d \ge 1\), where F is the general fibre of \(G'\rightarrow H'\). Then \(A_{k}\) is ample for \(k >n\) and is Cartier since G is smooth. In particular by the Fujita conjecture for smooth surfaces [33, Corollary 2.5], \(K_{G'}+4A_{k}\) is very ample. Choosing suitable \(k,k'\) we may write \(K_{G'}+4A_{k}=-3K_{G'}+4(k-1)dF=(-3K_{X}+k'G)|_{G'}\). Consider now

$$\begin{aligned}0 \rightarrow \mathcal {O}_{X}(-3K_{X}+(k'-1)G)\rightarrow \mathcal {O}_{X}(-3K_{X}+k'G)\rightarrow \mathcal {O}_{G'}(-3K_{G'}+4(k-1)dF) \rightarrow 0.\end{aligned}$$

Again the higher cohomology of \(-3K_{X}+(k'-1)G\) vanishes and we may lift sections to \(H^{0}(X,\mathcal {O}_{X}(-3K_{X}+k'G))\) from general fibres. In particular \(-3K_{X}+k'G\) separates points on a general \(G'\) so \(-3K_{X}+(k'+1)G\) separates general points and thus defines a birational map.

We may then pick some suitably large t for which the result holds as \(k,k'\) were chosen independently of \((X,\Delta ) \rightarrow Z\) and \(G,G_{1},G_{2}\). \(\square \)

Lemma 5.9

Let \((X,\Delta ) \rightarrow Z,S\) and G be as in Theorem 5.1 and t as in Theorem 5.8. Then there is some constant C with \((-3K_{X}+tG)^{3} \le C\) and \((X,\Delta )\in S\).

Proof

The anticanonical volumes \(Vol (X,-K_{X})\) are bounded by some V by Theorem 1.5 which is proved in the next section.

Suppose first \(\dim Z=1\). Then \(Vol (G,-K_{G})=(-K_{G})^{2} \le 9\) and so by Lemma 3.4

$$\begin{aligned} Vol (X,-3K_{X}+nG) \le Vol (X,-3K_{X}) + 3tVol (G,-3K_{G})\le 27(V+9t) \end{aligned}$$

as required.

Suppose instead then that \(\dim Z=2\). So G is a conic bundle over some H on Z with \(H^{2} \le c\). Hence we get

$$\begin{aligned} Vol (G,(-3K_{X}+tG)|_{G})= (-3K_{G}+(t+1)H^{2}F)^{2}=9K_{G}^{2}-2(t+1)H^{2}(K_{G}.F) \end{aligned}$$

where F is a general fibre of \(G \rightarrow H\). Hence F is a smooth rational curve and \(K_{G}.F=-2\) and \(Vol (G,(-3K_{X}+tG)|_{G})\le 72+4(t+1)c\). Then as before we may apply Lemma 3.6 to get

$$\begin{aligned} Vol (X,-3K_{X}+tG) \le Vol (X,-3K_{X}) + 3nVol (G,(-3K_{X}+tG)|_{G}) \end{aligned}$$

and boundedness follows. \(\square \)

Proof of Theorem 5.1

Suppose \((X,\Delta ) \in S\). Then \(A=-3K_{X}+tG\) is birational with bounded volume by the preceding results. Thus \(S'\) is birationally bounded by Corollary 2.22. \(\square \)

6 Weak BAB for mori fibre spaces

This section is devoted to providing a bound on the volume of \(-K_{X}\) under suitable conditions. Namely we show that the claim holds if X belongs to a suitable family of \(\epsilon \)-LCY Mori fibre spaces whose bases are bounded. We will work over fields of characteristic \(p>5\) as we will need to appeal to Theorem 3.12 at several points. In practice these results will be applied under the hypotheses of Theorem 1.5 with the constraints on characteristic needed to ensure X is a tame conic bundle, or a generically smooth del Pezzo fibration as appropriate. We consider first the case that X is a tame conic bundle over a surface.

Theorem 6.1

Pick \(\epsilon ,c >0\). Then there is \(V(\epsilon ,c)\) such that if \(f:(X,\Delta ) \rightarrow S\) is any projective, tame conic bundle over any closed field of characteristic \(p> 5\), \((X,\Delta )\) is \(\epsilon \)-klt and S admits a very ample divisor H with \(H^{2} \le c\), then \(Vol (-K_{X}) \le V(\epsilon ,c)\).

We may further assume that H and \(G=f^{*}H\) are smooth. Moreover H may be taken so that \((G,\Delta |_{G})\) is \(\epsilon \)-klt also by Corollary 4.18.

If \(Vol (-K_{X})=0\) the result is trivially true, so we may suppose that \(-K_{X}\) is big. In particular we may write \(-K_{X}\sim A+E\) where A is ample and \(E \ge 0\). Note that

$$\begin{aligned} -K_{X}-(1-\delta )\Delta \sim -\delta K_{X} \sim \delta A + \delta E \end{aligned}$$

for any \(0< \delta <1\). Choose \(\delta \) such that \((X,(1-\delta )\Delta +\delta E)\) and \((G,(1-\delta )\Delta |_{G}+\delta E|_{G})\) are \(\epsilon \)-klt and write \(B=(1-\delta )\Delta +\delta E\). Then (XB) is \(\epsilon \)-log Fano by construction. The proof follows essentially as in characteristic zero, which can be found in [19], but we include a full proof for completeness as some details are modified.

Lemma 6.2

[19, Lemma 6.5] With notation as above, \(Vol (-K_{X}|_{G}) \le \frac{8(c+2)}{\epsilon }\).

Proof

Suppose for contradiction \(Vol (-K_{X}|_{G}) >\frac{8(c+2)}{\epsilon }\) and choose r rational with \(Vol (-K_{X}|_{G})> 4r >\frac{8(c+2)}{\epsilon }\).

Write F for the general fibre of \(G \rightarrow H\). Then \(G|_{G}=H^{2}F=kF\) and for suitably divisible m and any n we have the following short exact sequence.

$$\begin{aligned} 0 \rightarrow \mathcal {O}_{G}(-mK_{X}|_{G}-nF) \rightarrow \mathcal {O}_{G}(-mK_{X}|_{G}-(n-1)F) \rightarrow \mathcal {O}_{F}(-mK_{F}) \rightarrow 0 \end{aligned}$$

In particular then \(h^{0}(G,-mK_{X}|_{G}-nF) \ge h^{0}(G,-mK_{X}|_{G}-(n-1)F)-h^{0}(F,-mK_{F})\). Hence by induction we have \(h^{0}(G,-mK_{X}|_{G}-nF) \ge h^{0}(G,-mK_{X}|_{G})-n\cdot h^{0}(F,-mK_{F})\).

Note however that, letting \(n=mr\) we have

$$\begin{aligned} \lim _{m \rightarrow \infty } \frac{2}{m^{2}}(h^{0}(G,-mK_{X}|_{G})-n\cdot h^{0}(F,-mK_{F}))= Vol (-K_{X}|_{G})-2rVol (-K_{F}) > 0 \end{aligned}$$

since F is a smooth rational curve. Hence \(-mK_{X}|_{G}-mrF\) admits a section for m sufficiently large and divisible. Choose an effective \(D\sim _{\mathbb {Q}} -K_{X}|_{G}-rF\).

Consider now

$$\begin{aligned}(G,(1-\frac{k+2}{r})B|_{G}+\frac{k+2}{r}D+F_{1}+F_{2})\end{aligned}$$

for two general fibres \(F_{1}, F_{2}\). This has

$$\begin{aligned}&-K_{G}+(1-\frac{k+2}{r})B|_{G}+\frac{k+2}{r}D+F_{1}+F_{2} \\ \sim&-(K_{X}|_{G}+kF)+\frac{k+2}{r})B|_{G}+\frac{k+2}{r}(-K_{X}|_{G}-rF)+F_{1}+F_{2} \\ \sim&-(1-\frac{k+2}{r})(K_{X}+B)|_{G} \\ \end{aligned}$$

and hence we may apply the Connectedness Lemma for surfaces, Theorem 3.11, to see that its non-klt locus is connected. Note that we have \(r> c+2 \ge k+2\) and so as \(-(K_{X}+B)\) is ample, this pair satisfies the assumptions of the Connectedness Lemma.

Since both \(F_{1}\) and \(F_{2}\) are contained in the non-klt locus, there must be a non-klt center W dominating H. Thus it follows that \((F,(1-\frac{k+2}{r})B|_{F}+\frac{k+2}{r}D|_{F})\) is non-klt. However \((F,(1-\frac{k+2}{r})B|_{F})\) is \(\epsilon \)-klt so we must have \(\deg (\frac{k+2}{r}D|_{F})\ge \epsilon \). Finally since \(D|_{F}\sim -K_{X}|_{F}=-K_{F}\) we have \(\deg (D|_{F})=2\) and hence \(\frac{2(c+2)}{r} \ge \frac{2(k+2)}{r} \ge \epsilon \), contradicting the choice of r. \(\square \)

Proof of Theorem 6.1

Take \(V(\epsilon ,c)=\frac{144(c+2)}{\epsilon ^{2}}\) suppose for contradiction that \( Vol (-K_{X}) > \frac{144(c+2)}{\epsilon ^{2}}\). Choose t with \(Vol (-K_{X})> t\cdot \frac{24(c+2)}{\epsilon } > \frac{144(c+2)}{\epsilon ^{2}}\) and consider the following short exact sequence.

$$\begin{aligned} 0 \rightarrow \mathcal {O}_{X}(-mK_{X}-nG)\rightarrow \mathcal {O}_{X}(-mK_{X}+(n-1)G)\rightarrow \mathcal {O}_{G}(-mK_{X}|_{G}-(n-1)G) \rightarrow 0 \end{aligned}$$

Arguing as before we see that \(h^{0}(X,-mK_{X}-tmG)\) grows like \(\frac{r}{6}m^{3}\) with \(r\ge Vol (-K_{X})-3tVol (-K_{X}|_{G}) >0\) by the previous lemma. In particular we may find \(D \sim _{\mathbb {Q}} -K_{X}+tG\).

Let \(\pi :Y \rightarrow X\) be a log resolution of \((X, (1-\frac{3}{t})B+\frac{3}{t}D)\). We may write \(K_{Y}+\Delta _{Y}+E=\pi ^{*}(K_{X}+(1-\frac{3}{t})B+\frac{3}{t}D)\) where \((Y,\Delta _{Y})\) is sub klt and E is supported on the non-klt places of \((X, (1-\frac{3}{t})B+\frac{3}{t}D)\).

As shown by Tanaka in [30, Theorem 1], since \(|L|=\pi ^{*}f^{*}|H|\) is base point free there is some m with \((Y,\Delta _{Y}+\frac{1}{m}(L_{1}+L_{2}+L_{3}))\) still klt for every choice of \(L_{i} \in |L|\). In particular, fixing some general \(z\in Z\) we may take \(H_{i} \in |H|\) meeting Z for \(1\le i \le 2m\) such that for any \(I \subseteq \{0,1,...,2m\}\) with \(|I| =3\) the following hold:

  • \((Y,\Delta _{Y}+\sum _{i \in I}\frac{1}{m}\pi ^{*}f^{*}H_{i})\) is klt;

  • \(\bigcap _{i\in I} H_{i}={z}\).

Thus we must have

$$\begin{aligned} Nklt (X,(1-\frac{3}{t})B+\frac{3}{t}D)=Nklt (X, (1-\frac{3}{t})B+\frac{3}{t}D+\frac{1}{m}f^{*}H_{i}) \end{aligned}$$

for each i.

Let F be the fibre over z and \(G_{1} = \sum _{i=1}^{2m} \frac{1}{m}H_{i}\). Then clearly \(\text {mult}_{F}(G_{1}) \ge 2\) and hence (XG) cannot be klt at F. By construction we have

$$\begin{aligned} Nklt (X, (1-\frac{3}{t})B+\frac{3}{t}D) \cup F = Nklt (X, (1-\frac{3}{t})B+\frac{3}{t}D+G_{1}). \end{aligned}$$

Similarly we may further take \(G_{2} \sim f^{*}H\) not containing F such that

$$\begin{aligned} Nklt (X, (1-\frac{3}{t})B+\frac{3}{t}D+G_{1}+G_{2}) = Nklt (X, (1-\frac{3}{t})B+\frac{3}{t}D+G_{1}). \end{aligned}$$

Now \(-(K_{X}+(1-\frac{3}{t})B+\frac{3}{t}D+G_{1}+G_{2}) \sim (1-\frac{3}{t})(K_{X}+B)\) is ample, so we may apply the Connectedness Lemma, Theorem 3.12, to see there is a curve in the non-klt locus of \((X,(1-\frac{3}{t})B+\frac{3}{t}D)\) meeting F. In particular then the non-klt locus dominates S. Hence we must also have that \((F,(1-\frac{3}{t})B|_{F}+\frac{3}{t}D|_{F})\) is not klt. However \((F,B|_{F})\) is \(\epsilon \)-klt and F is a smooth rational curve. Therefore by degree considerations, since \(-K_{X}|_{F} \sim D|_{F}\) we must have \(t \le \frac{6}{\epsilon }\), contradicting our choice of t. \(\square \)

Theorem 6.3

[Ambro-Jiang Conjecture for surfaces] [19, Theorem 2.8] Fix \(0<\epsilon <1\). There is a number \(\mu (\epsilon )\) depending only on \(\epsilon \) such that for any surface S over any closed field k, if S has a boundary B with (SB) \(\epsilon \)-klt weak log Fano then

$$\begin{aligned}\inf \{ulct (S,B;G) \text { where } G \sim _{\mathbb {Q}}-(K_{S}+B) \text { and } G+B \ge 0\}\ge \mu (\epsilon )\end{aligned}$$

Here \(ulct (S,B;G)= \sup \{t:(S,B+tG) \text { is lc and } 0 \le t \le 1\}\) and in particular it is at most the usual lct, if G is effective.

Though the proof is given for characteristic zero, it is essentially an arithmetic proof that the result holds for \(\mathbb {P}^{2}\) and \(\mathbb {F}_{n}\) for \(n \le \frac{2}{\epsilon }\). The arguments of the proof work over any algebraically closed field and as the bound is given explicitly in terms of \(\epsilon \) it is independent of the base field.

By applying this result to a general fibre of a Mori fibration over a curve we obtain the desired boundedness result.

Theorem 6.4

Pick \(\epsilon >0\). Suppose that \(f:X \rightarrow \mathbb {P}^{1}\) is a terminal threefold Mori fibre space with smooth generic fibre over a closed field of characteristic \(p> 5\). If there is a pair \((X,\Delta )\) which is \(\epsilon \)-LCY then \(Vol (-K_{X})\le W(\epsilon )\) for some \(W(\epsilon )\) depending only on \(\epsilon \).

Proof

By Corollary 5.7, there is some \(t(\epsilon )\ge 1\) depending only on \(\epsilon \) with \(-K_{X}+t(\epsilon )F\) ample, where F is a general fibre.

Let \(\mu =\mu (1)\) as given in Theorem 6.3 and take \(W(\epsilon )= \frac{27(t(\epsilon )+2)}{\mu }\). Suppose for contradiction \(Vol (-K_{X}) > W(\epsilon )\) and choose s rational with \(Vol (-K_{X})> 27s > W(\epsilon )\). Clearly \(s> \frac{(t(\epsilon )+2)}{\mu } > t(\epsilon )+2\).

For any n and for sufficiently divisible m, we have the following short exact sequence.

$$\begin{aligned}0 \rightarrow \mathcal {O}_{X}(-mK_{X}-nF) \rightarrow \mathcal {O}_{X}(-mK_{X}-(n-1)F) \rightarrow \mathcal {O}_{F}(-mK_{F})\rightarrow 0.\end{aligned}$$

This gives \(h^{0}(X,-mK_{X}-nF) \ge h^{0}(X,-mK_{X})-nh^{0}(F,-mK_{F})\) and subsequently

$$\begin{aligned}\lim _{m \rightarrow \infty } \frac{6}{m^{3}}(h^{0}(X,-mK_{X})-smh^{0}(F,-mK_{F})= Vol (-K_{X})-3sVol (-K_{F}).\end{aligned}$$

Since F is a smooth del Pezzo surface we have \(Vol (-K_{F})\le 9\). So by construction \(-mK_{X}-smF\) is effective for large, divisible m.

Choose \(D\ge 0\) with \(D \sim _{\mathbb {Q}} -K_{X}-sF\) and consider \((X,\frac{t(\epsilon )+2}{s}D+F_{1}+F_{2})\) for \(F_{1},F_{2}\) general fibres. By construction we have

$$\begin{aligned} -(K_{X}+\frac{t(\epsilon )+2}{s}D+F_{1}+F_{2})&\sim -(K_{X}-\frac{t(\epsilon )+2}{s}K_{X}-t(\epsilon )F)\\&\sim (1-\frac{t(\epsilon )+2}{s})(-K_{X}+t(\epsilon )F) + \frac{t(\epsilon )(t(\epsilon )+2)}{s}F \end{aligned}$$

which is ample since F is nef and \(-K_{X}+t(\epsilon )F\) is ample. Then the Connectedness Lemma, Theorem 3.12, gives that the non-klt locus is connected, and clearly contains \(F_{1},F_{2}\), so it must contain a non-klt center W which dominates \(\mathbb {P}^{1}\). Thus it must be that \((F,\frac{t(\epsilon )+2}{s}D|_{F})\) is not klt. However F is smooth, and equivalently terminal, with \(-K_{F}\sim D|_{F}\) ample, so by Theorem 6.3 it follows that \(\frac{t(\epsilon )+2}{s} \ge lct(F,0;D|_{F}) \ge \mu =\mu (1)\). Thus we have \(s \le \frac{t(\epsilon )+2}{\mu }\) contradicting our choice of s and proving the result. \(\square \)

7 Birational boundedness

We are now ready to prove the main theorems using the results of the previous sections.

Lemma 7.1

Suppose that \((X,\Delta )\) is an \(\epsilon \)-klt LCY pair in characteristic \(p>5\), with \(\Delta \ne 0\) and X both rationally chain connected and F-split. Then there is a birational map \(\pi :X \dashrightarrow X'\) such that \(X'\) has a Mori fibre space structure \(X' \rightarrow Z\) and \(\Delta '=\pi _{*}\Delta \) on \(X'\) making \((X',\Delta ')\) klt and \(\epsilon \)-LCY. Further both \(X'\) and Z are rationally chain connected and F-split and if X is terminal, so is \(X'\).

Proof

Replacing X by a \(\mathbb {Q}\)-factorialisation, we can assume X is \(\mathbb {Q}\)-factorial. This can be done by Theorem 3.8.

Since \((X,\Delta )\) is klt so is (X, 0) and hence we may run a terminating \(K_{X}\) MMP \(X=X_{0} \dashrightarrow X_{1} \dashrightarrow ... \dashrightarrow X_{n}=X'\) by Theorem 3.7. At each step \(X_{i} \dashrightarrow X_{i+1}\) we may pushforward \(\Delta _{i}\) to \(\Delta _{i+1}\), which is still klt since \(K_{X}+\Delta \equiv 0\). Similarly since \(X_{i}\) is F-split and rationally chain connected, so is \(X_{i+1}\) as these are preserved under birational maps of normal varieties. Since \(K_{X}\) cannot be pseudo-effective, \(X'\) has a Mori fibre space structure \(X' \rightarrow Z\), where Z is also rationally chain connected and F-split. If X is terminal we may run a \(K_{X}\) MMP terminating at a terminal variety, hence \(X'\) is terminal also. \(\square \)

Theorem 1.1

Fix \(0< \delta , \epsilon <1\). Let \(S_{\delta ,\epsilon }\) be the set of threefolds satisfying the following conditions

  • X is a projective variety over an algebraically closed field of characteristic \(p >7, \frac{2}{\delta }\);

  • X is terminal, rationally chain connected and F-split;

  • \((X,\Delta )\) is \(\epsilon \)-klt and log Calabi-Yau for some boundary \(\Delta \); and

  • The coefficients of \(\Delta \) are greater than \(\delta \).

Then there is a set \(S'_{\delta ,\epsilon }\), bounded over \(\text {Spec}(\mathbb {Z})\) such that any \(X\in S_{\delta ,\epsilon }\) is either birational to a member of \(S'_{\delta ,\epsilon }\) or to some \(X'\in S_{\delta ,\epsilon }\), Fano with Picard number 1.

Proof

Take any \((X,\Delta )\in S\) and replace it by a Mori fibre space \((X',\Delta ') \rightarrow Z\) by Lemma 7.1. Then Z is F-split and rationally chain connected. If Z is a surface then \(p>\frac{2}{\delta }\) ensures that \((X',\Delta ')\rightarrow Z\) is a tame conic bundle by Lemma 4.6. In particular Z admits a boundary \(\Delta _{Z}\) such that \((Z,\Delta _{Z})\) is \(\epsilon \)-LCY by Theorem 4.8. Hence by BAB for surfaces, Theorem 3.2, there is |A| a very ample linear system on Z with \(A^{2}\le c\) for some c independent of \(X,\Delta ,Z\).

On the other hand, if Z is a curve then it is a smooth rational curve and \(p>7\) gives that the general fibre of \(X \rightarrow Z\) is smooth by Theorem 3.16. Let then \(S'_{\delta ,\epsilon ,V}\) be set of such Mori fibre space \((X',\Delta ') \rightarrow Z\) with Z not a point and \(Vol (-K_{X})\le V(\epsilon ,c)\). In both cases we conclude by Theorem 5.1 that the set is birationally bounded. \(\square \)

Theorem 1.5

Fix \(0< \delta , \epsilon <1\) and let \(T_{\delta ,\epsilon }\) be the set of threefold pairs \((X,\Delta )\) satisfying the following conditions

  • X is projective over a closed field of characteristic \(p >7,\frac{2}{\delta }\);

  • X is terminal, rationally chain connected and F-split;

  • \((X,\Delta )\) is \(\epsilon \)-klt and LCY;

  • The coefficients of \(\Delta \) are greater than \(\delta \); and

  • X admits a Mori fibre space structure \(X \rightarrow Z\) where Z is not a point.

Then the set \(\{Vol (-K_{X}) :\exists \Delta \text { with } (X,\Delta ) \in T_{\delta ,\epsilon }\}\) is bounded above.

Proof

Take \((X,\Delta ) \in T_{\delta ,\epsilon }\) and let \(X \rightarrow Z\) be the associated Mori fibre space structure. If Z is a curve then we conclude that \(Vol (-K_{X})\) is bounded by Theorem 6.4 in light of Theorem 3.16. If instead Z is a surface then the set of possible such Z is bounded by Theorem 3.2 and Theorem 4.8 as above. Hence we conclude the claim by Theorem 6.1. \(\square \)