1 Introduction

Each species on earth is the product of some evolutionary history, both unique to itself and shared with other species. Phylogenetic diversity indices are a family of measures that quantify this history on a species-by-species basis. They do so by assigning to each species a numerical score that aims to indicate that species’ contribution to biodiversity. One characteristic of diversity indices is that they calculate this contribution based on species’ positions in a rooted phylogenetic tree. This is in contrast to other approaches such as measuring the mean ‘patristic distance’ to other species or the shortest such distance, the ‘pendant edge length’ approach (Redding et al. 2014). Moreover, unlike distance-based methods which measure the difference between species and their close relatives, diversity indices use phylogenetic information going right back to the root of the phylogeny under investigation.

Diversity indices and the scores they provide have found practical application in conservation biology. In particular, they have been used to suggest prioritisation rankings for biodiversity conservation, often in conjunction with other measures. For example, see Isaac et al. (2007), Bordewich et al. (2008), Jetz et al. (2014), Forest et al. (2018) and other work by the EDGE of Existence programme (EDGE of Existence Programme 2022). One aim of using diversity indices is to move beyond conservation that overwhelmingly protects the most charismatic species at the expense of others (Mace et al. 2003). The use of these measures can inform the distribution of conservation resources, prioritising those programmes that spread benefits and protection more widely, and properly reflect the full breadth of biodiversity.

That said, conservation efforts take place at a time heavily impacted by extinctions of species (Davis et al. 2018) and these extinction events impact the very measures that conservation biologists use to try to prevent them. Figuratively speaking, extinctions remove branches from the ‘tree of life’, thereby altering the phylogenetic tree structure on which these measures are based. For a conservation programme based on diversity indices, each extinction event necessitates the recalculation of diversity index scores for surviving species. This may lead to changes in relative rankings among species. Since conservation programmes often employ large amounts of scarce resources (White et al. 2022) and require community buy-in (Griffiths et al. 2019), it is useful to know if and when extinction events could lead to recalculations that give markedly different results. Large changes in priorities, and any subsequent reallocation of resources that followed, would not be conducive to the long-term planning required of many conservation programmes. Ideally, phylogenetic diversity indices would not only give informative prioritisation rankings but also be ‘robust’, that is, not particularly sensitive to changes caused by extinction.

Two diversity index methods have proven to be most popular, essentially to the exclusion of other approaches in this area. These are the Fair Proportion (FP) (Redding 2003; Isaac et al. 2007) and Equal-Splits (ES) (Redding 2003; Redding et al. 2014) indices. Nonetheless, it is possible to define many other diversity indices that follow our definition (see p. 5) and we give some examples of these in Sect. 2.1. But in current practice only FP and ES are diversity indices of any significance. A third diversity index based on the Shapley value from co-operative game theory (Shapley 1953) has also been considered. Yet it was subsequently shown that on rooted phylogenetic trees the Shapley value is identical to the Fair Proportion index (Fuchs and Jin 2015). Other evolutionary isolation measures exist, such as those evaluated in Redding et al. (2014) and a measure based on another co-operative game approach, the Banzhaf index (discussed in Supplementary material). However, none of these measures are phylogenetic diversity indices by the definition used here, largely ignoring edge lengths altogether. For this reason they will be set aside in the present discussion.

A recent paper by Fischer et al. (2023) assesses the robustness of the Fair Proportion index on rooted phylogenetic trees. Those authors showed that species rankings under the FP index can change markedly following certain extinction events. Furthermore, “for each phylogenetic tree, there are edge lengths such that the extinction of one leaf per cherry completely reverses the ranking” (Fischer et al. 2023, p. 2.) We call such an outcome a ‘ranking reversal’ induced by a set of extinctions. Their results raise concerns about the appropriateness of the Fair Proportion index and thus there would seem to be merit in developing an alternative. Some recent theoretical work has begun to look at properties of diversity indices in an abstract sense, with a view to evaluating their effectiveness as measures of species-level diversity (Bordewich and Semple 2024; Manson and Steel 2023). One motivating factor is that by taking a properties-first approach it may be that we can discover useful diversity index measures beyond FP and ES that do not share their flaws. However, our main results here (Theorems 5 and 9) show that a lack of robustness is exhibited by a large class of diversity indices. Moreover, we argue that those diversity indices outside this class have an unrealistic basis and thus any biologically reasonable diversity index used as an alternative to FP will not avoid the robustness problem. For this reason we do not claim that any alternative diversity index is unambiguously better than FP or ES for diversity index applications.

The FP index was a natural choice for Fischer et al. (2023) to begin the study of robustness of diversity indices. Given its relative importance, we also investigate the ES index specifically. The ES index generally requires more species to go extinct to induce a ranking reversal than FP does. This number may still be small though, and we give an example of a ranking reversal under ES caused by the extinction of just two species. Following Fischer and colleagues, we initially place no constraint on the size of edge lengths required to obtain such reversals. The examples contained in the original paper, the constructive methods appearing in the proofs of their Theorem 2 (Fischer et al. 2023, Supp. Mat. pp. 2–15) and Theorems 5 and 9 here all tend to place the leaves at quite varied distances from the root vertex. This leads to questions about whether similar results can be obtained when edge lengths obey an ultrametric constraint. In particular:

“if we restrict the analysis to ultrametric trees where all leaves have the same distance to the root, what are the worst-case scenarios in this setting?” (Fischer et al. 2023, p. 5)

By “worst-case” those authors refer to a combination of edge lengths and extinctions that re-orders the FP index score ranking as much as possible. In Sect. 5 we outline some necessary conditions for a reversal of FP index scores in the ultrametric context. We then present examples of ultrametric rooted phylogenetic trees for which FP index score rankings are completely reversible. This answers the question above by showing that the “worst-case scenario” on an ultrametric phylogenetic tree is as bad as possible.

The rest of this paper is organised as follows. We begin with a section of preliminary definitions and notation before describing a handful of diversity indices besides FP and ES. The following two sections contain our main results: that two large classes of diversity index, called ‘non-rigid interior’ and ‘rigid interior’ indices respectively, are not robust in the sense described above. Section 5 focusses on robustness under an ultrametric constraint on edge lengths and is followed by a short final section of concluding remarks.

2 Preliminaries

Let X be a non-empty set of taxa (e.g. species), with \(|X|= n\). A rooted phylogenetic X-tree is a rooted tree \(T = (V,E)\), where X is the set of leaves, and all edges are directed away from a distinguished root vertex \(\rho \). An interior vertex of T is any vertex that is not a leaf. A rooted phylogenetic tree is called binary if every interior vertex has out-degree 2. All edges drawn in this paper will be directed down the page.

Let \(P(T;\rho ,v)\) be the unique path in T from the root \(\rho \) to \(v \in V(T)\). For any edge \(e \in E(T)\), we write \(x \in c_T(e)\) if \(x \in X\) and \(P(T;\rho ,x)\) includes e. That is, \(c_T(e)\) is the set (cluster) of leaves descended from the terminal vertex of e. For \(v \in V(T)\), we also write \(x \in c_T(v)\) if \(x \in X\) and \(P(T;\rho ,x)\) includes v. If the (directed) edge (uv) appears in a phylogenetic tree T, we say that u is the parent of v. If two distinct leaves \(x_1\) and \(x_2\) in X have the same parent vertex v and no other vertex in V(T) has v as a parent then we call \(\{x_1,x_2\}\) a cherry. A rooted binary phylogenetic tree with exactly one cherry is called a caterpillar tree. If a set of \(m \ge 2\) leaves \(Y \subseteq X\) all have the common parent vertex v and no other vertex in V(T) has v as a parent then we call Y an m-cherry.

The term ‘pendant’ is used in two related senses in this paper. First, a pendant edge is an edge whose terminal vertex is a leaf. Second, a pendant subtree of T is any subtree that does not contain the root vertex and can be a connected component of a graph formed from T by the deletion of exactly one edge. We write \(P_e\) for the pendant subtree that would be formed from the deletion of edge e. A rooted phylogenetic tree has a number of maximal pendant subtrees equal to the out-degree of the root, each formed by the deletion of an edge incident with the root.

We denote the two maximal pendant subtrees of a rooted binary phylogenetic tree T by \(T_a\) and \(T_b\). Further subtrees, in the nonbinary case, shall be denoted similarly. The tree shape of T may be expressed in terms its maximal pendant subtrees by writing \(T=(T_a,T_b,\ldots )\). We may extend this notation to non-root non-leaf vertices of T, writing \(T_a(v)\), \(T_b(v)\), and so on, for the maximal pendant subtrees contained within the pendant subtree rooted at vertex v. The edge connecting the root vertex to \(T_a\) will be labelled a, and the set of leaves in \(T_a\) will be denoted \(X_a\), with \(|X_a |= n_a\). Parallel notation applies to the other maximal pendant subtrees of T. Figure 1 illustrates this notation for the simplest case, where \(\rho \) has out-degree 2.

Fig. 1
figure 1

Schematic diagram showing the notation for a rooted phylogenetic tree T and its maximal pendant subtrees \(T_a\) and \(T_b\), which are descended from the root vertex \(\rho \) via edges a and b

The edges of rooted phylogenetic trees in this paper are positively weighted. Let T be a rooted phylogenetic X-tree and let \(\ell : E(T) \rightarrow \mathbb {R}^{>0}\) be a function that assigns a positive real-valued length \(\ell (e)\) to each edge \(e \in E(T)\). Suppose that \(u,v \in V(T)\) are two vertices of T connected by a directed path from u to v. Then the distance from u to v, denoted d(uv), is the sum of the lengths of the edges in this path. If \(\ell \) is such that for every two distinct leaves x and y we have \(d(\rho ,x) = d(\rho ,y)\), we say that \(\ell \) satisfies the ultrametric condition.

A (phylogenetic) diversity index on a rooted phylogenetic tree T is a function that assigns a portion of the total edge length of T to each species. This can be seen as partitioning the total evolutionary history, or phylogenetic diversity (Faith 1992), of a phylogenetic tree among its species. Loosely speaking, these functions allocate the value of each edge length among that edge’s descendants in such a way that respects the symmetries of the tree shape. We write \(\varphi _{T,\ell }\) to denote the diversity index \(\varphi \) applied to the phylogenetic tree T given edge lengths by \(\ell \), although one or both of the subscripts may be omitted when clear from context. The formal definition of a diversity index builds on the related class of allocation functions (Manson and Steel 2023). Let \(T = (V,E)\) be a rooted phylogenetic X-tree with edge length assignment function \(\ell \). An allocation function \(\varphi _\ell : X \rightarrow \mathbb {R}^{\ge 0}\) is a real-valued function on the set of leaves of T that both satisfies the following equation:

$$\begin{aligned} \sum \limits _{x\in X} \varphi _\ell (x) = \sum \limits _{e \in E} \ell (e), \end{aligned}$$

and moreover may be expressed as \(\varphi _\ell (x) = \sum _{e \in E} \gamma (x,e)\ell (e)\), for every edge length assignment function \(\ell \), where all of the coefficients \(\gamma (x,e)\) are from the interval [0, 1]. Importantly, this means each such coefficient is non-negative.

A diversity index (on T) is an allocation function \(\varphi _\ell : X \rightarrow \mathbb {R}^{>0}\) given by \(\varphi _\ell (x) = \sum _{e \in E} \gamma (x,e)\ell (e)\) for every edge length assignment function \(\ell \), that additionally satisfies the conditions (DI\(_1\)) and (DI\(_2\)) below:

  • (DI\(_1\)) Descent condition: \(\gamma (x,e) = 0\) if x is not descended from e.

  • (DI\(_2\)) Neutrality condition: The coefficients \(\gamma (x,e)\) are a function of the tree shape of \(P_e\). Moreover, suppose that \(P_e\) and \(P_f\) are pendant subtrees of T with the same tree shape. If the leaves x in \(P_e\) and y in \(P_f\) appear in corresponding positions in their respective subtrees, then \(\gamma (x,e)~=~\gamma (y,f)\).

One important consequence of these conditions is that \(\gamma (x,e) = 1\) if e is the edge with leaf x as its terminal vertex. That is, the length of the edge incident to each leaf represents the evolutionary history unique to that leaf’s species, so its value must be allocated entirely to that species and no other.

We call \(\varphi (x)\) the diversity index score of x under \(\varphi \). When comparing these scores we use, and repeat here, the definitions of ranking, strict, and reversible as they appear in Fischer et al. (2023, pp. 2, 3). A ranking \(\pi (S,f)\) for a set \(S~=~\{s_1,\dots ,s_n\}\) based on a function \(f: S \rightarrow \mathbb {R}\) is an ordered list of the elements of S such that \(f(s_i) \ge f(s_j)\) if and only if \(s_i\) appears before \(s_j\) in \(\pi \). A ranking \(\pi (S,f)\) is called strict if none of the values of f in S are equal. A ranking \(\pi _T\) is called reversible for diversity index \(\varphi \) if there is a subset \(X^\prime \) of X whose removal from T leads to an induced subtree \(\tilde{T}\) whose corresponding ranking \( \pi _{\tilde{T}} = \pi (X\backslash X^\prime ,\varphi _{\tilde{T}})\) ranks the species in the opposite order to the ranking \(\pi _T= \pi (X\backslash X^\prime ,\varphi _T)\). We call the removal of species from X an extinction event and will write \(\tilde{X} = \{\tilde{x}_1, \tilde{x}_2, \ldots , \tilde{x}_t\}\) for the t species from X that survive an extinction event. The induced subtree \(\tilde{T}\) is the minimal subtree of T that spans \(\tilde{X}\) and the root vertex. Note that extinction events often cause vertices with out-degree one to appear. We suppress any such vertex and sum the lengths of its incident edges to give the length of the resultant edge in \(\tilde{T}\).

It will be useful to discuss diversity indices on T in terms of their ratios of allocations, which we now define. Let \(\Gamma _i(v,e) = \sum _{x \in T_i(v)} \gamma (x,e)\), that is, the sum of all coefficients associated with both edge e and some leaf in the i-th pendant subtree below vertex v. Let \(e=(u,v)\) be an edge of T. Then the ratio \(\Gamma _1(v,e): \cdots : \Gamma _d(v,e)\) is called the ratio of allocations at v, where d is the out-degree of v. Moreover, diversity indices in this paper will be assumed to be consistent in the sense that the values \(\Gamma _i(v,f)\) lie in the same ratio as the ratio of allocations at v for every edge f in the path between the root vertex and v. Each diversity index may be described by its (consistent) ratios of allocations, with Manson and Steel (2023) showing both how these ratios may be converted into \(\gamma (x,e)\)-type coefficients and that all diversity indices may be expressed in a consistent form.

Two types of diversity index will be considered further, defined in terms of their ratios of allocations. A boundary diversity index is a diversity index where it is possible for a term in a ratio of allocations to equal zero. In other words, boundary diversity indices contain ratios of allocations that mean some leaf is allocated no portion of the evolutionary history arising from one or more of its ancestral edges. We call any diversity index that is not boundary an interior diversity index. Equivalently, a diversity index \(\varphi (x) = \sum _{e \in E}\gamma (x,e)\ell (e)\) is interior if and only if \(\gamma (x,e)\) is strictly positive whenever x is descended from e. The names ‘boundary’ and ‘interior’ refer to the position of a diversity index in \(S(T,\ell )\), the compact convex space of diversity indices on the tree T under edge length assignment \(\ell \) (see Manson and Steel 2023 for details).

Given a rooted phylogenetic tree T, its non-root non-leaf vertices may be categorised by an equivalence relation \(\sim \), where for \(u,v \in V(T)\) we write \(u \sim v\) if and only if the multiset of tree shapes \(\{T_a(u),T_b(u), \ldots \}\) is the same as the multiset of tree shapes \(\{T_a(v),T_b(v),\ldots \}\). Each diversity index can be thought of as a rule that determines ratios of allocations for every possible \(\sim \)-equivalence class. For a \(\sim \)-equivalence class with representative v, we say that [v] has breadth equal to \(|c_T(v)|\). Let a singular \(\sim \)-equivalence class be one whose representative vertex is the parent of a leaf of T. Equivalently, for a singular \(\sim \)-equivalence class with representative v, the multiset of tree shapes \(\{T_a(v),T_b(v),\ldots \}\) contains a tree that consists of a single vertex. In Fig. 3, vertices q, s and v are from distinct singular \(\sim \)-equivalence classes. A rigid diversity index is a diversity index where the ratio of allocations is the same for at least two singular \(\sim \)-equivalence classes of different breadth on some rooted phylogenetic tree. We note that the majority of diversity indices are non-rigid interior indices. See Supplementary Material for details on the relative numbers of rigid, non-rigid and boundary indices.

2.1 Examples of diversity indices

In this section we present those diversity indices known from the literature and introduce some further examples to aid discussion. A small phylogenetic tree in Fig. 3 is used to compare the effects of various diversity index functions. We begin with the formal definitions of the Fair Proportion (or Evolutionary Distinctiveness) index and the Equal-Splits index on a rooted phylogenetic X-tree T. For each leaf \(x \in X\), the Fair Proportion (FP) index (Redding 2003; Isaac et al. 2007) of x in T is given by

$$FP_T(x) = \sum \limits _{e \in P(T;\rho ,x)} \frac{\ell (e)}{|c_T(e)|}.$$

Now let \(e =(u,v)\) be an edge of T. We define \(\pi (e,x)\) to be the product of the out-degrees of the interior vertices in the path P(Tvx) unless e is a pendant edge, in which case \(\pi (e,x) = 1\). Then for each leaf \(x \in X\), the Equal-Splits (ES) index (Redding 2003; Redding et al. 2014) of x in T is given by

$$ES_T(x) = \sum \limits _{e \in P(T;\rho ,x)} \frac{\ell (e)}{\pi (e,x)}.$$

These two indices are both interior diversity indices, but are different in terms of being rigid or not, as shown in the following lemmas.

Lemma 1

(i) The Equal-Splits diversity index is interior. (ii) The Fair Proportion diversity index is interior.

Proof

Let v be an interior non-root vertex of a rooted phylogenetic X-tree T. Suppose v has out-degree d and that \(T_1(v)\), \(T_2(v)\), ..., \(T_d(v)\) are the maximal pendant subtrees descended from v. Let \(|X |= n\) and write \(n_i\) for the number of leaves in \(T_i(v)\).

(i) Under Equal-Splits the ratio of allocations at v is \(\frac{1}{d}: \frac{1}{d}: \cdots : \frac{1}{d}\). Each term in this ratio is non-zero for every choice of v. Hence ES is an interior diversity index.

(ii) Under Fair Proportion the ratio of allocations at v is \(\frac{n_1}{n}: \frac{n_2}{n}: \cdots : \frac{n_d}{n}\). Each term in this ratio is non-zero for every choice of v. Hence FP is an interior diversity index. \(\square \)

Lemma 2

(i) The Equal-Splits diversity index is rigid. (ii) The Fair Proportion diversity index is not rigid.

Proof

Let T be a rooted phylogenetic tree with at least two singular \(\sim \)-equivalence classes of different breadth. Let u and v be vertices that are representatives of distinct \(\sim \)-equivalence classes where \(|c_T(u) |= n_u\) and \(|c_T(v) |= n_v\), and \(n_u \ne n_v\).

(i) Suppose T is binary. Then the ratio of allocations is \(\frac{1}{2}: \frac{1}{2}\) for every vertex under Equal-Splits. Thus the ratios of allocations at u and v are the same and hence ES is rigid.

(ii) Under Fair Proportion the term in the ratio of allocations pertaining to a leaf that has parent u is \(\frac{1}{n_u}\). Similarly, a leaf with parent v has the associated term of \(\frac{1}{n_v}\) in the ratio of allocations at v. As \(n_u \ne n_v\) these terms are different, and since these terms are necessarily the smallest terms in their respective ratios of allocations, the overall ratios must be different as well. Therefore FP is not rigid. \(\square \)

Fig. 2
figure 2

a The Equal-Splits index is a rigid diversity index, as the ratio of allocations is unchanged from u to v although the number of leaves in the left-hand subtree (represented by a triangle) is changed. b In contrast, the Fair Proportion index is not a rigid diversity index, as can be seen from the different ratio of allocations in each case

Figure 2 pictorially represents the rigid/non-rigid classifications for ES and FP, as per Lemma 2. These two diversity indices are essentially the only ones found in existing biodiversity literature. We give a handful of further examples to outline the breadth of the diversity index concept and to illustrate some of the terms introduced above. The problem of finding useful diversity indices to complement FP and ES does not lie in finding functions that satisfy the diversity index definition, but rather in finding such functions that are biologically justified. The examples should therefore not all be interpreted as practical solutions to the problem of partitioning evolutionary history. For simplicity’s sake, we momentarily restrict our attention to rooted binary phylogenetic trees. Our examples are defined by a rule that determines their ratio of allocations at an arbitrary vertex v with maximal descendant pendant subtrees \(T_a(v)\) and \(T_b(v)\). Table 1 displays the ratios of allocations when the example indices are applied to named vertices from the tree in Fig. 3. Note that, by the neutrality condition of the diversity index definition, the vertices labelled u, w and y in Fig. 3 will have a ratio of allocations of \(\frac{1}{2}: \frac{1}{2}\) for every diversity index.

Fig. 3
figure 3

A rooted binary phylogenetic tree T. Many diversity indices may be defined on T by specifying ratios of allocations at vertices p, q, r, s, t and v

Table 1 Ratios of allocation for diversity indices discussed in Sect. 2.1, at labelled vertices from the tree T in Fig. 3

Our first index, \(\alpha \), compares the two pendant subtrees \(T_a(v)\) and \(T_b(v)\) and prioritises the subtree that contains more leaves. If both \(T_a(v)\) and \(T_b(v)\) contain the same number of leaves, the \(\alpha \) index has a \(\frac{1}{2}: \frac{1}{2}\) ratio of allocations at v. Otherwise \(\alpha \) has a \(\frac{2}{3}: \frac{1}{3}\) ratio of allocations at v that allocates \(\frac{2}{3}\) of the preceding evolutionary history to the more populous subtree. Our second index, \(\beta \), follows the same approach, but takes the ratios to an extreme. Again, if both \(T_a(v)\) and \(T_b(v)\) contain the same number of leaves, the \(\beta \) index has a \(\frac{1}{2}: \frac{1}{2}\) ratio of allocations at v. Otherwise \(\beta \) has a 1 : 0 ratio of allocations at v that allocates all of the preceding edge’s length to the more populous subtree. Another similar diversity index, \(\gamma \), can be constructed by taking the other extreme, that is where each 1 : 0 ratio of allocations gives all of the preceding edge’s length to the less populous subtree. The indices \(\alpha , \beta \) and \(\gamma \) can be seen as three functions from a family of diversity indices that includes ES as its midpoint. All are rigid diversity indices, as the same ratio is used for many \(\sim \)-equivalence classes. Indices \(\beta \) and \(\gamma \) are also boundary diversity indices.

We can also choose other aspects of a tree’s structure on which to base a diversity index’s ratios of allocations. For example, the index \(\delta \) uses the number of cherries rather than the number of leaves: if both \(T_a(v)\) and \(T_b(v)\) contain the same number of cherries, the \(\delta \) index has a \(\frac{1}{2}: \frac{1}{2}\) ratio of allocations at v. Otherwise \(\delta \) has a \(\frac{3}{4}: \frac{1}{4}\) ratio of allocations at v that allocates \(\frac{3}{4}\) to the subtree with the greater number of cherries. (The value of \(\frac{3}{4}\) was chosen arbitrarily from the range [0, 1].) A further feature that could be used is the number of interior vertices in \(T_a(v)\) and \(T_b(v)\) that are parents to exactly one leaf. This feature is used by the \(\epsilon \) index, where the ratios of allocations are proportionate to the number of such vertices in each maximal pendant subtree descended from v, or a \(\frac{1}{2}: \frac{1}{2}\) ratio if no such vertices exist. The use of proportionate allocations in this way is closer to how the Fair Proportion index works than the use of fixed ratios in \(\alpha \), \(\beta \), \(\gamma \) and \(\delta \).

In addition to direct proportions of leaves, cherries, interior vertices or other features of tree topology, we can form ratios of allocations based on some function of these numbers. The (non-rigid) \(\zeta \) diversity index uses the proportions of the squares of the numbers of leaves in each subtree to determine its ratios of allocations. We can also combine existing diversity indices piecewise. For example, our \(\eta \) index has ratios of allocations that match ES for vertices that have five or more descendant leaves and ratios that match FP for each other vertex. A final type of diversity index that we mention here is the arbitrary diversity index \(\theta \). We sampled five integers between 0 and 8 and used each as the numerator of the left hand term in the ratios in our table. This is clearly not a biologically-relevant approach but does determine a legitimate diversity index. Lastly we note that linear combinations of diversity indices are themselves diversity indices, subject to some basic constraints (Manson and Steel 2023).

The indices above highlight the wide variety of possibilities using the diversity index concept. Each diversity index carries with it a set of assumptions about how evolutionary history is shared or embodied among descendants. The usefulness of each index depends on how biologically credible these assumptions are, because the differences in assumptions are reflected in the scores given to each species. For example, the indices in Table 1 give rise to the following scores for leaf \(x_3\) if all edges have unit length: \(FP(x_3) = 1.88\), \(ES(x_3) = 1.94\), \(\alpha (x_3) = 1.78\), \(\beta (x_3) = 1\), \(\gamma (x_3) = 2\), \(\delta (x_3) = 1.61\), \(\epsilon (x_3) = 2.88\), \(\zeta (x_3) = 1.63\), \(\eta (x_3) = 1.77\) and \(\theta (x_3) = 1.75\) (all rounded to 2 d.p.). Note the large range of values, from 1 to 2.88 units.

The Fair Proportion index assumes that each species descended from an edge exhibits that edge’s evolutionary developments equally. For Equal-Splits the assumptions are that at the time of the speciation event that terminated an edge, that edge’s developments were embodied equally among the new lineages, and that further speciation events do not alter this original separation. Index \(\beta \) effectively considers the less populous subtrees to be developing evolutionary history from scratch after a speciation event, whereas \(\gamma \) places this assumption on the more populous subtrees. We suggest that while the assumptions for FP and ES are reasonable, the final two are not readily justified.

It turns out that the \(\beta \) index can be more robust than interior indices (see Sect. 4, page 17 for details). Balanced against this observation is the fact that often it achieves this robustness by essentially ignoring much of the tree structure. Many \(\beta \) index scores are simply pendant edge lengths. This circumvents the entire reason for using phylogenetic trees to give a more structured and nuanced picture of evolution compared to plain distance methods. Hence we do not see \(\beta \) as a comprehensive solution to the robustness problem. Boundary indices can also act in an inconsistent and somewhat arbitrary fashion. A zero term in a ratio of allocations is making a strong claim about the way evolutionary history is embodied in living species. Each zero allocation excludes some species from a share of ancestral evolutionary developments, despite these species being descended from the ancestor where such developments arose. We therefore conclude that boundary diversity indices are not a good model for measuring the evolutionary history of species and are only interesting as the theoretical limit of the diversity index concept.

3 Non-rigid interior diversity indices

We begin our examination of robustness with a broad question: Given the freedom to choose any positive edge lengths for a known rooted phylogenetic tree, is there a set of extinctions that induces a strict and reversible ranking on the surviving species’ diversity index scores? Note that eventually, after more and more extinctions, we must arrive at a set of surviving leaves that form a strict and reversible ranking, regardless of the particular diversity index (even if this is a trivial set containing one or two leaves). Hence the pertinent question is: How many extinctions are necessary and sufficient to achieve this effect? The answer to this question depends on the diversity index being used and the structure of the given phylogenetic tree. For the FP index on rooted binary phylogenetic trees the extinction of one leaf per cherry is both necessary and sufficient (Fischer et al. 2023, Theorem 2). In this section we show that all non-rigid interior diversity indices exhibit the same ranking reversal behaviour as FP. That is, the number of necessary and sufficient extinctions, and their distribution among the leaves, is the same for any non-rigid interior index as it is for FP. Our results are given for rooted phylogenetic trees with unrestricted out-degree.

The separation of diversity indices into the classes of rigid and non-rigid is needed because the patterns of extinctions necessary to generate a diversity index ranking reversal differ in each case. The number of necessary and sufficient extinctions for rigid interior diversity indices is somewhat larger, and includes those extinctions necessary in the non-rigid case. Hence, rigid interior indices are slightly more robust than non-rigid indices, although still susceptible to complete ranking reversals as shown in Sect. 4.

The first result we establish concerns the number and type of extinctions necessary for a non-rigid index ranking reversal. The question of sufficiency will then be addressed, showing that the necessary extinctions suffice. Theorem 1 of Fischer et al. (2023) described the necessary number and type of leaf deletions for the Fair Proportion index on rooted binary phylogenetic trees. In fact, their result that at least one leaf per cherry must be deleted generalises directly to every diversity index. We now present a generalisation of that theorem, covering all phylogenetic diversity indices and also those rooted phylogenetic trees that are not binary. Proposition 3 includes the one-per-cherry extinction events as a special case and is proven in a very similar manner to Theorem 1 of Fischer et al. (2023).

Proposition 3

Let T be a rooted phylogenetic X-tree with \(|X |\ge 3\) and let \(\varphi \) be a diversity index on T. Suppose \(\pi _T\) is a strict and reversible ranking concerning the \(\varphi \) diversity index with respect to \(\tilde{X}\) and induced subtree \(\tilde{T}\). Then \(X^\prime = X \setminus \tilde{X}\) contains at least all but one of the leaves adjacent to each interior vertex of T.

Proof

With a view to contradiction, assume that leaves \(x_i\) and \(x_j\) are adjacent to the interior vertex v and that neither \(x_i\) nor \(x_j\) is in \(X^\prime \). Let \(e_i\) and \(e_j\) be the pendant edges incident with \(x_i\) and \(x_j\) respectively. By the neutrality condition in the definition of a diversity index, \(\varphi _T(x_i) = p + \ell (e_i)\) and \(\varphi _T(x_j) = p + \ell (e_j)\) for some real value p, and \(\varphi _{\tilde{T}}(x_i) = q + \ell (e_i)\) and \(\varphi _{\tilde{T}}(x_i) = q + \ell (e_i)\) for some real value q. Without loss of generality suppose that \(\pi _T(x_i) < \pi _T(x_j)\) and \(\pi _{\tilde{T}}(x_j) < \pi _{\tilde{T}}(x_i)\). But then substituting the values from above gives rise to the contradictory expressions \(\ell (x_i) < \ell (x_j)\) and \(\ell (x_j) < \ell (x_i)\). Hence our initial assumption was false and thus at least one of \(x_i\) and \(x_j\) is in the set \(X^\prime \). As \(x_i\) and \(x_j\) were chosen arbitrarily from the leaves adjacent to v, the result follows. \(\square \)

Fig. 4
figure 4

At least three out of the four leaves in A and one of the two leaves in each of B and C must be deleted in order for a strict reversal of diversity index score values to be possible. One such set of leaves is indicated above by the unfilled vertices

Figure 4 shows a rooted phylogenetic tree on nine leaves, where the leaves are grouped according to their parent vertex. The unfilled leaves represent one set of leaf deletions of the smallest size required for a strict and reversible ranking to be possible. This number of deletions can be a large proportion of the leaves for a nonbinary tree. However, it can be much smaller for some rooted binary phylogenetic trees, such as a single vertex for caterpillar trees.

Our task now is to show that this necessary set of extinctions is also sufficient to cause a non-rigid index ranking reversal. This was proven for the Fair Proportion index, as Theorem 2 in Fischer et al. (2023). The main result of this section is to show that that theorem can be extended beyond the FP index to apply to every non-rigid interior diversity index (Theorem 5). Hence, FP is not uniquely subject to robustness issues in light of ongoing extinctions.

Let \(\varphi \) be an interior non-rigid diversity index. Our inductive proof below relies on building up a collection of edge lengths for a phylogenetic tree T based upon edge lengths that make \(\varphi \) reversible on the maximal pendant subtrees of T. This follows the same general approach of Fischer and colleagues in the proof of their Theorem 2 (Fischer et al. 2023). Both results are proved inductively, building from small trees into larger ones. As the smaller trees are combined into larger ones and eventually into T as a whole, the edge lengths are repeatedly adjusted to have strict and reversible rankings at each step. The main difference here is that our approach abstracts the scores away from any particular diversity index and tree structure that creates them. We only require that all, or all but one, species experience a change in diversity score because of an extinction event. We begin with Lemma 4, a result that allows us to adjust edge lengths from the smaller trees in a useful way throughout the proof of Theorem 5.

Lemma 4

Let \(T=(T_a,T_b,\ldots )\) be a rooted phylogenetic X-tree where \(|X_a |\ge 2\). Let \(\varphi _T~=~\sum _{e \in E(T)} \gamma _T(x,e)\ell (e)\) be an interior diversity index on T that is not rigid. Suppose that \(\varphi \) induces a strict and reversible ordering of taxa in \(\tilde{X}_a \subset X_a\) after the extinction of all but one of the leaves adjacent to each interior vertex of \(T_a\). Let \(\tilde{T}\) be the rooted phylogenetic subtree of T induced by this extinction event. Then there exist edge lengths for T such that \(\varphi _T(\tilde{x}_i) \ne \varphi _{\tilde{T}}(\tilde{x}_j)\) for all \(\tilde{x}_i\) and \(\tilde{x}_j\) in \(\tilde{X}_a\).

Proof

Let \(\tilde{X}_a\) be indexed in such a way to give the \(\varphi \) diversity index score ranking \(\varphi _{T_a}(\tilde{x}_1)< \ldots < \varphi _{T_a}(\tilde{x}_{n_a-c_a})\), where \(c_a\) is the number of leaves deleted under this extinction event. We prove the result by describing how those edge lengths of \(T_a\) that allow the ranking reversal on \(\tilde{X}_a\) can be adjusted to give distinct \(\varphi \) index scores.

First, assume that \(\tilde{x}_i\) is a leaf with \(\varphi _T(\tilde{x}_i) = \varphi _{\tilde{T}}(\tilde{x}_i)\). As \(\varphi \) induced a strict and reversible ranking, \(\tilde{x}_i\) is the only leaf for which this equality holds. Let \(e_i\) be the pendant edge incident to \(\tilde{x}_i\) in T and let \(e_i^+\) be the edge in T whose terminal vertex is the parent of \(\tilde{x}_i\). Suppose that \(e_i^+\) is edge a, incident with the root of T. Since \(\varphi \) is an interior diversity index, \(\gamma _T(\tilde{x}_i,e_i^+)\) and \(\gamma _{\tilde{T}}(\tilde{x}_i,e_i^+)\) are both strictly positive. Moreover, as \(\varphi \) is not rigid we have \(\gamma _T(\tilde{x}_i,e_i^+) \ne \gamma _{\tilde{T}}(\tilde{x}_i,e_i^+)\) and because \(\tilde{x}_i\) is the only descendant of \(e_i\), we have \(\gamma _T(\tilde{x}_i,e_i) = \gamma _{\tilde{T}}(\tilde{x}_i,e_i) = 1\). But then

$$\varphi _T(\tilde{x}_i) = 1\cdot \ell (e_i) +\gamma _T(\tilde{x}_i,e_i^+)\ell (e_i^+) \ne 1\cdot \ell (e_i) + \gamma _{\tilde{T}}(\tilde{x}_i,e_i^+)\ell (e_i^+) = \varphi _{\tilde{T}}(\tilde{x}_i),$$

contradicting our supposition.

So it must be the case that \(e_i^+\) lies in \(T_a\). Now extending \(e_i^+\) will increase \(\varphi _{T_a}(\tilde{x}_i)\) and \(\varphi _{\tilde{T}_a}(\tilde{x}_i)\) by different amounts because \(\varphi \) is not rigid. Specifically, we choose to extend \(e_i^+\) by the lesser of \(\frac{1}{2}\left[ \varphi _{T_a}(\tilde{x}_{i+1}) - \varphi _{T_a}(\tilde{x}_i)\right] \) and \(\frac{1}{2}\left[ \varphi _{\tilde{T}_a}(\tilde{x}_{i-1}) - \varphi _{\tilde{T}_a}(\tilde{x}_i)\right] \) (or the defined value from these two choices if only one exists). This particular choice is small enough to ensure that neither the \(\pi _a\) ranking nor the \(\tilde{\pi }_a\) ranking is altered by the extension. Hence strict reversibility is maintained. Moreover, extending \(e_i^+\) in such a manner ensures that \(\varphi _T(\tilde{x}_i)\) and \(\varphi _{\tilde{T}}(\tilde{x}_i)\) will have distinct values.

Second, assume that \(\tilde{x}_i\) and \(\tilde{x}_j\) are distinct leaves with \(\varphi _T(\tilde{x}_i) = \varphi _{\tilde{T}}(\tilde{x}_j)\). Then we can extend \(e_i\), the pendant edge incident to \(\tilde{x}_i\), by the lesser of \(\frac{1}{2}\left[ \varphi _{T_a}(\tilde{x}_{i+1}) - \varphi _{T_a}(\tilde{x}_i)\right] \) and \(\frac{1}{2}\left[ \varphi _{\tilde{T}_a}(\tilde{x}_{i-1}) - \varphi _{\tilde{T}_a}(\tilde{x}_i)\right] \) (or the defined value from these two choices if only one exists). Extending the pendant edge incident to \(\tilde{x}_i\) increases both \(\varphi _T(\tilde{x}_i)\) and \(\varphi _{\tilde{T}}(\tilde{x}_i)\) by equal amounts but does not alter \(\varphi _{\tilde{T}}(\tilde{x}_j)\). This particular choice is small enough to ensure that neither the \(\pi _a\) ranking nor the \(\tilde{\pi }_a\) ranking is altered by the extension. Hence strict reversibility is maintained. Moreover, extending the pendant edge incident to \(\tilde{x}_i\) in such a manner ensures that \(\varphi _T(\tilde{x}_i)\) and \(\varphi _{\tilde{T}}(\tilde{x}_j)\) will have distinct values. We repeat this step as necessary for every such pair of leaves.

Finally, we choose \(\ell (a)\) to be short enough to ensure that it is less than the smallest difference between any two \(\varphi _{T_a}\) or \(\varphi _{\tilde{T}_a}\) index scores from our modified version of \(T_a\). So even if one leaf were to be allocated the entire length of edge a, the ranking orders \(\pi _a\) and \(\tilde{\pi }_a\) would not change.

Therefore, given a set of edge lengths for T, we can adjust them in the manner described above to ensure \(\varphi _T(\tilde{x}_i) \ne \varphi _{\tilde{T}}(\tilde{x}_j)\) for all \(\tilde{x}_i\) and \(\tilde{x}_j\) in \(\tilde{T}_a\). \(\square \)

We turn now to the first main result of the paper.

Theorem 5

Let \(T = (T_a,T_b,\ldots )\) be a rooted phylogenetic X-tree with \(|X |= n\), where \(n \ge 2\). Let \(\tilde{T}\) be the induced \(\tilde{X}\)-tree that results from deleting all but one of the leaves adjacent to each interior vertex of T. Then, for any non-rigid interior diversity index \(\varphi \), there exists an edge length assignment function \(\ell \) on T such that there is a strict \(\varphi \) ranking \(\pi _T\) for the leaves of T that is reversible with respect to \(\tilde{T}\) and such that \(\tilde{X}\) contains the species that has the highest \(\varphi \) index score in T.

Proof

We first prove the result for a rooted phylogenetic tree where the root vertex has out-degree 2 before describing how this technique can be extended to trees with larger out-degree at the root. We proceed by induction on the number of interior vertices present in a rooted phylogenetic tree. As a base case we consider a rooted phylogenetic ‘star’ tree with exactly one interior vertex, the root, and n edges of different lengths. If every taxon except that incident with the longest edge is deleted, then this remaining taxon is in a strict and reversible ordering (of size one) and the result holds.

Next, for the inductive step assume that the theorem holds for the non-rigid interior diversity index \(\varphi \) on all rooted phylogenetic trees with less than k interior vertices. Let \(T=(T_a,T_b)\) be a rooted phylogenetic tree with k interior vertices and n leaves, as drawn in Fig. 1. Without loss of generality we have \(n_a \ge n_b\). Let \(c_T\) be the number of leaves removed when deleting all but one of the leaves adjacent to each interior vertex of T and let \(c_a\) and \(c_b\) denote the number of these leaves contained in \(T_a\) and \(T_b\), respectively. We show that by using the edge lengths that allowed reversible orderings of \(\varphi \) on \(T_a\) and \(T_b\), we are able to choose edge lengths that make \(\varphi \) reversible on T as a whole.

As \(T_a\) has less than k interior vertices, the extinction of all but one of the leaves adjacent to each interior vertex of T induces a strict and reversible ranking \(\pi _a\) on the leaves of \(T_a\). Let \(\tilde{X} = \{\tilde{x}_1, \ldots , \tilde{x}_{n-c_T}\}\) be the set of leaves that is not deleted, labelled such that \(\{\tilde{x}_1, \ldots , \tilde{x}_{n_a-c_a}\}\) is contained in \(T_a\) and \(\{\tilde{x}_{n_a-c_a+1}, \ldots , \tilde{x}_{n-c_T}\}\) is contained in \(T_b\). Relabelling if necessary, we have

$$\varphi _{T_a}(\tilde{x}_1)< \varphi _{T_a}(\tilde{x}_2)< \ldots < \varphi _{T_a}(\tilde{x}_{n_a - c_a})$$

and

$$\varphi _{\tilde{T}_a}(\tilde{x}_1)> \varphi _{\tilde{T}_a}(\tilde{x}_2)> \ldots > \varphi _{\tilde{T}_a}(\tilde{x}_{n_a - c_a}).$$

As described in Lemma 4, we choose a small value for \(\ell (a)\) that does not alter any of these inequalities. Then we construct a set of combined \(\varphi \) index scores for \(\tilde{X}\) from both T and \(\tilde{T}\), denoted \(\Sigma _a \subset \mathbb {R}^{>0}\). In particular:

$$\Sigma _a = \{\varphi _T(\tilde{x}_i) \vert i = 1, \ldots , n_a - c_a\} \cup \{\varphi _{\tilde{T}}(\tilde{x}_i) \vert i = 1, \ldots , n_a - c_a\}.$$

By Lemma 4, since \(\varphi \) is non-rigid and interior, each element of \(\Sigma _a\) can be made to have a unique value by adjusting edge lengths within \(T_a\). Moreover, this is achievable in a way that the ranking \(\pi _T\) restricted to \(T_a\) is the same as \(\pi _a\). Thus we are able to ensure that \(\Sigma _a\) contains \(2(n_a-c_a)\) distinct positive real numbers without altering the ranking that we began with. See Fig. 5 for an illustration of this construction. Let \(\Sigma _*\) be the subset of \(\Sigma _a\) containing the \(n_a-c_a\) least elements of \(\Sigma _a\) under the usual ordering of real numbers and let \(\Sigma ^*\) be the subset of \(\Sigma _a\) containing the \(n_a-c_a\) greatest elements of \(\Sigma _a\) under the same ordering. Hence, \(\Sigma _a\) is the disjoint union of \(\Sigma _*\) and \(\Sigma ^*\).

Fig. 5
figure 5

Illustration of the first part of the construction described in the proof of Theorem 5. Each horizontal line represents the set of real numbers. Dotted lines indicate the reversal of index scores of leaves in \(T_a\) after the extinction of all but one of the leaves adjacent to each interior vertex of T, as per our induction hypothesis. The scores for both \(T_a\) and \(\tilde{T}_a\) are combined on the bottom number line, where, by Lemma 4, the \(2(n_a-c_a)\) scores may be made distinct without affecting the initial reversal. Finally the edge lengths of subtree \(T_b\) are adjusted and scaled to fit the index scores into the interval of size \(\delta \) between the vertical dashed lines

Next we consider the remaining taxa of T. The subtree \(T_b\) has less than k interior vertices and hence the extinction of all but one of the leaves adjacent to each interior vertex of T induces a strict and reversible ranking \(\pi _b\) on \(\tilde{X}_b\). Similar to \(T_a\), relabelling if necessary, we have

$$\varphi _{T_b}(\tilde{x}_{n_a-c_a + 1})< \varphi _{T_b}(\tilde{x}_{n_a-c_a+2})< \ldots < \varphi _{T_b}(\tilde{x}_{n-c_T})$$

and

$$\varphi _{\tilde{T}_b}(\tilde{x}_{n_a-c_a + 1})> \varphi _{\tilde{T}_b}(\tilde{x}_{n_a-c_a+2})> \ldots > \varphi _{\tilde{T}_b}(\tilde{x}_{n-c_T}).$$

As in the \(T_a\) case, we choose a small enough value for \(\ell (b)\) so that the above rankings are maintained. We construct the set

$$\Sigma _b = \{\varphi _T(\tilde{x}_i) \vert i = n_a-c_a+1, \ldots , n - c_T\} \cup \{\varphi _{\tilde{T}}(\tilde{x}_i) \vert i = n_a-c_a+1, \ldots , n - c_T\},$$

but, in contrast to \(\Sigma _a\), do not require that \(\Sigma _b\) contains \(2(n_b-c_b)\) distinct values. (For instance \(T_b\) may consist of a single vertex whose diversity index score is necessarily fixed as the length of its pendant edge.) However, we do note that, because diversity indices allocate the entirety of each pendant edge length to their incident leaf, every member of \(\Sigma _b\) is strictly positive.

Now let \(\delta = \min \Sigma ^*- \max \Sigma _*\) be the size of the gap between the two halves of \(\Sigma _a\) (see Fig. 5). We uniformly multiply the lengths of edges in \(E(T_b) \cup \{b\}\) by a positive real constant c, chosen so that \(c\left( \max \Sigma _b \right) < \delta \). That is, we scale (down) the entirety of \(T_b\) as well as edge b so that the range of values in the rescaled \(\Sigma _b\) extends less than \(\delta \). (In practice it may be tidier to scale up \(T_a\) and a to achieve the same result.) Call the resulting subtree \(T_b^\prime \). Finally, to each pendant edge in \(T_b^\prime \) add the value of \(\max \Sigma _*\). This shifts the index scores of \(T_b^\prime \) into the gap between \(\Sigma _*\) and \(\Sigma ^*\) without altering the ranking of these leaves.

Let \(\tilde{x}_\alpha \) be the taxon such that \(\varphi _{T_a}(\tilde{x}_\alpha )\) is the largest value in \(\Sigma _*\) among scores from \(T_a\) (as opposed to \(\tilde{T}_a\)). The result of the above construction is that we have

$$\begin{aligned} \varphi _T(\tilde{x}_1)< \ldots< \varphi _T(\tilde{x}_\alpha )&< \varphi _T(\tilde{x}_{n_a-c_a+1})\\ {}&< \ldots< \varphi _T(\tilde{x}_{n-c_T})< \varphi _T(\tilde{x}_{\alpha + 1})< \ldots <\varphi _T(\tilde{x}_{n_a-c_a}) \end{aligned}$$

and

$$\begin{aligned} \varphi _{\tilde{T}}(\tilde{x}_1)> \ldots> \varphi _{\tilde{T}}(\tilde{x}_\alpha )&> \varphi _{\tilde{T}}(\tilde{x}_{n_a-c_a+1})\\ {}&> \ldots> \varphi _{\tilde{T}}(\tilde{x}_{n-c_T})> \varphi _{\tilde{T}}(\tilde{x}_{\alpha + 1})> \ldots > \varphi _{\tilde{T}}(\tilde{x}_{n_a-c_a}). \end{aligned}$$

Therefore the extinction of \(X \setminus \tilde{X}\) has induced a strict and reversible ranking on \(\tilde{X}\) given our chosen edge lengths. Hence the induction is proved and the theorem holds for all rooted phylogenetic trees where the root has out-degree 2.

We now describe how to extend the above idea to phylogenetic trees where the root has greater out-degree. Let \(\tilde{T}\) be the rooted phylogenetic tree induced from T by the extinction of all but one leaf adjacent to each interior vertex. Suppose that \(\tilde{T}\) has maximal pendant subtrees \(\tilde{T}_1\), \(\tilde{T}_2\), ..., \(\tilde{T}_{\tilde{d}}\), where \(\tilde{d}\) is the out-degree of the root in \(\tilde{T}\). Without loss of generality we further suppose that \(\tilde{T}_{\tilde{d}}\) is the maximal pendant subtree with the fewest leaves from this list. Note that, since \(\tilde{T}\) contains at most one leaf adjacent to each interior vertex, at most \(\tilde{T}_{\tilde{d}}\) consists of a single leaf.

We first apply the above interleaving process to the subtrees \(\tilde{T}_1\) and \(\tilde{T}_2\), with one small change. Since \(\tilde{T}_2\) has at least two leaves we use Lemma 4 to ensure that the set \(\Sigma _2\) contains no repeated values. Thus the strict and reversible ranking obtained across leaves from \(\tilde{T}_1\) and \(\tilde{T}_2\) contains some interval \(\delta ^\prime \) between the lower half of diversity index scores and the upper half of these scores. It is into this interval, scaling as before, that we place the scores from \(\Sigma _3\), which itself will contain an inter-score interval \(\delta ^{\prime \prime }\) in which to place scores from \(\Sigma _4\), and so on. We proceed iteratively in this manner until finally we place the scores from \(\Sigma _{\tilde{d}}\) into the last interval. The result is a single completely reversible ranking across all leaves of \(\tilde{T}\), with edge lengths chosen according to the steps described. Therefore the statement of the theorem is proved for all rooted phylogenetic trees. \(\square \)

4 Rigid interior diversity indices

Theorem 5 has shown that the non-rigid interior diversity indices, to the same extent as FP, are not robust to extinctions. We would also like to establish the same understanding of rigid interior diversity indices. That is, which extinctions are required before a complete ranking reversal can occur and, given the freedom to choose positive edge lengths, which extinctions are sufficient? For a ranking derived from a rigid interior diversity index to undergo a strict reversal, the extinction of at least all but one leaf adjacent to each interior vertex is still required, as Proposition 3 includes all diversity indices. However, as an examination of (the quintessential rigid diversity index) ES will show, such an extinction event may not be sufficient to reverse the entire ranking of survivors’ index scores (Proposition 6). We then establish the necessary and sufficient sets of extinctions for this class of diversity index. This involves a new definition, categorising some leaves of a rooted phylogenetic tree as ‘isolated’. We show that the additional extinction of these isolated leaves is sufficient for inducing a reversible index ranking for any rigid diversity index (Theorem 9).

Proposition 6

Let T be a rooted caterpillar tree with at least four leaves. Then the extinction of one leaf from the cherry of T is necessary, but not sufficient, to cause a strict ranking of Equal-Splits index scores to reverse.

Moreover, let \(X = \{x_1,x_2,x_3, \dots , x_n\}\) be the leaves of T, where \(x_n\) is adjacent to the root vertex, and there is a path of length two from the root vertex to \(x_{n-1}\). Then the extinction of any proper subset of \(\{x_1,x_2,x_3, \dots , x_{n-2}\}\) cannot cause a strict ranking of Equal-Splits index scores to reverse.

Proof

Let \(x_1\) and \(x_2\) be the two distinct leaves that form the unique cherry of T. By Theorem 1 in Fischer et al. (2023), it is necessary for at least one of \(x_1, x_2\) to be deleted for the ranking of ES index scores to reverse.

The extinction of, say, \(x_1\) alone will lead to an increase in the ES index score of \(x_2\), but the ES index scores of all other vertices remain the same. If T contains at least four leaves, there are at least two leaves whose index scores are unaffected, and hence their ranking order does not reverse. Thus the extinction of \(x_1\) or \(x_2\) alone is not enough to reverse the ES index scores.

For a rooted caterpillar tree with \(n \ge 4\) leaves, suppose a strict subset of \(\{x_1,x_2,x_3, \dots , x_{n-2}\}\) becomes extinct. Note that the extinction of a leaf \(x_i\) affects the ES index score of precisely those leaves descended from the parent of \(x_i\) and no others. By this reasoning the ES index scores of \(x_{n-1}\) and \(x_n\) are unaffected by such an extinction and the ranking cannot reverse. Therefore the deletion of \(x_{n-1}\) or \(x_n\) is necessary for an ES ranking reversal. \(\square \)

This result confirms our claim above, that some rigid interior indices can be robust to the type of extinction events that cause ranking reversals on non-rigid interior indices. We would like to know how few extinctions beyond those specified in Proposition 3 are necessary to induce a ranking reversal. Let T be a phylogenetic X-tree rooted at \(\rho \) and let \(\tilde{T}\) be the \(\tilde{X}\)-tree induced by the extinction of \(X {\setminus } \tilde{X}\). A fixed leaf for the diversity index \(\varphi \) on T is any leaf x in X where \(\varphi _T(x) = \varphi _{\tilde{T}}(x)\) independent of the edge length assignment. For a strict and reversible \(\varphi \) ranking on \(\tilde{X}\) to exist, there must be at most one fixed leaf for \(\varphi \) on T. The focus on fixed leaves can help us to understand the robustness of various diversity indices. Consider the \(\beta \) diversity index defined in Sect. 2.1. When applied to caterpillar trees with n leaves (see \(Cat_n\) in Fig. 6), we require more than \(n-4\) extinctions to possibly induce a \(\beta \) ranking reversal on surviving leaves. This is because all leaves on \(Cat_n\), except the three leaves furthest from the root, are fixed leaves under \(\beta \) for every extinction event that they survive. Thus it is necessary for at least \(n-4\) of these fixed leaves to go extinct, plus one from the cherry. This makes \(\beta \) more robust on \(Cat_n\) than other indices we examine, however this advantage needs to be weighed against the disadvantages of boundary indices discussed earlier.

Next, let \(x \in X\) be a leaf vertex with parent \(v \in V(T)\) distinct from \(\rho \), where x is not contained in any m-cherry. Then we call x an isolated leaf if there is no vertex u in the path \(P(T;\rho ,v) \setminus \{\rho ,v\}\), such that u itself is the parent of a leaf vertex. For example, in the tree \(Cat_n\) in Fig. 6, leaf \(x_{n-1}\) is an isolated leaf because there is no additional vertex in the path from the root \(\rho \) to w, the parent of \(x_{n-1}\), other than the endpoints. Leaf \(x_n\) is not isolated because its parent vertex is the root. All other leaves of this tree are not isolated because the vertex w appears on all paths from \(\rho \) to the parents of other leaves. Proposition 7 below shows that in addition to all but one leaf adjacent to each interior vertex, we also require all but one isolated leaf to go extinct before a strict and reversible ranking on T can occur for rigid interior diversity indices.

Fig. 6
figure 6

A general caterpillar tree used to show that ES index score rankings can be reversed with as few as two extinctions. Observe that leaf \(x_{n-1}\) is the only isolated leaf in this tree

Proposition 7

Let T be a rooted phylogenetic X-tree with \(|X |\ge 3\) and let \(\varphi \) be a rigid diversity index on T. Suppose \(\pi _T\) is a strict and reversible ranking concerning the \(\varphi \) diversity index for T with respect to \(\tilde{X}\) and induced subtree \(\tilde{T}\). Then \(X^\prime = X \setminus \tilde{X}\) contains at least all but one of the leaves adjacent to each interior vertex of T and at least all but one of the isolated leaves of T. In the case that one of the maximal pendant subtrees of T contains a single leaf, then \(X^\prime \) must contain all of the isolated leaves of T.

Proof

Each isolated leaf may be a fixed leaf if \(\varphi \) is a rigid diversity index. Any leaf whose parent vertex is the root is also a fixed leaf, because its diversity index score is always just the length of the incident pendant edge. To induce a strict and reversible ranking on \(\tilde{X}\) under a diversity index at most one fixed leaf for \(\varphi \) on T can remain. Therefore, the deletion of at least all but one isolated vertex is required and if one of the maximal pendant subtrees of T contains a single leaf then the final isolated leaf must be deleted too. Since the extinction of at least all but one of the leaves adjacent to each interior vertex is needed to reverse any strict diversity index ranking, the result is shown. \(\square \)

Our next results show that the extra extinction of isolated vertices is sufficient to give a ranking reversal for rigid interior diversity indices. That is, those interior diversity indices not covered by Theorem 5 are also not robust given this larger set of extinctions. Lemma 8 describes how we can choose edge lengths for a rooted phylogenetic tree T so that, after the extinction of all isolated leaves and all but one leaf adjacent to each interior vertex of T, we can ensure that there is at most one fixed leaf for any rigid interior diversity index. Theorem 9 then uses this lack of multiple fixed leaves to determine edge lengths that induce a ranking reversal.

Lemma 8

Let T be a rooted phylogenetic X-tree and let \(\varphi _T~=~\sum _{e \in E(T)} \gamma (x,e)\ell (e)\) be an interior diversity index on T. Suppose that \(\tilde{T}\) is the rooted phylogenetic \(\tilde{X}\)-tree induced by the extinction of all isolated leaves and all but one leaf adjacent to each interior vertex of T. Then there exist edge lengths for T such that \(\tilde{X}\) contains at most one fixed leaf for \(\varphi \) on T.

Proof

We prove the result by describing how certain edge lengths can be extended to ensure that at most one fixed leaf for \(\varphi \) exists in T. Let \(\ell \) be an edge length assignment function on E(T). Assume that there exists vertex \(\tilde{x}_i \in \tilde{X}\) such that \(\varphi _T(\tilde{x}_i) = \varphi _{\tilde{T}}(\tilde{x}_i)\) and \(\tilde{x}_i\) is not adjacent to the root vertex. If \(\tilde{x}_i\) is a member of an m-cherry in T, let \(e_i^+\) be the edge whose terminal vertex is the parent of \(\tilde{x}_i\) in T. Then increasing the length of \(e_i^+\) will increase \(\varphi _{\tilde{T}}(\tilde{x}_i)\) by m times the amount that \(\varphi _T(\tilde{x}_i)\) increases. Hence, \(\tilde{x}_i\) is no longer a fixed leaf after extending \(e_i^+\).

If \(\tilde{x}_i\) is not a member of an m-cherry in T, then there exists some isolated leaf \(x_j \in X\) whose parent in T is in the path \(P(T;\rho ,\tilde{x}_i)\). This must be the case or \(\tilde{x}_i\) would itself be an isolated leaf of T, contradicting the fact that all isolated leaves have been deleted. Let \(e_j^+\) be the edge in T whose terminal vertex is the parent of \(x_j\). We set \(\gamma (x_j,e_j^+) = \alpha \), where \(\alpha \) lies in the open interval (0, 1) because \(\varphi \) is not a boundary index. Let \(\gamma (\tilde{x}_i,e_j^+) = p\). Since \(\varphi \) has a consistent form (Proposition 11 of Manson and Steel 2023), we can write \(p=(1-\alpha )q\) for some \(q \in (0,1)\). Increasing the length of \(e_j^+\) by, say, k units increases \(\varphi _T(\tilde{x}_i)\) by pk but increases \(\varphi _{\tilde{T}}(\tilde{x}_i)\) by qk, a different amount since \(\alpha \) is nonzero. Hence \(\tilde{x}_i\) is no longer a fixed leaf after extending \(e_j^+\).

Therefore we can extend particular edges as required to get a set of edge lengths for which \(\varphi _T(\tilde{x}_i) \ne \varphi _{\tilde{T}}(\tilde{x}_i)\) whenever \(\tilde{x}_i\) is not adjacent to the root vertex. \(\square \)

Theorem 9

Let T be a rooted phylogenetic X-tree and let \(\varphi _T=\sum _{e \in E(T)} \gamma (x,e)\ell (e)\) be an interior diversity index on T. Suppose that \(\tilde{T}\) is the rooted phylogenetic \(\tilde{X}\)-tree induced by the extinction of all isolated leaves and all but one leaf adjacent to each interior vertex of T. Then there exists an edge length assignment function \(\ell \) on T such that there is a strict \(\varphi \) ranking \(\pi _T\) for the leaves of T that is reversible with respect to \(\tilde{T}\).

Proof

The proof consists of two parts. We first construct an edge length assignment function \(\ell ^\prime \) for T. Next, we adjust the lengths of pendant edges to construct the edge length assignment function \(\ell \) with the desired reversal property.

Let \(F_{\text {iso}}\) be a set containing precisely those edges whose terminal vertex is the parent of an isolated leaf. Let \(F_{\text {ch}}\) be a set containing precisely those edges whose terminal vertex is the parent of leaves in an m-cherry, except any edge \(e=(u,v)\) for which some edge in \(F_{\text {iso}}\) lies in the path \(P(T;\rho ,u)\). We now combine these two sets, writing \(F_{\text {iso}} \cup F_{\text {ch}} = F = \{f_1,f_2,\ldots ,f_t\}\).

Let \(\ell ^\prime : E(T) \rightarrow \mathbb {R}^{>0}\) be an edge length assignment function on T given by \(\ell ^\prime (f_k) = M + k\varepsilon \) for each \(f_k \in F\) and \(\ell ^\prime (e) = \varepsilon \) for each \(e \in E(T) {\setminus } F\), where \(0 < \varepsilon \ll {M}\). We ensure that the value of M is chosen to be large enough that there are no fixed leaves in \(\tilde{X}\) except possibly one fixed leaf whose parent is the root vertex. It is possible to choose M with this property by Lemma 8.

Next, we define a number of values based on the \(\ell ^\prime \) edge lengths that will help define the function \(\ell \). For each surviving leaf we calculate the difference between their \(\varphi \) index scores, under \(\ell ^\prime \), before and after the extinction event. That is, for each \(\tilde{x} \in \tilde{X}\) we calculate \(\Delta _{\ell ^\prime }(\tilde{x}) = \varphi _{\tilde{T},\ell ^\prime }(\tilde{x}) - \varphi _{T,\ell ^\prime }(\tilde{x})\). By our choice of M, at most one \(\Delta _{\ell ^\prime }\) value is zero and including the \(k\varepsilon \) terms in the definition of \(\ell ^\prime \) ensures each \(\Delta _{\ell ^\prime }\) value is distinct. Hence we can label the leaves of \(\tilde{X}\) as \(\tilde{x}_1,\tilde{x}_2,\ldots ,\tilde{x}_m\) in such a way that \(\Delta _{\ell ^\prime }(\tilde{x_1})> \Delta _{\ell ^\prime }(\tilde{x_2})> \ldots > \Delta _{\ell ^\prime }(\tilde{x_m})\). Let \(c_i = \frac{1}{2}\left( \varphi _{\tilde{T},\ell ^\prime }(\tilde{x}) + \varphi _{T,\ell ^\prime }(\tilde{x})\right) \) and let \(c = \max \{c_i: 1 \le i \le m\}\). Finally, let \(e_i\) be the pendant edge incident with \(\tilde{x}_i\) and \(P = \{e_i: 1 \le i \le m\}\) be the set of these pendant edges.

Now we define the edge length assignment function \(\ell : E(T) \rightarrow \mathbb {R}^{>0}\), given by \(\ell (e) = \ell ^\prime (e)\) for all \(e \in E(T) {\setminus } P\) and \(\ell (e_i) = c - c_i + \varepsilon \) for all \(e_i \in P\). Using the edge lengths given by \(\ell \), we see that for each distinct \(x_i\) and \(x_j\) with \(i < j\) the following two inequalities hold:

$$\begin{aligned} \varphi _{T,\ell }(\tilde{x}_i)&= [\varphi _{T,\ell ^\prime }(\tilde{x}_i) -\varepsilon ] + c - c_i + \varepsilon \\&= c + \varphi _{T,\ell ^\prime }(\tilde{x}_i) - \frac{1}{2}\varphi _{T,\ell ^\prime }(\tilde{x}_i) - \frac{1}{2}\varphi _{\tilde{T},\ell ^\prime }(\tilde{x}_i)\\&= c - \frac{1}{2}\Delta _{\ell ^\prime }(\tilde{x}_i)\\&< c - \frac{1}{2}\Delta _{\ell ^\prime }(\tilde{x}_j) = \varphi _{T,\ell }(\tilde{x}_j)\\ ~&~\\ \varphi _{\tilde{T},\ell }(\tilde{x}_i)&= [\varphi _{T,\ell ^\prime }(\tilde{x}_i) -\varepsilon ] + \Delta _{\ell ^\prime }(\tilde{x}_i) + c - c_i + \varepsilon \\&= c + \frac{1}{2}\Delta _{\ell ^\prime }(\tilde{x}_i)\\&> c + \frac{1}{2}\Delta _{\ell ^\prime }(\tilde{x}_j) = \varphi _{\tilde{T},\ell }(\tilde{x}_j) \end{aligned}$$

Therefore \(\varphi _{T,\ell }(\tilde{x}_1)< \varphi _{T,\ell }(\tilde{x}_2)< \ldots < \varphi _{T,\ell }(\tilde{x}_m)\) before this extinction event and \(\varphi _{\tilde{T},\ell }(\tilde{x}_1)> \varphi _{\tilde{T},\ell }(\tilde{x}_2)> \ldots > \varphi _{\tilde{T},\ell }(\tilde{x}_m)\) afterwards. That is, there is a strict ranking for the leaves of T that is reversible with respect to \(\tilde{T}\). \(\square \)

We now give a concrete example of the construction used in the proof of Theorem 9. Consider the tree T in Fig. 3 under the Equal-Splits diversity index. The set F in this case consists of three edges: \(f_1 = (p,q)\), \(f_2 = (r,t)\) and \(f_3 = (r,u)\). Suppose that leaves \(x_2\), \(x_5\), \(x_7\), \(x_8\) and \(x_{10}\) are deleted, inducing the tree \(\tilde{T}\) that connects the six remaining leaves. Table 2 gives values for Equal-Splits on both T and \(\tilde{T}\) for the surviving species, using an edge length assignment \(\ell ^\prime \) that uses a value of \(M = 32\). This table also shows values for \(\Delta _{\ell ^\prime }(\tilde{x}_i)\) and \(c_i\) for each leaf \(\tilde{x}_i\). The value of c is indicated in bold. The next columns give the lengths of the pendant edges under the assignment function \(\ell \) and values for Equal-Splits on both T and \(\tilde{T}\) using an edge length assignment \(\ell \). We have chosen to simplify the table by not including the terms of order \(O(\varepsilon )\) where they do not impact the rankings we construct. Note that the ordering of the values is opposite in the final two columns, showing that this selection of edge lengths has indeed induced a reversible ranking of ES scores. Figure 7 shows the tree T with the edge lengths given by \(\ell \) labelled.

Table 2 Values used to establish edge lengths for the tree T in Fig. 7 that lead to a strict and reversible ranking of Equal-Splits index scores
Fig. 7
figure 7

Rooted phylogenetic tree T labelled with edge lengths. For these edge lengths, the extinction of leaves \(x_2\), \(x_5\), \(x_7\), \(x_8\) and \(x_{10}\) leads to a reversal of Equal-Splits rankings for the surviving species

5 Robustness of diversity indices under the ultrametric constraint

We now consider the rankings of diversity index scores on phylogenetic trees whose edge lengths obey the ultrametric constraint. Fischer and colleagues noted that for a rooted ultrametric caterpillar tree no set of extinctions is able to change the ranking order of the remaining leaves (Fischer et al. 2023, Proposition 1). However, we shall show here that this robustness does not extend to all ultrametric rooted phylogenetic trees, by giving examples of complete ranking reversals for FP and ES. To understand how these example trees were found, we first outline some necessary conditions for the reversibility of FP index scores in the ultrametric context. Note that, to simplify the presentation in this section, we restrict our attention to rooted binary phylogenetic trees.

For every diversity index \(\varphi \) the ranking \(\pi (X,\varphi _T)\) on an ultrametric rooted binary phylogenetic X-tree T cannot be strict, as both leaves from any ultrametric cherry must have equal index scores. With this in mind, we impose some further conditions that are not explicitly addressed by the earlier definition of a reversible diversity index. For \(x_i, x_j \in X\), whenever \(\pi _T(x_i) < \pi _T(x_j)\) we require \(\pi _{\tilde{T}}(x_i) > \pi _{\tilde{T}}(x_j)\) for \(\pi _{\tilde{T}}\) to be considered to be in the opposite order to \(\pi _T\). That is, it is not enough for \(\pi _{\tilde{T}}(x_i) = \pi _{\tilde{T}}(x_j)\). Moreover, to eliminate trivial reversals, for a ranking \(\pi _{\tilde{T}}\) to be considered to be in the opposite order to \(\pi _T\), we require \(\pi _T(x_i) < \pi _T(x_j)\) and \(\pi _{\tilde{T}}(x_i) > \pi _{\tilde{T}}(x_j)\) for at least one pair of distinct leaves \(x_i,x_j\). The necessary conditions for a Fair Proportion ranking reversal under the ultrametric constraint are given in Proposition 10 and Corollary 11. It can be easily seen that caterpillar trees do not meet the criterion below. However many other ultrametric rooted phylogenetic trees do, as shown in Sect. 5.1.

Proposition 10

Let T be a rooted binary phylogenetic X-tree whose edge lengths satisfy the ultrametric condition and let \(\pi _T\) be the ranking of FP index scores for T. Suppose \(x \in X\) is a leaf vertex contained in no cherry of T. Let v be the parent vertex of x and write \(c_T(v)\) as the disjoint union \(\{x\} \cup A\), where A contains at least two distinct leaves of T.

Then there exist ultrametric edge lengths and a set of extinctions for which the ranking \(\pi _T\) is reversible only if, whenever x survives, the entire set A has become extinct.

Proof

Let \(x \in X\) be a leaf of T that is not contained in any cherry, and let v be the parent vertex of x. As x is not in a cherry, \(c_T(v)\) must contain at least two distinct leaf vertices different from x, say y and z.

Then \(FP_T(x)\) must be larger than \(FP_T(y)\). To see this, first note that both x and y are allocated the same proportion of edge lengths along the path from the root vertex to v. As T satisfies the ultrametric condition, the length of edge (vx) is the same as the length of the path between v and y. Yet while leaf x is allocated the entire length of edge (vx), leaf y is only allocated part of the total length of the edges along the path from v to y. This is because y must share some of this total with z (from along the edges connecting v to the common parent of y and z).

Next, suppose a set of extinctions occurs that both x and y survive and that \(\tilde{T}\) is the resulting phylogenetic tree. Then \(FP_{\tilde{T}}(y)\) can be no larger than \(FP_{\tilde{T}}(x)\), with equality holding if and only if x and y form a cherry in \(\tilde{T}\). Hence x and y are not ranked in the opposite order in \(\pi _{\tilde{T}}\) compared to \(\pi _T\). Hence the ranking \(\pi _T\) is not reversible if y survives alongside x. Repeating this observation for all vertices in \(c_T(v)\) (other than x) gives the stated result. \(\square \)

Corollary 11

Let T be a rooted binary phylogenetic X-tree whose edge lengths satisfy the ultrametric condition and let \(\pi _T\) be the ranking of FP index scores for T. There exist ultrametric edge lengths and a one-per-cherry extinction event for which the ranking \(\pi _T\) is reversible only if every leaf in X is a member of some cherry.

Proof

Let \(x \in X\) be a leaf of T that is not contained in any cherry, and let v be the parent vertex of x. As x is not in a cherry, \(c_T(v)\) must contain at least two distinct leaf vertices different from x.

Choose two vertices from \(c_T(v)\) that form a cherry, say y and z. Suppose that, say, y survives a one-per-cherry extinction event. Since x is not contained in any cherry of T, then x must also survive the extinction event. Therefore, by Proposition 10, the ranking \(\pi _T\) is not reversible. \(\square \)

5.1 Examples of ultrametric phylogenetic X-trees with reversible rankings

We present a family of ultrametric rooted binary phylogenetic trees with reversible rankings under both the Fair Proportion and Equal-Splits indices. Moreover we give specific edge lengths to illustrate such reversals for both indices after the extinction of one leaf per cherry. The family is illustrated by a representative tree U in Fig. 8a, with twelve leaves: \(x_1,\ldots ,x_{12}\). A similar tree can be constructed for any even number of leaves n, where the sum of the edge lengths on the unique path from the root vertex to a leaf is \(n-1\) and the pendant edges right-to-left follow the pattern of increasing positive integer values as shown. A second reversible ultrametric family, consisting of balanced phylogenetic trees, is presented in supplementary material, as well as some further individual examples.

Let U be the rooted phylogenetic tree in Fig. 8a with ultrametric edge lengths as marked. The tree \(\tilde{U}\) in Fig. 8b is obtained from U after the extinction of every species with an even subscript. The Fair Proportion index scores for each surviving leaf are given in Table 3. Let \(\pi _U\) and \(\pi _{\tilde{U}}\) be rankings of FP index scores for the trees U and \(\tilde{U}\) respectively. Observe that \(\pi _{\tilde{U}}\) ranks the species in the opposite order to \(\pi _U\), hence \(\pi _U\) is reversible. (The scores of leaves \(x_1\) and \(x_3\) remain equal before and after the extinctions.) Table 3 also shows that the same behaviour occurs under the Equal-Splits index on U. This example answers the question posed in Fischer et al. (2023) by showing that the ‘worst-case’ scenario is possible under the ultrametric constraint.

Fig. 8
figure 8

An ultrametric rooted phylogenetic tree U before and after a set of extinctions that reverses both the FP and ES index score rankings for surviving species. See Table 3 for details

Table 3 Fair Proportion and Equal-Splits index values for U and \(\tilde{U}\), rounded to three decimal places

6 Concluding remarks

Diversity indices offer us the ability to take a rooted phylogenetic tree that describes the evolution of a set of species and quantify the evolutionary history of species individually. Helpfully, the ranking of these diversity index scores can indicate priorities for conservation. However, the usefulness of these rankings is diminished somewhat if they are inconsistent over time. Were the changes to diversity rankings small in nature, they might be easily ignored. But the potential for the complete reversal of these rankings could cause uncertainty over the usefulness of diversity indices in general. Fischer et al. (2023) do not conclude that reversibility was grounds for disregarding the use of the Fair Proportion index entirely, but that it was an effect that needed to be kept in mind. They suggest that taking various extinction events into account could be an important consideration before applying diversity indices. Based on Theorems 5 and 9 here, we suggest that the lack of robustness is an unavoidable part of using phylogenetic diversity indices, apart from possibly some boundary indices that seem quite unrealistic. These results indicate that the lack of a diversity index that is both robust and biologically reasonable is because such an index does not exist.

As such, reversibility is less a property to be held against FP or ES in preference to other diversity indices and rather more an aspect of measuring at a species-by-species level starting from a phylogenetic tree. We should not be surprised that a species’ contribution to the phylogenetic diversity of a larger set changes (usually increasing) given the demise of close relatives. Thus a diversity index that sensibly measures this contribution cannot be expected to fix its scores in the face of ongoing extinction events. Combinations of extinctions will likely lead to many score changes (of differing magnitudes) that could upset the initial ranking. In the extreme, we have seen complete reversals of rankings are possible.

The examples given in this paper have shown that ranking reversals may be induced not only given quite varied edge lengths but also on rooted phylogenetic trees under the ultrametric constraint. It may be useful for further investigation to determine the minimum number of extinctions required to cause an FP ranking reversal on ultrametric trees. This number will likely depend on the total number of leaves in a tree, as well as the number of leaves allowed to have the same FP index score despite not sharing a cherry. An understanding of these minimal extinction events could help to determine whether the converse of Proposition 10 holds and the extent to which real, time-based phylogenies may be susceptible to reversals.

Our presentation has largely been concerned with the theoretical worst-case scenarios. For this reason, the examples provided may not seem particularly realistic or relevant, especially given the large number of simultaneous extinctions required for the effects shown. The one-per-cherry extinction for binary non-rigid indices, the further isolated leaf extinctions for rigid indices and the even more extinctions required on nonbinary trees all push the limits of plausibility. Furthermore, Proposition 10 demands an even larger number of extinctions before an ultrametric ranking reversal can occur, namely at least half of all species. This is likely an unrealistic number of extinctions to occur together, but the point is that the extreme scenario is theoretically possible. We hope that the negative effects of ranking disruption in real scenarios are brought into focus by the stronger results shown in theory. Indeed, while simultaneous extinctions may seem unlikely, the timescale of progressive extinctions is still possibly shorter than that on which the related conservation efforts are able to adapt and reprioritise. Short of a complete reversal are many types of ranking alterations that would be quite unsettling to a co-ordinated programme of conservation. In addition, Fischer and colleagues investigated the effect of one-per-cherry extinctions on 575 real phylogenies under Fair Proportion and found dramatic re-orderings of FP index scores were indeed possible with real data (Fischer et al. 2023).

Finally, phylogenetic diversity indices, as have been used until now and as were described in Manson and Steel (2023), have been based on the assumption that, while edge lengths are used to calculate particular score values, the method of calculation should be independent of the lengths themselves. This has not been adequately justified apart from on the grounds of mathematical simplicity and the fact that the two diversity indices used in practice (FP and ES) both have this property. The widespread nature of diversity index ranking reversals suggests there may be value in using a measure that takes into account edge lengths more directly, if doing so could minimise ranking disruptions.