Paper The following article is Open access

Catalysis in action via elementary thermal operations

and

Published 18 March 2024 © 2024 The Author(s). Published by IOP Publishing Ltd on behalf of the Institute of Physics and Deutsche Physikalische Gesellschaft
, , Citation Jeongrak Son and Nelly H Y Ng 2024 New J. Phys. 26 033029 DOI 10.1088/1367-2630/ad2413

1367-2630/26/3/033029

Abstract

We investigate catalysis in the framework of elementary thermal operations (ETOs), leveraging the distinct features of such operations to illuminate catalytic dynamics. As groundwork, we establish new technical tools that enhance the computability of state transition rules for ETOs. Specifically, we provide a complete characterisation of state transitions for a qutrit system and special classes of initial states of arbitrary dimension. By employing these tools in conjunction with numerical methods, we find that by adopting a small catalyst, including just a qubit catalyst, one can significantly enlarge the set of state transitions for a qutrit system. This advancement notably narrows the gap of reachable states between ETOs and generic thermal operations. Furthermore, we decompose catalytic transitions into time-resolved evolution, which critically enables the tracking of nonequilibrium free energy exchanges between the system and bath. Our results provide evidence for the existence of simple and practicable catalytic advantage in thermodynamics while offering insight into analysing the mechanism of catalytic processes.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 license. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Catalysts are auxiliary states that enter a process in a way such that they are recovered at the end. They are particularly useful when one is restricted to a limited set of operations (motivated by fundamental principles or practical considerations). The general understanding is that the participation of such states helps in mitigating dynamical constraints, by expanding the working Hilbert space, without consuming resources in the catalyst. Since catalysts ideally suffer no deterioration, they can be used repeatedly to activate state transitions.

Since the discovery of quantum catalysis, first reported in entanglement theory [16], it has been extended to various quantum resource theories [7, 8], such as coherence [912], thermodynamics [1319], randomness [2024], and quantum teleportation [25]. Previous studies on catalysis focus predominantly on the initial and final states of state transformation, either via the mathematical structure of trumping [3, 5], majorisation [4], or resource monotones [14, 2628]. These approaches are powerful because they guarantee the existence of a catalytic operation given initial and target states. Nevertheless, most of them do not provide insight into the required catalyst state, or the process necessary for achieving the transformation. This knowledge gap poses an obstacle towards practical demonstrations of catalytic processes. A handful of recent advancements have been made to address these concerns. For example, a significant finding is that given any catalytically possible state transition, nearly any quantum state can serve as a catalyst, as long as many copies are used [29]. This insight comes from the reversible convertibility in the i.i.d. limit, combined with the fact that catalyst states are not altered by the activation process. While this result offers incredible insight towards the catalytic power of i.i.d. states, the construction in [29] always prescribes the usage of a very high-dimensional catalyst, which is excessive for processes that might require only a small catalyst and simple operations 1 . These factors reduce the implementability of catalytic processes and highlight the need for a complementary study that emphasises the extent to which small catalysts and straightforward operations can be effective.

Another challenge towards practical demonstrations of catalytic thermal processes comes from the genericity of thermal operations (TOs) [31] in the resource-theoretic setting [14, 3235]. TOs potentially involve intricate control over the global system-bath composite, in particular its joint energy eigenstates. Interestingly, crude operations [36] acting on a system plus a qubit Gibbs ancilla were proven to be sufficient to generate all TO state transitions to incoherent final states. Achieving target states with higher resourcefulness, however, can still be demanding, e.g. requiring the extreme lowering/raising of system-bath composite energy level, or arbitrary energy preserving unitaries in the system subspace. Therefore, the full employment of resource theoretic models in real world remains challenging [37].

Elementary TOs (ETO) [38] alleviate the above issues by considering a subset of TO that can be decomposed into series of two-level swaps. This decomposition offers a natural way to prescribe a process to the experimenter, analogous to how a complex n-qubit computation is decomposed into a small gate set 2 . Moreover, physical models, such as the collision model [39] or the Jaynes–Cummings (JC) model [40] can emulate the two-level swap. The intensity-dependent JC model [41] can furthermore generate all energy-preserving two-level swaps, making the setup more realistic. Another advantage of ETO is that it opens up the opportunity to analyse intermediate states, which are found by partially applying the swap sequence—providing a time-resolved description of the system dynamics, rarely possible in resource-theoretic thermodynamics. Nonetheless, ETO has its limitations: the only existing method to decide the feasibility of state transitions is to iteratively find all extreme points of the set of reachable states. The number of iterations grows extremely quickly with the system dimension.

Our aim is to leverage the operational simplicity of ETO to find examples of easy-to-realise catalytic evolutions. This goal is nontrivial since the conditions for ETO transitions, even without catalysts, are not well-characterised. Similar to the case of the resource theory of magic [42], our result offers a straightforward recipe for implementing catalytic protocols. This strengthens the connection between catalytic studies in quantum thermodynamics and physically motivated, relatively practical thermodynamic operations.

In section 2, we provide an overview of TO and ETO to set the foundation for our study. We expound on the concept of tight-majorisation and neighbouring β-swaps, elucidating them with figures 3 and 4. Additionally, we develop a fundamental technical tool, i.e. lemma 3 which plays a pivotal role in the development of our primary findings. In section 3, we present our initial examples of catalytic advantages in ETO and display the complete set of reachable states facilitated by any qubit catalyst. The catalytic regime that we study possesses several desirable properties: it can be entirely achieved through two-level swaps, both system and catalyst are small in dimension, and the catalyst is fully recovered without any errors or remnant correlations with the system.

Section 4 extends our investigation to a higher-dimensional regime, pushing the total system dimension from six, as explored in section 3, up to ninety. In general, the case of d = 9 already exceeds our computational capability. Nevertheless, by imposing restrictions on our initial states, we develop a significantly more efficient algorithm to determine the feasibility of (catalytic) ETO transitions. We focus our attention on the problem of cooling one thermal state to another under catalytic ETO and compare its performance to non-catalytic ETO and TO. Our results underscore that even relatively small catalysts suffice to overcome the TO limit.

In section 5, we present the analytical results that form the basis for our numerical methods in sections 3 and 4. This includes, for example, a full characterisation of the set of states reachable by ETO for systems of dimension d = 3 (theorem 5), as well as a universal upper bound, tighter than previous bounds (theorem 9). Notably, we identify a class of initial states where characterising the set of ETO-reachable states is computationally as efficient as checking thermomajorisation relations (theorem 8). This result essentially resolves the characterisation challenge of ETO for specific sets of initial states, opening the door for systematic investigations into operationally important tasks, such as the cooling scenario in section 4.

2. Background

In this section we define the terminology and notation used throughout the paper and provide an overview of the state-of-the-art knowledge and techniques applied to develop our results. Our approach in this work is grounded in the resource-theoretic framework of quantum thermodynamics, which explores the fundamental constraints on thermodynamic state transitions concerning a set of free operations and free states.

2.1. Thermal operations

The most well-known and studied version of the thermodynamic resource theory is the paradigm of TO. These are quantum channels that can be written in the form

Equation (1)

where

  • (i)  
    ${\tau}^\beta_B = \frac{\text{exp}\left[-\beta H_B\right]}{Z_B}$ is the Gibbs state at temperature $1/\beta$ given an arbitrary choice of a bath Hamiltonian HB ,
  • (ii)  
    $Z_B = \textrm{Tr}\text{exp}\left[-\beta H_B\right]$ is the partition function of the bath,
  • (iii)  
    USB is a global energy-preserving unitary, and therefore satisfies $[H_S+H_B,U_{SB}] = 0$, and
  • (iv)  
    Bʹ is a subsystem of choice within SB which is discarded at the end of the process 3 .

Several remarks can be made to compare the framework of TO with complementary approaches to quantum thermodynamics. First of all, in contrast to models that adapt a more dynamical viewpoint [43], i.e. open quantum systems, TOs model a closed system. In particular, TOs adopt the approach of explicitly modeling all systems in which energy flow occurs. For example, work extraction or injection is described via TO by including a battery system [33, 44, 45], to accurately account for work. Temperature gradients, on the other hand, can be modelled by explicitly adding a Gibbs state with $\beta^{^{\prime}} \neq \beta$ as part of the system [44, 46], or by applying TOs with different ambient temperatures sequentially. It is also possible to incorporate time-dependent Hamiltonians in this framework, by considering an enlarged picture with a controller system [47]. As a result, as long as the dynamics of interest satisfies energy preservation across the explicitly modelled systems, e.g. chiral molecules [4850], collision models [51], JC models [52], they are encompassed within the TO framework. On the other hand, the resource theoretic approach has also proven fruitful in rigorously identifying finite-size effects and non-negligible fluctuations in quantum thermodynamics [14, 5360]. In relation to fluctuation theorems [6163], resource theoretic frameworks were shown to not only provide similar predictions [64], but also refinements from an information-theoretic perspective [65].

The complete state transition conditions of determining whether an initial ρS can be transformed into some $ \rho_S^{^{\prime}} $ via TOs have remained a long-standing open problem, in particular when both initial and final states are energy-coherent. However, the situation simplifies in the quasi-classical case, where either ρS or $ \rho_S^{^{\prime}} $ is energy incoherent. Given a system S, let us describe any given initial state $ \rho_S = \sum_{ij} a_{ij} \vert E_i \rangle \langle E_j \rvert $, with respect to the ascending-ordered energy eigenbasis, namely $ E_i \unicode{x2A7D} E_j $ for all $ i \unicode{x2A7D} j $. In the quasi-classical regime, the state is fully characterised by the population vector for each energy eigenbasis, which we denote by $ \mathrm{p} = \lbrace a_{ii} \rbrace_i $. Henceforth, we refer to the population vectors as states living in some probability simplex $ \mathcal{V}^d $. The set of reachable final states from $ \mathrm{p} $ is then denoted as $ \mathcal{S}_\mathrm{TO}(\mathrm{p}) $. If $ \mathrm{q} \in \mathcal{S}_\mathrm{TO}(\mathrm{p}) $, then $ \rho_S~ \xrightarrow{\textrm{TO}}~\rho_S^{^{\prime}} = \sum_{i} q_i \vert E_i \rangle \langle E_i \rvert $, that is, $ \rho_S^{^{\prime}} $ is achievable from ρS via TOs.

For quasi-classical states, the possibility of state transition is fully characterised by thermomajorisation relations [33]. It is furthermore known that the action of TO on such quasi-classical states can equivalently be described by Gibbs preserving matrices G acting on the population vectors [31, 33]. That is, there exists G that preserves the Gibbs state population vector ${\tau}^\beta = \{e^{-\beta E_i}/Z_S\}_i$, i.e. $G{\tau}^\beta = {\tau}^\beta$, and $\mathrm{q} = G\mathrm{p}$.

Given two states $ \rho,\rho^{^{\prime}} $ represented by their population vectors $\mathrm{p}, \mathrm{q}$, and the system Hamiltonian (which in turn determines τβ ), thermomajorisation relation, denoted by $ \succ_\beta $, is a pre-order between states, defined according to their respective thermomajorisation curves $ \mathcal{L}_\mathrm{p},\mathcal{L}_\mathrm{q} $ (definition 2.2). We say that $\mathrm{p}\succ_\beta \mathrm{q}$ if $\mathcal{L}_\mathrm{p}(a)\unicode{x2A7E}\mathcal{L}_\mathrm{q}(a)$ for all $ a\in [0,1] $.

Definition 2.1 (β-order). Given $\mathrm{p}\in\mathcal{V}^d$ and a Gibbs state ${\tau}^\beta\in\mathcal{V}^d$, let us denote the element-wise ratio of the two vectors as

Equation (2)

Then the β-order ${\pi}(\mathrm{p},{\tau}^\beta) $ is a particular ordering of the energy eigenbasis labels $ (1,\ldots,d) $, such that the ratios according to this ordering is non-increasing, i.e.

Equation (3)

with $ \pi_k = {\pi}(\mathrm{p})_k$, where we omit τβ in the argument when it is obvious from context.

Definition 2.2 (thermomajorisation curve). For a state $\mathrm{p}\in\mathcal{V}^d$, the thermomajorisation curve $\mathcal{L}_\mathrm{p}:[ 0,1]\rightarrow [ 0,1]$ is a piecewise-linear function that interpolates between the coordinates $\{(0,0)\}$ and elbow points $\{(\sum_{k = 1}^{l}{\tau}^\beta_{\pi_k}, \sum_{k = 1}^{l}p_{\pi_k})\}_{l = 1}^{d}$.

Earlier works have shown that the set $ \mathcal{S}_{\mathrm{TO}}(\mathrm{p}) $ characterised by thermomajorisation can be constructed quite efficiently. This is summarised in the theorem below.

Theorem 1 ([38], lemma 12 and [66], theorem 2). For any given $ \mathrm{p} $, the set of reachable states $\mathcal{S}_{\mathrm{TO}}(\mathrm{p})$ is a convex combination of $ d! $ unique extreme points that correspond to distinct β-orders and are tightly thermomajorised by $ \mathrm{p} $.

An example of tight thermomajorisation

Given a qubit Hamiltonian ${\textrm{H}} = (E_1,E_2)$ with its corresponding Gibbs state ${\tau}^\beta = ({\tau}^\beta_1 , {\tau}^\beta_2)$ of some fixed temperature $1/\beta$, consider the pure ground state $ \mathrm{p} = (1,0) $, which has a β-order ${\pi}(\mathrm{p}) = (1,2)$ and a simple thermomajorisation curve

Equation (4)

A state $\mathrm{q} = (q_1,q_2)$ has ${\pi}(\mathrm{q}) = (2,1)$ if $q_2\gt{\tau}^\beta_2$, which leads to a thermomajorisation curve

Equation (5)

Then $\mathrm{p}\succ_\beta\mathrm{q}$ if $q_2\unicode{x2A7D} \text{exp}[\beta(E_1-E_2)]: = \Delta_{12}$. Furthermore, when $r_2 = \Delta_{12}$, $\mathrm{r} = (r_1,r_2)$ is tightly thermomajorised by $\mathrm{p}$, i.e. the elbows of $\mathcal{L}_\mathrm{r}$ coincide with the curve $\mathcal{L}_\mathrm{q}$. See figure 1 for an example of tight-thermomazorisation.

Figure 1.

Figure 1. Thermomajorisation curves of $ \mathrm{p},\mathrm{r},\mathrm{q} $, and τβ , where $ \mathrm{p}\succ_{\beta}\mathrm{r}\succ_{\beta}\mathrm{q}\succ_{\beta}{\tau}^\beta $. In particular, $ \mathrm{r} $ is tightly thermomajorised by $ \mathrm{p} $, implying that $ \mathrm{r} $ thermomajorises any $ \mathrm{q}\in\mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ with the same order. Here we plotted one such state $ \mathrm{q} = (\mathrm{r}+{\tau}^\beta)/2 $. Temperature and energy levels are set to be E1 = 0, $ \beta E_2 = 0.2$.

Standard image High-resolution image

With theorem 1, determining $\mathcal{S}_{\mathrm{TO}}(\mathrm{p})$ is computationally inexpensive. This theorem follows from the fact that if a state is tightly-majorised by $ \mathrm{p} $, then it thermomajorises any other state in $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ that has the same β-order ${\pi}^{(m)}$. The importance of tightly-majorised states can also be seen from another perspective:

Lemma 2 (theorem 12 of [36]). If two states $\mathrm{p}$ and $\mathrm{q}$ have the same β-order ${\pi}(\mathrm{p}) = {\pi}(\mathrm{q})$ and $\mathrm{p}\succ_{\beta}\mathrm{q}$, then $\mathrm{q}$ can be obtained from $\mathrm{p}$ by a sequence of two-level partial level thermalisations.

Partial level thermalisations [36] between two levels can be conceptualised as a process where these two levels are coupled to a Markovian heat bath, bringing their population ratio closer to that of a thermal state; see figure 2. Nevertheless, partial level thermalisations preserve the β-order of the state.

Figure 2.

Figure 2. Lemma 2 can be intuitively understood as follows: partial level thermalisation between levels ${\pi}(\mathrm{p})_{i}$ and ${\pi}(\mathrm{p})_{i+1}$ does not alter any elbow points of $\mathcal{L}_{\mathrm{p}}$ except for the ith one. The ith elbow point can move downward in the y-direction as long as the entire curve remains concave, while the x-coordinate is fixed. Panels (a)–(d) illustrate thermomajorisation curves of such a transformation in four steps. At each step, two levels of $\mathrm{p}$ (marked by pink shades) undergo partial thermalisation.

Standard image High-resolution image

Another important aspect of state transition conditions comes from functions that behave monotonically whenever TOs are applied. The most commonly known set of such monotones is dubbed generalised free energies [14]

Equation (6)

for all $\alpha\in(-\infty,\infty)$. The Rényi divergence for a quasi-classical state 4 $\mathrm{p}\in\mathcal{V}^d$ reduces to $ D_\alpha (\mathrm{p} \Vert {\tau}^\beta) = \frac{{\textrm{sgn}}(\alpha)}{\alpha-1}\text{log}\sum_{i}p_i^\alpha ({\tau}^\beta_i)^{1-\alpha}$. When $\alpha \rightarrow 1$, Fα reduces to the nonequilibrium free energy

Equation (7)

where $\langle\cdot\rangle_\rho$ denotes average with respect to the state ρ, and $S(\rho)$ is the von Neumann entropy—a recovery of the second law of thermodynamics.

Monotones only provide necessary conditions for state transitions via TO and its subsets. Yet, the set of monotones in equation (6) becomes sufficient in the quasi-classical regime [14], when catalysts are involved. In other words, it is straightforward to determine the existence of a catalytic TO (CTO) that achieves $ \rho\xrightarrow{\textrm{CTO}}~\rho^{\prime} $: one simply needs to check that equation (6) is monotonic for all α with respect to such a state transition.

2.2. Elementary thermal operations

Special subsets of TO, which take into account more realistic limitations on feasibility, have been developed in the past few years. Two prominent examples are Markovian TOs (MTOs) [67], which we will not explicitly describe in this paper, and ETO [38], which is the focus of this paper. See table 1 for a hierarchy between these operations.

Table 1. Comparison between Markovian thermal operations, elementary thermal operations, and thermal operations, when applied to the quasi-classical cases.

OperationsMTO       $\subsetneq$      ETO      $\subsetneq$       TO
Construct Ext$[\mathcal{S}]$ Neighbouring β-swap series (theorem 4 of [67]) β-swap series (theorem 9)Tight thermomajorisation (theorem 1)
$\mathrm{p}\rightarrow\mathrm{q}$ iff. $\mathrm{p}$ continuously thermomajorises $\mathrm{q}$ Unknown $\mathrm{p}$ thermomajorises $\mathrm{q}$
CatalysisExamples of Gibbs state catalysts [68]Qubit examples (section 3); higher-dimensional examples (section 4) $\exists$ catalytic transformation iff. Fα does not increase $\forall\alpha$ [14]

ETO is a subset of TO with an additional restriction on the unitary transformation USB in equation (1). Unlike TO, where the simultaneous manipulation of all energy levels of the system is allowed, ETO concatenates a series of operations where each individual step involves only two energy levels. Formally,

Equation (8)

where j and k are the system energy levels that the unitary aims to manoeuvre. As in TO, the state transition from $\mathrm{p}$ to $\mathrm{q}$ via a single ETO step acting on two levels can be written as $ \mathrm{q} = \mathcal{M}^{(j,k)}_\lambda\mathrm{p} $, where 5

Equation (9)

and

Equation (10)

This channel is also constructed to preserve the Gibbs state τβ . Note that the set of equation (10) for $\lambda\in[0,1]$ is equivalent to the set of all transition probabilities between two levels that satisfy detailed balance, i.e. $P_{i\vert j}e^{-\beta E_{j}} = P_{j\vert i}e^{-\beta E_{i}}$. Extremal cases of λ = 1 are named β-swaps (throughout the manuscript we also refer to them simply as swaps),

Equation (11)

By definition, it is clear that both concatenations and convex combinations of TOs are themselves also TO. This is obviously not true for ETO, whenever they act non-trivially over different energy levels. Therefore, most of the interesting transitions would arise only when we include all sequences of ETOs, and also arbitrary convex combinations as allowed operations. Such processes are always TO, hence,

Equation (12)

but the converse is not true [38, 69].

Since no efficient way of determining the possibility of ETO transition from $ \mathrm{p} $ to $ \mathrm{q} $ is known, one must construct $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ starting from the initial state $\mathrm{p}$, and then verify whether $ \mathrm{q}\in \mathcal{S}_\mathrm{ETO}(\mathrm{p})$. However, existing theoretical constructions of $ \mathcal{S}_\mathrm{ETO}(\mathrm{p})$ (by its extreme points) grow rapidly in the system dimension, and become computationally infeasible for the simplest examples of catalysis. We partially address this issue with our results in section 5.

2.3. Neighbouring swaps

To find the extreme points of $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $, neighbouring swaps are employed—these are β-swaps of the form $\beta^{(\pi_j,\pi_{j+1})}$ and applied to a state $\mathrm{p}$ with ${\pi}(\mathrm{p})_k = \pi_k$, i.e. they swap two consecutive levels in the β-order of the input state. Neighbouring swaps are important because they typically minimise dissipation/change in athermality (as captured by thermomajorisation, see lemma 3).

Lemma 3. For any state $ \mathrm{p} $, the state obtained from a neighbouring β-swap $\mathrm{q}^{(j)} = \beta^{(\pi_j, \pi_{j+1})}\mathrm{p}$ is tightly thermomajorised by $\mathrm{p}$. Furthermore, states obtained by a single but non-neighbouring β-swap are never tightly-majorised by $\mathrm{p}$ unless two swapped levels have the same energy.

Lemma 3 is proven in appendix section B.3, and furthermore, one can see examples of the lemma visualised in figures 3(b), (d) and (f). From theorem 1, it follows that the neighbouring swapped states $\mathrm{q}^{(j)}$ are identified as extreme points of $\mathcal{S}_{\mathrm{TO}}(\mathrm{p})$, and therefore thermomajorise all states $\mathrm{r}\in\mathcal{S}_{\mathrm{TO}}(\mathrm{p})$ such that ${\pi}(\mathrm{q}) = {\pi}(\mathrm{r})$. This leads to the following corollary:

Corollary 4. If a state $\mathrm{q} \in \mathcal{S}_\mathrm{ETO} (\mathrm{p})$ is tightly thermomajorised by p0, then it is a unique extreme point of $\mathcal{S}_\mathrm{ETO} (\mathrm{p})$ among states of the same β-order.

Therefore, all states $\mathrm{q}^{(j)}$ obtained from a single neighbouring swap are also unique extreme points of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ having order ${\pi}(\mathrm{q}^{(j)})$. Numerical examples such as figure 3(c) show that when multiple neighbouring swaps are applied, the final state is no longer necessarily extremal. Nevertheless, most of the extreme points of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ are of the form $\mathrm{q}^{(\overrightarrow{j})}$ 6 , or contain only a small number of non-neighbouring swaps in their construction.

Figure 3.

Figure 3. Thermomajorisation curves of a state undergoing β-swaps. The initial state population is set to $\mathrm{p} = (0.75,0.2,0.05)$ and the Hamiltonian is set to $\beta{\textrm{H}}_S = (0,0.2,0.5)$. Panel (a) shows a three-dimensional initial state $\mathrm{p}$. Panels (b) and (d) each shows the state after a single neighbouring swap, while (f) shows a single non-neighbouring β-swap. One observes that both (b) and (d) thermomajorise (f). Furthermore, panels (c) and (e) show $ \mathrm{p} $ after two consecutive neighbouring β-swaps. Observe that even (c) and (e) thermomajorise (f), since the non-neighbouring swap produced a much larger deviation from the initial thermomajorisation curve.

Standard image High-resolution image

Figure 4 exemplifies this preference towards neighbouring swaps—there, the free energy of the state is still higher after three neighbouring swaps, compared to a single non-neighbouring swap. In (a), when a neighbouring β-swap (blue arrow) is applied, the state moves to the adjacent cell of β-order, whereas the non-neighbouring swap (red arrow) transfers a state through multiple boundaries of different orders (black dashed lines), resulting in a state less resourceful than $\mathrm{q}^{(2,1,2)} = \beta^{(1,2)}\beta^{(2,3)}\beta^{(1,3)}\mathrm{p}_1$, that experienced three neighbouring swaps. Fast exhaustion of the athermality also occurs in different initial states, rendering non-neighbouring swaps typically suboptimal.

Figure 4.

Figure 4. Initial d = 3 state $\mathrm{p}_1 = (0.35,0.55,0.1)$ (blue plus marker) undergoing ETO, where its Hamiltonian is fixed as $\beta\mathrm{H}_S = (0,0.2,0.5)$, and all energies are scaled in a unit of $1/\beta$. Panel (a): ETOs in barycentric coordinate, where only relevant states $ \mathrm{q} $ with $F (\mathrm{q},{\tau}^\beta) \unicode{x2A7D} F (\mathrm{p}_1,{\tau}^\beta)$ are marked in colour. Triangles label the extreme points of $\mathcal{S}_\mathrm{ETO}(\mathrm{p}_1)$, achieved by β-swaps indicated by blue arrows. The red X marker is the state $\beta^{(2,3)}\mathrm{p}_1$. Different β-order cells are separated by black dashed lines connecting pure states and the thermal state. Panel (b): the free energy differences from the equilibrium state τβ , in two different paths corresponding to (a). Dashed lines show the continuous values of $ \Delta F $ from points on arrow paths in (a). Straight lines connect the discrete values obtained from endpoints denoted with triangle and X symbols. x-axis is the total length of the path taken from the initial state as plotted in (a), e.g. x-coordinate of the second triangle is the summation of the first and the second blue arrow lengths starting from the plus symbol.

Standard image High-resolution image

Moreover, it is useful to note that figure 4 displays the real time evolution of the state during ETO transitions. By construction, only two levels of the state undergo change in time, i.e. all the other populations are fixed during that period, imposing the system to follow the straight lines in barycentric representations as in (a). In (b), dashed lines correspond to the free energies during the real time evolution when the intensity-dependent JC model is assumed (See appendix A for the real time reduced state dynamics). Suppose we apply $ \beta^{(j,k)} $ to a state $ \mathrm{p} $ with $\mathrm{g}(\mathrm{p})_j\gt\mathrm{g}(\mathrm{p})_k$. According to equations (A.9) and (A.10), the state evolves as $ \mathrm{q}(\lambda(t)) $, where $ \mathrm{q}(\lambda = 1) = \beta^{(j,k)}\mathrm{p} $ with $ \mathrm{g}(\mathrm{q})_{k}\gt\mathrm{g}(\mathrm{q})_{j} $. Since the evolution is continuous, there exists a state $ \mathrm{q}(\lambda^{*}) $, such that $ \mathrm{g}(\mathrm{q})_{j} = \mathrm{g}(\mathrm{q})_{k} $. This point is given by $\lambda^* = 1/(1+\Delta_{jk})$ ($\lambda^* = 1/(1+\Delta_{kj})$) when $k\unicode{x2A7E} j$ ($k\unicode{x2A7D} j$). The state $ \mathrm{q}(\lambda^{*}) $ is the closest to being thermal, and achieves minimal generalised free energies—minima of dashed lines in (b)—among states $ \mathrm{q}(\lambda(t)) $. In other words, during a single β-swap, free energy decreases until the minimum point $\mathrm{q}_{\lambda^*}$ is reached, and then increases until the end of the β-swap. On the other hand, if only the endpoints values are considered (solid lines of (b)), free energy cannot increase after each ETO step. These intermediate increases within a single swap evince the non-Markovian effect of thermal reservoirs at each step, differentiating ETO with the strictly Markovian set, MTO [67, 68]. See section 6 for more discussion.

3. Qubit catalysts for qutrit systems

In this section we demonstrate the following:

  • (i)  
    Existence of the catalytic advantage in the ETO framework. In particular, we report the discovery of simple qubit catalysts that are sufficient to produce a non-trivial advantage for state transitions in a qutrit system.
  • (ii)  
    A time-step resolved tracking of the dynamical path for system and catalyst, that reveals snapshots of the inner workings during the catalytic process.

These results could be obtained only by a more systematic understanding of the set of reachable states via ETO, which are detailed in section 5, where we establish various simplifications that enhanced the computability of $ \mathcal{S}_\mathrm{ETO} $. Throughout the rest of the manuscript, we use the vectors $\mathrm{p}, \mathrm{q}, \mathrm{r}$ to denote system states and ${\textrm{c}}$ is reserved for catalyst states.

3.1. Catalytic advantages

For a qutrit system and qubit catalyst, the composite state $\mathrm{p}\otimes{\textrm{c}}$ lives in a six-dimensional probability space $ \mathcal{V}^6 $. Our goal is to construct a set obtainable by catalytic ETOs with a qubit catalyst (CETO2)

Equation (13)

By definition, $\mathcal{S}_\mathrm{ETO}(\mathrm{p})\subset \mathcal{S}_\mathrm{CETO}^{(2)}(\mathrm{p}) $, but qubit catalytic advantage exists iff. $\mathcal{S}_\mathrm{CETO}^{(2)}(\mathrm{p})\not\subset\mathcal{S}_\mathrm{ETO}(\mathrm{p})$. For a number of limited cases, given a fixed catalyst state ${\textrm{c}} = (c_1,1-c_1)$, some parts of the set

Equation (14)

can be evaluated analytically (appendix F). Nonetheless, $ \mathcal{S}_\mathrm{CETO}(\mathrm{p};{\textrm{c}}) $ is in general constructed by numerically evaluating the extreme points of $\mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}})$ and imposing exact catalyst recovery conditions. The set $\mathcal{S}_\mathrm{CETO}^{(2)}(\mathrm{p})$ is then given by iterating the process for different values of c1. In this first work, we focus on the case where S remains uncorrelated to C to affirm that catalytic advantages exist even in the most conservative catalytic setting. We expect the catalytic power to further increase when system-catalyst correlation persists as has been shown in different resource theories [11, 15, 18, 19, 21, 24, 70], but leave this case for future work.

In figure 5, we present two $\mathcal{S}_\mathrm{CETO}^{(2)}$ sets corresponding to two different initial states that have distinct β-orders. The sets are displayed in comparison with the non-catalytic set of reachable states $\mathcal{S}_\mathrm{ETO}$ and $\mathcal{S}_{\mathrm{TO}}$. In particular, notice that in figure 5(a), where the initial state is set to have ${\pi} = (2,1,3)$, we observe that $ \mathcal{S}_{\mathrm{TO}}\subsetneq\mathcal{S}_\mathrm{CETO}^{(2)} $. This feature persists for a number of randomly chosen initial states with ${\pi} = (2,1,3)$ and $(3,1,2)$. When this happens, we can on one hand reproduce every TO transition using ETO with a single qubit catalyst; and on the other hand, combat some of the finite-size effects and enable a larger set of transitions than previously allowed by an arbitrary TO. Likewise, consider figure 5(b), for an initial state ${\pi} = (1,2,3)$. Here, we observe that $\mathcal{S}_\mathrm{CETO}^{(2)}$ overlaps almost entirely with $\mathcal{S}_{\mathrm{TO}}$, but neither is fully contained by the other. This qualitative characteristic is again present for different initial states with the same β-ordering.

Figure 5.

Figure 5. Barycentric representation of $\mathcal{S}_\mathrm{ETO}$ (blue solid lines), $\mathcal{S}_{\mathrm{TO}}$ (violet dashed lines), and $\mathcal{S}_\mathrm{CETO}^{(2)}$ (black dotted lines), for the initial states (a) $ \mathrm{p}_1 $ in figure 4 and (b) $\mathrm{p}_2 = (0.7,0.2,0.1)$, such that ${\pi}(\mathrm{p}_2) = (1,2,3)$. The initial state, thermal state τβ , and β-order regimes are denoted similarly as to figure 4. Pink shaded areas mark catalytic advantage within $\mathcal{S}_{\mathrm{TO}}$, while yellow shaded areas are states that can be reached via ETO+qubit catalyst, but not by TO without catalysis.

Standard image High-resolution image

In figure 5, the set of states that go beyond $\mathcal{S}_{\mathrm{TO}}$ (yellow) highlights the additional advantage brought forward by CETO2. In addition, the set of states between $\mathcal{S}_{\mathrm{TO}}$ and $\mathcal{S}_\mathrm{ETO}$ (pink) also has operational merits—there exist states which require genuine multi-level TO to achieve, but can be obtained by an alternative pathway that involves only basic, JC-like interactions when a catalyst is present. One natural question follows: how does the gap between CETOn and ETO change with n being the dimension of the allowed catalyst? The question of a generally fixed n is difficult to answer, however, in section 4, some hints are given.

3.2. Tracking catalytic processes

Here we explicitly construct a simple series of ETO swaps that leads to the transformation of an initial state $\mathrm{p}$ into an extreme point of $\mathcal{S}_\mathrm{CETO}^{(2)}(\mathrm{p})$, and track changes in the system free energy throughout the process.

In general, a state $\mathrm{q}$ being an extreme point of $\mathcal{S}_\mathrm{CETO}(\mathrm{p})$ does not guarantee $\mathrm{q}\otimes{\textrm{c}}$ to be an extreme point of the enlarged $\mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}})$ in system-catalyst composite space. Thus, (non-trivially) different paths of catalytic processes exist. In figure 6, we choose the shortest swap series, which comprises seven ETO steps, to realise the CETO transition from an initial state $\mathrm{p}_1$ to a new extreme state of $\mathcal{S}_\mathrm{CETO}^{(2)}(\mathrm{p}_1)$. Note that such a choice may not always be unique; see appendix F for details. Since adopting a qubit catalyst doubles the total dimension, and recovering the catalyst also requires additional swaps, more two-level swaps are needed to achieve a similar final state (as compared to a non-catalytic ETO). For the first four steps, the nonequilibrium free energy of the system, FS decreases; see figure 6(b). After two consecutive ETO steps on the joint system, the catalytic path seems identical to a non-catalytic ETO on the system. Nevertheless, on the global picture, correlations are already starting to build up with the catalyst, as shown in figure 7. In the third and fourth steps, FS is further reduced until it has almost a half of the original nonequilibrium contribution. Recalling that free energies are the monotones of TO, this rapid depletion of the athermality would have been irreversible, if we had access only to the system. Yet, during catalysis, some of the free energy is transferred to either 1) the catalyst populations or 2) correlations between system and catalyst. This build-up is critical for the sixth and final steps of the process: stored free energy is what enables the system to again increase in local nonequilibrium free energy, therefore achieving a final state $ \mathrm{q}$ outside $\mathcal{S}_\mathrm{ETO}(\mathrm{p}_1)$ (and in this case, $ \mathrm{q} $ is even outside $\mathcal{S}_{\mathrm{TO}}(\mathrm{p}_1)$). See appendix F for a more detailed analysis. Similar behaviours are observed for other monotones, such as generalised free energies with α ≠ 1. Note that the most conservative form of catalysis is assumed: the catalyst is returned exactly and without correlations with the system. Hence, catalyst local free energy goes back to its original level, and mutual information also returns to zero, as shown in (b) and (d) of figure 7. Their role is restricted to temporary free energy storages. Furthermore, the total free energy always decreases after each swap, which is expected, since the process is an ETO on the joint system.

Figure 6.

Figure 6. (a) Illustration of system reduced state evolutions in a CETO process (diamond markers and dashed lines) starting from an initial state $\mathrm{p}_1$ with final state lying beyond $\mathcal{S}_\mathrm{ETO}(\mathrm{p}_1)$. This state is chosen to be an extreme point of $\mathcal{S}_\mathrm{CETO}^{(2)}(\mathrm{p}_1)$. The rest of the plot is the zoomed-in view of figure 4(a) around β-orders $(2,1,3)$ and $(2,3,1)$. (b) Free energy differences from the equilibrium state τβ . Solid lines connect the values after each swap and dashed lines mark continuous change between them. The fifth swap applied during the catalytic evolution does not alter the system reduced state, making the point before and after that swap not distinguishable in the system reduced picture of (a) and (b). These points are marked with red circles in both (a) and (b). Initial state $\mathrm{p}_1$ and its Hamiltonian $\beta\mathrm{H}_S$ are the same to those of figure 4.

Standard image High-resolution image
Figure 7.

Figure 7. Nonequilibrium free energies of CETO evolution described in figure 6. The symbol $\tilde{\Delta}$ indicates the difference from the values of the initial state. (a) System local free energy identical to black lines in (b) of figure 6; (b) catalyst local free energy; (c) total free energy; (d) mutual information which, in this case, is identical to the difference between the total free energy and the sum of two local free energies. x-coordinates of points are identical to ones in figure 6(b).

Standard image High-resolution image

4. Higher-dimensional catalysts for transitions between thermal states

In general, identifying the whole set of reachable states $\mathcal{S}_\mathrm{CETO}^{(n)}(\mathrm{p})$ with n-dimensional catalysts is highly challenging, even when $ \mathrm{p} $ is three-dimensional and n = 3. However, by developing a theoretical tool, namely theorem 8, we show that the numerical cost is dramatically reduced for initial states $ \mathrm{p} $ whose β-orders are monotonic in energy levels. A particularly important class of states that satisfy this property is the set of thermal states with different temperatures. When $ \beta_{h}\lt\beta $, i.e. the state $ \tau^{\beta_{h}} $ is hotter than the environment with temperature β−1, the β-order $ {\pi}(\tau^{\beta_{h}}) = (d,d-1,\ldots,1) $. Similarly, colder states with $ \beta_{c}\gt\beta $ have the order $ {\pi}(\tau^{\beta_{c}}) = (1,2,\ldots,d) $. If we further employ a catalyst $ {\textrm{c}} $ from the set of states which are sufficiently thermal, the monotonicity of β-order would be preserved, i.e. $ {\pi}(\mathrm{p}\otimes{\textrm{c}}) $ is also monotonic in the total energy. In such cases, the analysis of higher-dimensional catalysts becomes computationally tractable. We will refer to such catalyst states as minimally-disturbing catalysts, in the sense that they do not disturb the β-ordering of the system-catalyst composite.

To demonstrate the benefit of this reduction, let us consider the cooling process via (C)ETO starting from a high temperature thermal state $ \tau^{\beta_{h}} $ with $ \beta_{h}\lt\beta $ to a colder thermal state $ \tau^{\beta_{c}} $. For any temperature $ \beta_{h}^{-1} $, it is always possible to reach the ambient temperature β−1 by a full thermalisation with the environment. However, with ETO and TO, colder temperatures $ \beta_{c}\gt\beta $ can be achieved, which has been studied in [71] for the non-catalytic case of a qubit. One of our main theses is to showcase the effectiveness of small catalysts with practicable procedures. To corroborate this claim and investigate the scaling of catalytic advantage with respect to catalyst size, we apply theorem 8 to find the limits of the cooling performance for a qutrit when using catalysts of varying dimensions, ranging from two to thirty. The catalyst Hamiltonian is trivial for simplicity.

Figure 8 shows the coldest achievable βc from minimally-disturbing catalysts, where the worst and the best cases are marked with purple circles and blue diamonds, respectively. Even with qubit catalysts, almost half of the gap between the TO limit (dashed line) and the ETO limit ($ \textrm{dim}({\textrm{c}}) = 1 $) is covered. The maximal catalytic advantage (blue solid line) gradually increases with the catalyst size, and at $ \textrm{dim}({\textrm{c}}) = 16 $, best catalysts among the sample surpass TO limit, whilst at $ \textrm{dim}({\textrm{c}}) = 30 $, most of the samples perform better than TO. Note that we have limited the range of catalyst distribution to fix the initial composite state β-order; hence there might exist (not minimally-disturbing) catalyst states that activate a better cooling process than the ones marked in figure 8. Also, even the worst case catalysts provide some advantage for the same reason. Usually, if the catalyst is almost pure or pure, catalytic advantage vanishes. Nevertheless, our results give an efficiently computable lower bound to the achievable amount of cooling when any n-dimensional catalyst is allowed. One heuristic approach for exploring catalyst distributions that are not minimally-disturbing, is to search over sequences of β-swaps satisfying the conditions of theorem 8. This method is computationally efficient and yields a subset of $\mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}})$, typically strictly smaller than the full set. Nonetheless, we have observed instances where catalytic advantage is still present even within this restricted set.

Figure 8.

Figure 8. The cooling performance of CETO from a thermal state to another thermal state, quantified by final inverse temperature βc attainable when the catalyst state is fixed. We set the initial system inverse temperature to be $ \beta_{h} = 0.5 $, and the ambient temperature to be β = 1. The system of interest is three-dimensional with energy levels $ (0,0.4,0.5) $ and catalysts of dimension $ \textrm{dim}({\textrm{c}}) $ from two to nine, sixteen, and thirty are used. $ \textrm{dim}({\textrm{c}}) = 1 $ case corresponds to non-catalytic ETO, and the blue dashed line above marks the inverse temperature $ \beta_{\mathrm{TO}} $ that non-CTO can achieve. We searched over catalyst state distributions among minimally-disturbing catalysts. The results from the worst performing catalysts in each dimension are marked with purple circles, while blue diamonds are from the best catalysts in each dimension.

Standard image High-resolution image

To estimate the true minimum and maximum cooling performance achievable by the given set of catalysts, we employed multiple strategies, including uniform random sampling and gradient-descent-like search. Interestingly, regardless of the catalyst dimension, the worst catalysts are given by maximally mixed states which are the Gibbs states for the trivial Hamiltonian. Using Gibbs states as a catalyst can provide no advantage in the framework of TO, since they form the free states. However, for a set of operations with innate Markovianity, such as ETO, Gibbs states can activate state transitions as a catalyst by providing additional non-Markovianity. This is reminiscent of the discussion in section 3.2, where the catalyst functions as a temporary storage during the evolution, see [72] for a similar setup. The best catalyst distributions we obtained are non-trivial and would be of interest for future study.

Overall, the results in this section display that small catalysts do provide substantial advantage in the setting of ETO, where simple two-level swaps are sufficient to execute the procedure. Also, even when the catalysts are not fine-tuned, when they are in a certain regime, catalysis can still enhance ETOs considerably. In particular, Gibbs states are assumed to be easily preparable, and thus can be good candidate states for a more realistic catalytic protocol. Naturally, the characterisation of CETO with higher-dimensional catalysts or generic initial states are very interesting questions, and we anticipate the new tools developed in this manuscript to aid the addressing of such problems in the future. In fact, in a separate follow-up study [73], we were able to fully uncover $\mathcal{S}_\mathrm{CETO}$ when no restrictions are placed on either catalyst dimensions or states. In particular, we show that when the catalyst dimension is allowed to be unbounded, then ETOs with Gibbs state catalysts can achieve all TO transitions. Furthermore, when full freedom on choosing the catalyst is given to both ETOs and TOs, we also show that these two operation classes become equivalent. This confirms that the convergent behaviour observed in figure 8 is indeed a generic feature of CETO.

5. Characterising $\mathcal{S}_\mathrm{ETO}$

Currently, the only known way to determine whether $\mathrm{p} \xrightarrow{\textrm{ETO}} \mathrm{q}$ is to construct the full set $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ by finding all the extreme points of this convex polytope and check if $\mathrm{q}\in\mathcal{S}_\mathrm{ETO}(\mathrm{p})$. Lostaglio et al [38] provided a systematic way of finding all extreme points of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ for an arbitrary dimension d, which involves an exhaustive search among all possible β-swap sequences with a bounded length $\ell_\mathrm{max}$. In other words, this procedure identifies an upper bound on the number of extreme points

Equation (15)

In [38], $ \ell_\mathrm{max}\unicode{x2A7D} d! $ is shown, which means that equation (15) grows super-exponentially with the dimension of the system. This presents a serious roadblock to both understanding and determining the possibility of state transitions via ETO, a reason why ETO, despite its strong physical motivation, has not been extensively studied.

We improved this result in the following ways:

  • (i)  
    For the simplest non-trivial case of d = 3, we provide a full characterisation of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$, showing in particular that $\mathcal{S}_\mathrm{ETO}$ has at most 8 extreme points, in contrast to the upper bound in [38] of $3^6 = 729$.
  • (ii)  
    For d > 3,
    • (a)  
      we tighten the upper bound on $\ell_\textrm{{max}}$ analytically by the factor of d − 3 (see theorem 9).
    • (b)  
      for a subset of initial states where the β-ordering is monotonic in energy, we derive a tight upper bound for the number of extreme points, and fully identify the corresponding β-swaps (theorem 8).
  • (iii)  
    We use computational algorithms to obtain heuristics by random sampling of initial states. We present a comparison of these results with theoretical analysis in table 2.

Table 2. Theoretical bounds and numerical results for maximum β-swap lengths $l_{\text{max}}$ needed to construct all extreme points of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ are compared up to d = 7. Unlike the factorial scaling of the theoretical bound, heuristics suggest polynomial growth of the length. $N_{\mathrm{Ext}}$ is the typical number of extreme points of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$, also obtained from numerical search.

$d = $ dim($\mathrm{p}$) $\ell_{\text{max}}$ ([38]) $\ell_{\text{max}}$ (theorem 9) $\ell_{\text{max}}$ (heuristics) $N_{\mathrm{Ext}}$ (heuristics)
363 (theorem 5)3 $\mathcal{O}(10^0)$
424208 $\mathcal{O}(10^1)$
51205816 $\mathcal{O}(10^2)$
672023823 $\mathcal{O}(10^3)$
75040125938 $\mathcal{O}(10^5)$

5.1. An exact characterisation for d = 3

A qutrit system is the simplest setup where $\mathcal{S}_\mathrm{ETO}$ deviates from $\mathcal{S}_{\mathrm{TO}}$. Furthermore, the no-go results established for qutrits, which rule out certain swap series from generating extreme points of $\mathcal{S}_\mathrm{ETO}$, hold true for any three levels of higher-dimensional scenarios. Here, we provide a full characterisation of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ for any qutrit state $\mathrm{p}$ by defining two simple sets.

Let us start from an example in figure 4(a), where the initial state $\mathrm{p}$ has the β-order $(\pi_{1},\pi_{2},\pi_{3}) = (2,1,3)$. We take two natural strategies for each β-order: favouring neighbouring swaps and minimising the number of swaps in a series. For the original order, $\mathrm{p}$ is optimal with zero swaps. For orders $(\pi_{2},\pi_{1},\pi_{3})$ and $(\pi_{1},\pi_{3},\pi_{2})$, i.e. $(1,2,3)$ and $(2,3,1)$ for this example, a single neighbouring swap, yielding $\beta^{(1,2)}\mathrm{p}$ and $\beta^{(1,3)}\mathrm{p}$, is sufficient. Two other β-orders are obtained through length-2 neighboring swap series. The resulting states are $\beta^{(2,3)}\beta^{(1,2)}\mathrm{p}$, corresponding to $\beta^{( \pi_1, \pi_3)}\beta^{( \pi_1, \pi_2)}\mathrm{p}$, and $\beta^{(2,3)}\beta^{(1,3)}\mathrm{p}$, corresponding to $\beta^{( \pi_1, \pi_3)}\beta^{( \pi_2, \pi_3)}\mathrm{p}$. Collecting all these points, we form the set

Equation (16)

These states, marked in figure 4(a), occupy five β-order cells.

For the last β-order, i.e. the completely inverted one $(\pi_{3},\pi_{2},\pi_{1})$, two strategies diverge. Prioritising neighbouring swaps yields $\beta^{( \pi_2, \pi_3)}\beta^{( \pi_1, \pi_3)}\beta^{( \pi_1, \pi_2)}\mathrm{p}$ and $\beta^{( \pi_1, \pi_2)}\beta^{( \pi_1, \pi_3)}\beta^{( \pi_2, \pi_3)}\mathrm{p}$, while choosing the shortest series results in $\beta^{( \pi_1, \pi_3)}\mathrm{p}$. These three states, all marked in the leftmost β-order cell in figure 4(a), comprise the set

Equation (17)

In fact, these two sets include all extreme points of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ when $\mathrm{p}\in\mathcal{V}^{3}$.

Theorem 5. For any $\mathrm{p}\in\mathcal{V}^3$ with ${\pi}(\mathrm{p})_k = \pi_k$, let $ \Theta_\mathrm{ETO}^{(3)} $ and $ \Xi_\mathrm{ETO}^{(3)} $ be defined as in equations (16) and (17), where we have dropped the explicit $ \mathrm{p} $ for notational convenience. Then,

Equation (18)

One of the important techniques used in proving theorem 5 is identifying the β-swap series that produce states which are extreme points of $ \mathcal{S}_{\mathrm{TO}} $. As a result of theorem 1 and corollary 4, such states are unique extreme points for final states of their particular β-ordering. This identification utilises results established in the language of biplanar transportation matrices in [66]. We leave the full proof for appendix C and focus on a few notable points made there.

Firstly, observe that even non-neighbouring swaps can generate extreme points. An elucidative example is the state $\mathrm{p} = (0.5,0.5,0)$ with the Hamiltonian $\mathrm{H} = (0,E,E)$ and β-order ${\pi}(\mathrm{p}) = (2,1,3)$. The non-neighbouring swap yields a state $\beta^{(2,3)}\mathrm{p} = (0.5,0,0.5)$ without any entropy production, i.e. generalised free energies remain identical for all α. This reversible transformation cannot be achieved by neighbouring-swap series that necessarily involve the ground state level. This fact rules out the naïve attempt of working only with states of the form $\mathrm{q}^{(\overrightarrow{j})}$, and forces us to consider the entire set of swap series, which is in contrast to the case of MTO, where the extreme point verifying algorithm only needs to search for neighbouring swaps [67]. However, a necessary condition for $\beta^{( \pi_1, \pi_3)}\mathrm{p}$ to be extremal in $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ exists and alleviates the complications.

Lemma 6. For any $\mathrm{p}\in\mathcal{V}^d$ with ${\pi}(\mathrm{p})_k = \pi_k$, the state $\beta^{( \pi_i, \pi_{i+c})}\mathrm{p}$ can be an extreme point of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ only if ${\pi}(\beta^{( \pi_i, \pi_{i+c})}\mathrm{p}) = S_{i,i+c}({\pi}(\mathrm{p}))$, where $ S_{i,j} $ is the operation that swaps the ith element with the jth element.

This lemma can be proven by straightforward calculations of swapped states, as described in appendix B.4. Figure 4(a) offers visual intuition. In the figure, a non-neighbouring swap transforms the state along the red arrow and it crosses three black dashed lines that demarcate distinct β-orders. If ${\pi}(\beta^{( \pi_i, \pi_{i+c})}\mathrm{p}) \neq S_{i,i+c}({\pi}(\mathrm{p}))$, the arrow halts before reaching the third black dashed line, resulting in a state closer to the centre of the polytope and thus not extremal. Two simplifications emerge from this result. Firstly, in numerical computations, all non-neighbouring swaps that give orders different from what the lemma imposes can be removed. Secondly, when analytical studies are carried out without specifying an initial state, the ambiguity of β-orders after the swap vanishes when the resulting state is shown to be an extreme point of $\mathcal{S}_\mathrm{ETO}$.

5.2. Improved characterisation for higher-dimensions

Finding the extreme points of $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ is generally a hard task. In particular, multiple extreme points that correspond to the same β-ordering often exist, and the total number of extreme points is only known to be upper bounded by equation (15), which yields an extremely loose number, scaling as $d^{2d!/(d-3)}$ even with our improved result theorem 9. Lemma 7 provides some intuition into the lower bound on the number of extreme points for generic initial states.

Lemma 7. For any state $ \mathrm{p}\in\mathcal{V}^d $, the reachable state set $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ has at least one extreme point $ \mathrm{r} $ having $ {\pi}(\mathrm{r}) = \psi $ for any ordering ψ.

The proof can be found in appendix B.5. Note that lemma 7 does not imply that all initial states have at least $ d! $ distinct extreme points. For instance, a Gibbs state τβ cannot be transformed into any other state via ETO and thus $ \mathcal{S}_\mathrm{ETO}({\tau}^\beta) $ has a single (extreme) point τβ . In this case, $ \mathrm{g}({\tau}^\beta) $ is fully degenerate and $ {\pi}({\tau}^\beta) $ can be any permutation of $ (1,\ldots,d) $.

Similar to the τβ example, some initial states admit a simpler $ \mathcal{S}_\mathrm{ETO} $ structure. One of our main analytical results shows that the situation simplifies drastically and achieves the lower bound of lemma 7, when $ \mathrm{p} $ is known to have a particular β-ordering, i.e. when it is monotonic in energy,

Equation (19)

Theorem 8. If ${\pi}(\mathrm{p}) $ is monotonic in energy, extreme points of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ are achieved if and only if the corresponding β-swap series that produce them are

  • (i)  
    always neighbouring,
  • (ii)  
    containing no repetition of each swap.

Furthermore, when $ \overrightarrow{\beta}_1\mathrm{p}, \overrightarrow{\beta}_2\mathrm{p} \in \mathrm{Ext}[\mathcal{S}_\mathrm{ETO}(\mathrm{p})] $ and $ {\pi}(\overrightarrow{\beta}_1\mathrm{p}) = {\pi}(\overrightarrow{\beta}_2\mathrm{p})$ for such $ \mathrm{p} $, the two series are identical ($ \overrightarrow{\beta}_1 = \overrightarrow{\beta}_2 $).

See appendix D for the proof. Several important simplifications follow from the above theorem, whenever $\pi(\mathrm{p})$ is monotonic in energy. From the no-repetition condition, $\ell_\mathrm{max} = d(d-1)/2$ is obtained. The equivalence of β-swaps outputting the same target state β-order also guarantees the uniqueness of extreme points of $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ at each order, setting the maximum number of extreme points to be $ d! $. More importantly, given the target β-order, one can immediately identify a corresponding extreme point without the need of searching over all possible series, since we developed an explicit algorithm to evaluate this extreme point, which we call the standard formation (see definition appendix D.1 for details).

Notably, all thermal states $ \tau^{\beta^{^{\prime}}} $, with temperature $ 1/\beta^{^{\prime}}$ that might be different from the ambient temperature $ 1/\beta $, satisfy the monotonicity of the β-ordering. The analysis in section 4 deals with at most ninety dimensional system-catalyst composites. Without theorem 8, evaluating the extreme points for such high dimensional systems is practically impossible. However, leveraging the nice ordering of the initial state, we were able to simulate over two million ninety dimensional systems in just a few hours.

For the case of generic initial states, we derive an improved upper bound on the β-swap series length that generates the ETO cone.

Theorem 9 (improvement of theorem 6 of [38]).  $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ is the convex hull of all final states generated by

Equation (20)

with $l_\mathrm{max} \unicode{x2A7D} \left\lfloor\frac{d!-4}{d-3}\right\rfloor$ for $d\unicode{x2A7E} 4$ and all possible combinations of $\{(j_n,k_n)\}$.

The proof of theorem 9 can be found in appendix E. The main technique is again to identify a set of states that thermomajorise all the other states in $ \mathcal{S}_\mathrm{ETO} $ that share the same β-order, either via applying the results of [66], or showing that they are tightly-majorised by the previous state.

However, even for generic initial states, most elements in equation (20) are not extremal in $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$. As discussed in section 2.3, non-neighbouring swaps often deplete athermality too rapidly and the resulting state ends up being close to equilibrium and far from extremity.

5.3. Gap between current theoretical characterisation and heuristic analysis

When naïvely constructing all β-swaps as given in theorem 9, the number of the series that need to be checked scales as $(d/\sqrt{2})^{2d!/(d-3)}$, which is practically infeasible after d = 4. However, a simplification can be made by exploiting the fact that if a series $\overrightarrow{\beta}$ yields a non-extreme point $\overrightarrow{\beta}\mathrm{p} = [\lambda_1\overrightarrow{\mathcal{M}}_1 + (1-\lambda_1)\overrightarrow{\mathcal{M}}_2]\mathrm{p}$, all states $\overrightarrow{\beta}^{\prime}\overrightarrow{\beta}\mathrm{p}$ with any additional β-swap series $\overrightarrow{\beta}^{\prime}$ are also non-extreme. Thus by starting from length-1 series and checking extremity after each time the series is lengthened, we can greatly reduce the computational cost.

There are two ways to verify the extremity of final states. For $d = 3,4$ and qutrit-qubit cases, we explicitly used the fact that the resulting set is convex:

  • (i)  
    Apply length-1 β-swaps to the set $\mathcal{S}_0 = \{\mathrm{p}\}$ to get $\mathcal{S}_1 = \{\beta^{(k,l)}\mathrm{p}\}_{\forall k,l}$. Construct the hull $\mathrm{Conv}[\mathcal{S}]$, where $\mathcal{S} = \bigcup_{j}\mathcal{S}_j$.
  • (ii)  
    Find extreme points $\mathrm{Ext[Conv[\mathcal{S}]]}$ and update $\mathcal{S}_1 \rightarrow \mathcal{S}_1 = \mathrm{Ext[Conv[\mathcal{S}]]} - \mathcal{S}_0$.
  • (iii)  
    Iterate steps 1 and 2 by applying single swaps to $\mathcal{S}_i$, save newly obtained extreme states as $\mathcal{S}_{i+1}$, and update $\mathcal{S}_{j\unicode{x2A7D} i+1}$ by eliminating non-extreme points. Update $\mathcal{S} = \bigcup_{j}\mathcal{S}_j$.
  • (iv)  
    Stop when $\mathcal{S}_{i+1} = \emptyset$.

The strength of this algorithm is its straightforwardness. It does not include any non-extreme points nor does it miss an extreme point of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$. However, especially in the higher-dimensional cases, constructing convex hulls and characterising extreme points are highly demanding.

The second algorithm, which we used for $d = 5,6,7$, is a slight modification of the one presented in [67] for MTO and utilises lemma 2:

  • (i)  
    Apply length-1 β-swaps to the set $\mathcal{S}_0 = \{\mathrm{p}\}$ to get $\mathcal{S}_1 = \{\beta^{(k,l)}\mathrm{p}\}_{\forall k,l}$.
  • (ii)  
    Apply length-1 β-swaps to the set $\mathcal{S}_1$ to obtain $\mathcal{S}_2$ and check thermomajorisation relations between states in $\mathcal{S} = \bigcup_{j}\mathcal{S}_j$ that share the same β-order. Eliminate the ones that are thermomajorised by other states.
  • (iii)  
    Iterate step 2 for i times until $\mathcal{S}_{i+1} = \emptyset$. Update $\mathcal{S} = \bigcup_{j}\mathcal{S}_j$.

The difference from the algorithm for MTO is that our algorithm also considers non-neighbouring swaps. The second strategy has an important edge over the previous one: there are only $m(d-1)$ inequalities to check for each final state, where m is the number of already obtained extreme points having the same order. Hence for higher-dimensional systems, the second approach is preferable. Yet, non-extreme states that are achievable by convex combinations of different extreme states, but not thermomajorised by either of them, can be included in the final set. When interested only in the final set $ \mathcal{S}_\mathrm{ETO} $, counting some non-extreme states is still permissible.

To search for the maximum length $\ell_{\text{max}}$, randomly generated initial states and energy levels were used to construct the set $\mathcal{S}_\mathrm{ETO}$. For $d\unicode{x2A7D} 6$, $\sim\mathcal{O}(10^4)$ random samples were tested for each dimension. For d = 7, only 200 cases were calculated due to its high computational cost (∼ few hours for each initial state). Table 2 presents results from the search, where $\ell_\mathrm{max}$ scales much slower than the theoretical prediction. Moreover, the typical number of extreme points also deviates significantly from the naïve expectation, i.e. they do not scale exponentially with $\ell_\textrm{{max}}$.

6. Discussions and conclusions

In search of more practical thermal processes, we analyse catalysis in ETOs with small catalysts. These operations offer a clearer path to implementation and less stringent experimental control requirements, compared to the more general and well-studied framework of TOs. However, it is known that ETO is only a strict subset of TO, and in particular, some of the states reachable via TO are no longer achievable via ETO. We try to alleviate this limitation by allowing catalysis while maintaining ease of execution by limiting the catalyst size.

Depending on the physical system of interest, modifications can be made to our catalytic ETO framework. We assumed the ability to apply any two-level TOs to any pair of levels in the system. Stricter limitations could be imposed on the power of operations: MTOs [67] or JC models [38] may replace ETO when using certain thermal baths. In other cases, some pairs of levels might lack a resonant thermal environment, preventing direct swapping. In both scenarios, the set of reachable states would shrink. On the other hand, one might explore more relaxed definitions of catalysis. If correlations between the system and catalyst are not detrimental, or if catalysts are allowed to change by a small amount, the set of states reachable via CETO can be expanded [8]. Nonetheless, we believe that our main message, namely the existence of useful yet realistically implementable catalytic operations in thermodynamics, would remain robust under these modifications.

Several roadblocks had to be removed in order to tackle this problem. Firstly, there is currently no efficient way of characterising the set of allowed state transitions under ETO. We partially overcome this challenge through two approaches. First, we fully solve the characterisation problem for three-dimensional systems and for a subset of initial states of generic dimension. Additionally, we improve the analytical upper bound of the computational cost needed for the most general cases. Armed with these tools, we demonstrate the existence of catalysis in ETO, where even small catalysts prove to be remarkably useful. In fact, the extreme case of employing a qubit catalyst alone nearly eliminates the gap between TO and ETO for qutrit system states, sometimes even providing additional advantages beyond TO. At the same time, our physically relevant example of a cooling protocol highlights the power of relatively small catalysts, approaching the TO limit without the need for fine-tuning the catalyst states.

Another obstacle is the initial lack of clarity on why catalysts work, despite reports of their existence (or non-existence) in various resource theories. To address this, we leverage the step-wise structure of ETO to track and analyse catalytic evolutions, by capturing snapshots of states after each ETO step. This approach opens up a new avenue for understanding the underlying origins of catalytic advantage. In our example, the catalyst's role was to receive the free energy flowing out from the system, either through reduced state population changes or correlations with the system. Without the catalyst, all changes in system free energy would dissipate into the surrounding bath, which thermalises after each swap. This interpretation could potentially be extended to catalysts in different resource theories. For instance, it would be intriguing to further investigate how residual correlations between catalyst and system (for the scenario of correlating catalysis) are exemplified in this picture.

Among our results, the theoretical upper bound presented in theorem 9 is expected to have more room for tightening. The β-swapping operations (unless the energy levels are degenerate) are highly resource-depleting in general. Therefore, the application of these operations exponentially many times, e.g. using $\ell_{\text{max}}$ number of times, is unlikely to produce a final state that is extreme in terms of athermality. Our heuristic approach strongly suggests that the maximum length scales polynomially, rather than exponentially. Yet, we currently do not have rigorous proof of this scaling, and we leave it for future studies.

Acknowledgments

We thank Matteo Lostaglio for insightful discussions. This work was supported by the start-up grant of the Nanyang Assistant Professorship of Nanyang Technological University, Singapore.

We organise the appendix as follows: appendix A illustrates the real time dynamics of one ETO step, assuming intensity-dependent JC interaction between two levels and a harmonic oscillator in Gibbs state.

Appendix B contains various technical results we developed on ETOs, which are necessary for the analytical results in the main text. In particular, the proof of lemma 6 is provided in this section.

In appendix C, we develop the full characterisation of ETO with d = 3 by proving theorem 5. For general high-dimensional cases, we prove theorem 9 in appendix E.

Appendix D proves theorem 8 claiming the drastic simplification of the $ \mathcal{S}_\mathrm{ETO} $ extreme points construction for two special initial β-orders that are monotonic with the energy eigenvalues.

In appendix F, we detail the methods of constructing the set of reachable states via catalytic ETO with a qubit catalyst. Here we also analyse an example transition obtained from this method.

Data availability statement

The data cannot be made publicly available upon publication because they are not available in a format that is sufficiently accessible or reusable by other researchers. The data that support the findings of this study are available upon reasonable request from the authors.

Appendix A: Dynamics of an elementary thermal operation

In this section, we show that any two-level swap $ \mathcal{M}_{\lambda}^{(i,j)} $ can be implemented through the intensity-dependent JC interaction between two levels to be swapped and a harmonic oscillator with the matching energy gap. Our model comprises two-level system, harmonic oscillator, and an interaction between them, which respectively read

Equation (A.1)

Equation (A.2)

Equation (A.3)

where

Equation (A.4)

is a Pauli X-like operator in the subspace spanned by energy eigenvectors $ \lvert1\rangle_{S}\lvert n\rangle_{B} $ and $ \lvert2\rangle_{S}\lvert n-1\rangle_{B} $ possessing the same energy and g is the coupling strength parameter. Note that the interaction term is energy-preserving, i.e.

Equation (A.5)

For simplicity, we shift to the interaction picture. We always consider the initial state to be a product of incoherent system state and a Gibbs state w.r.t. some inverse temperature β,

Equation (A.6)

which also commutes with $ H_{S}+H_{B} $. Then, the state ϱSB and the Hamiltonian HSB remain invariant in the interaction picture. The time-evolution operator is given by

Equation (A.7)

making use of the properties of $ X^{(n)} $,

Equation (A.8)

After time t, the system reduced state becomes

Equation (A.9)

Since the reduced state on S is quasi-classical at all time, we focus on the population vector $ \mathrm{q}(t) = (q_{1}(t),q_{2}(t)) $, which can be written as a ETO channel

Equation (A.10)

with $ \lambda(t) = \text{sin}^{2}(gt) $. Equation (A.10) specifies the continuous evolution during the two-level swap process, achieving the β-swap at $ gt = \pi/2 $.

Appendix B: Elementary thermal operations for d-dimensional systems

B.1. Basic transformation of thermomajorisation curves after a β-swap

Given a system characterised by Hamiltonian H, we denote its thermal state as τβ , and describe the initial state with respect to its energy population vector $ \mathrm{p} $. Under a β-swap $\beta^{(k,l)}$ as defined in equation (11),

Equation (B.1)

when $E_k\unicode{x2A7D} E_l$. Then the element-wise ratios, $ \mathrm{g}(\mathrm{q}) $ as defined in equation (2), are transformed accordingly

Equation (B.2)

Equation (B.3)

Equation (B.4)

where

Equation (B.5)

Furthermore, equalities for the above equations hold only under the following circumstances:

Equation (B.6)

Equation (B.7)

Naturally, the β-swap operations also alter β-orderings of states. Let us denote the initial β-order as ${\pi}(\mathrm{p}) = (\pi_1,\ldots,\pi_d)$. If $k = \pi_i$ and $l = \pi_{i \pm 1}$ for some i, that is if $\beta^{(k,l)}$ is a neighbouring swap for a state $\mathrm{p}$, then ${\pi}(\mathrm{q})$ can be easily determined:

Equation (B.8)

where $ S_{i,j} $ is a swap between i'th and j'th elements as introduced in lemma 6.

B.2. Useful technical remarks

Here we present sundry remarks on β-swaps that are utilised in proofs of lemmas and theorems in the later part of the appendix. These results hold as equalities in the channel level and do not depend on the states these channels are acting on.

Remark 1. The β-swap series $\beta^{(k,l)}\beta^{(k,l)} = \mathcal{M}_{\lambda}^{(k,l)}$ for some $\lambda\neq 0,1$, and thus always produces a non-extreme point except for in the trivial case where $E_k = E_l$. In that trivial case, two repeated swaps always result in identity, and λ = 0.

Remark 2. If $E_k = E_l$ for some levels $k,l$, $\beta^{(k,l)}$ always connect one extreme point of $\mathcal{S}_\mathrm{ETO}$ to another.

Proof. If this statement does not hold, there exists a state $\mathrm{q}$ that is extremal for $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ but $\beta^{(k,l)}\mathrm{q} = \sum_i\lambda_i \mathrm{r}_i$ for some $\lambda_i\unicode{x2A7E} 0$ and $\mathrm{r}_i\in\mathcal{S}_\mathrm{ETO}(\mathrm{p})$. But then $\mathrm{q} = (\beta^{(k,l)})^2\mathrm{q} = \sum_i\lambda_i\beta^{(k,l)}\mathrm{r}_i$. Since $\beta^{(k,l)}\mathrm{r}_i\in\mathcal{S}_\mathrm{ETO}(\mathrm{p})$, $\mathrm{q}$ cannot be extremal. □

Remark 3. When $k,l,m,n$ are all distinct, β-swaps commute, i.e.

Equation (B.9)

Remark 4. If $E_{k}\unicode{x2A7D} E_{l}\unicode{x2A7D} E_{m}$,

Equation (B.10)

The equality is obtained through direct calculations,

Equation (B.11)

where we omit identities acting on irrelevant levels when writing ETO maps (and do so consistently in the rest of the appendix for notational brevity).

B.3. Proof of lemma 3

Proof. For the first part of the lemma: by equation (B.8), a single neighbouring swap $ \beta^{\pi_{j},\pi_{j+1}} $gives us

Equation (B.12)

which, combined with equations (B.2)–(B.4), yields

Equation (B.13)

Each $ (a_m,\mathcal{L}_\mathrm{q}(a_{m})) $ corresponds to an elbow point of $ \mathcal{L}_\mathrm{q} $, which coincides with $ \mathcal{L}_\mathrm{p} $, implying that $\mathrm{q}$ is tightly thermomajorised by $\mathrm{p}$.

For the second case when $\beta^{(\pi_{i},\pi_{i+c})}$ acts on non-neighbouring levels (c > 1) in $\mathrm{p}$, the change in $ {\pi}(\mathrm{q}) $ becomes a little less straightforward. That is,

Equation (B.14)

in general. First suppose $ E_{\pi_{i}} \lt E_{\pi_{i+c}}$. Then

Equation (B.15)

which implies $ \pi_{i+c} = {\pi}(\mathrm{q})_{i} $ and $ \pi_{i} = {\pi}(\mathrm{q})_{i+c^{^{\prime}}} $ with $ 1\unicode{x2A7D} c^{^{\prime}}\unicode{x2A7D} c $. Here we imposed

Equation (B.16)

to make πi and $ \pi_{i+c} $ truly non-neighbouring. From the first equality of equation (B.15), $ \mathcal{L}_{\mathrm{q}}(a_{i}) = \mathcal{L}_{\mathrm{p}}(a_{i}) $ follows. However, $ (i+1) $th elbow for $ \mathcal{L}_\mathrm{q} $ is strictly separated from $ \mathcal{L}_\mathrm{p} $, which is sufficient to prove the second part of the lemma. We demonstrate this for each possible case:

Case i: $a_{i+1} - a_{i-1} \unicode{x2A7D} {\tau}^\beta_{\pi_{i}}$. Since $\mathrm{g}(\mathrm{p})_{\pi_{i}}\gt\mathrm{g}(\mathrm{q})_{{\pi}(\mathrm{q})_{i+1}}$,

Equation (B.17)

Case ii-a): $a_{i+1} - a_{i-1} \gt {\tau}^\beta_{\pi_{i}}$ and $c^{^{\prime}}\neq1$. Then $ {\pi}(\mathrm{q})_{i+1} = \pi_{i+1}$. Using $\mathrm{g}(\mathrm{p})_{\pi_{i}} \gt \mathrm{g}(\mathrm{p})_{\pi_{i+1}}$,

Equation (B.18)

Case ii-b): $ {\pi}(\mathrm{q})_{i+1} = \pi_{i} $, i.e. $a_{i+1} = a_i + {\tau}^\beta_{\pi_{i}}$. From $p_{\pi_{i}}+p_{\pi_{i+c}} = q_{\pi_{i}}+q_{\pi_{i+c}}$,

Equation (B.19)

The strict concavity of $\mathcal{L}_\mathrm{p}$ (equation (B.16)) imposes

Equation (B.20)

which leads to $\mathcal{L}_\mathrm{p}(a_{i+1})\gt \mathcal{L}_\mathrm{q}(a_{i+1})$.

The argument above is easily generalizable for $E_{\pi_{i}}\gt E_{\pi_{i+c}}$, so this concludes the proof. □

B.4. Proof of lemma 6

Here, we want to prove that a β-swap involving two levels $ E_j,E_k $ produces an extreme point only if its sole effect on the β-ordering on the final state is a swap of j and k. This technical result is later used in establishing lemma 10, a key tool used throughout in several subsequent proofs. We start by proving the lemma for the case of $\mathrm{p}\in\mathcal{V}^3$. Denote the initial β-order as ${\pi}(\mathrm{p}) = (\pi_1,\pi_2,\pi_3)$. Consider the following three cases:

  • (i)  
    Suppose $E_{\pi_1} \lt E_{\pi_3}$ and ${\pi}(\mathrm{q}) = (\pi_3,\pi_1,\pi_2)$, where $\mathrm{q} = \beta^{(\pi_1,\pi_3)}\mathrm{p}$. The same final ordering is obtained after two neighbouring swaps, i.e.
    Equation (B.21)
    Note that $q_{\pi_3} = q^{^{\prime}}_{\pi_3}$ (this is seen from equations (9) and (B.1)). Therefore, the thermomajorisation curves $ \mathcal{L}_\mathrm{q},\mathcal{L}_{\mathrm{q}^{^{\prime}}}$ are identical up to the first elbow
    Equation (B.22)
    The third elbow also coincides as $\mathcal{L}_\mathrm{q}(1) = \mathcal{L}_{\mathrm{q}^{^{\prime}}}(1) = 1$. Finally, the second elbow points of $\mathrm{q}$ and $\mathrm{q}^{^{\prime}}$ curves are given by
    Equation (B.23)
    from $q^{^{\prime}}_{\pi_2} = (\beta^{(\pi_2,\pi_3)}\mathrm{p})_{\pi_2}\lt p_{\pi_2}$. Therefore, $\mathrm{q}^{^{\prime}}$ strictly thermomajorises $\mathrm{q}$ and the non-extremity of $\mathrm{q}$ then follows from lemma 2.
  • (ii)  
    For $E_{\pi_3} \gt E_{\pi_1}$, comparison between $\mathrm{q}$ with ${\pi}(\mathrm{q}) = (\pi_2,\pi_3,\pi_1)$ and $\mathrm{q}^{^{\prime}} = \beta^{(\pi_3,\pi_1)}\beta^{(\pi_1,\pi_2)}\mathrm{p}$ gives the same result.
  • (iii)  
    If $E_{\pi_3} = E_{\pi_1}$, we always get $ {\pi}(\mathrm{q}) = (\pi_3,\pi_2,\pi_1) $.

Note that from equations (B.2)–(B.5), cases 1 and 2 cover all possible ways of obtaining $ {\pi}(\beta^{\pi_{1},\pi_{3}}\mathrm{p}) \neq S_{1,3}({\pi}(\mathrm{p})) $.

For the general case of $\mathrm{p}\in\mathcal{V}^d$, if ${\pi}(\mathrm{q} = \beta^{(\pi_i,\pi_{i+c})}\mathrm{p}) \neq S_{i,i+c}({\pi}(\mathrm{p}))$, then the equivalents of $ \mathrm{q}^{^{\prime}} $ above can be chosen as follows:

  • (i)  
    If $E_{\pi_i}\lt E_{\pi_{i+c}}$ and $ \pi_{i}\neq {\pi}(\mathrm{q})_{i+c} $, then $ \mathrm{q}^{^{\prime}} = \beta^{(\pi_i,\pi_{i+c})}\beta^{(\pi_{i+c-1},\pi_{i+c})}\mathrm{p} \succ_\beta \mathrm{q}$. Although $ {\pi}(\mathrm{q})\neq{\pi}(\mathrm{q}^{^{\prime}}) $ in general, $ \mathrm{q} $ can always be obtained from $ \mathrm{q}^{^{\prime}} $ by partial level thermalisation between πi and $ \pi_{i+c-1} $. To see this, notice that $ q_{\pi_{k}} = q^{^{\prime}}_{\pi_{k}}$, $\forall k\neq i,i+c-1 $, i.e.
    Equation (B.24)
    Using the same argument to equation (B.23), $ q^{^{\prime}}_{\pi_{i}}\gt q_{\pi_{i}} $, and thus
    Equation (B.25)
    Combining with equation (B.24), $ \mathrm{q} $ is obtained $ \mathrm{q}^{^{\prime}} $ by partial thermalisation between levels i and $ i+c-1 $.
  • (ii)  
    If $E_{\pi_i}\gt E_{\pi_{i+c}}$ and $ \pi_{i+c}\neq{\pi}(\mathrm{q})_{i} $, then $ \mathrm{q}^{^{\prime}} = \beta^{(\pi_{i+c},\pi_{i})}\beta^{(\pi_{i},\pi_{i+1})}\mathrm{p} \succ_\beta \mathrm{q}$. Likewise, $ q_{\pi_{k}} = q^{^{\prime}}_{\pi_{k}}$, $\forall k\neq i+1,i+c $, and $ q_{\pi_{i+1}} + q_{\pi_{i+c}} = q^{^{\prime}}_{\pi_{i+1}} + q^{^{\prime}}_{\pi_{i+c}}$. With
    Equation (B.26)
    $ \mathrm{q} $ is obtained $ \mathrm{q}^{^{\prime}} $ by partial thermalisation between levels i + 1 and i + c.
  • (iii)  
    If $ E_{\pi_{i}} = E_{\pi_{i+c}} $, we always get $ \pi_{i} = {\pi}(\mathrm{q})_{i+c} $.

Therefore, the state $ \mathrm{q} $ with order ${\pi}(\mathrm{q}) \neq S_{i,i+c}({\pi}(\mathrm{p}))$ is always non-extremal in $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $.

B.5. Each β-order has at least one $ \mathcal{S}_\mathrm{ETO} $ vertex: proof of lemma 7

In this section, we show that it is impossible to have a β-order that has no vertex of $ \mathcal{S}_\mathrm{ETO} $. Lemma 7 hints the lower bound scaling of the number of extreme points in worst cases.

Proof. To start, we construct a series of sets $ S_d\subset S_{d-1}\subset\cdots\subset S_0 $ defined as follows:

  • $ S_0 = \{\mathrm{q}\vert \mathrm{q}\in\mathcal{S}_\mathrm{ETO}(\mathrm{p})\ \text{and}\ {\pi}(\mathrm{q}) = \psi\} $,
  • $ S_j = \{\mathrm{q}\vert\mathrm{q}\in S_{j-1}\ \text{and}\ q_{\psi_j}\unicode{x2A7E} q^\prime_{\psi_j}, \forall \mathrm{q}^{\prime}\in S_{j-1}\} $ for $ 1\unicode{x2A7D} j\unicode{x2A7D} d $.

In other words, S1 is the set of states in $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ with a specific β-order ψ, and with maximal ψ1 population $ r_{\psi_1} $. Likewise, S2 is a subset of S1, having additionally the maximal ψ2 population, and so on. Note that Sd always has a single element for each fixed choice of ψ, which we denote as $ \mathrm{r} $ 7 .

Suppose there is no extreme point of $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ corresponding to ψ. Then $ \mathrm{r} $ can be written as a strict convex combination of extreme states $ \mathrm{e}^{(i)} $, i.e.

Equation (B.27)

Starting from j = 1, check the following:

  • (i)  
    For j > 1, we have $ e^{(i)}_{\psi_k} = r_{\psi_{k}} $, $ \forall k\lt j $ from the last iteration. For j = 1, we do not need any condition yet.
  • (ii)  
    If $ e^{(i)}_{\psi_j}\gt r_{\psi_j} $ for some i, a state $ \mathrm{e}^{\prime} = \overrightarrow{\beta}\mathrm{e}^{(i)} $ with $ {\pi}(\mathrm{e}^{\prime}) = \psi $ and $ e^\prime_{\psi_k} = e^{(i)}_{\psi_k} $, $ \forall k\unicode{x2A7D} j $ can be found. Let us show how to do this:
    • If $ \mathrm{g}(\mathrm{e}^{(i)})_{\psi_{j-1}} = \mathrm{g}(\mathrm{r})_{\psi_{j-1}} \unicode{x2A7E} \mathrm{g}(\mathrm{e}^{(i)})_{\psi_{j}} \gt\mathrm{g}(\mathrm{r})_{\psi_{j}}$, we can simply thermalise all the levels ψk , $ \forall k\gt j $ of $ \mathrm{e}^{(i)} $ to have the same slope, which is smaller than $ \mathrm{g}(\mathrm{e}^{(i)})_{\psi_{j}} $. Then we obtain the desired state $ \mathrm{e}^{^{\prime}} $, since levels with degenerate slopes—all $ \psi_{k\gt j} $ in this case—can be permuted within themselves in the β-order.
    • If $ \mathrm{g}(\mathrm{e}^{(i)})_{\psi_{j}}\gt \mathrm{g}(\mathrm{r})_{\psi_{j-1}} \gt\mathrm{g}(\mathrm{r})_{\psi_{j}}$ 8 , we can first reduce $ e^{(i)}_{\psi_{j}} $ by partially thermalising with populations of levels $ \psi_{k\gt j} $ until $ \mathrm{g}(\mathrm{r})_{\psi_{j-1}} \unicode{x2A7E} e^{^{\prime}}_{\psi_{j}} \gt \mathrm{g}(\mathrm{r})_{\psi_{j}} $. Then, as in the previous case, thermalising all the levels $ \psi_{k\gt j} $ will give $ \mathrm{e}^{^{\prime}} $.
    However, such $ \mathrm{e}^{^{\prime}} $ satisfies $ \mathrm{e}^{\prime}\in S_{j-1} $ and $ e^\prime_{\psi_j}\gt r_{\psi_j} $, which contradicts the assumption that $ \mathrm{r} \in S_j $.
  • (iii)  
    If $ e^{(i)}_{\psi_j}\unicode{x2A7D} r_{\psi_j} $ for all i, from convexity of the combination, $ e^{(i)}_{\psi_j} = r_{\psi_j} $ for all i. Proceed to $ j\rightarrow j+1 $.

If $ e^{(i)}_{\psi_j} = r_{\psi_j} $ for all i and j, $ \mathrm{e}^{(i)} = \mathrm{r} $, which contradicts the assumption that $ \mathrm{r} $ is not extremal. □

Appendix C: Full characterisation of elementary thermal operations for d = 3 and the proof of theorem 5

To prove theorem 5, we first establish three preliminary results for d = 3.

  • Remark 5 identifies two-neighbouring-swap series that generate $\mathcal{S}_{\mathrm{TO}}$ extreme points, which are uniquely extremal for $\mathcal{S}_\mathrm{ETO}$ in their orders by corollary 4. This remark is used repeatedly to prove the other two lemmas.
  • Lemma 10 shows that non-neighbouring swaps cannot be used in series with more than one swap.
  • Lemma 11 sets the maximum length of swap series to be three.

The set of candidates for extreme points is achieved after ruling out all the swaps yielding provably non-extreme states.

Remark 5. For $\mathrm{p}\in\mathcal{V}^3$,

  • (i)  
    the states $\beta^{(2,3)}\beta^{(1,2)}\mathrm{p}$ and $\beta^{(2,3)}\beta^{(1,3)}\mathrm{p} $ are extreme points of $\mathcal{S}_{\mathrm{TO}}(\mathrm{p})$, if ${\pi}(\mathrm{p}) = (2,1,3)$ or $(3,1,2)$;
  • (ii)  
    the state $\beta^{(1,3)}\beta^{(2,3)}\mathrm{p}$ is an extreme point of $\mathcal{S}_{\mathrm{TO}}(\mathrm{p})$, if ${\pi}(\mathrm{p}) = (1,2,3)$ or $(3,2,1)$; and
  • (iii)  
    the state $\beta^{(1,2)}\beta^{(2,3)}\mathrm{p}$ is an extreme point of $\mathcal{S}_{\mathrm{TO}}(\mathrm{p})$, if ${\pi}(\mathrm{p}) = (1,3,2)$ or $(2,3,1)$.

Proof. Direct calculation gives

Equation (C.1)

By using the algorithm in definition 6 of [66], one can verify that the above channels are biplanar extreme points of the set of thermal processes. According to theorem 4 in [66], such biplanar extremal channels generate extreme points of $\mathcal{S}_{\mathrm{TO}}(\mathrm{p})$, when the initial state corresponds to a particular β-ordering that can be found in the process of decomposing the graph structure of the channel matrix. Performing this procedure according to [66] reveals that for $ \beta^{(2,3)}\beta^{(1,2)} $ and $ \beta^{(2,3)}\beta^{(1,3)} $, the relevant input state β-order is given by $ (2,1,3) $ and $ (3,1,2) $; similarly for statements 2 & 3 in remark. □

Lemma 10. Given $\mathrm{p}\in\mathcal{V}^3$ with ${\pi}(\mathrm{p}) = (\pi_1,\pi_2,\pi_3)$, $\overrightarrow{\beta}\mathrm{p}$ is extremal for $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ only if i) $\overrightarrow{\beta}$ is always neighbouring when applied to $\mathrm{p}$ or ii) $\overrightarrow{\beta} = \beta^{(\pi_1,\pi_3)}$.

Proof. To prove this, we need to show that i) a neighbouring swap following a non-neighbouring swap produces non-extreme point and ii) a non-neighbouring swap following a neighbouring one also yields a non-extreme point. Since only extreme points are of our interest, using lemma 6, we can safely assume that $ {\pi}(\beta^{(\pi_{i},\pi_{j})}\mathrm{p}) = S_{i,j}(\pi) $ for any $ \pi = {\pi}(\mathrm{p}) $ and $ i,j $. We tackle each problem by further dividing cases.

Case i-(a): ${\pi} = (1,2,3)$ or $(3,2,1)$ experiencing a neighbouring swap followed by a non-neighbouring swap. Swap $ \beta^{(2,3)}\beta^{(1,2)} $ produces final states with order $ (3,1,2) $ or $ (2,1,3) $. From remark 5, these orders have a unique extreme point produced from $ \beta^{(1,3)}\beta^{(2,3)} $. Another swap gives

Equation (C.2)

whereas two consecutive neighbouring swaps give

Equation (C.3)

with order $ (2,3,1) $ or $ (1,3,2) $ depending on the initial order. Note that $ r_{2} = q_{2} $ and

Equation (C.4)

i.e. $ q_{3} \unicode{x2A7E} r_{3} $ if initial $ \pi = (1,2,3) $, and $ q_{3} \unicode{x2A7D} r_{3} $ if initial $ \pi = (3,2,1) $. Either way, $ \mathrm{q}\succ_\beta\mathrm{r} $ while $ {\pi}(\mathrm{q}) = {\pi}(\mathrm{r}) $ and thus $ \mathrm{r} $ cannot be extremal from lemma 2.

Case i-(b): ${\pi} = (1,2,3)$ or $(3,2,1)$ experiencing a non-neighbouring swap followed by a neighbouring swap. We can in fact prove that non-neighbouring swap already always produces non-extreme points. Compare two states

Equation (C.5)

Equation (C.6)

with ${\pi}(\mathrm{r}) = {\pi}(\mathrm{q}) = (3,2,1)$ or $ (1,2,3) $ depending on the initial π. Then we may observe the following:

Equation (C.7)

Equation (C.8)

Equation (C.9)

i.e. $\mathrm{q} \succ_{\beta} \mathrm{r}$ and thus $\mathrm{r}$ is not extremal for $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$.

Case ii-(a): ${\pi} = (1,3,2)$ or $(2,3,1)$ experiencing a neighbouring swap followed by a non-neighbouring swap. The two possible neighbouring swaps for these initial orders are $\beta^{(2,3)}$ and $\beta^{(1,3)}$. First, consider the neighbouring swap $\beta^{(2,3)}$: this modifies the order into $(1,2,3)$ or $(3,2,1)$ and Case i-(b) forbids a non-neighbouring swap to come next. The other neighbouring and non-neighbouring swap pair $\beta^{(2,3)}\beta^{(1,3)}$ gives output orders $(2,1,3)$ or $(3,1,2)$. But remark 5 states that $\beta^{(1,2)}\beta^{(2,3)}\mathrm{p}$ is a unique extreme point for that output order.

Case ii-(b): ${\pi} = (1,3,2)$ or $(2,3,1)$ experiencing a non-neighbouring swap followed by a neighbouring swap. After a non-neighbouring swap, ${\pi}(\beta^{(1,2)}\mathrm{p}) = (2,3,1)$ or $(1,3,2)$. If a following neighbouring swap is $\beta^{(1,3)}$, the resulting orders are $(2,1,3)$ or $(3,1,2)$, which again cannot be extremal from the remark 5. The remaining possibility is to apply $\beta^{(2,3)}$, which results in

Equation (C.10)

Compare this with a state having the same β-order $ {\pi}(\mathrm{r}) = {\pi}(\mathrm{q}) = (3,2,1) $ or $ (1,2,3) $,

Equation (C.11)

Then

Equation (C.12)

for initial $\pi = (2,1,3)$ and becomes positive for $(3,1,2)$. Plus, $ q_3 = r_3 $, which in turn gives $\mathrm{q}\succ_{\beta}\mathrm{r}$; thus, $\mathrm{r}$ is not extremal for $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$.

Case iii-(a): ${\pi} = (2,1,3)$ or $(3,1,2)$ experiencing a neighbouring swap followed by a non-neighbouring swap. If the first neighbouring swap is $\beta^{(1,2)}$, the output β-order becomes $ (1,2,3) $ or $ (3,2,1) $, which does not allow non-neighbouring swap to follow as stated in Case i-(b). The other series, $\beta^{(1,2)}\beta^{(1,3)}$ outputs orders $(1,3,2)$ or $(2,3,1)$, but these orders have unique extreme points for $\mathcal{S}_\mathrm{ETO}$ given by remark 5.

Case iii-(b): ${\pi} = (2,1,3)$ or $(3,1,2)$ experiencing a non-neighbouring swap followed by a neighbouring swap. Two candidate series are $\beta^{(1,3)}\beta^{(2,3)}$ and $\beta^{(1,2)}\beta^{(2,3)}$, which respectively produces orders $(1,3,2)$ and $(3,2,1)$ when applied to an initial state with $\pi = (2,1,3)$; $(2,3,1)$ and $(1,2,3)$ when applied to $\pi = (3,1,2)$. All output states obtained here have different unique extreme points for $\mathcal{S}_\mathrm{ETO}$ given in remark 5, and the states generated by the considered swaps are therefore non-extremal.

We exhausted all possible cases and none of the swaps can create an extreme point of $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$. □

Lemma 11. For $\mathrm{p}\in\mathcal{V}^3$ having non-degenerate energy levels, $\prod_{i = 1}^{l}\beta_i\mathrm{p}$ can be extremal in $\mathcal{S}_\mathrm{ETO}(\mathrm{p})$ only when $l\unicode{x2A7D} 3$.

Proof. We prove this lemma by showing l = 4 swaps always produce non-extreme states. From lemma 10, non-neighbouring swaps do not need to be considered. Then the remaining two final states are $\mathrm{q}^{(1,2,1,2)}$ and $\mathrm{q}^{(2,1,2,1)}$.

Case i: ${\pi}(\mathrm{p}) = (1,2,3)$ or $(3,2,1)$. There are only three distinct β-swaps for three-dimensional systems. Then length-four series always include a repetition of the same swap, and theorem 8 forbids such series to make extreme points.

Case ii: ${\pi}(\mathrm{p}) = (1,3,2)$ or $(2,3,1)$. Corresponding swaps are $\beta^{(1,3)}\beta^{(2,3)}\beta^{(1,2)}\beta^{(1,3)}$ or $\beta^{(2,3)}\beta^{(1,3)}\beta^{(1,2)}\beta^{(2,3)}$. From remark 4, the second swap becomes

Equation (C.13)

and thus the channel itself is non-extremal. If the first swap is applied, the resulting orders are $(2,1,3)$ for $\pi = (1,3,2)$ and $(3,1,2)$ for $\pi = (2,3,1)$. Different states obtained from remark 5 are known to be uniquely extremal for these orders, rendering the states after the first swap non-extremal.

Case iii: ${\pi}(\mathrm{p}) = (2,1,3)$ or $(3,1,2)$. Corresponding output orders are $(1,3,2)$ and $(3,2,1)$ for $\pi = (2,1,3)$; $(2,3,1)$ and $(1,2,3)$ for $\pi = (3,1,2)$. Again, remark 5 provides different states that are uniquely extremal in the orders above. □

When there is a degeneracy in energy levels, e.g. $E_k = E_l$, the equality $\beta^{(k,l)}\beta^{(k,m)} = \beta^{(l,m)}\beta^{(k,l)}$ let us rearrange the swap to make $\beta^{(k,l)}$ appear only at the end of the series. From $(\beta^{(k,l)})^2 = \unicode{x1D7D9}$, we can always make $\beta^{(k,l)}$ never appear or appear only once in $\overrightarrow{\beta}$. After this reduction, lemma 11 also holds for degenerate energy systems.

Ruling out series with length $\gt1$ having non-neighbouring swaps (lemma 10) and series with length l > 3 (lemma 11), we are only left with $\beta^{(\pi_1,\pi_3)}$ and all-neighbouring swaps with length $\unicode{x2A7D} 3$. $\Theta_\mathrm{ETO}^{(3)}(\mathrm{p}) \cup \Xi_\mathrm{ETO}^{(3)}(\mathrm{p})$ is the collection of all states after such swaps, and this proves $\mathrm{Ext}(\mathcal{S}_\mathrm{ETO})~\subset~\Theta_\mathrm{ETO}^{(3)}(\mathrm{p}) \cup \Xi_\mathrm{ETO}^{(3)}(\mathrm{p})$ part of theorem 5. To prove the other part, notice that elements of $\Xi_\mathrm{ETO}^{(3)}(\mathrm{p})$ all have the same β-order $(\pi_3,\pi_2,\pi_1)$, while the ones in $\Theta_\mathrm{ETO}^{(3)}(\mathrm{p})$ all have distinct β-orders different from $(\pi_3,\pi_2,\pi_1)$. Since each β-order possesses at least one extreme state (lemma 7), all $\Theta_\mathrm{ETO}^{(3)}(\mathrm{p})$ elements should be extremal, which concludes the proof.

Appendix D: Simplifications for initial orderings monotonic in energy levels: proof of theorem 8

The technical proof of theorem 8 can be sketched in the following steps:

  • Firstly, we introduce a specific series of β-swaps that transforms an initial β-ordering to a target ordering (definition appendix D.1).
  • This structure, which we refer to as the standard formation, is then shown to be equivalent to any β-swap series, where (i) all swaps are neighbouring to initial states of the form equation (19) and (ii) each swap is applied at most once (lemma 12).
  • Finally, to prove theorem 8, we show that whenever the initial β-ordering is monotonic in energy, then a transformation that is not according to a standard formation always leads to a non-extreme state. This Lemma allows us to conclude that the number of extreme points for such a $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ is at most $ d! $, similar to that of $ \mathcal{S}_{\mathrm{TO}} (\mathrm{p})$.

Definition D.1 (standard formation). Given a tuple of d-dimensional β-orderings $ (\pi,\pi^{^{\prime}}) $, a standard formation is a β-swap series $\overrightarrow{\beta}_\mathrm{sf}$ that transforms an initial state $ \mathrm{p} $ with an order π into some final state $ \mathrm{p}^{^{\prime}} $ having an order $\pi^{\prime}$, with the construction below:

  • (i)  
    Set an initial index of j = 1, and identify m such that $\pi^{\prime}_j = \pi_{m}$. If m = j, define $\overrightarrow{\beta}^{(j)}$ as an identity. Otherwise, since $ \pi_{1},\ldots,\pi_{m-1} $ are already occupied by $ \pi^{^{\prime}}_{1},\ldots,\pi^{^{\prime}}_{m-1} $, we get m > j. Then, define a swap-series $\overrightarrow{\beta}^{(j)} = \beta^{(\pi_{j},\pi_{m})}\beta^{(\pi_{j+1},\pi_{m})}\cdots\beta^{(\pi_{m-1},\pi_{m})}$. Note that these swaps are always neighbouring when applied to a state initially having the order π, due to equation (B.8). After j = 1 round, this swap series will take the initial ordering π to the new ordering $ (\pi_1^{^{\prime}},\pi_1,\ldots,\pi_{m-1},\pi_{m+1},\ldots,\pi_d) $ if $ \overrightarrow{\beta}^{(1)}\neq\unicode{x1D7D9} $.
  • (ii)  
    Iterate the above step for $ j = 2,\ldots,d-1 $, defining $\lbrace \overrightarrow{\beta}^{(j)}\rbrace_{j = 1}^{d-1}$.

The standard formation series is then simply the concatenation

Equation (D.1)

By construction, there is no repetition of a swap in this series and they are all neighbouring when applied to an initial state with the order π.

An example of the standard formation

For the orderings π = 1234 and $ \pi^{^{\prime}} = 4231 $, the standard formation is $ \overrightarrow{\beta}_\mathrm{sf} = \overrightarrow{\beta}^{(3)}\overrightarrow{\beta}^{(2)}\overrightarrow{\beta}^{(1)} $, where

Equation (D.2)

The intermediate β-orderings given by this process are

Equation (D.3)

This formation has a nice property, namely all swaps in each block $\overrightarrow{\beta}^{(j)}$ acts on level $\pi^{\prime}_j$ and the ones in $\overrightarrow{\beta}^{(k)}$ for any k > j does not act on level $\pi^{\prime}_j$. In the lemma below, we illustrate how certain classes of swap series, which turns out to be the ones producing extreme states, can always rearranged into a standard formulation.

Lemma 12. Given an initial state $ \mathrm{p} $ with ordering π monotonic in energy, denote a β-swap series

Equation (D.4)

and $ \pi^{^{\prime}} $ to be the final β-ordering of the state $\mathrm{p}^{^{\prime}} = \overrightarrow{\beta}\mathrm{p} $. If $ \overrightarrow{\beta} $ is such that:

  • (1)  
    each βi is a distinct swap, and
  • (2)  
    when applied to $ \mathrm{p} $, is always a neighbouring swap,

then $ \overrightarrow{\beta} $ can always be expressed in the form of a standard formation for $ (\pi,\pi^{^{\prime}}) $.

Proof. We prove this by induction. Suppose the above lemma is true for $\mathrm{p}\in\mathcal{V}^{d-1}$. The first goal is to prove this for initial ordering $\pi = (1,2,\ldots,d)$ and $ \overrightarrow{\beta} $ that satisfies the conditions in the statement of the lemma. By identifying the first and the last swaps acting on the level d, one can decompose $ \overrightarrow{\beta} $ into

Equation (D.5)

Here, $\overrightarrow{\beta}^{(\text{Pre})}$ are the swaps coming before the first swap acting on the level d, and $\overrightarrow{\beta}^{(\text{Post})}$ are the ones after the last swap acting on the level d.

Case i: $d = \pi^\prime_{m}$, where m ≠ 1 is the position of level d in the β-ordering of the final state $ \mathrm{p}^{^{\prime}} $. To make a rearrangement of swaps, we first remark a few points using sets $A = \{i\vert\mathrm{g}_i\gt\mathrm{g}_d\}$ and $B = \{i\vert\mathrm{g}_i\lt\mathrm{g}_d\}$. We will update these sets after each swap. At the beginning, there is no element in B and all the other levels except d are in the set A. Next, we note the following:

  • (i)  
    Swapping $i\in A$ and $j\in B$ is not allowed, since they are non-neighbouring.
  • (ii)  
    Any $i\in A$ can move to B only when $ \beta^{(i,d)} $ is implemented.
  • (iii)  
    Since initially $ B = \emptyset $, any given level either stays in A at all times, or it moves to B at some point and remains so thereafter. This comes from the restriction that in order for i to move between A and B, the swap $ \beta^{(i,d)}$ must be used.
  • (iv)  
    If $k,l \in A$ when $\beta^{(k,l)}$ is applied, $\beta^{(k,l)}$ precedes $\beta^{(k,d)}$ and $\beta^{(l,d)}$ when they exist in $\overrightarrow{\beta}$.

The sets A and B after the whole transformation is determined by the target β-ordering $ \pi^{^{\prime}} $, where we denote them as $A_f = \{\pi^{\prime}_i\vert i\lt m\}$ and $B_f = \{\pi^{\prime}_i\vert i\gt m\}$.

Given the constraints above, we know that $ \overrightarrow{\beta} $ describes a special process. See figure D1, for instance, for a visualisation of this operation. A and B are separated by level d (point 1 above). Starting from $B = \emptyset$, some elements $i\in A$ are transferred to B whenever $ \beta^{(i,d)} $ is implemented. Once this happens, i cannot go back to A, since it would require the repetition of $ \beta^{(i,d)} $ to do so. Visually, this is understood by saying that the bar representing level d in figure D1 is penetrable from the left only. At the end, $A = A_f$ and $B = B_f$, where elements of Af never passed through level d (point 3 above). Lastly, if $k,l\in A$, then they have not experienced a swap with level d yet, explaining the point 4 above.

The next step we want to show is that w.l.o.g. a rearrangement, where $ \overrightarrow{\beta}^\mathrm{(Pre)} $ contains all β-swaps that are part of $ \overrightarrow{\beta} $, acting on elements $ i\in A_f $, but not involving level d, is possible. To do so, we identify all swaps $\beta^{(k,l)}$ in $\overrightarrow{\beta}^{(\text{Post})}\overrightarrow{\beta}^{(d-\text{rel})}$ such that $k,l\in A$ at the time of swap. From the rightmost one, make a decomposition

Equation (D.6)

Notice that $\overrightarrow{\beta}^{(b)}$ only contains swaps among $B\cup\{d\}$, which includes neither k nor l. From remark 3, we then have that $ \beta^{(k,l)} $ and $ \overrightarrow{\beta}^{(b)}$ commute, i.e. $\beta^{(k,l)}\overrightarrow{\beta}^{(b)} = \overrightarrow{\beta}^{(b)}\beta^{(k,l)}$. Therefore, w.l.o.g.

Equation (D.7)

$ \beta^{(k,l)} $ can be integrated into $ \beta^{(\text{Pre})} $ to update $ \beta^{(\text{Pre})} \rightarrow \beta^{(k,l)}\beta^{(\text{Pre})} $ and $ \overrightarrow{\beta}^{(\text{Post})}\overrightarrow{\beta}^{(d-\text{rel})} \rightarrow \overrightarrow{\beta}^{(a)}\overrightarrow{\beta}^{(b)} $.

If $k\in A_f$, the existence of $\beta^{(k,l)}$ in $\overrightarrow{\beta}$ indicates $l\in A$ when the swap is applied. Hence, by repeating this until the end, all swaps $\beta^{(k,l)}$ with $k\in A_f$ are merged into $\overrightarrow{\beta}^{(\text{Pre})}$. Since $\overrightarrow{\beta}^{(\text{Pre})}$ does not contain swaps acting on level d, it acts on at most d − 1 levels and can be reordered in the standard formation by the assumption that the lemma holds for states in $ \mathcal{V}^{d-1} $. Until now, there is no swap acting on level d and thus $ d = {\pi}(\overrightarrow{\beta}^{(\text{Pre})}\mathrm{p})_{d} $. Then levels in Bf , which should be swapped with d in $ \overrightarrow{\beta}^{(d-\text{rel})} $, occupy later d − m slots in the β-order: $ {\pi}(\overrightarrow{\beta}^{(\text{Pre})}\mathrm{p})_{m}, {\pi}(\overrightarrow{\beta}^{(\text{Pre})}\mathrm{p})_{m+1},\ldots,{\pi}(\overrightarrow{\beta}^{(\text{Pre})}\mathrm{p})_{d-1}$. By construction of the standard formation, $\overrightarrow{\beta}^{(\text{Pre})}$ then can be decomposed into $\overrightarrow{\beta}^{(\text{Pre})} = \overrightarrow{\beta}^{(B_f)}\overrightarrow{\beta}^{(A_f)}$ with $\overrightarrow{\beta}^{(B_f)}$ swapping only between Bf elements.

Finally, $\overrightarrow{\beta}^{(\text{Post})}\overrightarrow{\beta}^{(d-\text{rel})}\overrightarrow{\beta}^{(B_f)}$ consists of swaps among the levels in $B_f\cup \{d\}$ ($d-m+1\lt d$ elements). Again, by assumption this swap can be rearranged as a standard formation. Concatenating standardised series $\overrightarrow{\beta}^{(\text{Post})}\overrightarrow{\beta}^{(d-\text{rel})}\overrightarrow{\beta}^{(B_f)}$ and $ \overrightarrow{\beta}^{(A_f)} $, we obtain the standard formation for the entire series.

Case ii: $d = \pi^{\prime}_1$. The only difference here is that level d swaps with every other level and Af is an empty group. Again, we locate $\beta^{(k,l)}$ such that $\overrightarrow{\beta}^{(\text{Post})}\overrightarrow{\beta}^{(d-\text{rel})} = \overrightarrow{\beta}^{(a)}\beta^{(k,l)}\overrightarrow{\beta}^{(b)}$ from the rightmost swap. If $k,l\in A$ after $\overrightarrow{\beta}^{(b)}$, move $\beta^{(k,l)}$ to be included in $\overrightarrow{\beta}^{(\text{Pre})}$ as before. In addition, repeat this process for $ \overrightarrow{\beta}^{(d-\text{rel})} $ but starting from the leftmost swap $\beta^{(k,l)}$ with $k,l\neq d$ in $\overrightarrow{\beta}^{(d-\text{rel})} = \overrightarrow{\beta}^{(a)}\beta^{(k,l)}\overrightarrow{\beta}^{(b)}$. Since we already moved all $k,l\in A$ swaps to go before $ \overrightarrow{\beta}^{(d-\text{rel})} $, at the point of swap $ \beta^{(k,l)} $ all $ k,l\in B $, which leads to the equality $\overrightarrow{\beta}^{(a)}\beta^{(k,l)} = \beta^{(k,l)}\overrightarrow{\beta}^{(a)}$ and enables $ \overrightarrow{\beta}^{(\text{Post})} \rightarrow \overrightarrow{\beta}^{(\text{Post})}\overrightarrow{\beta}^{(k,l)} $. After merging all such $\beta^{(k,l)}$ into $\overrightarrow{\beta}^{(\text{Pre})}$ or $\overrightarrow{\beta}^{(\text{Post})}$, we get

Equation (D.8)

where $\delta = (\delta_1,\delta_2,\ldots,\delta_{d-1},d)$ is the β-order after $\overrightarrow{\beta}^{(\text{Pre})}$. Both $\overrightarrow{\beta}^{(\text{Pre})}$ and $\overrightarrow{\beta}^{(\text{Post})}$ act at most d − 1 levels, and can be modified into the standard formation. Now to put the entire series into the standard formation, δ need to be rearranged. We do this starting from j = 1.

  • (i)  
    Find m such that $\delta_m = j$. If m = j, proceed to the last step. If not, previous iterations guarantee that $ \delta_{k} = k $, $ \forall k\lt j $, which leads to m > j and $ \delta_{m-1}\gt j $. Defining $L = \{\delta_n\vert n\lt m-1\}$ and $R = \{\delta_n\vert n\gt m\}$, we get
    Equation (D.9)
    where $ \overrightarrow{\beta}^{dL(R)} $ denotes the series swapping d and elements of L(R). Moreover, from the standardisation $\overrightarrow{\beta}^{(\text{Pre})} = \overrightarrow{\beta}^{(R)}\beta^{(j,\delta_{m-1})}\overrightarrow{\beta}^{(L)}$, where $\overrightarrow{\beta}^{(R)}$ does not act on levels j and $\delta_{m-1}$ and we can rearrange it into $\overrightarrow{\beta}^{(dR)}\overrightarrow{\beta}^{(\text{Pre})} = \beta^{(j,\delta_{m-1})}\overrightarrow{\beta}^{(dR)}\overrightarrow{\beta}^{(R)}\overrightarrow{\beta}^{(L)}$. From remark 4, $\beta^{(\delta_{m-1},d)}\beta^{(j,d)}\beta^{(j,\delta_{m-1})} =$ $\beta^{(j,\delta_{m-1})}\beta^{(j,d)}\beta^{(\delta_{m-1},d)}$ since $ d\gt\delta_{m-1}\gt j $. This procedure updates $\delta \rightarrow (\cdots,\delta_{m-2},j,\delta_{m-1},\delta_{m+1},\cdots )$.
  • (ii)  
    Repeat the first step until $\delta_{j} = j$.
  • (iii)  
    Repeat the first and the second step with $ j\rightarrow j+1 $.

At the end, one gets $ \delta = (1,2,\ldots,d-1) $ and $ \overrightarrow{\beta}^{(\text{Pre})} = \unicode{x1D7D9} $. By standardizing $ \overrightarrow{\beta}^{(\text{Post})} $ and concatenating with $ \overrightarrow{\beta}^{(d-\text{rel})} $, the standard formation is obtained.

For d = 2, the lemma is trivially true. Thus by induction, the lemma is proved for $\pi = (1,2,\ldots,d)$. Following the same logic, this can also be proven for $\pi = (d,d-1,\ldots,1)$. □

Figure D1.

Figure D1. Illustration of changes in β-order starting from $\pi = (1,2,\ldots,d)$ undergoing a swap series $\overrightarrow{\beta}$, where it is assumed that $ \overrightarrow{\beta} $ is always neighbouring and allows no repetition of a particular swap. Level d is depicted as a bar, dividing the levels into sets A and B. For an initial state whose β-order is monotonically increasing in energy, $ A = \lbrace 1, \cdots, d-1\rbrace $ while $ B = \emptyset$. Three different types of swaps are possible: (i) swapping between level d and its neighbouring element which is in A (1st swap above), (ii) swapping among neighbouring levels in A (2nd, 3rd steps above), and (iii) swaps among levels in B (4th explicit step above). When the type (ii) and (iii) swaps occur, A and B remain the same. On the other hand, type (i) swaps move one element of A into B. The process of elements going from B to A is forbidden due to the no-repetition constraint. At the end of applying $\overrightarrow{\beta}$, A and B becomes Af and Bf . The elements of Af never experience a swap with level d, while the elements of Bf experienced exactly once swapping with level d.

Standard image High-resolution image

Now we prove theorem 8.

Proof. We prove the theorem for the case ${\pi}(\mathrm{p}) = (1,2,\ldots,d)$ and argue that the proof also holds for ${\pi}(\mathrm{p}) = (d,d-1,\ldots,1)$.

We first prove the only if statement of the theorem, as follows:

  • (i)  
    We show that the repetition of any particular β-swap always leads a to non-extreme state if the series is all-neighbouring. This is done by contradiction: suppose that $\overrightarrow{\beta} = \beta^{(k,l)}\overrightarrow{\beta}^{\prime}$, w.l.o.g. assuming k < l. Furthermore, assume $\overrightarrow{\beta}^{\prime}$ to be a series satisfying the only if part of the statement but contains $\beta^{(k,l)}$, causing $\beta^{(k,l)}$ to occur twice in $\overrightarrow{\beta}$. From lemma 12, $\overrightarrow{\beta}^{\prime}$ can be written in a standard formation, which reads
    Equation (D.10)
    Notice that since $\overrightarrow{\beta}$ is an all-neighbouring swap, this implies that after $\overrightarrow{\beta}^{\prime}$, levels k and l should be neighbouring with $\mathrm{g}(\overrightarrow{\beta}^{\prime}\mathrm{p})_k\unicode{x2A7D}\mathrm{g}(\overrightarrow{\beta}^{\prime}\mathrm{p})_l$, which then implies
    Equation (D.11)
    for some set of levels $M \subset \{m\vert m\lt k\lt l\}$, by construction of the standard formation. Here, $\overrightarrow{\beta}^{(\text{Irrel})}$ is a series that acts on neither k nor l. Using remarks 3 and 4,
    Equation (D.12)
    which always generates a non-extreme point from remark 1.
  • (ii)  
    Now we show that non-neighbouring swaps are also not allowed.
    • (a)  
      For d = 3: from equations (C.5)–(C.9), $ \beta^{(1,3)} $ yields non-extreme point of $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ with $ {\pi}(\mathrm{p}) = (1,2,3) $ or $ (3,2,1) $. Also, lemma 10 forbids any other occasions having a non-neighbouring swap.
    • (b)  
      For d>3 : suppose that $\beta^{(k,m)}$ is the only non-neighbouring swap in the series $ \overrightarrow{\beta} $, i.e.
      Equation (D.13)
      and
      Equation (D.14)
      after all-neighbouring series $ \overrightarrow{\beta}^{(\text{NS})} $. From the first part of the proof, $ \overrightarrow{\beta}^{(\text{NS})} $ also cannot include any repetition. Using lemma 12, we rearrange $ \overrightarrow{\beta}^{(\text{NS})} $ into the standard formation.
      • (1)  
        $ \exists l $ satisfying $k\lt l\lt m$ or $k\gt l\gt m$: firstly, note that the procedures from equations (C.5)–(C.9) can be generalised for higher dimensions. If $\mathrm{g}(\mathrm{r})_k\gt\mathrm{g}(\mathrm{r})_l\gt\mathrm{g}(\mathrm{r})_m$ and $k\lt l\lt m$ for some $\mathrm{r}$,
        Equation (D.15)
        Equation (D.16)
        and
        Equation (D.17)
        which implies that $\beta^{(k,m)}\mathrm{r}$ can be obtained from $\beta^{(l,m)}\beta^{(k,m)}\beta^{(k,l)}\mathrm{r}$ via partial thermalisation of levels k and l (cf d-dimensional case for B.4), and thus not extremal in $ \mathcal{S}_\mathrm{ETO}(\mathrm{r}) $. Similarly, the result also holds when $ k\gt l\gt m $. By putting $ \mathrm{r} = \overrightarrow{\beta}^{(\text{NS})}\mathrm{p} $, the state $ \overrightarrow{\beta}\mathrm{p} $ is not extremal in $ \mathcal{S}_\mathrm{ETO}(\overrightarrow{\beta}^{(\text{NS})}\mathrm{p}) $ and thus not extremal in $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $.
      • (2)  
        $ \exists l $ satisfying m < l: l initially has a smaller slope than m in $ \mathrm{p} $ and thus swapped with m during $ \overrightarrow{\beta}^{(\text{NS})} $. Denote the last such l swapped with m as l0.
        • (A)  
          If l0 and m are neighbouring in $ \overrightarrow{\beta}^{(\text{NS})}\mathrm{p} $, standard formation indicates
          Equation (D.18)
          for some set of levels $ N\subset\{n\vert n\lt m\} $. Here, $ \overrightarrow{\beta}^{(\text{Irrel})} $ acts on neither m nor l0. Then as in equation (D.12),
          Equation (D.19)
          $ {\pi}(\overrightarrow{\beta}^{(\text{Pre-}2)}\mathrm{p}) = (\cdots,k,\ldots,m,l_{0},\cdots) $ and thus $ \beta^{(k,m)}\beta^{(m,l_{0})} $ produces a non-extreme state from lemma 10.
        • (B)  
          If l0 and m are not neighbouring and there is no l satisfying $k\lt l\lt m$ or $k\gt l\gt m$, the levels between l0 and m are $ \{j_{i}\} $ such that $ j_{i}\lt k,m $. By the construction of the standard formation and the resulting order
          Equation (D.20)
          we know that for all ji : i) $ \beta^{(j_{i},l_{0})} $ and $ \beta^{(j_{i},k)} $ exist in $ \overrightarrow{\beta}^{(\text{NS})} $, ii) $ \beta^{(j_{i},k)} $ proceeds $ \beta^{(j_{i},l_{0})} $, and iii) $ \beta^{(j_{i},m)} $ does not exist in $ \overrightarrow{\beta}^{(\text{NS})}$. The last condition also implies that after $ \beta^{(j_{i},l_{0})} $, swaps acting on ji are only acting within the set $ \{j_{i}\} $, which we will denote as $ \overrightarrow{\beta}^{(\{j_{i}\})} $. Then
          Equation (D.21)
          since $ \beta^{(k,m)}\overrightarrow{\beta}^{(\text{Irrel})} $ does not act on levels l0 and all ji . Finally, l0 is again neighbouring to m in $ {\pi}(\overrightarrow{\beta}^{(\text{Irrel})}\beta^{(l_{0},m)}\overrightarrow{\beta}^{(\text{Pre})}\mathrm{p}) $ and we can use the argument from case ii.A to prove this state is non-extremal.
      • (3)  
        All $ l\lt k,m $: denote the last such l swapped with k as l0 and they are neighbouring from the structure of the standard formation. Similar to equation (D.19), we get $ \beta^{(k,m)}\beta^{(k,l_{0})} $ part, which produces a non-extreme state by lemma 10.
      As a result, non-neighbouring swaps are completely ruled out from the candidate of extreme point producing β-swaps when starting from monotonic order states.

The sufficient condition of the theorem can be shown by recalling two properties: (i) there exists at least one extreme point of $\mathcal{S}_\mathrm{ETO}$ for each β-order (lemma 7) and (ii) β-swap series satisfying the conditions of the lemma are all equivalent if the pair ($\pi,\pi^{\prime}$) is identical (lemma 12), which makes them the only candidate for an extreme point with order $ \pi^{^{\prime}} $.

Lastly, we note that the proof holds for $\pi = (d,d-1,\ldots,1)$ initial states since all we have used are remark 4, lemmas 10 and 12, which hold even when the energy ordering is inverted. □

Appendix E: Proof of theorem 9

Before proving theorem 9, we establish a result connecting lower dimensional results to higher dimensions. Lemma 13, combined with the fact that two disjoint neighbouring β-swaps produce $ \mathcal{S}_{\mathrm{TO}} $ extreme points (using remark 3 twice), shows that states after two different neighbouring β-swaps are always uniquely extremal in their order.

Lemma 13. For any $ \mathrm{p}\in\mathcal{V}^d $ with the corresponding β-ordering $ \pi = {\pi}(\mathrm{p}) $, and for any index $ j\in [2,d-1] $, the extreme states of $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ that have β-orders

Equation (E.1)

Equation (E.2)

are unique and given by the respective states

Equation (E.3)

Equation (E.4)

Proof. One can prove the above statement for the following independent cases:

  • (i)  
    $S_{j-1,j}S_{j,j+1}\pi $ and $ S_{j,j+1}S_{j-1,j}\pi $ admit unique extreme points for $ \pi_{j}\lt \pi_{j+1},\pi_{j-1} $: out of the three consecutive levels in β-ordering, the middle term corresponds to the lowest energy level (e.g. a β-ordering (2,1,3)),
  • (ii)  
    $ S_{j-1,j}S_{j,j+1}\pi $ for $ \pi_{j-1}\lt \pi_{j},\pi_{j+1} $,
  • (iii)  
    $ S_{j,j+1}S_{j-1,j}\pi $ for $ \pi_{j+1}\lt\pi_{j-1},\pi_{j} $,
  • (iv)  
    $ S_{j,j+1}S_{j-1,j}\pi $ for $ \pi_{j-1}\lt \pi_{j},\pi_{j+1} $,
  • (v)  
    $ S_{j-1,j}S_{j,j+1}\pi $ for $ \pi_{j+1}\lt\pi_{j-1},\pi_{j} $.

Cases 1–3 follow from a slight generalisation of remark 5. If a matrix M is a biplanar extremal thermal process (see [66] definition 6 and section IV), then $M\oplus \unicode{x1D7D9} $ is also a biplanar extremal thermal process, since an identity connects i'th element only to i'th element and thus biplanar; and all elements of thermal processes are non-negative and upper bounded by 1, making an identity extremal. Hence, we only need to prove for cases 4 & 5.

Case 4: This can be proven by showing that all states $ \mathrm{q}^{^{\prime}}\in\mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ such that $ {\pi}(\mathrm{q}^{^{\prime}}) = {\pi}(\mathrm{q}^{(j-1,j)}) $ are thermomajorised by $ \mathrm{q}^{(j-1,j)} $. We will show this by contradiction. To do so, first note that $ \mathrm{q}^{(j-1,j)} $ is very close to being tightly-majorised by the initial $ \mathrm{p} $—in particular, all elbow points of $ \mathcal{L}_{\mathrm{q}^{(j-1,j)}} $ lie on $ \mathcal{L}_{\mathrm{p}} $ except for the j'th one. Therefore, for any $ \mathrm{q}\in\mathcal{S}_\mathrm{ETO}(\mathrm{p}) $ with the same β-ordering as $ \mathrm{q}^{(j-1,j)} $, their elbows are aligned, and we know that $ \mathcal{L}_{\mathrm{q}^{(j-1,j)}} \unicode{x2A7E} \mathcal{L}_{\mathrm{q}^{^{\prime}}} $ at all elbows other than j'th. Now, assume that in fact $ \mathrm{q}^{(j-1,j)}\nsucc_\beta\mathrm{q}^{^{\prime}} $. However, this can only happen if $ \mathcal{L}_{\mathrm{q}^{^{\prime}}} \gt \mathcal{L}_{\mathrm{q}^{(j-1,j)}} $ at the j'th elbow. Since $ {\pi}(\mathrm{q}^{(j-1,j)}) = (\cdots,\pi_{j},\pi_{j+1},\pi_{j-1},\cdots) $, the above condition translates into

Equation (E.5)

where we have simply used the fact that $ \mathcal{L}_{\mathrm{q}^{^{\prime}}} $ at the j-th elbow is equal to one minus the probability mass corresponding to the j + 1-th up to d-th elements in the β-order $ {\pi}(\mathrm{q}^{^{\prime}}) $.

Let us denote $A_R = \sum_{k\in R}p_{k} $ to be the probability mass over a set of levels R. Note that this quantity changes whenever a level $ i\in R $ is β-swapped with a level in $ i^{^{\prime}}\in R^{\mathsf{c}} $. When level iʹ has a steeper slope than i, then AR increases after the β-swap; conversely, $ \sum_{k\in R}p_{k} $ decreases. To make use of this observation for equation (E.5), take

Equation (E.6)

The only β-swaps that can reduce AR would be $ \beta^{(\pi_{j-1},\pi_{j})} $ and $ \beta^{(\pi_{j-1},\pi_{j+1})} $. Among different combinations of two swaps, $ \mathrm{q}^{(j-1,j)} = \beta^{(\pi_{j-1},\pi_{j+1})}\beta^{(\pi_{j-1},\pi_{j})}\mathrm{p} $ achieves the minimum probability mass over a set of levels R as can be seen from theorem 5. Thus, equation (E.5) is impossible.

Case 5: $ \pi_{j+1}\lt \pi_{j-1},\pi_{j} $, we must try to maximise $ \sum_{k = 1}^{j-2}q_{\pi_k} + q_{\pi_{j+1}} $, which is achieved when $ \mathrm{q} = \mathrm{q}^{(j,j-1)} $ as in Case 4. This concludes the proof. □

Now we prove theorem 9.

Proof. Given an initial state $ \mathrm{p}_0 \in\mathcal{V}^d$, suppose that a β-swap series $\overrightarrow{\beta} = \beta^{(j_l,k_l)}\cdots\beta^{(j_1,k_1)}$ of length l produces an extreme point $\mathrm{p}_l = \overrightarrow{\beta}\mathrm{p}_0$ of $\mathcal{S}_\mathrm{ETO}(\mathrm{p}_0)$. To prove theorem 9 is to prove that $ \ell \unicode{x2A7D} \left\lfloor\frac{d!-4}{d-3}\right\rfloor$.

We start by denoting

Equation (E.7)

Because of the assumption that $ \mathrm{p}_l $ is extremal, by necessity all the intermediate states $ \mathrm{p}_1,\ldots,\mathrm{p}_{\ell-1} $ are also extreme points of $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}_0) $. Furthermore, we may use the fact that

Equation (E.8)

since together with equation (E.7) this would imply that $ \mathrm{p}_{n}\succ_{\beta}\mathrm{p}_{m} $ if n < m. Then the original bound $\ell_\textrm{{max}} \unicode{x2A7D} d! - 1$ of [38] is immediately obtained.

This bound can be improved by carefully studying the total number of feasible β-orders that $\mathrm{p}_{n+1}$ can take, given its history $(\mathrm{p}_0,\ldots,\mathrm{p}_n)$, which we denote as R(n). For instance, $R(0)\unicode{x2A7D} d! - 1$, where the upper bound refers to choosing any β-order for $\mathrm{p}_1$ except ${\pi}(\mathrm{p}_0)$. This is an upper bound, likely to be a loose one, because for a fixed $ \mathrm{p} $, not all R(0) orders are possible, given that $ \mathrm{p}_1 $ is obtained only by a single β-swap. If $R(n)\lt1$ for all trajectories $(\mathrm{p}_0,\ldots,\mathrm{p}_n)$, then $p_{n+1}$ cannot be extremal and therefore $\ell_\mathrm{max}\unicode{x2A7D} n$.

Now, given $\mathrm{p}_1$, we know the following:

  • (i)  
    The state $ \mathrm{p}_2 \in \mathcal{S}_\mathrm{ETO}(\mathrm{p}_1) \subsetneq \mathcal{S}_\mathrm{ETO}(\mathrm{p}_0)$,
  • (ii)  
    Since $ \mathrm{p}_2 $ is extremal for $\mathcal{S}_\mathrm{ETO}(\mathrm{p}_0)$, it is also extremal for $\mathcal{S}_\mathrm{ETO}(\mathrm{p}_1)$,
  • (iii)  
    By lemma 3, the states $\mathrm{q}_0^{(j)} = \beta^{({\pi}(\mathrm{p}_0)_j,{\pi}(\mathrm{p}_0)_{j+1})}\mathrm{p}_0$ are unique extreme points of $\mathcal{S}_\mathrm{ETO}(\mathrm{p}_0)$ with order ${\pi}(\mathrm{q}_0^{(j)})$.

Firstly, by equation (E.8), ${\pi}(\mathrm{p}_2) \neq {\pi}(\mathrm{p}_0),{\pi}(\mathrm{p}_1)$, leading to $R(1)\unicode{x2A7D} d! - 2$. The third observation above allows us to conclude that ${\pi}(\mathrm{p}_2) \neq {\pi}(\mathrm{q}_0^{(j)})$ for $j = 1,\ldots, d-1$. However, we should not subtract d − 1 orders from R(1) for all $(\mathrm{p}_0,\mathrm{p}_1)$. When $\mathrm{p}_1 = \mathrm{q}_0^{(k)}$ for some k, ${\pi}(\mathrm{q}_0^{(k)})$ and ${\pi}(\mathrm{p}_1)$ must not be double-counted, giving upper bound $R(1)\unicode{x2A7D} d!-d$ for any trajectory $(\mathrm{p}_0,\mathrm{p}_1)$.

Similar as to the above reasoning, for $ \mathrm{p}_{i+1} $, we know that

  • (i)  
    The state $ \mathrm{p}_{i+1} \in \mathcal{S}_\mathrm{ETO}(\mathrm{p}_{i}) \subsetneq \cdots \subsetneq \mathcal{S}_\mathrm{ETO}(\mathrm{p}_0)$,
  • (ii)  
    Since $ \mathrm{p}_{i+1}\in\mathrm{Ext}(\mathcal{S}_\mathrm{ETO}(\mathrm{p}_0)) $, therefore $ \mathrm{p}_{i+1}\in\mathrm{Ext}(\mathcal{S}_\mathrm{ETO}(\mathrm{p}_{k})) $ for all $ k\unicode{x2A7D} i $,
  • (iii)  
    By lemma 3, $ \mathrm{q}_{i}^{(j)} = \beta^{({\pi}(\mathrm{p}_{i})_{j}, {\pi}(\mathrm{p}_{i})_{j+1})}\mathrm{p}_{i} $ are unique extreme points of $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}_{i}) $ with order $ {\pi}(\mathrm{q}_{i}^{(j)}) $.

Therefore, given $(\mathrm{p}_0,\ldots,\mathrm{p}_n)$, we can repeat the same using ${\pi}(\mathrm{p}_{n+1}) \neq {\pi}(\mathrm{p}_{k})$ for $k\unicode{x2A7D} n$, and ${\pi}(\mathrm{p}_{n+1}) \neq {\pi}(\mathrm{q}_{k}^{(j)})$ for $k\unicode{x2A7D} n-1$ and $1\unicode{x2A7D} j\unicode{x2A7D} d-1$. The former condition excludes n + 1 orders and the latter one occupies $n(d-1)$ orders. Again, we need to account for double counting.

  • (i)  
    ${\pi}(\mathrm{p}_i) = {\pi}(\mathrm{q}_{i^{\prime}}^{(j)})$ for some $i,i^{\prime}$, and j.
  • (ii)  
    ${\pi}(\mathrm{q}_i^{(j)}) = {\pi}(\mathrm{q}_{i^{\prime}}^{(j^{\prime})})$ for some $j, j^{\prime}$ and $i^{\prime}\gt i$.

We will show that above coincidences can happen only when $\mathrm{p}_{i^{\prime}\pm1} = \mathrm{q}_{i^{\prime}}^{(j)}$ (case 1), or $\mathrm{q}_{i^{\prime}-2}^{(j,j^{\prime})} = \mathrm{p}_{i^{\prime}}$ (case 2).

If $\mathrm{p}_{i} = \mathrm{q}_{i^{\prime}}^{(j)}$ for $i\unicode{x2A7D} i^{\prime} -2$, ${\pi}(\mathrm{p}_{i^{\prime}}) = S_{j,j+1}({\pi}(\mathrm{p}_i)) = {\pi}(\mathrm{q}_i^{(j)})$. But ${\pi}(\mathrm{q}_i^{(j)})$ is already removed from $ R(i^{\prime}-1) $ for all $i^{\prime}\unicode{x2A7E} i+2$. $\mathrm{p}_{i} = \mathrm{q}_{i^{\prime}}^{(j)}$ for $i\unicode{x2A7E} i^{\prime} +2$ is also not allowed from the same reason.

The second case implies that $S_{j,j+1}({\pi}(\mathrm{p}_i)) = S_{j^{\prime},j^{\prime}+1}({\pi}(\mathrm{p}_i^{\prime}))$ or equivalently, there exists some β-ordering χ such that

Equation (E.9)

Consider then three cases:

  • (a)  
    $j^{\prime} = j$: ${\pi}(\mathrm{p}_i) = {\pi}(\mathrm{p}_{i^{\prime}})$ and $\mathrm{p}_{i^{\prime}}$ is not extremal for $\mathcal{S}_\mathrm{ETO}(\mathrm{p}_0)$.
  • (b)  
    $j^{\prime} = j\pm1$: by lemma 13, we know that $\mathcal{S}_\mathrm{ETO}(\mathrm{p}_i)$ has a unique extreme point that has the β-ordering χ. This is given by the state $\mathrm{q}_{i}^{(j,j^{^{\prime}})}$, that is, $\mathrm{p}_{i^{\prime}} = \mathrm{q}_{i}^{(j,j^{^{\prime}})}$, $i^{\prime} = i+2$, and $ {\pi}(\mathrm{q}_i^{(j)}) = {\pi}(\mathrm{q}_{i^{\prime}}^{(j^{\prime})}) = {\pi}(\mathrm{p}_{i+1}) $. We do not have to worry about double counting $ {\pi}(\mathrm{q}_{i^{\prime}}^{(j^{\prime})}) $ since it is already dealt as case 1: ${\pi}(\mathrm{p}_{i^{\prime}-1}) = {\pi}(\mathrm{q}_{i^{\prime}}^{(j^{\prime})})$.
  • (c)  
    if $j^{\prime} \neq j, j\pm1 $, $\beta^{(j^{\prime},j^{\prime}+1)}$ and $\beta^{(j,j+1)}$ commute and $\beta^{(j^{\prime},j^{\prime}+1)}\beta^{(j,j+1)}\mathrm{p}_i$ is tightly thermomajorised by $\mathrm{p}_i$, again, making $\mathrm{p}_{i^{\prime}} = \beta^{(j^{\prime},j^{\prime}+1)}\beta^{(j,j+1)}\mathrm{p}_i$ and $i^{\prime} = i+2$. Since $ \beta^{(j,j+1)} $ and $ \beta^{(j^{^{\prime}},j^{^{\prime}}+1)} $ commute, $ \mathrm{p}_{i+1} $ can be both $ \beta^{(j,j+1)}\mathrm{p}_i $ or $ \beta^{(j^{\prime},j^{\prime}+1)}\mathrm{p}_i $. We do not have to worry about the first case since $ {\pi}(\mathrm{q}_i^{(j)}) =$ ${\pi}(\mathrm{q}_{i^{\prime}}^{(j^{\prime})}) = {\pi}(\mathrm{p}_{i+1}) $ as before. In the other case, $ {\pi}(\mathrm{q}_i^{(j)}) = {\pi}(\mathrm{q}_{i^{\prime}}^{(j^{\prime})}) \neq {\pi}(\mathrm{p}_{i+1}) $ and we should not count $ {\pi}(\mathrm{q}_{i^{\prime}}^{(j^{\prime})}) $ again.

Then, newly removed orders in $ R(i^{\prime}+1) $ are: $ {\pi}(\mathrm{p}_{i^{\prime}+1}) $, $ {\pi}(\mathrm{q}_{i^{\prime}}^{(k)})$. The worst case is when $ \mathrm{p}_{i^{^{\prime}}} = \mathrm{q}_{i^{^{\prime}}-2}^{(j^{^{\prime}},j)} $, $ \mathrm{p}_{i^{^{\prime}}-1} = \mathrm{q}_{i^{^{\prime}}-2}^{(j^{^{\prime}})} $, and $ \mathrm{p}_{i^{^{\prime}}+1} = \mathrm{q}_{i^{^{\prime}}}^{(j^{^{\prime\prime}})} $ with $ j,j+1,j^{^{\prime}},j^{^{\prime}}+1,j^{^{\prime\prime}} $ all distinct, which then gives

Equation (E.10)

Excluding these double-counted orders, one can remove at least d − 3 orders at each step.

To sum up, $R(2) \unicode{x2A7D} d! - d - (d-2)$ since $\mathrm{q}_1^{(j)}$ eliminates d − 2 new levels. For n > 3, at least d − 3 new β-orders can be eliminated from R(n) after applying each β-swap. As a result, we get

Equation (E.11)

which bounds the length of β-swap series as $l_\mathrm{max} = \left\lfloor\frac{d!-4}{d-3}\right\rfloor$ when $ d\unicode{x2A7E} 4 $. □

Appendix F: Catalytic elementary thermal operations: an example

We tackle the problem by the following procedure: given $ \mathrm{p}\in \mathcal{V}^3$, we choose a qubit catalyst $ {\textrm{c}} \in \mathcal{V}^2 $, and construct the reachable state set $\mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}})$. The catalyst Hamiltonian is assumed to be w.l.o.g. degenerate, and hence a choice of $ {\textrm{c}} = (c_1,1-c_1) $ is simply determined by a real-valued parameter c1. We then find the full characterisation of $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}}) $ by identifying its extreme points, and denote $\mathcal{S}_\mathrm{CETO}^{(2)}(\mathrm{p};{\textrm{c}})$ to be the set of all states $\mathrm{q}$ such that $\mathrm{q}\otimes{\textrm{c}}\in\mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}})$.

Lastly, this procedure is iterated for different choices of ${\textrm{c}}$, by varying the choice of c1 in a sufficiently fine-grained manner.

The first step, where $\mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}})$ is found, is done with the first algorithm of section 5.3 where convex hull is explicitly constructed. Since degenerate Hamiltonian is assumed for catalysts, this process is easier than a generic search. The resulting set $\mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}})$ also encompasses states $p^{^{\prime}}\in\mathcal{V}^6$ that are not in product form. Final states of the form $ \mathrm{q}\otimes{\textrm{c}} $ can be distilled by forming plane equations imposing catalyst state ${\textrm{c}}$, and performing half-space intersections with the full set $\mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}})$ numerically. In general, extreme points of $\mathcal{S}_\mathrm{CETO}^{(2)}(\mathrm{p};{\textrm{c}})$ are not necessarily extremal in $\mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}})$.

Note that the catalytic ETO operation that produces $ \mathrm{p}\rightarrow\mathrm{q} $ is often non-unique. First of all, there may be multiple catalyst states that enable a particular transition. Nevertheless, given $ \mathrm{p},\mathrm{q} $ and a choice of $ {\textrm{c}} $, if $ \mathrm{q}\otimes{\textrm{c}} \in \mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}}) $, then decomposing the transformation into a particular convex combination of different β-swap series is straightforward: writing $ \mathrm{q}\otimes{\textrm{c}} $ as a convex combination of $\mathrm{Ext}[\mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}})] $ can be done by linear programming and each extreme point is obtained from a β-swap series. This gives us a particular algorithm for implementation.

Now we will examine one specific extreme point $ \mathcal{S}_\mathrm{CETO}^{(2)}(\mathrm{p};{\textrm{c}}) $, plotted in figures 6 and 7, following the procedure described above. This example is also an extreme point of $ \mathcal{S}_\mathrm{CETO}^{(2)}(\mathrm{p}) $ having the smallest ground state population among reachable states.

An extreme point of $ \mathcal{S}_\mathrm{CETO}^{(2)} $

The initial state $ \mathrm{p} = (0.35,0.55,0.1) $ and the Hamiltonian $\beta\mathrm{H}_S = (0,0.2,0.5)$, giving ${\pi}(\mathrm{p}) = (2,1,3)$. The initial catalyst distribution is fine-tuned to be ${\textrm{c}} = (c_1,1-c_1)$, with

Equation (F.1)

which provides the maximum advantage when minimising the ground state population. Then the total β-order becomes

Equation (F.2)

where $a*b$ implies energy eigenstate $\lvert a\rangle_{S}\lvert b\rangle_{C}$ of a system (S) plus catalyst (C) state.

From $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}}) $, we obtain new extreme points of $ \mathcal{S}_\mathrm{CETO}^{(2)}(\mathrm{p};{\textrm{c}}) $, including our example point $ \mathrm{q} $. Further, by solving a linear programming problem, four extreme points of $ \mathcal{S}_\mathrm{ETO}(\mathrm{p}\otimes{\textrm{c}}) $, $\mathrm{q}^\prime_{1,2,3,4}\in\mathcal{V}^6$, such that

Equation (F.3)

and corresponding β-swaps $ \mathrm{q}^{\prime}_i = \overrightarrow{\beta}_i(\mathrm{p}\otimes{\textrm{c}}) $ can be identified. $ \overrightarrow{\beta}_i $ differ only slightly from each other (differences marked red),

Equation (F.4)

Equation (F.5)

Equation (F.6)

Equation (F.7)

facilitating the recombination into

Equation (F.8)

with $ \lambda_1 = \alpha_3/(\alpha_1+\alpha_3) = \alpha_4/(\alpha_2+\alpha_4) $ and $ \lambda_2 = \alpha_2/(\alpha_1+\alpha_2) = \alpha_4/(\alpha_3+\alpha_4) $.

Given equation (F.8), we can analyse system-catalyst interplay during the catalytic evolution, starting from

Equation (F.9)

The swap series can be grouped into three phases:

  • (i)  
    The first four swaps all involve the $3*2$ population. The first and the third swaps ($ \beta^{(1*1,3*2)} $ and $ \beta^{(2*1,3*2)} $) work to shift population from the first level of the catalyst to the second level and thus intensify the non-uniformity of the catalyst reduced state, as reflected in corresponding steps of figure 7(b). The ratio between $3*2$ and $3*1$ populations is increasing more rapidly than the one between levels 2 and 1 of catalyst reduced state, correlating the system and catalyst as shown in the mutual information. The fourth swap is chosen to be λ ≠ 1 swap to prevent $3*2$ population to become too large to recover $ {\textrm{c}} $.Since both catalyst local free energy and mutual information increase, system local free energy should always decrease at this stage. The total state after these swaps is
    Equation (F.10)
    Equation (F.11)
  • (ii)  
    The fifth swap balances catalyst distribution in the degenerate block $\lvert2\rangle_S$ while system reduced populations are fixed, yielding
    Equation (F.12)
    Equation (F.13)
    Catalyst local free energy decreases as a result of mixing, while correlation increases.
  • (iii)  
    Last two swaps increases $3*1$ level population to recover the original ratio between $3*1$ and $3*2$, while at the same time further reducing the system ground state population. For this particular choice of catalyst, we have a simplification in the sense that these swaps also balance $1*1$ and $1*2$, leading to
    Equation (F.14)
    Equation (F.15)
    with vanishing correlation and retrieval of the original catalyst. The system free energy increases here, since level $\lvert1\rangle_S$, which already has the lowest slope, loses more population and the new state will thermomajorise the old state. This behaviour of increasing free energy after swaps is strictly forbidden in non-catalytic setting, and allowed in this case by sacrificing correlations and catalyst free energy stored from previous operations.

Although there is a room for some change in orders between swaps that are commuting, the above is a typical strategy for constructing catalytic transformations:

  • (i)  
    exploits expanded dimensionality to swap a system plus catalyst level with more numbers of levels, paying i) temporary correlations and ii) local variations on catalyst as a cost;
  • (ii)  
    resolves correlations by mixing in the degenerate energy subspace; and
  • (iii)  
    recovers original catalyst distribution while increasing system local free energy.

Even for generalised free energies Fα with different α values, similar behaviours are observed. In figure F1, α = 0.5 and 2 are presented as representative examples. In both of the cases, Fα for a system reduced state (see (c) and (d) of figure F1) share the decreasing/increasing trend, albeit with different slopes.

Figure F1.

Figure F1. Replica of figure 6 with generalised free energies.

Standard image High-resolution image

Footnotes

  • See [30] for an alternative direction discussing catalysts that activate the transitions that are not possible even between multiple copies of initial and target states.

  • This analogy is not exact because, while in quantum computing, universal 2-qubit gate sets exist, it is known that TO cannot always be decomposed into ETO [38].

  • For most cases, we are interested in $B = B^{\prime}$.

  • Multiple generalisations for energy-coherent states exist, and can be found in [14].

  • Whenever the order of Ej and Ek is known, we denote ETO such as $ \mathcal{M}^{(j,k)}$ or $\beta^{(j,k)} $, using the convention that $E_j\unicode{x2A7D} E_k$.

  • This is a state after l neighbouring swaps. We use the notation $\mathrm{q}^{(\overrightarrow{j},j_{l+1})} = \beta^{({\pi}(\mathrm{q}^{(\overrightarrow{j})})_{j_{l+1}}, {\pi}(\mathrm{q}^{(\overrightarrow{j})})_{j_{l+1}+1})}\mathrm{q}^{(\overrightarrow{j})}$, where $\overrightarrow{j} = (j_1,\ldots,j_{l})$.

  • It should be noted at this point that $ \mathrm{r} $ may not be the only extreme point that has β-ordering ψ—there could be states $ \mathrm{r}^{^{\prime}} $ of the same β-order, where the first elbow is lower that of $ \mathrm{r} $, and the second elbow higher. Such states are not, however, contained in Sd .

  • Requiring $ \mathrm{g}(\mathrm{r})_{\psi_{j-1}} \gt\mathrm{g}(\mathrm{r})_{\psi_{j}} $ is always possible by putting all the levels having the same slope $ \mathrm{g}(\mathrm{r})_{\psi_{j}} $ to come after ψj in the order ψ.

Please wait… references are loading.