Higher-order interference and single-system postulates characterizing quantum theory

We present a new characterization of quantum theory in terms of simple physical principles that is different from previous ones in two important respects: first, it only refers to properties of single systems without any assumptions on the composition of many systems; and second, it is closer to experiment by having absence of higher-order interference as a postulate, which is currently the subject of experimental investigation. We give three postulates -- no higher-order interference, classical decomposability of states, and strong symmetry -- and prove that the only non-classical operational probabilistic theories satisfying them are real, complex, and quaternionic quantum theory, together with 3-level octonionic quantum theory and ball state spaces of arbitrary dimension. Then we show that adding observability of energy as a fourth postulate yields complex quantum theory as the unique solution, relating the emergence of the complex numbers to the possibility of Hamiltonian dynamics. We also show that there may be interesting non-quantum theories satisfying only the first two of our postulates, which would allow for higher-order interference in experiments while still respecting the contextuality analogue of the local orthogonality principle.


I. INTRODUCTION
Quantum theory underpins much of modern physics, as well as being essential in many other scientific fields and countless technological applications. However, by most accounts quantum phenomena remain rather mysterious: there is no generally accepted intuitive picture of the underlying reality, and the standard textbook introductions of the mathematical formalism lack a simple conceptual motivation.
With the rise of quantum information processing and the ever more refined control of quantum phenomena, there has recently been a surge of diverse attempts to tackle such foundational questions. These range from studies of the information processing capabilities of theories similar to quantum theory [7,12,[34][35][36], to reconstructions of the formalism from information-theoretic principles [40,41,46,48,49,67], to no-go theorems regarding interpretations and generalizations of the formalism [5,6,56], to novel experiments testing various predictions of the theory [2][3][4].
In this paper we give several closely related reconstructions of the mathematical structure-Hilbert space, Hermitian observables, positive operator-valued measuresof finite-dimensional quantum theory from simple postulates with clear physical significance and generality.
Providing such an explanation for the Hilbert space * Electronic address: hnbarnum@aol.com † Electronic address: markus@mpmueller.net ‡ Electronic address: cozster@gmail.com structure of quantum theory in terms of physically (not just mathematically) natural postulates is important for several reasons. First, deeper and more reasonable principles might help to dissolve the mysteries of quantum phenomena and make them more intelligible and easier to teach. Two well-known examples of this approach are Kepler's laws of planetary motion and their explanation through Newton's laws of motion and gravitation, and the Lorentz transformations and their explanation in Einstein's two relativity postulates. Second, it can be argued that this approach will be essential in making progress on problems such as formulating a theory unifying quantum and gravitational physics, as well as for developing potentially more accurate and more fundamental theories. In the absence of a picture of the underlying reality, we can use first principles to proceed toward the next physical theory in a careful, conceptual fashion. More practically, this approach can shed light on what is responsible for the power of quantum information processing and cryptography.
As quantum theory applies to an extremely broad range of physical systems and phenomena, and its probabilistic structure seems essential, we therefore work within a broad framework for studying probabilistic physical theories (usually called operational probabilistic theories). These are theories that describe sets of experiments and efficiently assign probabilities to measurement outcomes. More precisely, we imagine that physicists, or nature, prepare physical systems in various states, and then observe these systems in various ways. The outcomes of these observations occur with certain probabilities, which are predicted by the theory. It is important to emphasize that we do not assume that these probabilities are described by quantum theory; instead our postulates will allow us to derive their structure as represented by quantum theory.
Our postulates are as follows: 1. Classical Decomposability: Every state of a physical system can be represented as a probabilistic mixture of perfectly distinguishable states of maximal knowledge ("pure states").
2. Strong Symmetry: Every set of perfectly distinguishable pure states of a given size can be reversibly transformed to any other such set of the same size.

No Higher-Order Interference:
The interference pattern between mutually exclusive "paths" in an experiment is exactly the sum of the patterns which would be observed in all two-path subexperiments, corrected for overlaps.

Observability of Energy:
There is non-trivial continuous reversible time evolution, and the generator of every such evolution can be associated to an observable ("energy") which is a conserved quantity.
Before discussing their physical interpretation and motivation in more detail, we point out that all of our postulates refer to single systems only. This is in contrast to earlier reconstructions of quantum theory [40,41,46,48] which rely heavily on properties of composite systems. Our motivation to rely on single systems is as follows. It is not clear that the notion of subsystems and their composition, as it is often used in information-theoretic circuit diagrams and category-theoretic considerations, has fundamental physical significance. Quantum field theories, for example, need not have this kind of structure. Assigning subsystems may turn out to be a derived concept, contingent on the ability of an observer to control certain degrees of freedom in isolation from others, and independent of possibly more fundamental divisibility notions such as bosonic or fermionic particles.
Moreover, there has recently been a surge of interest in finding compelling physical principles that explain the specific contextuality behavior of quantum theory as compared to other probabilistic theories. This line of research aims at analyzing the single-system analogue of quantum non-locality, and understanding its specific characteristics in terms of principles such as "consistent exclusivity" [52]. Our results also contribute to this line of research by showing that Postulates 1 and 2 are sufficient to guarantee that systems satisfy consistent exclusivity.
We do not claim that our postulates are the only reasonable ones, but we think that they -like other recent reconstructions -are more natural than the usual abstract formulations which simply presume Hilbert spaces, complex numbers, and operators. Moreover, as we discuss below, we think that our formulation is especially FIG. 1: Higher-order interference. Consider a particle which can pass one of M (here: M = 4) slits, where some of the slits may be blocked by the experimenter (indicated by the black bars). After passing the multi-slit setup, the particle may trigger a certain event, for example the click of a detector localized in a certain area of the screen. We are interested in the probability pJ of the event, given that slits J ⊂ {1, 2, . . . , M } are open (for example p23 in the depicted setup). Classically, the probability of such an event given that all four slits are open, p1234, equals p1 + p2 + p3 + p4, where pi is the probability assuming than only slit i is open. This is violated in quantum theory due to interference. However, even in quantum theory, the total probability can be computed from contributions of pairs of slits only: we have It is in this sense that quantum theory has second-, but no third-or higher-order interference. The definition of interference that we use is not restricted to spatially arranged slits, but is formulated generally for any set of M perfectly distinguishable alternatives in a probabilistic theory. suitable in the search for interesting and physically reasonable modifications of quantum theory; that is, state spaces that are not described by the Hilbert space formalism but are otherwise consistent and physically plausible.
Comparison to other reconstructions can help uncover logical relations between physical structures of our world. For example, our fourth postulate (observability of energy) is used to rule out non-complex Hilbert spaces in this work; in other reconstructions, this role is usually played by the the postulate of tomographic locality, which states that joint states on composite systems are uniquely determined by local measurement statistics and their correlations. Thus, one may argue that there is a logical relationship between tomographic locality and observability of energy, and thus ultimately with the fact that we observe Hamiltonian mechanics in our world.
We will now give a short discussion of the interpretation of our postulates.
To clarify the terms in Postulate 1, a set of states is perfectly distinguishable if there is a measurement whose outcomes can be paired one-to-one with the states so that each measurement outcome has probability one when its corresponding state has been prepared, and probability zero when any of the other states have been prepared. A state of maximal knowledge (aka "pure state") is a state ω which cannot be written as a nontrivial convex combination of states, i.e. as ω = pσ + qτ where p + q = 1, p, q > 0, and σ = τ . That is, it cannot be viewed as arising from a lack of knowledge about which of two distinct states has been prepared.
Postulate 1 can be viewed as a generalization of the spectral decomposition of every quantum density matrix as a convex combination of orthogonal rank-one projectors onto orthogonal eigenstates of the density matrix. However, our postulate is stated purely in terms of the convex structures of the set of states and of measurement outcomes; the notion of spectrum of an operator is not involved. An important part of the physical significance of this postulate is that it appears likely to be needed for an information theory and probably a statistical mechanics that share desirable and physically fundamental properties with those supported by quantum theory. In particular, it is a plausible conjecture that this postulate implies the correspondence of two natural ways of defining entropies for states in generalized probabilistic theories [62,63]: the first as the minimal entropy of the outcomes of a fine-grained measurement made on the state, and the second as the minimal entropy of a preparation of the state as a mixture of pure states.
Postulate 2 expresses a fundamental symmetry: given any integer n, all n-level systems are equivalent. That is, we can transmit (not necessarily copy) the state of any n-level system to any other system without losing information, at least in principle. This implies a certain minimal amount of possible reversible dynamics or computational power.
Postulate 3, that the system exhibits at most "secondorder interference," is based on the notion of multi-slit interference introduced by Rafael Sorkin [1]. This is a manifestly physical assumption which is currently under experimental investigation [2,3]. The precise notion of an interference experiment will be defined in Section V below; an illustration is given in Figure 1.
This postulate suggests a possible route towards obtaining concrete predictions for conceivable third-order interference in experiments: drop the third postulate, and work out the new set of theories that satisfy only Postulates 1 and 2 (and possibly 4). As we will show, any system of this kind -if it exists -has a set of "filtering" operations that represent an orthomodular lattice known from quantum logic [37], but these filters do not necessarily preserve the purity of states as they do in quantum theory (equivalently, the lattice does not satisfy the "covering law"). However, these systems still satisfy the principle of "Consistent Exclusivity" [52], bringing their contextuality behavior close to quantum theory, despite the appearance of (non-quantum) third-order interference.
In this way, our results hint at possible physical properties of conceivable alternative theories against which quantum theory can be tested in interference experiments, and which may be of independent mathematical interest. In particular, the existence of theories exhibiting higher-order interference and containing quantum theory as a subtheory has been conjectured for several years. Preliminary results indicate interesting physical properties of those theories [54], but the concrete construction of the corresponding state spaces is still an open problem. We hope that our approach can help to make progress on this question.
We obtain our main result by first showing that the first three postulates bring us very close to quantum theory: they imply that systems are described by finitedimensional irreducible formally real Jordan algebras, or are classical. Moreover, these three postulates precisely characterize this class of theories, since classical systems and irreducible Jordan systems all satisfy Postulates 1-3. As Jordan, von Neumann and Wigner [10] showed, the formally real irreducible Jordan algebraic systems are the real, complex, and quaternionic quantum theories (for all finite dimensions), one exceptional case (the 3 × 3 octonionic "density matrices") and the spin factors (ballshaped state spaces) of all finite dimensions. Standard complex quantum theory is the only one among these which also satisfies the fourth postulate.
The association of energy with a conserved physical quantity is an important principle of both quantum and classical theory, exhibited for example in the Lagrangian formulation of classical mechanics in the guise of Noether's theorem; this provides some motivation for our energy observability postulate.
Further, Postulates 1, 2 and 4 seem likely to be necessary-or at least sufficient-to run standard statistical mechanics arguments, a possibility we will explore in further work. We have already mentioned the conjecture that Postulate 1 implies the equivalence of measurement and preparation entropy, which likely has relevance to thermodynamic processes and Maxwell's demon arguments. Reversible processes, the subject of Postulate 2, are even more crucial in classical and quantum thermodynamics.

II. OPERATIONAL PROBABILISTIC THEORIES
In this section, we summarize the standard convex framework for operational probabilistic theories, and give needed definitions and mathematical facts about convexity and cones. References for the mathematics include [11] and [33]. More details on the framework can be found in e.g. [36], [48], [35], [34], [13]; also, [71,72] offer accessible introductions. This review is primarily to fix notation and clarify the specific version used here.
The primitive elements are experimental devices and probabilities. In particular, experimental devices can be classified into preparations, transformations, and measurements. With each use, a preparation device (such as an oven, antenna, or laser) outputs an instance of a physical system, denoted by A, in some state ω specified by the type of device and its various settings. The system then passes through a transformation device (such as a beam splitter, or Stern-Gerlach magnet) which modifies the state of the system, in a potentially non-deterministic fashion. Finally, a measurement device takes in the system, and one of a distinct set of outputs (such as a light flashing, or a pointer being in some range of possible positions) signals the measurement outcome. Even though we motivate the formalism by example of such laboratory devices, the resulting operational framework is not restricted to this and may also be used to describe other physical processes.
A main purpose of a physical theory in this framework is to specify the probabilities of the outcomes of any measurement made on a system that has been prepared in a given state. To this end, single measurement outcomes, called effects, will be denoted by lowercase letters such as e. The probability of obtaining an outcome e, given state ω, will be denoted e(ω).
By standard arguments, each state can be specified by a minimal list of measurement outcome probabilities, which contains sufficient information to predict the probabilities of all measurements that can be in principle performed on the system. Using this idea and a further convexity argument, states can therefore be represented as elements of a real linear space of some finite dimension K A , which we denote also by A. Further, for each system A there is a convex compact subset, Ω A ⊂ A, of normalized states in a real affine space of dimension K A − 1 which is embedded in A as an affine plane not intersecting the origin. The nonnegative multiples of elements of Ω A form a regular (closed, generating, pointed) cone A + ⊂ A, of unnormalized states.
Effects then become linear functionals from A to R such that 0 ≤ e(ω) ≤ 1 for all ω ∈ Ω A , i.e. they give valid probabilities on normalized states. The nonnegative multiples of effects constitute the dual cone A * + := {e ∈ A * : ∀γ ∈ A + e(γ) ≥ 0}. Given our embedding of Ω A in A, there is a unique unit functional u A ∈ A * that evaluates to 1 on every element of Ω A . The set of all mathematically valid effects is the unit order interval, This notation uses the ordering obtained from the regular cone A * + , writing x ≤ y for y − x ∈ A * + . For a given system, not all mathematically valid effects may be "operationally possible" measurement outcomes, so we define a subset E of the full set of effects on Ω, which we call the allowed effects. Thus, we are not assuming the "no-restriction hypothesis" [61]. We make weak, operationally natural assumptions on the subset E: it is convex and topologically closed, contains u A , and for every x ∈ E, u A − x is also in E (so that x can be part of at least one complete measurement, namely {x, u A − x}). We also assume that E has full dimension (otherwise, there would be states ϕ = ω that give the same outcome probabilities for all allowed measure-ments, which means that we would not have called them "different states" to start with). We think of any set of allowed effects e i such that i e i = u A as a possible measurement we could make on the system, with outcomes e i 1 . Since we can imagine post-processing the output of such a measurement such that a chosen pair e i and e j of outcomes are grouped together as a single outcome (a "coarse-graining" of the measurement), we also assume that e i + e j is allowed. In brief, we assume that whenever e i , e j are allowed effects with e i + e j ≤ u A , e i + e j is allowed. From our assumptions, it follows that the set of allowed effects is the unit order interval [0, u A ] in a regular subcone A + (containing u A ) of the dual cone. If A + = A * + , we say that all effects are allowed. Associated with every system there is also a set of allowed transformations, which are linear maps T : A → A, taking states to states, namely, T (A + ) ⊆ A + (a property called positivity). Transformations are required to be normalization-nonincreasing, i.e. u A (T (ω)) ≤ 1 for all ω ∈ Ω A . The set of allowed transformations is also closed topologically and under composition. If all effects are allowed, it follows from positivity and normalization that e • T ∈ E for all allowed effects e (all elements of E); otherwise we explicitly require this (i.e., that T * (E) ⊆ E).
Since E is the unit order interval in A + , it is equivalent (for normalization-nonincreasing T ) to require that T * (A + ) ⊆ A + . We note also that the normalizationnonincrease condition is equivalent to the dual condition T * (u A ) ≤ u A . An allowed transformation T is called reversible if its inverse T −1 exists and is also an allowed transformation. It follows that reversible transformations T preserve normalization: u A (T (ω)) = u A (ω) for all ω ∈ A + (though these are not in general the only normalization-preserving transformations). The set of all reversible transformations on a system A is a compact group G A . For a transformation T , the number u A (T (ω)) can be interpreted as the probability of transformation T occurring, if a system prepared in state ω is subjected to a process that has as a possible outcome the occurrence of T . In other words, transformations can be part of an instrument in the sense of [68].
A system described by standard complex ndimensional quantum theory fits into this framework. Its ambient real vector space A is the n 2 -dimensional space of complex Hermitian n × n-matrices, the cone of states A + is the set of positive semidefinite matrices, Ω is the set of density matrices (the intersection of A + with the affine plane {ρ : trρ = 1}), the order unit is the functional ρ → trρ, and the allowed effects are the unit order interval in the dual cone, i.e., the functionals ρ → tr(Eρ) where 0 ≤ E ≤ 1. The allowed transformations are the trace-nonincreasing completely positive maps A → A, and the reversible transformations are the maps ρ → U ρU † for unitary matrices U .
We now describe some further important notions and facts about this type of theory and the relevant mathematical structures that will be used in our discussion.
A cone A + is reducible if the ambient space decomposes into two nontrivial subspaces such that every extremal ray of the cone lies in one or the other of these subspaces. A system is called reducible if its cone of unnormalized states is reducible. Intuitively, information about which of these two summands the state is in, is classical information. Every cone in finite dimension has a decomposition as a finite sum ⊕ n i=1 A i of irreducible cones, and if these irreducible components are all onedimensional any base for the cone is affinely isomorphic to the simplex of probability measures over n outcomes, so we say the system is classical. Its faces are the subsimplices generated by the subsets of outcomes, its reversible transformations are the permutations of the vertices, and more general transformations are given by substochastic matrices.
One can identify A * with A by introducing an inner product ., . on A, and interpreting the inner product as functional evaluation: e(ω) = e, ω . Via this isomorphism the dual cone A * + is identified with the "internal dual cone" relative to the given inner product, A * int + := {y ∈ A : ∀x ∈ A + y, x ≥ 0}. Often, such an inner-product-space formulation is used as the basic framework for presenting probabilistic systems and theories; see for example [46,66]. If an inner product can be introduced in such a way that A * int + = A + , the cone is said to be self-dual and the inner product self-dualizing; a cone in an inner product space is said to be manifestly self-dual if the inner product is the one that identifies the cone with its dual.
A set of states ω 1 , . . . , ω n ∈ Ω A is called perfectly distinguishable if there are allowed effects e 1 , . . . , e n ∈ A + which can appear in a common measurement, i.e. e 1 + . . . + e n ≤ u A , such that e i (ω j ) = δ i,j , that is 1 if i = j and 0 otherwise 2 .
A face F of a convex set C is a subset of C such that α ∈ F and α = i λ i ω i , ω i ∈ C, λ i > 0, i λ i = 1 implies that all ω i ∈ F . In other words F is closed under inclusion of anything that can appear in a convex decomposition of an element of F . An exposed face of a convex set is the intersection of a supporting hyperplane with the set, easily seen to be a face. The faces of A + and those of Ω A are in 1-1 correspondence: the face of A + corresponding to face F of Ω is just {λF : λ ≥ 0}. The relation "is a face of" is transitive: If G is a face of C, and F is a face of G, then F is a face of C. The orderings of the set of faces and of the set of exposed faces by subset inclusion each form a lattice, with greatest lower bound F ∧ G = F ∩ G, and least upper bound F ∨ G, which is the smallest face containing both F and G. If a lattice has an upper bound, this is conventionally called 1, and a lower bound is called 0; for Ω we have 1 = Ω and 0 = ∅, while for A + , 1 = A + and 0 = {0}, where 0 is the 0 of the vector space A. (We adopt the convention that the empty set ∅ is not counted as a face of A + .) An atom is a minimal non-zero element of the lattice; the atoms of the face lattice of a regular finite-dimensional cone are the extremal rays, Ray(ω) := {λω : λ ≥ 0} for ω extremal in Ω. An element of A + may be called ray-extremal if it is a nonnegative multiple of a pure state of Ω.
Quantum systems are self-dual, with all effects allowed, and with the self-dualizing inner product usually chosen to be X, Y = tr(XY ). (For this reason, the dual cone is often identified with the positive semidefinite operators, and the effects with operators E such that 0 ≤ E ≤ 1, rather than with the functionals ρ → trEρ associated with such operators.) The faces of a quantum system, which are all exposed, correspond to the subspaces S of the underlying Hilbert space: the face F S of Ω corresponding to such a subspace S consists of the density matrices ρ whose images, when viewed as linear operators on that Hilbert space, are contained in S. Equivalently, they are those density matrices whose convex decompositions into rank-one projectors involve nonzero probabilities only for projectors onto subspaces of S.

III. CONSEQUENCES OF POSTULATES 1+2
We call a list of n perfectly distinguishable pure states a frame, of size n. The convex hull of such a set of states is a simplex, isomorphic to the space of probability measures on n alternatives, which we call a "classical subsystem" of the state space. For every finite-dimensional system A, there is a largest frame size N A ; frames of this size are called maximal. In quantum theory, a frame corresponds to a set of mutually orthogonal pure states, and it is maximal if the corresponding state vectors are an orthonormal basis of the underlying Hilbert space.
Using the concepts we have introduced, our first two postulates can be stated as follows: Postulate 1 is that every state ω ∈ Ω has a decomposition of the form ω = i p i ω i , for some probabilities p i and some n-frame ω 1 , . . . , ω n , for some n ∈ N.
Postulate 2 is that every n-frame can be taken to every other n-frame by a reversible transformation of Ω.
We could paraphrase Postulate 1 as "every state lies in some classical subsystem", and Postulate 2 as "all classical subsystems of a given size are equivalent". Proposition 1. Postulate 1 implies that all effects are allowed.
Proof. We show each effect e ∈ A * + that generates an exposed ray of A * + is allowed, i.e. an element of A + . It follows that all effects are allowed, since the exposed rays generate A * + via convex combinations and closure.
By the definition of exposed ray of A * + , there is an x ∈ A + such that the non-negative multiples of e are the unique nonzero effects with e(x) = 0. We may choose x to be normalized. Since e exposes a proper face containing x in A + , x is in the boundary of Ω. By Postulate 1, x belongs to some classical subsystem ∆ = conv{ω 1 , . . . , ω n } ⊆ Ω. It must be in ∆'s boundary since it is in Ω's. By the definition of classical subsystem there are allowed effects e 1 , ..., e n with e i (ω j ) = δ ij . Since x is in the boundary of ∆, there must be at least one i such that e i (x) = 0. Because e and its multiples are the unique effects for which e(x) = 0, this i is unique, and e is a multiple of e i , hence allowed. Proposition 2. Postulates 1 and 2 imply that every face of Ω is generated by a frame. Any two frames that generate the same face F have the same size, called the rank of F , and denoted |F |. Moreover, if G F then |G| < |F |, and every frame of size |F | in F generates F . Proof. A face is generated by any element of its relative interior. By Postulate 1, such an element is in the convex hull of a frame; this frame also generates the face.
Let F be any face, and suppose there are two frames ϕ 1 , . . . , ϕ m and ω 1 , . . . , ω n with m < n that both generate F , and e 1 , . . . , e n effects such that e i (ω j ) = δ i,j and i e i ≤ u. Let F be the face generated by ω 1 , . . . , ω m , Since T F is a proper face of F , it must have smaller dimension, which contradicts the invertibility and thus reversibility of T . Similarly, if we had G F and |G| ≥ |F |, then a reversible transformation could map F into G, which is a contradiction, too.
If ω 1 , . . . , ω |F | is any frame on F , and G the face that it generates, then G ⊆ F , and some reversible transformation T will map it to some other frame of the same size that generates F . Hence T G = F , and this contradicts G F . Proposition 3. Postulates 1 and 2 imply that A + is selfdual, with a corresponding self-dualizing inner product that satisfies T ϕ, T ω = ϕ, ω for all reversible transformations T . The inner product can be chosen so that the corresponding norm ω := ω, ω attains the value 1 on all pure states, and is strictly less than 1 for all mixed states.
Proof. Ref. [31] shows that bit symmetry and the fact that all effects are allowed imply this proposition. Bit symmetry is the 2-frame case of Postulate 2, and we have shown that all effects are allowed in Proposition 1. Proof. Reversible transformations T resp. T −1 preserve the normalization, i.e. u A (T −1 ω) = u A (ω) for all ω, which we can now write as u A , ω = u A , T −1 ω = T u A , ω . Since this is true for all ω, we must have T u A = u A for all reversible transformations T .
Proof. Let ϕ 1 , . . . , ϕ N be any frame that generates all of A + , with effects e 1 , . . . , e N such that e j (ϕ i ) = δ ij and j e j = u A . Then ϕ 1 , . . . , ϕ n is itself a frame of size n; thus, according to Postulate 2, there is a reversible transformation T with T ϕ i = ω i for i = 1, . . . , n. For i > n, define ω i := T ϕ i . Set e j := e j • T −1 , then e j (ω i ) = δ ij and j e j = u A , and so we have extended ω 1 , . . . , ω n to a frame with N elements.
The following proposition will turn out to be useful in several proofs. Proposition 6. Postulates 1 and 2 imply that if ω 1 , . . . , ω n are mutually orthogonal pure states, then they are a frame, and Proof. We have to find effects e 1 , . . . , e n with e i (ω j ) = δ ij and n i=1 e i ≤ u A . To this end, we will first construct a decomposition of the order unit. By self-duality and Proposition 3 , . . , N }, the states ϕ π(1) , . . . , ϕ π(N ) are again a frame; thus, there is a reversible transformation T π with T π ϕ i = ϕ π(i) . Hence (using the invariance of u A under reversible transformations) Taking the inner product with ϕ j shows that λ π −1 (j) = λ j ; since this is true for all permutations, all λ j are equal to some λ > 0. Finally, 1 = u A , ϕ 1 = u A 2 λ, and Thus, we have shown that every maximal frame adds up to the order unit. Now we show the statement of the proposition by induction on n. Start with n = 1. Any pure state ω 1 is by definition a frame of size 1. Moreover, if ϕ ∈ Ω, then the Cauchy-Schwarz inequality yields hence ω 1 ≤ u A . Now suppose the statement of the proposition is true for some n, and consider pure mutually orthogonal states ω 1 , . . . , ω n+1 . Set e 1 := ω 1 , . . . , e n := ω n , and e n+1 := u A − n i=1 e i . By the induction hypothesis, e n+1 ≥ 0, and so e 1 , . . . , e n+1 is a measurement with e i (ω j ) = δ ij for 1 ≤ i, j ≤ n + 1. Thus, ω 1 , . . . , ω n+1 is a frame. According to Proposition 5, it can be extended to a maximal frame ω 1 , . . . , ω N , and then Proposition 7. Postulates 1 and 2 imply that for every face F of A + , the set F : . . ϕ n is any frame that is contained in some face F , then it can be extended to a frame ϕ 1 , . . . , ϕ n , . . . , ϕ |F | that generates F .
As mentioned in Section I, Postulates 1 and 2 imply that there is a special transformation called a filter associated with each face of the state space. The next theorem shows that certain projections are positive (recall that a linear map is positive if it maps the cone A + into itself), and in Section IV we will further show that these projections have the additional properties required of filters.
Theorem 8. Postulates 1 and 2 imply that for every face F of A + , the orthogonal projection P F onto the linear span of F is positive.
Proof. Iochum ([29], see also [30]) has shown that positivity of all P F is equivalent to perfection. (For the reader's convenience, and the authors' peace of mind, a proof is included in Appendix A.) A cone is called perfect if all faces F of A + , regarded as cones in the linear span lin F , are themselves self-dual with respect to the inner product inherited from A. We will therefore show this property, establishing the claim.
So let F be any face of A + , and F * ⊂ lin F be the dual cone with respect to the inner product inherited from A.
The properties that we have proven so far turn out to give an interesting structure known from the field of quantum logic, indeed sometimes taken as a definition of a quantum logic [50]. As noted above, the set of faces ordered by subset inclusion is a bounded lattice. However, from Postulates 1 and 2, we recover more of the logical structure of quantum theory: Theorem 9. Postulates 1 and 2 imply that the lattice of faces of A + is an orthomodular lattice.
Before giving the proof, recall that orthomodularity is the property that (1) Note that in [32] it is shown that for self-dual cones, orthomodularity of the face lattice in the above sense is equivalent to the property of perfection mentioned in the proof of Theorem 8. Furthermore, in [16] it is shown that orthomodularity of the face lattice, according to an orthocomplementation which agrees with ours in case Postulates 1 and 2 hold, follows from a property called projectivity. In the next section we will define projectivity and establish that state spaces satisfying Postulates 1 and 2 are projective, giving us an alternative proof of orthomodularity. Here, we proceed with the direct proof.
Proof. Constructing F as the face generated by the extension of a frame generating F shows easily that (F ) = F (as already shown in Proposition 7), and that F ⊆ G implies F ⊇ G , as well as F ∨ F = 1 ≡ A + and F ∧ F ≡ 0 ≡ {0}. These properties mean that the operation is an orthocomplementation on the lattice of faces. It remains to show that this orthocomplemented lattice satisfies the orthomodular law, Eq. (1). To this end, assume F ⊆ G, and let ω 1 , . . . , ω |F | be a frame on F . Extend this to a frame on G, and further extend the result to a frame on A + , yielding ω 1 , . . . , ω N . Then ω |F |+1 , . . . , ω |G| is a frame on G∩F ; if it did not generate G ∩ F , it could be extended in G ∩ F , and to this extension we could append ω 1 , . . . , ω |F | to obtain a frame of size larger than |G| in G, which is a contradiction. Hence H := G∩F is generated by ω |F |+1 , . . . , ω |G| . Since F ∨H is the smallest face containing F and H, it is the smallest face containing ω 1 , . . . , ω |G| , hence equal to G.
Systems that satisfy Postulates 1 and 2 are operationally close to quantum theory also with respect to their contextuality behavior: they satisfy the principle of consistent exclusivity [52], the single-system generalization of the recently introduced postulate of local orthogonality [53]. This is also called Specker's Principle [55], and comes in slightly different versions, depending on assumptions of the validity of the principle in situations where one has more than one copy of a state. Here we are interested in the single-system version that is called CE 1 in [52].
In order to talk about contextuality, we need a notion of "sharp measurements": the analogs of projective measurements in quantum theory. Following [55], we call an effect 0 ≤ e ≤ u A sharp if it can be written as a sum of normalized extremal effects; that is, if there are pure states ω 1 , . . . , ω n such that and if an analogous decomposition exists for u A −e. This definition does not assume that the ω i are mutually orthogonal; however, they have to be as a consequence of Postulates 1 and 2. To see this, note that for all j hence ω i , ω j = 0 for all i = j. The corresponding effects e can also be characterized in two further ways, namely as projective units and as the extremal points of the unit order interval, giving further weight to the interpretation as the analogue of orthogonal projectors in quantum theory. This is the content of the next lemma. We start with a definition.
Definition 10 (Projective units). Let A be any system satisfying Postulates 1 and 2. Then, to every face F of A + , define the projective unit u F as where P F is the orthogonal projection onto the linear span of F . A projective unit u F is called atomic if |F | = 1.
This is now used in the following lemma: Lemma 11. Let A be any system satisfying Postulates 1 and 2. Then, to every face F of A + , there is a unique effect u F with 0 ≤ u F ≤ u A such that u F (ω) = 1 for every ω ∈ F ∩ Ω A , and u F (ϕ) = 0 for all ϕ ∈ F ∩ Ω A , namely the projective unit from Definition 10. If ω 1 , . . . , ω |F | is any frame that generates F , then Furthermore, every effect e ∈ A + with 0 ≤ e ≤ u A is a convex combination of projective units, and we have Proof. As in Definition 10, set If ϕ ∈ F , then P F ϕ = 0, and an analogous computation proving that there exists some frame ω 1 , . . . , ω |F | with decomposition (2) of u F , and showing the inequality 0 ≤ u F ≤ u A . If ϕ 1 , . . . , ϕ |F | is any other frame on F , then there exists a reversible transformation T with T ω i = ϕ i . Since both frames generate F , T must preserve the face F (and also its orthogonal complement because T is orthogonal). Hence proving that u F can be decomposed into any frame in the claimed way. If 0 ≤ e ≤ u A is any effect, then it has a frame decomposition e = |A+| i=1 λ i ω i , where ω i ∈ Ω A are mutually orthogonal pure states, and 0 ≤ λ i ≤ 1. Thus, the vector λ := (λ 1 , . . . , λ |A+| ) is an element of the |A + |dimensional unit cube, and can thus be written as a convex combination of extremal points of the (convex) cube, corresponding to vectors µ = (µ 1 , . . . , µ |A+| ) where all µ i ∈ {0, 1}. Hence e can correspondingly be decomposed into effects of the form |A+| i=1 µ i ω i , which are projective units. This also shows that the u F are the unique effects with the properties stated in the lemma.
Following the definition of [56], expressed in the language of [52], every system satisfying Postulates 1 and 2 defines a contextuality scenario given by a hypergraph H, where the vertices of H are the projective units u F (F = {0} any face of A + ), and the edges are collections of effects u F1 , . . . , u Fn with n i=1 u Fi = u A . These edges describe contexts, i.e. sharp measurements (given by projective units) that are compatible (i.e. jointly measurable).
Theorem 12. Any system satisfying Postulates 1 and 2 also satisfies the principle of Consistent Exclusivity CE 1 as given in [52, Def. 7.1.1] and [57].
Proof. We have to show the following: if I is any set of vertices of the hypergraph H such that every two elements of I belong to a common edge, then e∈I e(ω) ≤ 1 for all ω ∈ Ω. In the context of Postulates 1 and 2, I is then a set of projective units This proves the claim.
As mentioned in Section I, the classification of the set of all state spaces that satisfy Postulates 1 and 2 remains an open problem with interesting physical and mathematical implications. Now we show that one additional assumption brings us into the realm of Jordan algebra state spaces. Before postulating the absence of third-order interference, we study another postulate which turns out to be equivalent in our context.

IV. JORDAN SYSTEMS FROM POSTULATES 1+2 AND PURITY PRESERVATION BY FILTERS
The main tool for our argumentation will be the following consequence of Alfsen and Shultz [16,Thm. 9.33], first published in [17]. One of the conditions will be important in its own right in what follows, and is therefore called Postulate 3'. ω is a pure state, then P ω is a nonnegative multiple of a pure state.
Then A + is the state space of a formally real Jordan algebra.
The original theorem in [16,17] is formulated in terms of "compressions", with similar results in finite dimensions given by Gunson [42] and by Guz [43][44][45]. Theorem 13 above is an adaptation to our language and to finite dimension, using the notion of filters instead of compressions. The conjunction of (b) and (c) is what Alfsen and Shultz [16] call the "pure state properties" (their (3)), while their (2) is a technical condition that is automatically satisfied in finite dimension, and their (1) follows from our (a).
The definitions used are the following.
Definition 14 (Filters and projectivity). Let A be any state space with cone A + . Projections are linear operators P : A → A with P 2 = P . Positive projections P and Q are called complementary if im + P = ker + Q and vice versa, where im + P := im P ∩ A + and ker + Q := ker Q ∩ A + . A positive projection P is complemented if there exists a positive projection Q such that P and Q are complementary.
A filter is a positive linear projection P : A → A which (i) is complemented, (ii) has a complemented adjoint P * , and (iii) is normalized, i.e. satisfies u A (P ω) ≤ u A (ω) for all ω ∈ A + . 3 The state space A is called projective if every face of A + is the image of a filter.
As we define them, filters are precisely the adjoints of compressions as defined in [16,Def. 7.22]. (In fact, in the context of Postulates 1 and 2, we do not need to apply the adjoint -filters are compressions.) As described above, in standard quantum theory the face associated with a subspace S of Hilbert space consists of the density matrices whose support is contained in S. Quantum state spaces are projective: there is a filter onto each face, namely the linear map ρ → P S ρP S , where P S is the orthogonal projector onto S. The complementary projection is ρ → P S ⊥ ρP S ⊥ .
We will explain condition (b) of Theorem 13 while proving the following theorem.
Theorem 15. In finite dimension, Postulates 1 and 2 imply that the system is projective. Assuming in addition Postulate 3 implies that the system is either irreducible Jordan-algebraic, or classical.
Proof. We have to prove conditions (a) and (b) of Theorem 13 from Postulates 1 and 2. We start with (b).
Observe that given (a), for each atomic projective unit p, which is associated [16,Prop. 7.28] with a unique filter P for which P * u = p, the associated face {x | p(x) = 1} of Ω contains a single pure state. Call this statep. The map p →p is a one-to-one map from the set of atoms of the lattice of projective units onto the set of extremal points of Ω. The system is said to satisfy (b), symmetry of transition probabilities, if for all pairs a, b of atoms of the lattice of projective units, a(b) = b(â). In the context of Postulates 1 and 2, atomic projective units are u F for |F | = 1, where F is generated by a pure state (frame of size 1) ω 1 , such that u F (ϕ) = ω 1 , ϕ according to Lemma 11, soû F = ω 1 = u F in the notation just introduced. Thus It remains to show condition (a). In Theorem 8, we have already shown that the orthogonal projections onto faces are positive. Now we show that they are filters, which establishes that A + is projective. For any face F , the corresponding projection P F satisfies im P F = lin F , im + P F = F , and ker + P F = F = F ⊥ ∩ A + . So also im + P F = F and ker + P F = F = F , and we see that P F and P F are complements, establishing property (i) in the definition of filter. Since by self-duality, P F = P * F , P F has complemented adjoint, property (ii). P F and P F are positive by Theorem 8. To see property (iii), i.e. normalization of P F , recall from Lemma 11 that To see that the only reducible Jordan-algebraic cones this allows are the classical ones (corresponding to direct sums of the one-dimensional formally real Jordan algebra), note that the cone of a direct sum of Jordan algebras is the direct sum C 1 ⊕C 2 of their cones. Write our Jordan algebra as a direct sum of irreducible ones, obtaining a direct sum ⊕ i C i of irreducible cones. Suppose one of the summands, say C j , is not one-dimensional. The face generated by two ray-extremal points, ω j ∈ C j and ω k ∈ C k , with k = j, is a direct sum of one-dimensional cones, i.e. a classical bit. Since C j is irreducible and not one-dimensional, it is not classical, so it contains perfectly distinguishable pure states ω j and ω i that generate a face that is not a direct sum. Since we have another rank-2 face that is a direct sum, in light of Proposition 2 this violates Postulate 2. Hence either the cone is irreducible, or all summands are one-dimensional (i.e. it is classical).
The following proposition will be needed later.
Proposition 16. Assume Postulates 1 and 2. Then, to every face F 1 of A + with complementary face F 2 ≡ F 1 and corresponding projections P 1 and P 2 , the space A has an orthogonal decomposition where A i := im P i , A c 12 := ker P 1 ∩ ker P 2 . Proof. By construction, A 1 = lin F 1 ⊥ lin F 1 = A 2 , and by elementary linear algebra,

V. THIRD-ORDER INTERFERENCE
Rafael Sorkin defined a notion of k-th order interference [1], which can be manifested in analogues of the two-slit experiment involving k or more slits. This notion was adapted to projective convex systems in [13,14], and the k = 3 case explored in [15]. Quantum theory exhibits k = 2 interference, but no higher interference. In this section, we show that Postulates 1 and 2, plus the assumption of no third-order interference, characterize Jordan algebraic systems.
We start with the definition of an M -slit experiment. For any given set of faces with the properties stated above, the corresponding set of orthogonal projections P J := P F J will be called an M -slit mask. It is called complete if P 12···M = 1.
The interpretation is that we build an experiment in which a prepared state ω passes an array of M slits, followed by a final measurement with the event "detector clicks" being described by effect e. In every run of the experiment a certain subset J of slits will be open, and e J (ω) is the total probability of passing the slits and giving a detector click. If ω is a state that passes the slits unchanged (which is true for states in the face F J ), then the total probability is just e(ω). On the other hand, if ω is a state that is filtered out by the slit mask (which happens for states in the face F J ), then this probability is zero. We run the experiment a sufficient number of times, for each subset J, to estimate each of the probabilities e J (ω).
We will soon show that the effect of these slits on the state can be described by the orthogonal projections P F J . However, note that Definition 17 does not assume that we can actually build or implement those maps operationally; all we have to be able to do is to measure the effects e J , which we know can be done due to Proposition 1.
Such an experiment exhibits second-order interference (say, for M = 2) if the overall interference pattern e 12 (ω) fails to be the sum of the one-slit patterns e 1 (ω), e 2 (ω). 4 If it exhibits second-order interference, it may in addition exhibit irreducibly third-order interference. Third-order interference occurs if the overall pattern e 123 (ω) fails to be the sum of the double-slit patterns e ij (ω), corrected for overcounting by subtracting suitable multiples of the single-slit patterns e i (ω). Unless otherwise specified we use the notation i<j to mean the double sum i j>i .
The second term in (3) corrects for the overlaps of the sets {i, j}; each index occurs M − 1 times in pairs i < j. Sorkin's [1] original definition, and the discussion in [14,15], used the M = 3 case as their definition of thirdorder interference, but the two can straightforwardly if somewhat tediously be shown to be equivalent. Sorkin showed that if a scenario lacks k-th order interference, it cannot have l-th order interference for any l > k.
Proof. We have absence of third-order interference if for any choice of faces (as described in the statement of the lemma) and choice of effect e as well as state ω, (3) holds with equality. Since the states span the space, this is equivalent to the statement As this must hold for all effects e, and the effects span the space, we obtain the statement of the lemma. Now we are ready to prove one of our main results about the absence of third-order interference together with Postulates 1 and 2: Theorem 21. A system satisfies Postulates 1, 2 and 3 if and only if it is an irreducible Jordan system or a classical system.
Proof. We begin with the "if" direction: irreducible Jordan systems and classical systems satisfy Postulates 1, 2 and 3. For classical systems it is well-known and easy to see that Postulates 1 and 2 are satisfied: indeed, finitedimensional classical state spaces Ω are often defined as those for which every state has a unique decomposition into extremal points, and in this case Postulate 2 follows from the fact that any permutation of the extreme points in this unique maximal frame is an affine automorphism of Ω. Classical systems do not even have 2nd-order interference [1] (the first level that is actually interference), so they cannot have any higher order of interference. It follows directly from a fairly standard orthogonal decomposition in formally real Jordan algebras (cf. e.g. [8]) that finite-dimensional Jordan systems satisfy Postulate 1; and it is also well-known that the Jordan algebra automorphisms are affine automorphisms of the normalized state space, and act transitively on the set of ordered sets of orthogonal extremal states in the irreducible case [8]. In Proposition 27 below, we show that in the context of Postulates 1 and 2, absence of third-order interference is equivalent to the property that filters preserve purity of states. Since the latter property is well-known for a class of Jordan systems including the finite-dimensional ones [16,Thm. 9.38], this shows that they satisfy also Postulate 3.
The "only if" direction is an immediate consequence of Proposition 27-to be proved in the remainder of this section-which states that the absence of third-order interference implies that all filters preserve purity, together with Theorem 15, which states that Postulates 1, 2, and purity-preservation by filters imply that systems are irreducible Jordan, or classical.
The proof of the crucial Proposition 27 proceeds via several other propositions and lemmas. The following property is also mentioned in [13] and [15].

Lemma 22.
It follows from Postulates 1 and 2 that P J P K = P J∩K for any M -slit mask.
Proof. First note that if F , G and H are faces such that F ⊥ H and G ⊥ H, then (F ∨ G) ⊥ H. This is because (F ∨ G) ∩ H ⊥ is a face which contains F and G, and is also a subset of F ∨ G, hence equal to F ∨ G.
Defining the projective units u j := u Fj and u J := u F J , it follows from Lemma 11 that u K = k∈K u k . Hence For j ∈ J and l ∈ K \ J we have F j ⊥ F l , thus F J = ∨ j∈J F j ⊥ F l , and so P J u l = 0. On the other hand, if k ∈ K ∩ J then P J u k = u k , so According to [16,Prop. 7.39], this implies that P J P K = P K P J , which in turn implies [16,Thm. 8.3] that P J P K = P J ∧ P K = P J∩K .
The next proposition uses the decomposition described in Proposition 16 to derive a similar decomposition corresponding to a complete M -slit mask.
Proposition 23. Let P i with i ∈ {1, ..., m} be a complete M -slit mask on a system A satisfying Postulates 1 and 2. Then there is an orthogonal decomposition where A i := im P i , A c ij := ker P i ∩ ker P j ∩ im P ij and A (3) := i<j ker P ij .
Proof. Using Proposition 16 and the fact that each face is itself a system satisfying Postulates 1 and 2, we decompose each im P ij as A i ⊕ A j ⊕ A c ij . (Note that we still have A c ij orthogonal to A i ⊕ A j because it is contained in ker P i ∩ ker P j .) For k, l / ∈ {i, j} we have A c ij ⊥ A c kl , since im P ij ⊥ im P kl . Furthermore, for i = k, A c ij ⊥ A c jk , because for x ∈ A c ij , y ∈ A c jk x, y = P ij x, P jk y = x, P ij P jk y = x, P j y = 0 , where the first equality follows from x ∈ im P ij , y ∈ im P jk due to the definitions of A c ij , A c jk , the last equality from y ∈ ker P j due to the definition of A c jk , and the second last equality from Lemma 22. Now we just have to show that A (3) := i<j ker P ij is the orthogonal It is interesting to note that the pairwise intersections ker P i ∩ ker P i represent "coherences" associated with the two-slit experiment P i , P i [15], and that intersecting this with im P ij gives the part associated with the twoslit experiment P i , P j . The decomposition is thus into the spans of the faces im + P i , M (M − 1) spaces associated with interference between these faces, and a further space, which as the next Proposition shows, is associated with three-way interference. Proposition 23 is stated as a decomposition of the vector space A. However, note that every face of A + (with group of reversible transformations given by the restriction of those global reversible transformations that preserve that face) is itself a state space satisfying Postulates 1 and 2. Thus, if we have an incomplete M -slit mask with F := im P 12...M and corresponding face F + := F ∩ A + , we obtain a decomposition where F i = im P i ⊆ F , F c ij = ker P i ∩ ker P j ∩ im P ij ⊆ F , and F (3) = i<j ker P ij ∩F . This is used in the following proposition. Proof. From Lemma 20, the absence of third order interference is equivalent to for all x ∈ A. However, since P ij = P ij P 12···M and P i = P i P 12···M , this is equivalent to (7) holding for all x ∈ im P 12···M =: F . Since the pure states in F span F , this is equivalent to for all pure states ω ∈ F . By Proposition 23 and its generalization (6), P ij ω = P i ω + P j ω + ω c ij , where ω c ij is the component of ω in F c ij . So absence of third-order interference is equivalent to for all pure states ω ∈ F . Noting that i<j (P i + P j ) contains, for each fixed value of k, M − 1 occurrences of P k , this becomes: In other words, ω (3) = 0 in F (3) in (6).
Definition 25. The impurity I(ω) of any unnormalized state ω ≥ 0 is defined as: For normalized states ω ∈ Ω, we have u(ω) = 1, and ω ≤ 1, with equality if and only if ω is a pure state. Extending this to the unnormalized states by multiplication ω → λω with λ ≥ 0 shows that I(ω) ≥ 0 for all ω ≥ 0, with equality if and only if ω is ray-extremal. Proposition 26. Let P i with i ∈ {1, ..., M } be an M -slit mask on a system satisfying Postulates 1 and 2. Then for any state ω ∈ F := im P 12...M (not necessarily pure or normalized) (6).
While we use this equation directly in what follows, its significance is underlined by noting its immediate corollary: that if ω and each of the P i ω are pure, and there is no third-order interference, then (by the nonnegativity of impurity) each of the P ij ω is also pure. In other words: in the absence of third-order interference, if the P i are each purity-preserving, so also are the P ij . Proof of Proposition 26: First we expand ω ∈ F via (6): Taking squared norms and using orthogonality of the decomposition, we get In order to get results about the purity of P i ω and P ij ω, we use P ij ω = P i ω + P j ω + ω c ij to eliminate ω c ij by substituting ||ω c ij || 2 = ||P ij ω|| 2 − ||P i ω|| 2 − ||P j ω|| 2 in (9), obtaining: Since a given k appears (as i or j) in M − 1 of the pairs i < j, and the last sum in the above expression has a ||P k ω|| for each such appearance, this becomes Note that ||P ij ω|| 2 = u(P ij ω) 2 − I(P ij ω), so Using Again using the fact that a given i appears in M − 1 of the pairs i < j, and writing i j =i in place of 2 i<j , this becomes: Substituting this into (10) and rearranging gives (8).
We will use this result several times in an inductive argument to establish that all filters are purity preserving.
Proposition 27. Let a system A satisfy Postulates 1 and 2. Then it has no third-order interference if and only if all its filters are purity-preserving.
Proof. Suppose that all filters are purity-preserving. Then, if P i , i ∈ {1, . . . , M } is any M -slit mask and ω is a pure state in im P 12...M , we have I(ω) = I(P i ω) = I(P ij ω) = 0, and so (8) implies that the component ω (3) of ω in F (3) in (6) is zero. Then Proposition 24 implies that there is no third-order interference.
To show the converse direction, note first that it follows from [16,Prop. 7.28] in the context of Postulates 1 and 2 that all filters are of the form P F for some face F ; thus, we only have to show that these orthogonal projections are purity-preserving. Let N be the size of A's largest frame. The proof that all filters are purity-preserving will be inductive on the rank of filters. The base case is rank-1 filters, which holds because a rank-one filter projects the state onto the span of an extremal ray of A + .
We now prove the induction step, which states that if for some fixed rank k ≤ N − 1 all filters are puritypreserving, then all filters of rank k + 1 are purity preserving. Suppose filters of rank k are purity-preserving and consider any mask consisting of a rank-k filter P 1 and N − k rank-1 filters P i , i ∈ {2, ..., N − k + 1}. Then for any pure state ω each P i ω is pure. So with ||ω (3) || 2 = 0 by the absence of third-order interference, (8) becomes: Since impurity is nonnegative, each of the P ij ω is pure too. So all the P i ∨ P j , and in particular the rank-(k + 1) filters P 1 ∨P i , i ∈ {2, ..., N −k+1}, are purity-preserving. Since every rank-(k + 1) filter on A has the form P ∨ Q for some rank-k P and some rank-1 Q orthogonal to P , all rank-(k + 1) filters on A are purity-preserving, and the induction step is established for k ≤ N − 1. Hence all filters of rank up to N − 1 are purity-preserving.
In the context of assumptions (a) and (b) of Theorem 13, Postulate 3' is known [16,43] to be equivalent to another postulate: that the lattice of exposed faces has the covering property. We say that an element F of a lattice covers another element G if G is below F and there is nothing between them. Hence an atom is an element that covers 0. By definition, a lattice has the covering property if for every element F and atom a, either F ∨ a = F or F ∨ a covers F .
In the context of Postulates 1 and 2, the covering property can be formulated as follows: if F is any face of A + , and ω a pure state, then the face G generated by both has rank |G| ≤ |F | + 1. Since we have shown that (a) and (b) of Theorem 13 follow from Postulates 1 and 2, the covering property can replace the absence of third order interference (or Postulate 3 ).

VI. STANDARD QUANTUM THEORY FROM OBSERVABILITY OF ENERGY
In standard quantum mechanics, we are used to treating the generator of time evolution as an observable: evolution of any closed quantum system with initial state ρ 0 is given by where H = H † is the system's Hamiltonian. The righthand side, as a one-parameter group acting on ρ, is generated by the superoperator X : ρ → −i[H, ρ], so that ρ(t) = e tX ρ 0 . We are used to associating the observable E : ρ → tr(Hρ) with this generator, and call it the "expectation value of energy".
It is an interesting question why such an association is possible -what is the operational relation between E and X? The following properties characterize this relation: • If X and X are two different generators, then the corresponding observables satisfy E = E. That is, the observable determines the generator uniquely.
• The observable E is a conserved quantity of the time evolution generated by X: E(ρ(t)) = E(ρ 0 ).
• If time evolution is not trivial (i.e. ρ(t) not constant), then E is also not a trivial observable: there are at least two states ρ, σ such that E(ρ) = E(σ).
• The map X → E is linear -in particular, larger values of E correspond to "faster" time evolution.
These properties allow us to define a notion of "observability of energy" for arbitrary probabilistic theories, which will turn out to be a rather restrictive property. Definition 28. Let A be any state space with a group of reversible transformations G A that has a non-trivial Lie algebra g A = {0}. An energy observable assignment is an injective linear map φ : g A → A * such that the observable φ(X) is conserved under the time evolution generated by X, but not under all time evolutions unless X = φ(X) = 0. We say that "energy is observable" in A if there exists an energy observable assignment.
Writing the time evolution in initial state ω 0 explicitly as ω(t) := e tX ω 0 , a conserved quantity E ∈ A * is a linear functional with E(ω(t)) = E(ω 0 ). It is easy to check that this is equivalent to E • X = 0, where "•" is for composition of linear maps. If E were equal to the order unit, i.e. E = u A , then E(ω(t)) = u A (ω(t)) = 1 for all t and all time evolutions, since all elements of G A preserve the normalization. Thus, Definition 28 implies the conditions Our notion is related to Alfsen and Shultz's notion of a "dynamical correspondence" [16], except that they require an injection of observables into dynamical generators, rather than vice versa, and in addition to a conservation condition, impose a condition relating reversible transformations to general automorphisms of the cone of states which is formulated in the Jordan-algebraic setting. Our setting is more general, and we impose no such relation between the reversible transformations and cone automorphisms. Connes [22] used a notion of orientation related to dynamical correspondence to characterize the state spaces of von Neumann algebras (one of the infinite-dimensional generalizations of standard quantum systems) among those of JBW-algebras (one infinite-dimensional generalization of finite-dimensional formally real Jordan algebras). Other work making use of similar notions to characterize quantum and classical theory in different settings, can be found in [23][24][25][26]. References [23], [24], and [25] all derive relations between energy and observables, and thence that the theory must essentially be standard quantum or classical, from considerations involving dynamics on composites, so our work is complementary to theirs in that we avoid assumptions about composite systems. The identification of dynamical generators with conserved observables that exists in classical and quantum theories is central to many physical phenomena and arguments, providing motivation for our postulate. We mention in particular that standard formulations of the statistical mechanics underlying thermodynamics use a conserved energy observable in the definition of free energy.
Our goal is to show the following: Theorem 29. Postulates 1, 2, absence of third-order interference, and observability of energy imply that the state space is an N -level state space of standard complex quantum theory, for some N ∈ N, and all conjugations ρ → U ρU † with U ∈ SU (N ) are contained in the group of reversible transformations.
Proof. We show that complex quantum N -level state spaces are the only finite-dimensional irreducible formally real Jordan algebra state spaces that have observability of energy. This is enough due to Theorem 21. First, consider the d-dimensional ball state spaces ("spin factors") The Lie algebra is non-trivial only for d ≥ 2. Consider the case that the group of reversible transformations g d contains the full orthogonal group, such that g d = so(d). If the energy is observable, then there must be an injective map φ from g d to R d+1 . But dim(so(d)) = d(d−1)/2 which is larger than d + 1 for d ≥ 4, so no such map can exist for d ≥ 4. If d = 2, we have and calling this matrix X, it is easy to see that φ(X)•X = 0 implies that φ(X) = c · u d for the normalization functional u d (x 1 , x 2 , x 3 ) = x 1 . This contradicts the definition of an energy observable assignment.
If d is even or d = 7, there are compact connected subgroups of SO(d) that are transitive on the pure states of Ω d , and thus satisfy Postulate 2, (see Ref. [47] for the list of groups; they have been classified in [38,39]). As we show in Appendix B, all of these cases except for one can be ruled out by dimension counting, exactly as the cases d ≥ 4 above; the only case where this does not work is d = 4 with transformation group G 2 = SU (2). But there, it can be shown that there are time evolutions which only have the normalization as their conserved observable, contradicting Definition 28. Now let A be the state space of the 3 × 3 octonionic matrices. Due to Postulate 2 (in the special case of 1frames), the group of reversible transformations G A acts transitively on the pure states, which is the Cayley plane P 2 (O), hence so does its connected component at the identity [48]. According to [20] and [21], the only compact connected Lie group which acts transitively and effectively on it is the exceptional Lie group F 4 . But dim(F 4 ) = 52 > dim(A) = 27, so there is no injective linear map from g A to A * .
For N ≥ 3, consider the N -level state space A N of quaternionic quantum mechanics, with any group of reversible transformations G N satisfying Postulate 2. Then dim(A N ) = 2N 2 − N . The pure states define the quaternionic projective space P N −1 (H), and so G N must act transitively on it. According to [19,21], the only possibility is G N ⊇ Sp(N ), and dim(sp(N )) = N (2N + 1), which is larger than dim(A N ).
The only remaining cases are the N -level state spaces A N of real quantum mechanics for N ≥ 3, which are more difficult to rule out -dimension counting does not work. First, it can be shown from the classification results of [20] that Postulate 2 implies that the group of reversible transformations contains all maps of the form ρ → OρO T with M ∈ SO(N ); consequently, every map X(ρ) := ρ → [M, ρ] with M ∈ so(N ) is a valid generator. An energy observable assignment φ maps these generators (resp. the matrices M ) to observables (that is, symmetric matricesM ) such that [φ(X)](ρ) = tr(M ρ); the conservation condition φ(X) • X = 0 becomes [M,M ] = 0. However, as we show in the appendix by considering certain special generators X, all maps of this kind must haveM = 1 in their range, yielding the normalization functional, which contradicts the definition of an energy observable assignment.
In the standard case of complex N -level quantum theory, it remains to show that the group of reversible transformations G N contains all unitaries (it might also contain anti-unitaries; due to Wigner's theorem [27,28], these are the only possibilities). Postulate 2 implies transitivity of the connected subgroup of G N on the pure states, that is, on the projective space P N −1 (C); according to [21], for odd N , the only possibility is SU (N ); but if N is even, say N = 2n, there is a second possibility: Sp(N ). But consider two N -frames |e 1 e 1 |, . . . , |e N , e N | and |f 1 f 1 |, . . . , |f N , f N |, where e 1 , . . . , e N are defining vectors of the basis in which and only if U T JU = J. Moreover, suppose that f 1 = e 1 . If Postulate 2 is satisfied, there is U ∈ Sp(N ) such that U |e i e i |U † = |f i f i | for all i, so U e 1 = e iϕ e 1 for some ϕ ∈ R. Since Je 1 = e n+1 , it is easy to see that the symplectic constraint on U , together with U † e n+1 = (U T e n+1 ), implies that U e n+1 = e −iϕ e n+1 , so |f n+1 f n+1 | = |e n+1 e n+1 |, which contradicts frame transitivity, i.e. Postulate 2.
The fact that energy observability rules out classical systems in this theorem is a consequence of our finitedimensional setting, for which classical reversible dynamics are a discrete group. The probabilistic representation of phase-space classical mechanics involves an infinitedimensional space of Liouville distributions, and does, of course, have continuously parametrized reversible dynamics.

VII. DISCUSSION AND CONCLUSIONS
We have given four principles that we argue have, to various degrees, the virtues of conceptual clarity, important physical implications, intuitive appeal, and interesting experimental consequences. We have shown that while they are formulated in the setting of an extremely broad class of probabilistically described systems together they constrain the abstract structure of such a system to be that of the usual Hilbert space quantum theory over the complex field. Our demonstration was limited to finite dimension, a limitation which we believe to be primarily technical. This reconstruction of quantum theory differs interestingly from several previous ones in avoiding any postulates concerning the structure or even existence of composite systems.
Another desirable feature of our reconstruction is its stepwise structure, in which conceptually and often physically significant properties appear even as a consequence of the first postulate, and additional such properties appear at each step.
Thus we saw that the fact that all effects are allowed arises as a consequence of our first postulate of "classical decomposability" or "generalized spectrality". This is a much weaker requirement than classical decomposability, of course, and we expect Postulate 1 to have further physically interesting consequences.
Postulates 1 and 2 together further have very strong consequences: they imply that every face of the state space is the image of a filter, i.e., that the state space is projective, and also that it is self-dual. Filters allow one to verify that a state is in a claimed face of the state space without (if the claim is true) disturbing the state. They are likely to be important ingredients of both information-processing and thermodynamical protocols; possibilities which are under investigation. Filters can also be used to equip a system with operations destroying coherence between any set of mutually orthogonal faces. In other words, the existence of filters ensures the possibility of a process of decoherence similar to the one in quantum theory.
Self-duality is another strong property of state spaces that is independent of projectivity (for a self-dual state space that is not projective see the "house-shaped" state space in [60]; for a projective state space that is not selfdual, take any compact strictly convex set which is not an ellipsoid, symmetric with respect to reflections x → (−x), and add an additional component to make it a set of normalized states Ω in a cone of unnormalized states). Self-duality introduces a correspondence between atomic measurement outcomes and pure states that is exploited in quantum steering and teleportation, for example. It is also known to be linked, in some special contexts such as polygonal state spaces, to correlations satisfying the Tsirel'son bound on violations of Bell locality [60].
The lattice of faces given Postulates 1 and 2 is orthomodular-as is implied, indeed, by projectivity. This expresses a kind of "local classicality", which one sees also in the topos-theoretic approach of e.g. [59], and also relates our work to the classic "quantum logic" approach initiated by Birkhoff and von Neumann [37]. Postulate 2 imposes a high degree of symmetry on this lattice-it would be interesting to investigate lattices with such high symmetry using purely lattice-theoretic methods.
There is a close connection between Postulate 2 and certain properties of the circuit model for quantum computation. In this model it is standard to start with an input n−level system in a particular state, as well as a number of other n−level systems which can without loss of generality be taken to be in the |0 state. Then we implement the circuit representing the computation we wish to carry out, and at the end we must measure a specific observable to determine the (probability of the) output of the computation. This last measurement step can be done without loss of generality by first reversibly transforming the (generally entangled) logical n−level system of interest into an individual physical n−level system, and then doing the desired measurement on this system alone. This transfer is possible because quantum theory satisfies Postulate 2. Postulate 1 and 2 together can be understood as generalizing this idea by demanding that every state (not just pure ones) of a system can be transferred to any other system (with the same or larger number of distinguishable states) by a suitable reversible interaction, provided both are subsystems of a common larger system. Our third postulate provides, in the context set by the first two postulates, a perhaps surprising link between the absence of irreducibly three-slit interference, currently under experimental scrutiny, and mathematical notions: the Jordan algebraic structure of quantum theory on the one hand, and the satisfaction of the covering law by its lattice of faces, on the other. In the context of our first two postulates, these are all equivalent. The known equivalence (even in the broader context of projective systems) of the latter two with the requirement that filters preserve purity is further food for thought. An interesting question is whether the equivalence of no higher-order interference with these two principles still holds in the broader projective context. Looking to operational consequences, perhaps the failure of purity preservation might give rise to an extra source of noise or irreversibility in information processing or thermodynamical protocols-though this might be circumvented if the protocols are designed so the states being filtered are "compatible" with the filters.
Most interesting, perhaps, is the possibility that there exist families of systems satisfying our first two postulates but not the third: these would still have an extremely regular structure and likely support interesting information processing, but so far no examples are known. Should they be shown not to exist, we would then know that Jordan systems are singled out by Postulates 1 and 2 alone.
The final step, narrowing things down from Jordan systems to complex quantum systems via energy observability, is not so surprising. Similar postulates have been used for this purpose by Connes and by Alfsen and Shultz. We require an injection of dynamical generators into the space of observables, each injected generator conserved by the dynamics it generates, whereas Alfsen and Shultz require the converse and also impose ancillary conditions. In contrast, our condition, though applied only to Jordan algebraic systems, is formulated in greater generality where the ancillary conditions do not make sense. It is likely that in the Jordan-algebraic setting, the ancillary conditions, as well as a bijection, are obtained automatically. Exploration of conditions of this type-either ours, or abstractions of Connes' or Alfsen and Shultz's-in a broader context are desirable. Indeed, as we have mentioned, others have explored similar principles, though some of these investigations have made use of composite systems which appear to us to be required to satisfy local tomography. In the context of our Postulates 1 and 2, locally tomographic composites and the existence of stand-alone 2-level systems would imply that the systems are standard quantum systems; indeed one reason for our interest in energy observability is as an alternative to local tomography.
The fact that energy observability rules out classical systems in this theorem is an artifact of our finitedimensional setting, for which classical reversible dynamics are a discrete group. Since infinite-dimensional classical systems do have continuous one-parameter groups of reversible transformations, however, it is important to point out that there are numerous alternative assumptions which would allow us to rule out classical systems in the finite-dimensional case without assuming the existence of continuous reversible dynamics. Such alternatives are likely to retain their usefulness in infinite dimensions. For example, we could postulate the existence of a tradeoff between information gained in a measurement and disturbance to the measured state [66], or the existence of at least one state that has two distinct convex decompositions into pure states, or the existence of interference; the existence of nonclonable or nonbroadcastable sets of states [35,36] might also work.
Although we are not aware of work using the set of postulates we use, several authors have used one or more related principles. In Wilce's characterization in [64], a symmetry principle reminiscent of our Postulate 2 (but concerning test spaces rather than state spaces) was used, along with reversible transitivity on pure states (a special case of Postulate 2). In his most recent reconstruction, Hardy [65] uses a postulate ("filters are nonflattening") which relies on a definition of filters that is equivalent to ours (at least in the context of our Postulates 1 and 2), and which implies Postulate 3' (that filters are purity-preserving). Niestegge has also used the absence of higher-order interference as one ingredient in deriving Jordan algebraic systems [9]. In [14] it was established that finite-dimensional Jordan systems do not have higher-order interference, a result also found by Niestegge in [9]. We have already mentioned other work postulating connections between observables and dynamical generators. More work understanding the connections between the various approaches would likely be fruitful.
Besides providing an understanding of the Hilbert space structure of quantum theory from first principles, our reconstruction suggests a variety of open questions, such as the existence of systems with strong symmetry and classical decomposability, but also with higher-order interference. Furthermore, we think that the naturalness of our postulates allows us to make closer contact with other aspects of physics, a direction we consider important to pursue. This is evident from the postulates themselves -Postulate 3 considers a property that is under direct experimental investigation, and so solving the aforementioned open problem might provide concrete consistent models that can be tested against quantum theory in experiments. Postulate 4 relates the probabilistic structure to the existence of a notion of energy of the form physicists are used to. Furthermore, consequences of the postulates -such as projectivity -seem crucial for thermodynamic reasoning. In fact, weaker versions of Postulates 1 and 2, in conjunction with local tomography, are enough to make sense of the general-probabilistic thermodynamics results in [69,70].
In this sense, our result is part of a broader research program: analyze the structure of physics -that is, the way that the different parts of physics fit together -by rigorously assessing the consequences of changing some of its parts. One part of physics is quantum theory, and seeing how a more general probabilistic theory could still harmonize with thermodynamics or Hamiltonian mechanics is one of many ways to gain insights into the way our world works. Given the current quest for a theory that unifies quantum and gravitational physics, in a situation where conclusive experimental results are mostly absent, it seems particularly promising to rigorously analyze the logical and conceptual structure of what is known, hoping thereby to glimpse a path towards the unknown. Proof. We write F * + for the dual of F + in F , according to the restriction of the self-dualizing inner product for A + ; thus perfection means that F * + = F + for every face. We begin with "only if". Let P be the orthogonal projector onto F , x ∈ A + , y ∈ F + . Now y, P x = P * y, x ; since P is Hermitian this equals P y, x = y, x . The latter is nonnegative because both y and x are in A + , which is self-dual. So we have shown ∀y ∈ F + y, P x ≥ 0, i.e. P x ∈ F * + . But by perfection F * + = F + . Thus P x ∈ F + for any x ∈ A + , i.e. P is positive.
For "if", we begin by observing that given positivity of P , P A + = F + . This is because P x = x for any x ∈ F , so P F + = F + , whence P A + ⊇ F + ; on the other hand P A + ⊆ F + by positivity.
Note that F + ⊆ F * + as a consequence of self-duality of A + : since everything in F + is in A + , it must have nonnegative inner product with everything in A + , hence with everything in F + , and since it is in addition in F , it is in F * + . Recall that y ∈ F * + is defined as y ∈ F and satisfying ∀x ∈ F + y, x ≥ 0. Since P A + = F + , the latter part of this condition is equivalent to ∀z ∈ A + y, P z ≥ 0. Again moving the projector to act on y, using its Hermiticity and that y ∈ F so P y = y, this is equivalent to ∀z ∈ A + y, z ≥ 0, i.e. y ∈ A * + . Since A * + = A + and y was also assumed in F , y ∈ F + , establishing that F * + ⊆ F + . We have now shown F * + = F + , i.e. perfection.

Appendix B: Calculations for observability of energy
The goal of this section is to show the following: However, among those, only the complex quantum theory state spaces (including Ω 3 , the qubit) satisfy Postulate 4, that is, observability of energy.
In complex quantum theory, the group of reversible transformations G can actually be larger: it may also contain the antiunitary transformations according to Wigner's theorem (but not more). Similarly, real quantum theory may also contain the conjugations with O ∈ O(N ), but for quaternionic quantum theory, we have G = G 0 [58]. We do not know whether octonionic 3 × 3 quantum theory may contain additional element in its transformation group, and we do not know the complete classifications of possible compact transformation groups G ⊃ G 0 for the ball state spaces. Lemma 31 will be proven step by step. We start by showing that the only ball state space with transitive group of reversible transformations that has observability of energy is the qubit. Proof. If G d acts transitively on the pure states, then so does its connected component at the identity [48]. According to [47], the list of groups is the following. Since the group action is locally effective [18], the dimensions of g d are just the dimensions of the corresponding groups.
• For all d ≥ 2: SO(d). We have shown in the main text that an energy observable assignment only exists if d = 3.
• For d = 4, 6, 8, . . . : SU (d/2). We have dim(su(d/2)) = (d/2) 2 − 1, and this is larger than d + 1 if d ≥ 6. Thus, no injective map φ : su(d/2) → R d+1 defining an energy observable assignment can exist. However, we have to treat d = 4 separately. In this case, the transformation group is (up to similarity) such that the Lie algebra is at least Let X ∈ g 4 be a generator corresponding to the choice of parameters a = 1 and b = c = 0. If φ is any energy observable assignment, we can write the functional φ(X) as a vector ϕ ∈ R 5 such that [φ(X)](y) = ϕ, y for all y ∈ R 5 , and the condition φ(X) • X = 0 translates into X T ϕ = 0. The kernel of X T is one-dimensional, with unique solution (up to some factor) of ϕ = λ · (1, 0, 0, 0, 0) T , λ ∈ R. But this represents the normalization functional: ϕ, y = u 4 (y) for all y, so φ(X) = u 4 , contradicting the definition of an energy observable assignment.
• For d = 2, 4, 6, 8, . . .: U (d/2). The case d = 2 is already covered in the main text; in all other cases, this representation contains the corresponding representation of SU (d/2) as a subgroup, and this has already been treated.
This proves the claim.
As mentioned in the main text, it is more difficult to rule out N -level real quantum mechanics for N ≥ 3. This needs a sequence of lemmas.
, and let S ∈ R 2×2 such that JS = αSJ for some α ∈ R.
We omit the proof; it is a simple exercise in linear algebra.
Lemma 34. Consider any antisymmetric matrix of the form Proof. Define the 2 × 2 block matrices Λ i := 0 λ i −λ i 0 , and divide S into 2 × 2 block matrices S i,j : If this is the zero matrix, then 0 = [Λ i , S i,i ] = −λ i [J, S i,i ] for all i. It follows from Lemma 33 that there exists s i ∈ R such that S i,i = s i · 1. Similarly, for all i = j, we have Λ i S i,j = −λ i JS i,j = S i,j Λ j = −S i,j Jλ j , hence JS i,j = αS i,j J with α = (λ j /λ i ) ∈ {−1, +1}. Thus, Lemma 33 yields that S i,j = 0.
Before applying this, we need to show that real quantum mechanics is necessarily equipped with all reversible transformations (conjugations with orthogonal matrices) to comply with Postulate 2: Lemma 36. For N ≥ 3, let Ω N be the state space of N -level real quantum mechanics, and G N be a group of reversible transformations on it such that Postulate 2 is satisfied. Then Proof. Every G ∈ G N is an automorphism of the cone of positive semidefinite symmetric real matrices, and thus of the form ρ → QρQ T [51]; preservation of the trace implies that Q T Q = 1, i.e. that Q is orthogonal. Define G as the set of all orthogonal Q such that the map ρ → QρQ T is contained in G N . Clearly G is a subgroup of O(N ); since G N is topologically closed, so is G. Now we show that G contains all of SO(N ). Let a, b ∈ R be irrational numbers such that their difference a − b is also irrational. Define the unit vectors e i := (0, . . . , 1 i , 0, . . . , 0) T , and v 1 := (cos(aπ), − sin(aπ), 0, . . . , 0) T , v 2 := (sin(aπ), cos(aπ), 0, . . . , 0) T , v 3 = e 3 , . . . , v N = e N , w 1 := (cos(bπ), − sin(bπ), 0, . . . , 0) T , w 2 := (sin(bπ), cos(bπ), 0, . . . , 0) T , w 3 = e 3 , . . . , w N = e N .
Then the sets of vectors {v 1 , . . . , v N } and {w 1 , . . . , w N } are both orthonormal bases of R N , and so the sets of pure states {|v 1 v 1 |, . . . , |v N v N |} and {|w 1 w 1 |, . . . , |w N w N |} are both N -frames in N -level real quantum mechanics, and so is {|e 1 e 1 |, . . . , |e N e N |}. Thus, according to Postulate 2, there are two orthogonal matrices V, W ∈ G such that V |e i e i |V T = |v i v i | and W |e i e i |W T = |w i w i | for i = 1, . . . , N.
It follows that there are signs σ 1 , . . . , σ N , τ 1 , . . . , τ N ∈ {−1, +1} such that V |e i = σ i |v i and W |e i = τ i |w i . Hence In both cases, we have established the existence of a matrix in G that acts as cos θ sin θ − sin θ cos θ in the e 1 − e 2 -subspace, where θ is an irrational multiple of π. But any matrix of this form generates all of SO(2) by composition and closure. We can argue similarly for all other e i − e j -subspaces. The corresponding SO(2) rotations in all these planes generate all special orthogonal matrices, hence SO(N ) ⊆ G.
Theorem 37. Energy is not observable on any N -level real quantum mechanics state space.
Proof. The case N = 1 is trivial; N = 2 is shown in the main text, so let N ≥ 3. First, consider the case that N is even. Let H ⊂ so(N ) be the subspace of matrices Similar argumentation as in the even case, now using Lemma 35, shows thatφ(M ) is a diagonal matrix for every M ∈ H; the same conclusion holds true if the subspace H is defined by appending the zero in the top-left corner instead of the bottom-right. But then, by linearity, the matrix also has the property thatφ(M ) is a diagonal matrix. Suppose that all λ i = 0, then the only diagonal matrix S that commutes with M is of the form S = diag(s 1 , s 1 , s 2 , s 2 , . . . , s k−1 , s k−1 , s k , s k , s k ). Again, arguing analogously to the even case, the subspace of all matrices M of the given form (dropping the condition λ i = 0) is mapped byφ injectively into the subspaces of all diagonal matrices S of that form. Since both are of dimension k, there is M = 0 such that φ(M ) = 1, violating the definition of an energy observable assignment.