Risk excess measures induced by hemi-metrics

The main aim of this paper is to introduce the notion of risk excess measure, to analyze its properties, and to describe some basic construction methods. To compare the risk excess of one distribution Q w.r.t. a given risk distribution P, we apply the concept of hemi-metrics on the space of probability measures. This view of risk comparison has a natural basis in the extension of orderings and hemi-metrics on the underlying space to the level of probability measures. Basic examples of these kind of extensions are induced by mass transportation and by function class induced orderings. Our view towards measuring risk excess adds to the usually considered method to compare risks of Q and P by the values ρ(Q), ρ(P) of a risk measure ρ. We argue that the difference ρ(Q)−ρ(P) neglects relevant aspects of the risk excess which are adequately described by the new notion of risk excess measure. We derive various concrete classes of risk excess measures and discuss corresponding ordering and measure extension properties.


Motivation
The evaluation and comparison of risks are basic tasks of risk analysis. For the evaluation of risks, the notion of risk measures-in particular of coherent and convex risk measures-has been introduced in an axiomatic way for real risks in [1], [6], [11] and has been extended to vector risks in [13], [3], and many others. This notion leads to the comparison of two risks X, Y (resp., distributions Q, P ) by ρ(X) − ρ(Y ) (resp., ρ(P ) − ρ(Q)). If the main interest is to compare a risk X to a benchmark risk Y w.r.t. a common risk measure ρ, then the one-sided distance is the induced comparison of risks (where x + = max(x, 0) denotes the positive part of x).
We argue that the comparisons in (1.1), (1.2) neglect some relevant part of measuring the risk excess. This deficit can be seen in the analog simple case where for the basic space E = R d , the risk of a vector x = (x 1 , . . . , x d ) ∈ R d is measured by the Euclidean norm ρ(x) = |x|. In this case, gives a quantitative comparison of the new risk x w.r.t. a benchmark risk y, which is not informative enough. If |x| = |y|, then the comparisons in (1.3) would not take into account whether some or many components of x might be essentially larger than those of y. A better measure for the risk excess would be Another motivation comes from the fact that some concepts which have an impact on the notion of risk are better defined in a relative manner than in absolute terms: for example, the concept of "heavy tailedness" of a distribution (and the subsequent idea of "tail risk") is easier to define by comparing the "size of the tail" or "speed of decrease of the density" of the distribution F to the corresponding "size of the tail" or "speed of decrease of the density" of a benchmark distribution G (say, the standard Gaussian one). These comparisons can be operationalized in a quantitative measure of tail risk, e.g., by computing the difference of mass of the distribution F over an α-quantile w.r.t. to the corresponding mass for the benchmark distribution G over the same α-quantile, viz., see, e.g., [5, p. 45], [25]. See also the motivation in section 4.

Outline
In this paper, we propose to measure the risk excess of a risk distribution Q over a given risk distribution P by a hemi-metric on the space of probability measures. Hemi-metrics are a suitable tool for one-sided comparison of risks. When measuring the risk excess of Q compared to P , it is natural to associate a one-sided distance D + (Q, P ) on the space (M 1 (E), ) of probability measures, where is a given stochastic (pre)order (see the forthcoming definition 2.3 in section 2). The stochastic order is related to the ordering ≤ on the underlying space E. This allows to consider for a quantitative one-sided comparison of risks at the level of probability measures as an extension of the order and distance structure on E.
We discuss several classes of risk excess measures D + (Q, P ) and consider the question when these are given as order extensions of hemi-distances d + on the underlying space E. Several relevant hemi-distances are induced by mass transportation and thus give access to natural interpretation. One particular extension is given by a version of the Kantorovich-Rubinstein theorem for hemi-distances. The paper develops basic tools and notions for measuring the one-sided risk excess of a risk distribution Q compared to P .
The paper is organized as follows: in section 2, we introduce the notion of hemimetrics which are basic for obtaining a quantitative description of one-sided distance in a preordered space (E, ≤). The risk excess measure D + (Q, P ) of Q w.r.t. P is then introduced as a one-sided hemi-metric on the space of probability measures M 1 (E). The ordering on M 1 (E) is chosen consistent with the preorder ≤ on E and describing a positive risk excess, i.e., Q P if Q has no positive risk excess w.r.t. P . We discuss several examples to describe the meaning of this notion and the interplay of order and distance.
In section 3, we study several classes of interesting risk excess comparison measures and corresponding extension properties of the preorderings on the underlying space. A general class of risk comparison measures is introduced by considering worst-case comparison over suitable classes of increasing functions. This is analog to the worstcase representation of convex and coherent risk measures. There are several classes of examples.
In section 4, we describe risk excess measures D + (X, Y ) on the space of random variables. The class of compound risk excess measures is obtained for those measures which depend only on the joint law of the random elements (X, Y ). Mass transportation gives a natural way to obtain minimal extensions of compound risk excess measures to risk excess measures in the space of distributions, i.e., which depend only on the marginal laws of X and Y . Dual representations of these risk excess measures are obtained by a version of the Kantorovich-Rubinstein theorem for hemi-metrics. Several examples illustrate these constructions.
In section 5, we introduce the concept of weak risk excess measure, which is a risk excess measure without the weak identity property. Similarly to section 4, a mass transportation formulation gives a way to obtain weak risk excess measures as the maximal extension of compound risk excess measures. We also give a dual representation of this risk excess measure and introduce several examples of weak excess risk measures constructed from mass transportation problems.
Finally, in section 6, we consider dependence restrictions on the class of risk pairs (X, Y ) and consider maximal and minimal excess risks with these restrictions. These maximal and minimal excess risks do not define risk excess measures, but give relevant and well-motivated bounds. For one and two-sided restrictions, we obtain explicit formulas for the bounds.
2 Hemi-metrics and measuring risk excess 2.1 Hemi-metrics As a motivation for the introduction of measuring the risk excess of distributions, one could argue that, from the structural and phenomenological point of view, the concept of risk combines aspects of the metric structure (a risk measure evaluates some "size" or "norm" on the space of distributions) and of the order structure (there is an underlying preorder structure on the space of distributions which allows one to say when one risk is larger than another). Such "quantitative measure of the order" is encapsulated in the notion of hemi-metric, see [12,Chap. 6,p. 203]. (The terminology is not completely standard and the notion of hemi-metric is also known of as pseudo quasi-metric in the topology literature, while [19, p. 61] calls it a semimetric). We use the following definition: Definition 2.1 (Hemi-metric) A hemi-metric or hemi-distance d + on a set E is an application d + : E × E → R which satisfies the following axioms: for all x, y, z ∈ E, (A1) positivity: d + (x, y) ≥ 0; (A2) weak identity: The main difference with the notion of metric is the omittance of the symmetry condition, and assuming only the weak identity property. For establishing a connection with a preorder ≤ on E, we introduce the notion of a one-sided hemi-metric.
For two comparable elements, the one-sided hemi-metric of a smaller element x to a larger element y is zero.

Remark 2.1
1 If E is a set and d + a hemi-metric on E, one can endow E with a preorder structure by setting Then, by construction of ≤, we obtain that d + is a one-sided hemi-metric on E. 2 Hemi-norms and hemi-metrics: When E has a vector space structure, a metric d can be induced in a natural way by a norm ρ, as d(x, y) := ρ(x − y). Similarly, a hemi-norm ρ + on E, (i.e., a subadditive, positive homogeneous, non-negative functional ρ + : E → R satisfying the weak separation condition x = 0 E ⇒ ρ + (x) = 0) defines a hemimetric d + by setting In addition, if E has a preorder ≤ and ρ + is a hemi-norm which has the property that More generally, if (E, ≤, ρ) is a lattice-ordered normed vector space, one can construct a one-sided hemi-metric compatible with ≤ by setting where ∨ is the least upper bound operation. 3 To any hemi-metric d + on E, one can associate its dual hemi-metric d − , obtained by symmetrization of d + , When d + is a one-sided hemi-metric associated with the order ≤ on E, d − is a one-sided hemi-metric associated with the corresponding dual order ≥ on E.
A hemi-metric d + induces a distance d by symmetrization or by taking the positive linear combination, say More generally, a hemi-metric allows defining a "one-sided" topology by setting the open balls as The concept of a hemi-metric is implicit in several notions encountered in analysis, probability, and statistics. For example, recall that a real-valued function f on a metric space (E, d) is upper semi-continuous at x 0 iff is the usual basic one-sided hemi-metric on (R, ≤, |.|) (see Example 2.3 and (2.9) below).

Risk excess measures
After the discussion of hemi-metrics, we are now in a position to introduce the main object of this paper, which is a measure of the risk excess of a distribution Q w.r.t. P . To that aim, we assume that a preorder is defined on the set M 1 (E) of probability measures on a measurable space (E, E): P Q describes that Q has more risk than P . Definition 2.3 (Risk excess measure) A risk excess measure D + is defined as an one-sided hemi-metric on the preordered space (M 1 (E), ), (or on a subset M ⊂ M 1 (E)). D + (Q, P ) is called the risk excess of Q w.r.t. P .
We illustrate below this concept with the following examples. A general class of risk excess measure will be presented in a systematic way in section 3.
Example 2.1 (Stochastic ordering) On E = R d , we consider the componentwise order ≤, which is closely connected with the stochastic order st : for a measurable set B ⊂ E, define B ↑ = {y ∈ E; ∃x ∈ B s.t. y ≥ x} and say that B is an increasing set if B = B ↑ . Denote by I(E) the set of measurable increasing sets of E.
The stochastic order st is defined on for all measurable sets B ∈ I(E). A corresponding risk excess measure is given by There exists no risk excess of Q w.r.t. P , i.e., By the well-known Strassen theorem (see [31] and e.g., [29, Theorem 1.18, p. 22] ), this is equivalent to the existence of random vectors X ∼ Q, Y ∼ P s.t. X ≤ Y a.s. In other words, the distribution Q is considered more safe than P if one can construct representations X of Q and Y of P s.t. all coordinates of X are lower than those of Y. Q has a positive risk excess w.r.t. P if some of the components of any representation X of Q exceed the corresponding components of any representation Y of P . Of course, this gives a very strict notion of no risk excess. Example 2.2 (Levy-Prokhorov) Let E be a space with a hemi-metric d + . Define a "one-sided" topology on E by setting the open balls as in (2.5). Let E be the corresponding Borel σ−algebra. For two probability measures P, Q ∈ M 1 (E, E) , define

Examples of hemi-metrics
Hemi-metrics are suitable tools to measure one-sided distances. We illustrate the meaning of this notion and the interplay of order and distance via the following example, which will be used constantly throughout the paper. 1 Discrete one-sided hemi-metric: Let (E, ≤) be a preordered space, then defines a one-sided hemi-metric on (E, ≤), which we call the discrete one-sided hemi-metric on (E, ≤). 2 l p hemi-metric: On E = R 1 , one can decompose the absolute value into its positive and negative parts |x| = x + + x − = ρ + (x) − ρ + (−x), viz., into two hemi-norms satisfying (2.3). As a consequence of (2.2), the metric is decomposed as a sum of two one-sided hemi-metrics (d + , d − ) associated with the dual orders (≤, ≥). The basic one-sided hemi-metric describes in a quantitative way the ordering relationship ≤. Compared to the discrete hemi-metric (2.8), it also contains information on the magnitude of the one-sided departure of two elements. Similarly on (E, ≤) = (R d , ≤) supplied with the componentwise (product) order the l p hemi-norms, defined as induce the one-sided l p hemi-metrics Several of the hemi-metrics have a direct interpretation and extensions as risk measures for probability distributions. We give two examples: 1 τ −quantiles: Consider on the real line E = R 1 , the hemi-norm induces, by Remark 2.1 and (2.2), a hemi-metric (2.12) It is well known that this hemi-metric can be used to define τ −quantiles q τ (Y ) (viz., the Value at Risk) of a random variable Y as a minimizer of see [15, p. 5]. Note, however, that the order induced by d τ reduces to the trivial order =, as d τ (x, y) = 0 iff x = y. 2 Half-space depth, departure in direction u: A multivariate generalization of the preceding example can be defined as follows. On E = R d , we define for any unit vector u an ordering (the length in the direction u), by where x T denotes the transpose of x. With this ordering, defines, as in (2.8), a one-sided hemi-metric. It is one if the length of y in direction u is greater than that of x, and is zero else. This one-sided hemi-metric has, as basic application, the definition of the halfspace depth function, which describes the degree of outlyingness of a point x ∈ R d w.r.t. a probability measure P on R d . It is defined as is the part of the unit sphere in the positive cone x ≥ 0. We mention that a very general approach to multivariate quantiles can be found in [10].
At last, we briefly mention some examples of one-sided hemi-metrics which may appear in related contexts.

Example 2.5
1 Schur-order ≤ S on R d : The majorization, or Schur order ≤ S , is useful to compare vectors x, y ∈ R d with identical sums w.r.t. their degree of dispersion, see e.g., [17]. In a natural way, this ordering extends to an ordering on M 1 (R d ), comparing the relative degree of dispersions of two measures. Let where γ, β ∈ Γ(d) are the decreasing rearrangements of x and y: ≤ S is a preorder: x ≤ S y and y ≤ S x only imply that the components of each vector are equal, but not necessarily in the same order. Geometrically, x ≤ S y if and only if x is in the convex hull of all vectors obtained by permuting the coordinates of y. When x, y stands for a pair of discrete probability measures on the same set of d-points, the norming condition (2.19) is satisfied as the sum is normalized to one.
Say that x and y are Schur-comparable if The degree of dispersion is measured by the following one-sided hemi-metric: for Schurcomparable elements x, y, define One has, for Schur-comparable elements: x ≤ S y iff d + (x, y) = 0.
Specialized to discrete probability measures, this gives a one-sided hemi-metric measuring the degree of dispersion or "variance".

Motivation and definition
For a law invariant, convex risk measure ρ on M 1 (R d ), one has a representation of the form where X ∼ Q, A is a class of scenario measures and α(ν) is a penalization term, see [11]. This representation suggests to consider for a class F of real functions on E the following hemi-metric Then, D F + is a risk excess measure on (M F , F ). Another motivation comes from the theory of probability metrics, where some metrics on the space of probability measures are defined by duality from a class of functions: D F + in (3.2) is the natural one-sided analog of the probability metrics D F induced by a functional class F, which go under the name of probability metrics with a ζ-structure in [21] or integral probability metrics in [18]. We are thus naturally inclined to define: Remark 3.1 On a probability space (Ω, B, µ), let X be a random variable with image measure µ X = Q. By (3.1), any law-invariant convex coherent risk measure ρ has a representation of the form D F , ν ∈ A}, µ is an underlying measure dominating A, µ X and ν X the image measures of µ, ν by X. Indeed, So the notion of risk excess measure can be seen as an extension of the notion of risk measures.

Extension and restrictions of orders and hemi-metrics
For risk excess measures, an important aspect is to have a kind of consistency w.r.t. some ordering ≤ on E, i.e., F consists of increasing functions w.r.t. ≤. In this respect, the following order extension properties are useful.

Proposition 3.2 (Extension and restriction of order)
1 If is a preorder on M 1 (E), then, the relation ≤ r , defined, for x, y ∈ E, by defines a preorder on E. ≤ r is called the restriction of the preorder on M 1 (E). 2 Conversely, if ≤ is a preorder on E, then the stochastic order st defines a partial order on M 1 (E), such that its restriction ≤ r is identical to ≤.
Proof 1 The proof follows by direct verification.
2 By definition, we have In particular, restricted to principal up-sets B = {z} ↑ , the implication (3.5) becomes Analogously, we can also extend and restrict in a consistent way the discrete onesided hemi-metric d ≤ + of Example 2.3, equation (2.8) into the risk excess measure of Example 2.1.

Proposition 3.3 (Extension and restriction of discrete hemi-metrics)
1 If D + is a risk excess measure on (M 1 (E), ), then Proof 1 The proof follows by direct verification and Proposition 3.2.
2 The restriction of D st which is {0, 1}−valued and a one-sided hemi-metric on E by Proposition 3.3 part 1. By Proposition 3.2 part 2,

Remark 3.3
The construction of the previous proposition, based on the D st + of Example 2.1, which encodes the order ≤ into st , is consistent w.r.t. the order ≤, in the sense that the restriction of D st + is the discrete one-sided hemi-metric d r + = d ≤ + , which encodes the original order ≤. However, for a one-sided hemi-metric d + on (E, ≤) different from the discrete one, the extention D st + is in general inconsistent w.r.t. the hemi-metric d + , in the sense that the restriction of the risk excess measure D st + is not the original d + but is again the discrete one-sided hemi-metric d ≤ + . This is illustrated in the following diagram: The question of consistently extending/restricting a one-sided hemi-metric d + into a risk excess measure D + , according to the diagram, will be treated by mass transportation in section 4.
It is interesting to observe that, in general, there may exist many extensions of a one-sided hemi-metric on E to a risk excess measure on M 1 (E), as seen in the following example. We will discuss some general extensions in section 4.
where F (z) = Q([z, ∞)) and G(z) = P ([z, ∞)) stand for the survival functions of Q and P . So it will be easier for Q to be less risky than P for this order than for the stochastic order, where the comparison has to be made for all increasing sets. The F uo -induced risk excess measure D Fuo + is given by Note that the restriction ≤ uo on E = R d of the partial order uo in the sense of Proposition 3.2 is identical to the usual componentwise ordering, i.e., ≤ uo =≤. The restriction d uo + of the risk excess measure D uo + in the sense of Proposition 3.3 is the discrete one-sided hemi-metric d ≤ + (see Example 2.3 and (2.8)): As a consequence, both risk excess measures D uo + and D st + of Example 2.1 induce the same componentwise ordering ≤ on E = R d and also induce the same restriction as hemi-metric on E. D uo + and D st + are both extensions of the same discrete onesided hemi-metric d ≤ + on E from Example 2.3 (a), as is illustrated in the diagram below: Then, on the class of distributions M 1 1 with finite first moment, the induced ordering Ficx is identical to the increasing convex order, are the mean excess functions. D icx + measures the risk excess of Q w.r.t. P in terms of the corresponding mean excess functions. When restricted to the class of probability measures with identical first moments, Ficx is also identical to the convex ordering, In this example, the restriction d icx On the one hand, On the other hand, if x > y, then d icx + (x, y) = sup t∈R (π t (x) − π t (y)). By considering all cases, t ≤ y, y ≤ t ≤ x, and x ≤ t, one sees that the supremum takes the value x − y. Hence, the restriction d icx + of D icx + is given by which is the basic one-sided hemi-metric of (2.9).
: Ω → E} (see e.g., [11]), it is natural to define risk excess measures D + : X × X → R, also on a space X of random variables.
This allows to consider the risk of a random element X ∈ E as a relative property: there is a joint modeling of the vector (X, Y ) ∈ X 2 , defined on a common probability space (Ω, A, µ), so that the risk of X : Ω → E can be considered in relation to the random element Y : Ω → E, regarded as a benchmark. In the context of insurance and financial mathematics, Y can stand for the value of an alternative portfolio, of a hedge, of a market indicator, or the wealth of an insurer. For example, an insurer, facing the prospect of losing a claim amount X, may wish to evaluate its perceived risk with respect to its reserve capital Y : the "risk" X does not have the same potential consequences whether Y is small or large compared to X. In the same vein of reasoning, because of the fluctuating and (usually) inflating nature of fiat money in the post-1973, petro-dollar based, current monetary system, one may be interested in evaluating the value of a financial asset X w.r.t. the price of a commodity Y considered as a standard, like gold or oil, whose supply is limited in essence.
For X ⊂ L 0 E = L 0 E (Ω, A, µ) a set of random variables on (Ω, A, µ) with values in (E, ≤), we consider the pointwise ordering on X induced by ≤. We identify random elements in L 0 E which are identical a.s. and similarly X ≤ Y means that X ≤ Y µ-a.s.   1 An example of a risk excess measure on X which is not compound is However, since random elements in L 0 E which are identical µ-a.s are identified, it is natural to consider only compound risk excess measure, e.g., the essential supremum version (Ω, A, µ), let A 0 ∈ A, with 0 < µ(A 0 ) < 1, be a class of scenarios considered as "low risk", while its complement A 1 := Ω \ A 0 is considered as "high risk". Then, for some safety coefficient α > 1, define non-compound risk excess measures, which values α times more the risk excess (X − Y ) + for the high risk scenarios than for the low risk ones.
Remark 4.1 1 The notation D c + in Definition 4.2 stresses that D c + depends on the joint distribution µ (X,Y ) and not solely on the marginals µ X , µ Y of (X, Y ), as is the case in Definition 2.3. See also [34], [21] for the similar notion of compound probability metric. For risk measures ρ(X) on X, there is the analog notion of law-invariant risk measures which depend only on the law µ X of the random variable. for X, Y ∈ X. For details in the case of probability metrics, see [21].
4.2 Construction of a compound risk excess measure from a one-sided hemi-metric d + on E There is a natural way to construct such a compound risk excess measure on a set X of r.v. in (E, ≤): let d + be a one-sided hemi-metric on (E, ≤), and let X be the set of random variables X s.t. there exists x, y ∈ E s.t. Ed + (X, x) < ∞ and Ed + (y, X) < ∞. The notion of excess risk of Y w.r.t. X is measured by d + (X, Y ). The latter can be turned into a deterministic value, e.g., by taking its expectation, so that one obtains a hemi-metric on X, Note that (4.1) depends only on the joint distribution of (X, Y ): it is indeed a compound risk excess measure defined on a space X of random variables. Indeed, one has: Proposition 4.3 For any measurable one-sided hemi-metric d + on (E, ≤), (4.1) defines a finite one-sided compound risk excess measure on X.
Proof For all X, Y ∈ X, there exists x, y ∈ E s.t. Ed + (X, x) < ∞ and Ed + (y, Y ) < ∞. Hence, by the triangle inequality, Remark 4.2 Formula (4.1) gives a natural way to obtain a compound excess risk measure from a one-sided hemi-metric d + on the ambient space E. Note that not all compound excess risk measures can be written in this form. For example, let (d +,i ) i∈I be a countable family of one-sided hemi-metrics on E, then defines a compound excess risk measure which can not be written as in (4.1) for some d + .

Minimal extension of a compound risk excess measure
A compound risk excess measure D c + , depending on the joint distribution µ (X,Y ) , can be turned by mass transportation into a risk excess measure on M 1 (E), i.e., depending only on the pair of marginals µ X , µ Y , where M 1 (E) is supplied with the stochastic ordering st consistent with the underlying order ≤ on X.
The fact that D inf + is indeed a one-sided risk excess measure on the space of probability measures is given in the following proposition: is a Polish space with a closed partial order, and if D c + is weakly lower-semicontinuous, in the sense that 2 If D c + (X, Y ) = Ed + (X, Y ), as in (4.1), for d + a lower semi continuous onesided hemi-metric on (E, ≤), then D inf + is a one-sided risk excess measure on (M 1 (E), st ).
By the gluing lemma, see e.g., [33, p. 208], there exists a trivariate measure γ = γ QP R s.t. its projection on the first two marginals is α and its projection on the last two marginals is β. In addition, γ can be obtained as the image measure of µ for some measurable mapping. In other words, there exists a joint construction of a random vector (X,Ỹ ,Z) on the probability space (Ω, A, µ) s.t. µX ,Ỹ ,Z = γ and By (A3) for the compound risk excess D c + , which gives with (4.4), Letting ↓ 0 gives (A3) for D inf + . For the one-sidedness property (A4), if D inf + (Q, P ) = 0, then there exists a sequence (X n , Y n ) of random variables on (Ω, A, µ), all with fixed marginals Q, P , s.t. D c + (X n , Y n ) → 0. Since M 1 (Q, P ) the set of probability measures on E × E with marginals Q, P is weakly compact in M 1 (E 2 ), one can extract a subsequence n s.t. (X n , Y n ) d → (X, Y ) for some (X, Y ) with marginals Q, P . By the assumption on D c + ,  Y ). Therefore, lower semi-continuity of d + and Fatou's lemma entails, i.e., (4.3) is satisfied.  .2), admits a representation as a F-induced risk excess measure, as in (3.2), which is given by the following Kantorovich-Rubinstein-type theorem for hemi-metrics: Theorem 4.6 (Kantorovich-Rubinstein theorem for minimal risk excess measure) On a Polish space E, supplied with a closed order ≤, and a lower semi-continuous one-sided hemi-metric d + , the minimal extension D inf + of the compound risk excess measure D c + (X, Y ) = Ed + (X, Y ) has the dual form

Dual representations of minimal extensions
In other words, D inf + is identical to a F-induced risk excess measure D F + of (3.2), with F = Lip 1 b , the class of bounded Lipschitz functions w.r.t. d + .
Proof The proof is similar to the method used to prove the Kantorovich-Rubinstein theorem for metric spaces, see e.g., [24], [33], with some slight modifications. Let M 1 (Q, P ) be the set of probability measures π on E × E with marginals Q, P . For (f, g) ∈ L 1 (Q) × L 1 (P ), set , y), for all x, y ∈ E} , and C 2 b be the set of pairs of real-valued functions (f, g) which are continuous and bounded. Set (4.6) • Step one: One has the easy inequality, Indeed, for all f ∈ Lip 1 (d + ) ∩ L 1 and π ∈ M(Q, P ), Taking the inf on the right and the sup on the left entails the stated inequality (4.7). One obviously has f (x) + f * (y) ≤ d + (x, y), for all x, y ∈ E. Therefore, if x → d + (x, y) is bounded l.s.c. and f ∈ C b , then f * is well defined and bounded. Moreover, by the triangle inequality, one also has Taking the infimum on x on both sides yields where d − is the opposite dual hemi-metric defined in (2.4): . One has f * * (x) ≥ f (x): by definition, by taking x = x in the last equation. Moreover, f * * is this time d + -Lipschitz: the triangle inequality d We obtain:

Examples of minimal risk excess measures
The following propositions give explicit representations of the minimal risk excess measure for several hemi-metrics. We first consider the discrete hemi-metric d ≤ + : Proposition 4.7 (Minimal risk excess measure arising from the stochastic order) This induces, as minimal extension by mass transportation on M 1 (R d ), the stochastic ordering one-sided risk excess measure of (2.6): 2 A dual representation of (4.10) is given by y} is an open set and d ≤ + (x, y) = 1 C (x, y) is a {0, 1}-valued l.s.c. function. By [14] and [26,Lemma 1], (see also [33,Theorem 1.27]), 2 By Kantorovich-Rubinstein Theorem 4.6, Note that one can restrict to the set of increasing functions such that 0 ≤ f ≤ 1 by shifting the function by a constant.
Next, we consider, for E = R, the basic one-sided hemi-metric d b + (x, y) = (x−y) + , introduced in (2.9), describing the magnitude of one-sided departure in a quantitative way. For X = L 1 (µ) the set of random variables on (Ω, A, µ) with finite first moment, d + induces the compound one-sided risk excess measure on X. The corresponding minimal risk excess is given in the following result: Proposition 4.8 (Minimal risk excess arising from mean exceedance) 1 The minimal extension of (4.13) to a risk excess measure on M 1 (R) by mass transportation is given by where Lip 1,↑ the class of increasing, 1-Lipschitz functions (w.r.t. |.|). The ordering induced by D inf is the stochastic order st . 2 One has the following explicit representation: where F, G are the distribution functions of Q, P , and U ∼ U [0,1] is uniformly distributed on [0, 1].
Proof 1 With the assumption on X, Kantorovich-Rubinstein Theorem 4.6 specializes to e., f increasing and 1-Lipschitz w.r.t. the absolute value |.| norm. The fact that the order induced by D inf + on M 1 (R) is the stochastic order st follows from Proposition 4.5. Alternatively, a direct proof is as follows: let n ≥ 1 be a positive integer, X ∼ Q, Y ∼ P . By Markov's inequality, Taking the infimum over X ∼ Q, Y ∼ P yields that D inf + (Q, P ) = 0 implies that X − Y < n −1 with probability one. Letting n → ∞ yields X ≤ Y a.s. Hence, and the latter is equivalent to Q st P , by Strassen theorem. 2 f (x) = x + is convex, hence f (x − y) is submodular (or quasi-antitone in the terminology of [4], or supernegative or 2-negative in the terminology of [32]). This implies (4.14) by results of [4, Theorem 2], or [32, Corollary 2.3 ] (see also [29]).

Remark 4.4 (Comparison with the stop-loss metric)
Note that for t ∈ R, the compound one-sided risk excess measure D c is the average risk excess over the threshold t, which stands for the stop-loss premium of a reinsurer in insurance theory. Rachev and Rüschendorf [23] consider the stop loss metric as the difference of two stop loss premiums, which would write with our conventions of notations (see equation (2.2) in [23]) as, One could obtain from it the corresponding hemi-metric which was introduced in (3.6), in relation to the increasing convex order, which is distinct from the minimal risk excess D inf + . This follows from the triangle inequality for (X − t) + : and taking the infimum yields that In other words, the hemi-metric obtained by a one-sided comparison of risks through their stop-loss premiums is always majorized by the minimal risk excess. See also remark 4.5 for similar considerations for the tail risk.
In risk theory, it is also of interest to compare the expected risks above their distributional α-quantiles: this is the basis for the conditional tail expectation where q α (X), q α (Y ) denote the corresponding α−quantiles of X ∼ Q with c.d.f. F , Y ∼ P , with c.d.f. G. In order to obtain a coherent risk measure and to generalize to possibly non-continuous distributions (see [3]), it is useful to instead consider the expected shortfall. Define, for λ ∈ [0, 1], the extended c.d.f.s of F , G as Define also the distributional transforms of X and Y as where V ∼ U (0,1) is independent of (X, Y ), see [28]. The expected shortfalls are then defined as ES α (X) : For the one-sided comparison of the risk excess of X w.r.t. Y over their α-quantiles, we therefore consider the excess risk of their expected shortfall defined by the following one-sided compound risk excess measure D α,c where U 1 , U 2 are as in (4.16). We obtain the following result: Proposition 4.9 (Minimal tail risk excess) 1 The minimal extension of (4.17) to a risk excess measure on M 1 (R) by mass transportation has the representation which corresponds to the classical stochastic order restricted to the upper tail.

Remark 4.5
It is interesting to note that the expected shortfall of X is given by As expected, the minimal extension risk excess measure dominates the normalized one-sided difference of expected shortfalls: where Y ∼ P, X ∼ Q.

Motivation and definition
In view of the mass transportation approach of (4.2), one may inquire whether there exist other schemes of obtaining a risk excess measure D + (Q, P ), in the sense of Definition 2.3, from a compound risk excess measure D c + (X, Y ), in the sense of Definition 4.2. In particular, it is natural to investigate the following "maximal extension" in the sense of mass transportation, Obviously, D inf + (Q, P ) ≤ D sup + (Q, P ). However, D sup + is not a risk excess measure: although (A1) and (A3) are obviously satisfied, (A2) is not. Indeed, This implies that X ≤ Y a.s. for all possible realizations X ∼ Q, Y ∼ Q. But for X, Y independent with the same law Q, this would require that X ≤ Y a.s. which is only true for Q being a one-point distribution. These considerations imply that D sup + can not be compatible with a reflexive order relation: axiom (A4) can not be satisfied either.
Nonetheless, D sup + , as a supremum over all joint constructions of (X, Y ) ∼ (Q, P ), gives the best possible upper bound on the compound risk excess measure in the sense of mass transportation, and therefore has a natural interpretation as a worst-case comparison, which is appealing for risk applications. These considerations motivate the introduction of a weakened notion of risk excess measure, without axiom (A2) and with axiom (A4) restricted to a strict order ≺, i.e., a transitive and irreflexive relation. Therefore, we propose the following definitions:  Remark 5. 1 1 The concept of one-sided weak risk excess measure is an asymmetric analog of the concept of moment function in the theory of probability metrics, see [21,Chap. 3.3], or [22,Chapters 3.4. and 8.2]. In addition, the adjunction of axiom (A4) makes it compatible with a notion of order. Obviously, a one-sided risk excess measure for a preorder is a one-sided weak risk excess measure for the strict order ≺ defined by P ≺ Q ⇔ P Q and P = Q.
2 The relation between the minimal D inf + and maximal D sup + extensions obtained from a compound risk excess measure D c + , is given in the following improved triangle inequality: where P, Q, R are three probability measures on E, see [22,Theorem 3.4.1].
Define on M 1 (E) the following strict order ≺ sup by where supp(.) denotes the support of a distribution. The analog of Proposition 4.5 for the maximal extension, which shows that D sup + is indeed a one-sided weak risk excess measure, is given in the following proposition: 2 if d + is upper semi-continuous, then the supremum is attained for some probability measure.
Therefore, if f takes some negative values, then, setting γ 1 = inf f (x) entails f ≥ 0 and the infimum in (5.3) can be restricted to By symmetry, the infimum in (5.3) can further be restricted to Hence, J(f, g) ≥ J(f * * , f * ). Moreover, by the triangle inequality, and taking the supremum in y yields .
The general case, for d + unbounded, proceeds by approximation, as in Theorem 4.6. 2 Follows from Theorem 2.3.10 in [24].

Examples of maximal extensions
We discuss for some of the examples in section 4 the corresponding worst-case risk excess D sup + . First, we consider the discrete one-sided hemi-metric d ≤ + of (2.8) on E = R d , supplied with the product order ≤. The associated compound risk excess measure is given by (4.9): for X ∼ Q, Y ∼ P , and its minimal extension (4.11) coincides with the induced risk excess measure D st + (see (2.6)) compatible with the stochastic order. The maximal extension is given in the following proposition: Remark 5.2 Comparing this result with those of Proposition 3.3 and Example 3.2, one sees that the discrete one-sided hemi-metric d ≤ + (x, y) = 1 y x and the corresponding compound risk excess measure has many extensions on M 1 (R d ) and, in particular, we obtain The following diagram illustrates the different embeddings of structures, through their hemi-metrics: Next, we investigate the maximal one-sided weak risk excess extension for the basic hemi-metric (2.9): on E = R, for X ∼ F, Y ∼ G, let D c + (X, Y ) = E(X − Y ) + be the average risk excess as in (4.13). The maximal risk excess extension by mass transportation is given by the following proposition.
where F, G are the c.d.f.s of Q, P , respectively.
Proof The argument for the maximal risk excess extension is similar to that of the minimal risk excess extension.
In the previous propositions, the order induced by the maximal extension is very strong. For insurance applications, in particular for comparing tail risk, it is of interest to restrict the comparisons to the upper tails of the distributions, see Proposition 4.9 in section 4. Finally, we give the result for the tail excess compound risk measure D c,α + (X, Y ) in (4.17), which induces a more interesting order: Proposition 5.7 (Tail risk excess) 1 Let 0 < α < 1, then the maximal extension D α,sup + is given by where Q α , P α are the conditional distributions of Q, P on their upper αquantiles intervals [q α (Q), ∞), [q α (P ), ∞). 2 Correspondingly, a suitable consistent ordering ≺ α on M 1 (R) is given by 6 Extensions with dependence constraints 6.1 Setup In sections 4 and 5, we considered risk excess measures D + (Q, P ) obtained as minimal and maximal extensions obtained by mass transportation of a compound risk excess measure, i.e., over the class of all dependence structures of (Q, P ). In this section, we consider a relevant modification of this method by restricting the class of possible dependence structures. This setup allows to take into consideration some known side information on the dependence structure of (Q, P ), like various bounds on positive or negative dependence, see e.g., [29, Chapter 5].
We consider the setup E = R with hemi-metric d + and the compound excess risk measure D c + (X, Y ) = Ed + (X, Y ) of the kind (4.2), where X, Y ∈ X have marginals Q, P . If C = C X,Y is a copula of (X, Y ), we also write E C d + (X, Y ) to stress the dependence on C, and we denote by C the set of all bivariate copula functions. Let D ⊂ C denote a subclass of copulas which describe the information on the dependence structure. Then, it is natural to consider the worst and best-case extension of D c + over D.
Definition 6.1 (Minimal and maximal extension with dependence restriction) For a subclass D ⊂ C: • The minimal extension with dependence restriction D of D c + is defined as • Similarly, the maximal extension with dependence restriction D is defined as In the case without dependence restriction, i.e., when D = C, we get the minimal and maximal extensions D inf + , D sup + of (4.2) and (5.1) considered in sections 4 and 5. Despite that, the extensions (6.1) and (6.2) have a natural motivation as best, resp., worst-case excess risk taking into account the dependence restrictions. On the level of random variables, the class of pairs (X, Y ) with C XY ∈ D and X ≤ Y may be empty even if Q st P . Therefore, the unrestricted extensions D inf + , resp., D sup + , would under, resp., over estimate the real risk excess. As a consequence, this is a strong indication for the relevance of the notion of minimal, resp., maximal risk excess with dependence restriction D.

Explicit results for extensions with positive and negative dependence restriction
We now consider two particular classes of dependence restrictions D which allow determination of the minimal, resp., maximal, extensions in explicit form. Denote for copulas C 0 , C 1 ∈ C by the class of all copulas which are smaller than C 0 , resp., bigger than C 1 , in the lower orthant ordering lo (equivalently in the upper orthant ordering uo ). (6.3) describes a negative dependence restriction, (6.4) a positive dependence restriction: for the case C 0 = C 1 = Π, the independence copula Π(u, v) = uv, 0 ≤ u, v ≤ 1, these restrictions correspond to negatively quadrant dependent (NQD), resp., positively quadrant dependent (PQD), random variables, as defined by [16], see [20, p. 186]. Then, for d + (x, y) = (x − y) + , we obtain the following explicit result. where X 0 ∼ Q, Y 0 ∼ P and C X 0 ,Y 0 = C 0 .
2 For D = D ≥ (C 1 ), we obtain the explicit formula for the maximal risk excess extension where X 1 ∼ Q, Y 1 ∼ P and C X 1 ,Y 1 = C 1 .
Proof 1 For (X, Y ) with X ∼ Q, Y ∼ P and C X,Y = C ≤ C 0 , it follows from the submodularity argument, as in the proof of Proposition 4.8 that since f (x − y) = (x − y) + is submodular and (X, Y ) ≤ sm (X 0 , Y 0 ), with ≤ sm the supermodular ordering. Taking the infimum yields the result. 2 The argument is similar.

Remark 6.2
• Taking for D the two-sided dependence information we obtain for D D,inf + the same formula as in (6.5) and for D D,sup + the same formula as in (6.6). Thus, this information simultaneously shrinks the upper and the lower bound for the risk excess.
• The concept of minimal, resp., maximal risk excess can also be introduced for the general case (E, ≤) and general compound risk excess measures D c + . In this case, D denotes a class of dependence structures of random elements X, Y ∈ E. Even if D inf + and D sup + do not satisfy on the level of distributions the risk excess measure axioms (A2) and (A4), they describe the relevant bounds for the risk excess with dependence information D.

Conclusion
We proposed a quantitative one-sided comparison of probabilistic risks via the concept of risk excess measures, obtained as order extensions of hemi-metrics on the underlying space E. Like for the case of risk measures, the choice of a suitable hemi-metric and corresponding excess risk measure for a particular application will depend on the problem considered and the notion of order one wants to quantify. For reliability, insurance mathematics, finance, epidemiology, etc... different notions of orders and distances are related to the problem at hand. In this regard, the examples proposed, together with their explicit formulas, are helpful. Together with the extension/restriction properties of section 3, and the dual representations of sections 4 and 5, they can serve as a guide for the interpretation of the excess risk measure and coherence w.r.t. order and distance on the ambient space E.
We leaved aside the statistical aspects, but let us just mention that one can obtain empirical versions of the various risk excess measures D + (P, Q) presented here by replacing P, Q in their definitions by the corresponding empirical measures P n , Q n .
For excess risk measures which have an explicit formula, statistical estimation is straightforward by plugging in the empirical measures P n , Q n instead of P, Q. For the F-induced risk excess measures of section 3, and for risk excess measures obtained by minimal and maximal extensions (sections 4 and 5) of a compound one, their dual representation as a supremum (or infimum) over a functional class allows to consider their estimation via Glivenko-Cantelli-type theorems indexed by function classes. This is one supplementary interest of these dual formulations. For example, for the F-induced risk excess measure of (3.2), since x + ≤ |x|, one has obviously that i.e., the risk excess measure is majorized by the corresponding integral probability metric and the convergence of the latter follows from classical results on abstract empirical process, see e.g., [30].