Refinements of the integral Jensen’s inequality generated by finite or infinite permutations

There are a lot of papers dealing with applications of the so-called cyclic refinement of the discrete Jensen’s inequality. A significant generalization of the cyclic refinement, based on combinatorial considerations, has recently been discovered by the author. In the present paper we give the integral versions of these results. On the one hand, a new method to refine the integral Jensen’s inequality is developed. On the other hand, the result contains some recent refinements of the integral Jensen’s inequality as elementary cases. Finally some applications to the Fejér inequality (especially the Hermite–Hadamard inequality), quasi-arithmetic means, and f-divergences are presented.


Introduction
The significance of convex functions is rightly due to Jensen's inequality. A real function f defined on an interval C ⊂ R is called convex if it satisfies f αt 1 + (1α)t 2 ≤ αf (t 1 ) + (1α)f (t 2 ) for all t 1 , t 2 ∈ C and all α ∈ [0, 1].
Let the set I denote either {1, . . . , n} for some n ≥ 1 or N + . We say that the numbers (p i ) i∈I represent a discrete probability distribution if p i ≥ 0 (i ∈ I) and i∈I p i = 1. It is called positive if p i > 0 (i ∈ I). A permutation π of I refers to a bijection from I onto itself.
The following discrete and integral versions of Jensen's inequality are well known. Theorem 1 (discrete Jensen's inequalities, see [16] and [17]) (a) Let C be a convex subset of a real vector space V , and let f : C → R be a convex function. If p 1 , . . . , p n represent a discrete probability distribution and v 1 , . . . , v n ∈ C, then (1) (b) Let C be a closed convex subset of a real Banach space V , and let f : C → R be a convex function. If p 1 , p 2 , . . . represent a discrete probability distribution and v 1 , v 2 , . . . ∈ C such that the series ∞ i=1 p i v i and ∞ i=1 p i f (v i ) are absolutely convergent, then Theorem 2 (integral Jensen's inequality, see [16]) Let ϕ be an integrable function on a probability space (X, A, μ) taking values in an interval C ⊂ R.
Remark 3 It follows that Theorem 1 (b) can be generalized in case of V = R: if C ⊂ R is an interval (not necessarily closed) and the other conditions of the statement are satisfied, then ∞ i=1 p i v i lies in C and (2) holds.
There are many papers dealing with refinements of discrete and integral Jensen's inequalities (see the book [12] and the references therein).
In papers [2] and [13] there are special refinements of the discrete Jensen's inequality of the form in Theorem 1 (a) (so-called cyclic refinements). These led the author to the following refinement of the discrete Jensen's inequality which is a significant generalization of the previously mentioned results.
Theorem 4 (see [11]) (a) Let k, n ≥ 2 be integers, and let p 1 , . . . , p n and λ 1 , . . . , λ k represent positive probability distributions. For each j = 1, . . . , k, let π j be a permutation of the set {1, . . . , n}. If C is a convex subset of a real vector space V , f : C → R is a convex function, (b) Let the set J denote either {1, . . . , k} for some k ≥ 2 or N + . Let p 1 , p 2 , . . . and (λ j ) j∈J represent positive probability distributions. For each j ∈ J, let π j be a permutation of the set N + . If C is a closed convex subset of a real Banach space (V , · ), f : C → R is a convex In the paper [13] we obtain refinements of the integral Jensen's inequality by using cyclic refinements of the discrete Jensen's inequality, but these results are not natural obverses of the discrete one's.
In this paper we give the integral version of Theorem 4 when V = R. On the one hand, a new method to refine the integral Jensen's inequality is developed (totally different from earlier techniques, see e.g. [10] and [19]) and our result contains Theorem 4 when V = R. On the other hand, we can have from it some recent refinements of the integral Jensen's inequality (see [7], [5], [6]) as elementary cases. Finally, some applications to the Fejér inequality (especially the Hermite-Hadamard inequality), quasi-arithmetic means, and f -divergences are presented.

Preliminary result
We give an extension of Theorem 4 if V = R.

Proposition 5
Let the index set I denote either {1, . . . , n} for some n ≥ 1 or N + . Let the index set J denote either {1, . . . , k} for some k ≥ 1 or N + . For each j ∈ J, let π j be a permutation of the set I. Let (p i ) i∈I and (λ j ) j∈J represent positive probability distributions. If C is an interval in R, f : C → R is a convex function, and (v i ) i∈I is a sequence from C such that the series i∈I p i v i and i∈I p i f (v i ) are absolutely convergent, then Proof By using Remark 3, we can copy the proof of Theorem 4 in [11].
The positive part f + and the negative part fof a real-valued function f are defined in the usual way.
We need another result about integrability.

Lemma 6
Let ϕ be an integrable function on a probability space (X, A, μ) taking values in an interval C ⊂ R. If f is a convex function on C such that f • ϕ is μ-integrable, then there exists a convex function g on C such that |f | ≤ g and g • ϕ is μ-integrable too.
Proof Along with the function f , the function f + is also convex.
The convexity of f on C shows that there is an affine function l : Since the function ϕ is a μ-integrable function, |l| • ϕ is also μ-integrable. Using that the function |l| is convex, |f | = f + + f -, and the sum of two convex functions is also convex, it follows from the above that g := f + + |l| can be chosen.
The proof is complete.
We shall use the following Fubini theorem for double series.
are absolutely convergent and both have the same sum.

Main results
We need the following hypotheses.
represent a positive probability distribution. For each j ∈ J, let π j be a permutation of the set I. (H 4 ) Suppose that we are given a sequence M I = (μ i ) i∈I of measures on A with μ i (X) > 0 for all i ∈ I and i∈I μ i = μ. (H 5 ) Suppose that we are given a sequence S I = (A i ) i∈I of pairwise disjoint sets A i ∈ A with μ(A i ) > 0 for all i ∈ I and i∈I A i = X.
Proof This can be obtained by an application of Proposition 5 to the parameters Really, (p i ) i∈I represents a positive probability distribution, and by the integral Jensen's inequality, v i ∈ C (i ∈ I).
Next we show that the series i∈I p i v i and i∈I p i f (v i ) are absolutely convergent.
Since ϕ is a μ-integrable function on X and i∈I μ i = μ, By Lemma 6, there exists a convex function g on C such that |f | ≤ g and g • ϕ is μ-integrable.
Another application of the integral Jensen's inequality and i∈I μ i = μ now show that We can see that the conditions of Proposition 5 hold, and therefore, by applying it, we obtain As a final step, we can apply the integral Jensen's inequality in (5). The proof is complete.
A useful consequence of the previous theorem is the next result. (H 1 -H 3 ) and (H 5 ). Let C ⊂ R be an interval and f : C → R be a convex function. Let ϕ be a μ-integrable function on X taking values in C such that f • ϕ is also μ-integrable on X. Then

Corollary 9 Assume
Proof Let the measure μ i (i ∈ I) be defined on A by and then apply Theorem 8. The proof is complete.

Discussion
First we study the relationship between Theorem 8 and Proposition 5. Assume (H 2 ) and (H 3 ), and let (p i ) i∈I represent a positive probability distribution. Define the measure μ on the power set P(I) of I by where ε i (i ∈ I) is the unit mass at i on P(I), and use the measure space (I, P(I), μ) in (H 1 ). Let C ⊂ R be an interval, f : C → R be a convex function, and (v i ) i∈I be a sequence in R such that the series i∈I p i v i and i∈I p i f (v i ) are absolutely convergent. Define the function ϕ on I by It is easy to check that under these conditions Corollary 9 is equivalent to Proposition 5. Now we compare our main result with some recent refinement of the integral Jensen's inequality.
Let (X, A, ν) be a measure space with ν(X) ∈ ]0, ∞]. For the ν-integrable positive ν-a.e. weight w, consider the Lebesgue space For the ν-integrable positive ν-a.e. weight w and given n ≥ 2, we consider the set B k (w) of all possible n-tuples of ν-integrable positive ν-a.e. weights w = (w 1 , . . . , w n ) with the property that n i=1 w j = w. The next result can be found in [5].
Remark 11 Let the measure μ i (i ∈ I) and μ be defined on A by and by Then μ i (X) > 0 (i ∈ I) and μ = n i=1 μ i . By choosing k = 1 (thus λ 1 = 1), we can see that Theorem 10 is a simple consequence of Theorem 8.
We say that the family of measurable sets F n (X) . . , n} with i = j and ν(A i ) > 0 for any i ∈ {1, . . . , n}. For given n ≥ 2, we denote by D n (X) the set of all n-divisions of X.
The following result appears in [6].
Remark 13 (a) Define the measure μ on A by (7). By choosing k = 1 (thus λ 1 = 1), we can see that Theorem 12 is a simple consequence of Corollary 9. Moreover, it follows that Theorem 12 is contained in Theorem 10.
(b) The main result Theorem 2.1 in [7] is the special case of Corollary 9 when n = 2 and k = 1. (a < b). The σ -algebra of Lebesgue-measurable subsets of R is denoted by L. λ means the Lebesgue measure on L. Assume that f : [a, b] → R is a convex function and g : [a, b] → [0, ∞[ is a Lebesgue-integrable function which is symmetric to a+b 2 . The classical Fejér inequality (see [8]) says

Applications
This is a weighted generalization of the Hermite-Hadamard inequality (see [9]) which has the form By applying our main result, we can obtain a refinement of the left-hand side of the Fejér inequality. There are refinements of the Fejér inequality (see e.g. [21]), but the next result provides a totally different refinement. (H 2 -H 3 ). Let

Proposition 14 Assume
Proof (a) This is a special case of Proposition 14 (a).
(b) Let the measure μ i (i ∈ I) be defined on L by and then apply (a). The proof is complete.
The second application concerns quasi-arithmetic means. Let C ⊂ R be an interval, and let q : C → R be a continuous and strictly monotone function. If (X, A, μ) is a probability space, and ϕ : X → C is a function such that q • ϕ is μ-integrable on X, then is called the quasi-arithmetic mean (integral q-mean) of ϕ. Now we introduce some new quasi-arithmetic means related to the formula C mes .

Definition 17
Assume (H 1 -H 4 ). Let C ⊂ R be an interval, let q, r : C → R be continuous and strictly monotone functions, and ϕ : X → C be a μ-integrable function for which q • ϕ and r • ϕ are also μ-integrable functions. Then we define the following quasi-arithmetic mean of ϕ with respect to C mes : M q,r (ϕ, μ, λ, π, M I ) .
In the next result the introduced means are compared.
Proof We consider only the case when q•r -1 is convex and q is strictly increasing. By applying Theorem 8 with f := q • r -1 and with r • ϕ instead of ϕ, we obtain and this implies the result since q -1 is strictly increasing. The proof is complete.
Finally, some applications to information theory are presented. Throughout the rest of the paper probability measures P and Q are defined on a fixed measurable space (X, A). It is also assumed that P and Q are absolutely continuous with respect to a σ -finite measure ν on A. The densities (or Radon-Nikodym derivatives) of P and Q with respect to ν are denoted by p and q, respectively. These densities are ν-almost everywhere uniquely determined.
Introduce the set of functions