Ricci curvature, isoperimetry and a non-additive entropy

Searching for the dynamical foundations of the Havrda-Charv\'{a}t/Dar\'{o}czy/Cressie-Read/Tsallis non-additive entropy, we come across a covariant quantity called, alternatively, a generalized Ricci curvature, an $N$-Ricci curvature or a Bakry-\'{E}mery-Ricci curvature in the configuration/phase space of a system. We explore some of the implications of this tensor and its associated curvature and present a connection with the non-additive entropy under investigation. We present an isoperimetric interpretation of the non-extensive parameter and comment on further features of the system that can be probed through this tensor.


Introduction
Havrda-Charvát [1] / Daróczy [2] / Cressie-Read [3,4] / Tsallis [5,6] entropy is single parameter family of functionals, which have attracted some interest in the Statistical Mechanics community over the last 25 years. To go straight to the point, assume that a discrete set of outcomes is labelled the set I. Assume that each outcome is labelled by i ∈ I and the corresponding probability of its occurrence is indicated by p i . Then the non-additive entropy S q that we will be interested us in this work, is defined by Here k B stands for the Boltzmann constant which will be set k B = 1 almost everywhere in the sequel. The naive extension to a continuous set of outcomes, is characterised by the probability density function ρ : Ω → R, and is assumed to be absolutely continuous everywhere on Ω with respect to the Lebesgue measure (volume) dvol Ω is In both of the above expressions q ∈ R, although a recent work has suggested [7] the possibility of q ∈ C. We called the definition of S q for continuous sets of outcomes (2) as "naive" since there has been some recent controversy about the validity of (2) [8][9][10][11][12][13][14][15] and some related skepticism on whether it is possible to extend S q to continuous sets of outcomes. We consider this criticism to be valid and the general controversy not yet settled. Nevertheless, in the absence of a viable alternative or a consensus, we will use (2) in the sequel as the version of S q for a continuous sets of outcomes, having the above possible caveat in mind. It is reassuring to notice that for q → 1 one recovers the Boltzmann/Gibbs/Shannon (BGS) entropic functional A question of fundamental importance for any entropic functional is to determine its dynamical foundations. More specifically, to determine the microscopic dynamical systems whose collective behaviour is encoded by the entropic functional under consideration. Moreover one can ask whether is it possible, even in principle for someone to predict which particular entropic functional, or less ambitiously, which are the common features of a class of entropic functionals that can effectively describe the collective behaviour given a microscopic dynamical system. In our considerations, despite the fact that such an assumption can be considered substantially, or even unnecessarily, restrictive, we will always have in mind Hamiltonian systems of many degrees of freedom. The dynamical behaviour of such systems is described by their evolution in their configuration or phase space. Such a space is a manifold M endowed with a Riemannian metric g. There may be additional structures present, such as the symplectic structure in phase space ω whose presence has unexpected and profound consequences [16]. However in this work, we will use exclusively the underlying Riemannian manifold (M, g) and ignore any further structures that may be present.
In Section 2, we discuss the basics of Ricci curvature on Riemannian manifolds and the generalized / N-/ Bakry-Émery Ricci curvature and the related tensor constructed via optimal transport. In Section 3, we discuss the connection of the generalised Ricci curvature to S q through a gradient flow and through the behaviour of the functions belonging to the displacement convexity classes DC N . In Section 4, we present an isoperimetric interpretation of the non-extensive parameter q. We also present an interpretation of q in terms of a projection arising from coupling the system to an external "thermostat". In Section 5, we present a brief assessment of the current situation.

Geometry in Mechanics
Consider an autonomous dynamical system of n interacting point particles, having mass m i , i = 1, . . . , n. Let its configuration space M be parametrized by the local coordinates {x i ,ẋ j }, i, j = 1, . . . n and let its Lagrangian be . , x n ,ẋ 1 , . . . ,ẋ n ) The dot stands for the derivative with respect to some parameter ("time"). To make the analysis more tractable, we simplify (4) by assuming generalized velocity-independent interactions, namely This potential form is quite restrictive as it does not cover the case of point particles interacting with Electromagnetic or Yang-Mills fields. However, it turns out that such interactions can be straightforwardly incorporated by following a similar approach for the tangent bundle T M or more general, model-dependent, principal or associated bundles related to M. However, it turns out that such interactions can be straightforwardly incorporated by following a similar approach.
If we wish to be slightly more general, by possibly incorporating the effect of constraints and reducing the system to truly independent variables (without constraints) even locally, we can use instead of (4) where M ij stands for a positive definite quadratic form (mass matrix) which can be used as a Riemannian metric on M. In the above case (6), when all masses of particles are taken as equal to each other we get back (4). Since the system is autonomous, with canonical momenta p i = δL/δẋ i , its Hamiltonian corresponding to total energy E of the system, is an integral of motion. Maupertuis' principle, recast in Hamiltonian terms, states that the motions of the system are the stationary paths in M of the adiabatic functional where c : [0, 1] → T * M, are rectifiable curves with fixed initial and final points having total energy H = E. Maupertuis' principle, as generalized by Hamilton, states that physical motions are extremals of the length functional (9) of such curves c. In geometric terms, such extrema are geodesics whose local characterisation in terms of the coordinate components of the metric are geodesics on the level sets of energy E of M. The geodesics equation in such local coordinates is where the Christoffel symbols (connection coefficients) Γ i jk are given, as usual, in terms of the metric tensor components g ij by where ∂ i = ∂/∂x i and the Einstein summation convention is assumed over repeated indices in (9)- (12) and henceforth, unless otherwise stated.
Here, and in the sequel, s represents the arc-length of a curve c which is assumed to be arc-length parametrised. An alternative to the M ij metric, is the Jacobi metric for which the geodesic equation (11) reduces to Newton's Second Law In classical Statistical Mechanics we consider dynamical systems having sets of "nearby" initial conditions. Phrased differently, we are not considering just one particular geodesic c 0 to describe the evolution of the system, but rather a set of them, all of which are initially close to c 0 . The behavior of nearby geodesics is encoded, in the linear approximation, via the Jacobi fields with coordinate components J i , i = 1, . . . , n. In their dynamical evolution, the infinitesimal separation between nearby geodesic is determined by the by the geodesic deviation (Jacobi) equation. This is a linearization of a variation of the equation of geodesics and, expressed in the local coordinates we are using, it is where R i jkl are the components of the Riemann tensor in the coordinate basis of T M.

Rudiments of Riemannian Curvature
The Riemann tensor R with components in a coordinate basis indicated by R i jkl is a fundamental object in Riemannian geometry. It should be noted that despite a century and a half of intense exploration, many of its properties still remain unknown [17]. A way to motivate its introduction is by searching for a local quantity that allows us to determine the distance between two points in a Riemannian manifold. Such a calculation is practically intractable, as can be seen by looking at the geodesic equation (11) in conjunction with (12). The Riemann tensor is a quantity that allows us to infinitesimally address such a question. This viewpoint, among several others, can be seen in [17].
The components of the Riemann tensor are explicitly given, in terms of the metric, by The Jacobi field with coordinate components J i , i = 1, . . . , n points from c 0 toward nearby geodesics, as noted above, but may also point along c 0 itself. The definition of the Riemann tensor is far more elegant and brings forth its linear and differential nature when expressed in terms of the Levi-Civita connection ∇ (the unique symmetric connection preserving the metric) as a multi-linear map R : where X, Y, Z ∈ T M and [·, ·] indicates the Lie Bracket on T M. By taking advantage of (15), J i can be chosen to be perpendicular to c 0 . We see from (16), that the behavior of nearby geodesics is controlled by the Riemann curvature tensor (17). Consideration of initial conditions near c 0 amounts to averaging in the n-1 perpendicular directions to c 0 , as expressed by J i , i = 2, . . . , n, by using a suitably chosen measure. The simplest case is to choose a uniform measure, meaning a measure which is a constant multiple of the Lebesgue measure ("Riemannian volume") dvol. Then evolution of a set of initial conditions along c 0 is controlled by the average of the Riemann tensor in the 2-planes spanned by the tangent to c 0 and one of the perpendicular directions to c 0 , an average that results in the Ricci tensor along c 0 . Therefore, the Ricci tensor is given by the following, essentially unique, contraction of the Riemann tensor For geometric purposes, one uses the following far more transparent, but essentially equivalent as can be proved via polarisation, scalar quantities. Their formulation is slightly easier on orthonormal bases. So, consider an orthonormal basis of T M with respect to the metric tensor g indicated by {e i }, i = 1, . . . , n. Let e 1 be the tangent to c. The sectional curvature in a 2-plane of T M at P ∈ M spanned by e p , e q , p = q, p, q = 1, . . . , n is defined by without a summation over p, q. In a slightly different notation, following the notation of (17), we can express the sectional curvature as where no summation over p or q takes place, once more. As defined, the sectional curvature is formally a function on the Grassmann manifold G 2,n (M) of 2-dimensional planes of T M. So, the sectional curvature is, at its core, an essentially 2-dimensional quantity. Its geometric meaning becomes evident through the following two theorems which are true in 2-dimensions. Let P be a point of the Riemannian manifold M and let r be the radius of a geodesic circle around P . Let c(r) be its circumference and A(r) be the area of the geodesic disk whose boundary has length c(r). Then [18] Bertrand-Puiseux (1848) Essentially equivalently [18] (Diquet 1848) According to these theorems one sees that the sectional curvature determines how much the volume of a sphere or ball exceeds that of the corresponding sphere or ball in flat space. There are several other, equivalent, formulations of this geometric fact [19,20].

About the Ricci Curvature
The Ricci curvature in the direction of e 1 , which as was stated above is assumed to be tangent to the geodesic c passing by P , is a symmetric bilinear form related to the Ricci tensor by In other words, it is given in terms of the sectional curvature at P ∈ M, This makes evident the fact that the Ricci curvature is the outcome of averaging in all directions perpendicular to e 1 at P as was previously mentioned. This averaging is also explicit in the contraction in the indices of the Riemann tensor resulting in the Ricci tensor (18). Hence the definition of the Ricci tensor or Ricci curvature uses explicitly two distinct facets of Riemannian manifolds: their metric (and connection which is uniquely defined through the metric) and a measure (in this case the volume). These two concepts are uniquely inter-related in Riemannian manifolds exactly because such manifolds M are locally isometric, to first order, to the Euclidean space R n [17]. This can be most easily seen via the expansion of the metric in geodesic normal coordinates around P ∈ M taken as the origin of the normal coordinate system One can define the volume in Riemannian spaces via the following two requirements [17] • For surjective, distance-decreasing maps f : • The volume of the unit cube [0, 1] n in R n is normalized so that vol ([0, 1] n ) = 1.
and then prove the more familiar formula in local coordinates There are several ways for someone to understand the geometric meaning of Ricci curvature [17,[19][20][21]. A very common one, which will be useful in the sequel, is through the Bishop-Gromov volume comparison theorem. The main idea behind comparison theorems [17,[19][20][21][22][23][24] is that even though it is quite hard to analyze geometric features of manifolds by directly solving the partial differential equations equations resulting from infinitesimal considerations, such as the geodesic (11) or the Jacobi equations (15) , one may still be able to extract useful geometric and topological information about such manifolds by comparing them to "simpler" spaces. As such "model" spaces one usually considers the simply-connected manifolds of constant sectional curvature ("space forms"). In this spirit, one tries to establish inequalities, bounding the quantities of the manifold of interest by the corresponding ones of the model space. Such ideas can also be extended to more general metric spaces, quite frequently also endowed with measures [24]. This "comparison" idea is not foreign in Physics, where occasionally one encounters such inequalities, especially in more rigorous treatments of Statistical Mechanics [25,26].
There are different versions of the Bishop-Gromov volume comparison theorem expressing the same idea slightly differently. For our purposes we use the following: Assume that (M, g) is a complete Riemannian manifold with Ric ≥ (n − 1)k, k ∈ R and P ∈ M an arbitrarily chosen point. Let B P (r) be the geodesic ball of radius r centered at P and B k (r) be the ball of radius r in the space form of constant sectional curvature k and dimension n. Then the function is a non-increasing function of r. This volume ratio approaches 1 as r → 0 due to (25). An immediate corollary of this is (Bishop's inequality) From a physical (General Relativistic) viewpoint the Bishop-Gromov volume comparison theorem (27) is "obvious". Let Scal indicate the scalar curvature, namely the unique contraction of the Ricci tensor Scal = R µν g µν . Einstein's equations determine the metric g µν in terms of the stress energy tensor T µν . Here c stands for the speed of light and G is the universal gravitational constant. Roughly speaking, in areas where there is a lot of matter, T µν will result in strong gravity, so the bound in the Bishop-Gromov inequality will be high. Consider a homogeneous and isotropic space-time, as in the case of the Friedmann-Robertson-Walker (FRW) cosmology, as a comparison space. Any space-time that has more attractive matter than that, will make the geodesics converge faster, as seen also in (21), hence the volume of a ball centered along one such member of the geodesic congruence will contract faster than in the model (FRW) space. The above analogy relies in the fact that the results we use in the Riemannian signature carry over to the Lorentzian signature case, something that is known, even though non-trivial to prove [27]. A second implication of Ricci curvature, still in the context or Comparison Geometry, and of potential interest in Physics is via Lichnerowicz's inequality [28]. This provides a lower bound on the lowest non-trivial eigenvalue λ 1 of the Laplacian ∆ in terms of a lower bound on the Ricci curvature. As a reminder, the Laplacian ∆ on functions f : M → R on a Riemannian manifold (M, g) is defined to be the trace of the Hessian ∆ = tr (Hess) (30) or expressed in coordinates {x i , i = 1, . . . , n} in terms of the components of the metric g ij where summation over repeated indices is assumed. The significance of λ 1 for Physics is substantial: in the particular context of Lagrangian field theories, it expresses the lowest excitation of the mass spectrum, or alternatively and depending on the interpretation, the mass of the lightest particle in the particle spectrum. Since it is not practically feasible to explicitly calculate λ 1 even for the simplest manifolds, with scant few exceptions, providing bounds to it is the best that someone can hope for. Lichnerowicz's theorem states that for (M, g) a compact Riemannian manifold without boundary with for R k > 0, then λ 1 satisfies It is worth observing that the right hand side is λ 1 of the sphere S n endowed with the round metric, having constant curvature k/(n − 1). A result by Obata [29] states that the equality is attained if and only if (M, g) is actually isometric to such a round sphere. This result can be interpreted as stating that knowing the mass of the lowest excitation, alongside the Ricci curvature lower bound (32), uniquely determines the whole spectrum of excitations in this space(-time). It actually determines much more than that: it completely determines the geometry and the topology of such a space(-time). In General Relativity, or diffeomorphism-invariant theories, the Ricci curvature bound (31) usually results from imposing a strong energy condition [27], namely a lower bound to where X, Y ∈ T M with some additional causal behaviour assumed. Such a requirement (33) constrains the properties of what a "reasonable" mass-energy distribution in space(-time) should be allowed to have. In the above paragraph, we use the word "space(-time)" loosely, as we always assume that g has a positive-definite signature.

Generalized Ricci Curvature
There are several ways to go about generalising the above concepts. There is not any really unique extension of the Ricci curvature in more general metric-measure spaces, but several proposals, slightly different from each other exist. There are several different motivations for such generalizations. The most immediately pertinent one for our purposes, is the claim that S q describes systems that are non-ergodic. As a result, in such cases one does not have available a uniqueness statement like the Krylov-Bogoliubov theorem [30], or a statement relating the asymptotic averages along a trajectory (or iterates of maps) and phase space (micro canonical) averages like Birkhoff's ergodic theorem and its implications [30]. On the other hand, things may not be as uncontrollable as they may appear at first sight: indeed according to the Ergodic Decomposition theorem [30], every invariant Borel probability measure of a continuous map on a metrisable compact space can be decomposed into a sum of ergodic invariant probability measures each of which is supported in disjoint subsets of the whole space. In practical terms though, concretely determining such a decomposition into sets having ergodic measures may not be tractable. So, to proceed along these lines, we assume that a non-ergodic measure is absolutely continuous with respect to the volume of the underlying configuration / phase space M whose projection on the total energy E hyper-surfaces M E gives rise to the micro-canonical measure. Hence it is of interest to determine a generalisation of the Ricci tensor that is defined with respect to the measure rather than with respect to the volume element, where f : M → R. So, the question is to define a Ricci-like tensor that will infinitesimally control the behaviour of such a dµ as in (34). The easiest approach, used extensively in Physics, is to construct, by hand, a simple two-index symmetric tensor from the scalar f and from the Ricci tensor R ij . Symmetric tensors constructed from f and involving two derivatives are (∂ i f )(∂ j f ) as well as the Hessian In local coordinates x i , i = 1, . . . , n the Hessian can be written as where the summation convention upon repeated indices is assumed. So the easiest choice would be to define a generalised Ricci tensor (R N ) ij by This definition is essentially due to Bakry and Émery [31] where N is a letter which is just part of the notation, at this stage. The realisation that the term (∂ i f )(∂ j f ) can also be included with an arbitrary undetermined coefficient N ∈ [1, ∞) and the exploration of some of its consequences was provided first by Qian [32]. Therefore, a more general definition of R N is provided by The exact normalisation in front of the last term is a matter of convention. What is not a matter of convention though, is that in this expression there is the undetermined coefficient N which has to be inserted by hand since it is not determined by any of the differential or algebraic (such as the Bianchi or the Palatini) identities that the Riemann tensor and its contractions obey. The geometric origin of N is straightforward to pinpoint: in the Riemannian case, the metric uniquely determines the volume element and the Hausdorff dimension of the manifold expresses this unique relation between the metric and the volume, as in the discussion preceding (26). In the case of a general metric-measure space, the measure is not related in any unique way to the metric. In the case of Finsler spaces for instance, which from a particular viewpoint are considered to be a "direct" generalisation of the Riemannian manifolds, no unique measure is preferred, so several have been proposed depending on one's goals [24,33]. Therefore, the effective dimension of the measure dµ (35) in a metric-measure space cannot be captured by the Hausdorff dimension n of M, since n is really expressed in terms of the volume of M. Hence the effective dimension of dµ is an additional piece of information that has to be provided, a priori.
One can put together the definitions (39) and (40) by observing that lim N →∞ of (40) reduces to (39). Putting all these elements together, one can state that the central differential quantity for characterising the non-ergodic behaviour of systems conjecturally described macroscopically by S q is the generalized-/ N-/ Bakry-Émery-Ricci tensor defined by where, by convention, ∞ · 0 = 0. The last two lines are put there for completeness and for having a unifying treatment of these extreme cases with the rest. The definition (40) is taken from the works of J. Lott and C. Villani, in particular [34,35]. We observe that the generalized Ricci tensor R N is, in reality, a one-parameter family of tensors giving non-trivial results when N ∈ [n, ∞]. It may not come as a total surprise that such an extension of the Ricci tensor and the related curvature is not unique, see [36][37][38] for some alternatives, for instance. We will mention the synthetic definitions which reduce to (41) in the case of smooth measure spaces in the following Sections. The definition of (41) is a matter of choice, which however has many desirable properties [34,35]. Our interest to this choice having some more widespread, or even fundamental, significance comes from that it has been "re-discovered" and used independently in different contexts. An example of the former occasion is via the work of Bakry-Émery [31] and Qian [32]. The first two authors are interested in properties of the heat semigroup in the presence of external conservative forces which provide an additional drift term to the heat equation. Their analytical approach lead them to a Bochner formula whose further analysis lead to the definition of the curvature-dimension CD(k, n) condition. Details can be found in [31,39]. This condition expresses properties of manifolds of sectional curvature at least k and dimension at most n in a way which is strongly reminiscent of the Gromov pre-compactness theorem [24]. A second occasion, is the case of the extensive use of the N = ∞ expression in (41), in particular, in the Ricci flow leading to the proof of the Poincaré, and consequently of Thurston's geometrization, conjecture by G. Perelman [40][41][42]. A third occasion was the work of Chang, Gursky and Yang [43] who asked on whether one could define conformally invariant analogues of the Ricci and scalar curvatures on smooth metric measure spaces. The comparison and interpolation between [31] and [43] was taken up by Case [44,45], who has also contributed to the investigation of the quasi-Einstein metrics (R N ) ij = λg ij resulting from the definition of (41) for N < ∞. In the Physics literature, the generalised Ricci curvature has been recently discussed in explorations of scalar-tensor (e.g., dilaton, Brans-Dicke etc.) theories of gravity [46][47][48].
The geometric meaning of (R N ) ij starts becoming clearer when one tries to check on whether, or under what conditions, standard results of Riemannian geometry can be extended to smooth metric measure spaces [34,35,49,50]. A theorem in the spirit of the Bishop-Gromov volume comparison goes as follows: Consider a compact, smooth metric measure space (M, g, µ) where following (35), . Then for all 0 < r ≤ R and P ∈ M one has The theorem actually holds for the class of measured length spaces, which are more general than Riemannian manifolds. Usually volume comparison theorems in this spirit also require some additional conditions (such as bounds, convexity properties etc.) on f . It is worth noticing that according to the Bishop-Gromov comparison theorem, the weighted volume of M does not expand faster than polynomially, with the exponent being exactly N . This justifies, to some extent, the statement made above that N can be seen a substitute for the Hausdorff, or as an effective dimension, of the measure µ on M. Regarding Lichnerowicz's inequality, the generalization goes as follows: Since the measure of integration changes from dvol to dµ the Laplacian has to be modified if we want to keep it a self-adjoint operator with respect to dµ. The new Laplacian is It is straightforward to check that this is indeed a self-adjoint operator with respect to the new measure dµ, namely that for ϕ 1 , ϕ 2 ∈ L 2 (M, µ), one has It should be noted at this point that, physically, this is the Laplacian on functions of M but in an external field whose potential is f . Therefore, maintaining the self-adjoint property of the Laplacian amounts to modifying its sub-leading symbol by adding a drift term. Generalizing Lichnerowicz's theorem one can prove that if (R N ) ij ≥ kg ij , with k > 0, then Comparing (46) with (33) we see, once more, the validity of the interpretation of N as a dimension of the measure µ. It is quite interesting to notice that in the case of N = ∞ the resulting inequality is λ 1 ≥ k which is independent of N . This has important implications in the sequel as the N = ∞ case will turn out to be related to S BGS when a finite N is related to S q . Having stated all the above, it appears, and correctly so, that the considerations are quite generic and could be satisfied by any functional form related to non-ergodicity. This is true, in part. It turns out that an infinity of entropic functionals could play a significant role, in this respect. The functional S q has some special significance even among them, although it is not unique. How these elements work together, will be discussed in the next Section.

Ricci Curvature via Optimal Transport
There are several threads of development leading to the relation between S q and the generalised Ricci curvature R N . For a comprehensive treatment of these topics, one can start by consulting [35].

Otto's View: the Porous Medium Equation and the Geometry of Space of Probability Distributions
An influential idea that connects S q with the above developments is due to Otto [51]. He studied the set of solutions of the porous medium equation in R n where ρ ≥ 0 is a density function on R n which explicitly depends on time t ∈ [0, ∞). In this equation m ≥ n−1 n and m > n n+2 for reasons that will be stated later. The question that arose was how to interpret the porous medium equation as a gradient flow equation. As a reminder, a gradient flow [40][41][42]52] of sufficient generality for our purposes, consists of a Riemannian manifold (M, g) and an "energy" functional E[ρ] on (M, g) obeying the autonomous differential equation A simple example of such a gradient flow is the heat/diffusion equation where E[ρ] is the kinetic energy (Dirichlet functional) of ρ. The interesting property of a gradient flow is that E[ρ] decreases along the actual trajectories ρ(t). The established practice until [51] for the porous medium equation (47), was the following: The manifold M of its solutions was taken to be where dvol indicates the infinitesimal volume element on the space we are integrating over (in this case R n ). Its tangent space at ρ ∈ M, T ρ M, is The tangent space T ρ M can also be seen as where ∼ identified φ differing by an additive constant and where φ were solutions of the Poisson equation The metric tensor g at ρ ∈ M was taken to be The "energy" functional was taken to be Otto's view was to keep (49)-(51) as they were, but change (52) to requiring φ to satisfy − ∇ · (ρ∇φ) = s (55) and modify the metric g on M from (53) to One can immediately see from (49), (53), (56) that (M, g) is an infinite dimensional Riemannian manifold, so appropriate care should be taken of convergence issues in a more careful, less formal, treatment. The "energy" functional was modified from (54), by Otto, to be We recognise right away that the "energy" functional in (57) has the exact same form as the entropy S q if q is identified with m. We also see that S BGS plays the role of the "energy" functional in the gradient flow characterisation of the ordinary heat/diffusion equation which corresponds to m = 1 in (47). Moreover we see the prominent role that the modified measure ρ dvol, which is exactly (35) with ρ = exp(−f ), plays in the viewpoint advocated in [51]. Otto went much further along these lines. He established a formal calculus on (M, g), dubbed "Otto calculus", without paying too much attention to the subtleties arising from the fact that (M, g) is infinite-dimensional. He discovered, for instance, that the manifold (M, g) has formally non-negative sectional curvature. Addressing more carefully some of these topics was undertaken by Lott [53], among many others [34,[54][55][56] and several aspects are still under intense investigation. The long-term asymptotic behaviour of the porous medium equation is expressed by the Barenblatt solution ρ B . To determine the scaling behaviour of a solution in the asymptotic regime close to ρ B , we re-express it as where One can prove thatρ satisfies the gradient flow equation, where τ = log t where the modified energy functionalẼ is given bỹ and where V is represents the second moment functional ofρ with |x| 2 being the Euclidean norm of {x i , i = 1, . . . n} ∈ R n . Otto proved [51] thatẼ is uniformly strictly convex on (M, g) since it satisfies and moreover that An interpretation of these results in the context of S q is as follows: First of all, the convergence to the Barenblatt solution, which we have not explicitly stated, but whose scaling is the same as that of (58) can be traced back to the origins of S q in the Physics literature [5,6]: fractals usually do have some form of self-similarity which is expressed by scaling properties. Second: the ad hoc constraints on the value of m stated above, can now be seen as requirements for the convexity of E and for the well-posedness and finiteness at ρ B of E and V . Third, we observe in (64) that the rate of convergence of ρ to ρ B is polynomial (power-law) in terms of t. This, conjecturally, is one of the important properties of systems described by S q : their power law rate of convergence to their long-time asymptotic limits [6]. It is worth mentioning that this rate of convergence is believed to also have the form of a q-exponential However, the value of the non-extensive parameter controlling the rate of approach to equilibrium in the q exponent distribution is not necessarily the same as that in the equilibrium distribution itself [6]. There is no a priori reason why these two distributions that share the same functional form should also have the same non-extensive parameter. In the absence of any physical arguments to the contrary, we would expect that different aspects of the systems under consideration are described by different values of the non-extensive parameter, if we can even assume that their macroscopic behaviour is described by S q in the first place. Moreover, if the renormalisation group lessons are taken at heart, it is also entirely possible that q, for a given system, may be different for the approach to different quasi-equilibrium states. Leaving these possibilities on the side, we can also see in (64) an aspect related to the existence of the observed, in numerical simulations, quasi-equilibrium states [57][58][59]. If the approach to equilibrium is polynomial, then it is too slow, with respect to an exponential time scale that we usually employ in analyzing the approach of systems to equilibrium. This way of thinking is similar to what we advocated in [60,61], for justifying why the largest Lyapunov exponent of systems described by S q is zero. In closing, we would like to refer to [62] for a comprehensive treatment of the porous medium equation, including the Barenblatt solution stated above, and [63] for properties of some of its possible fractional generalisations.

Optimal Transportation and Wasserstein Spaces
In a seemingly different line of development from the above, the definition of the generalized Ricci curvature and its relation to S q resulted from attempts to address aspects of the Monge optimal transport problem. This problem ascribed to Monge [64] tries to determine the least costly way of transporting a quantity of soil/earth ("déblais") from a pile to an excavation/embankment ("remblais"). Naturally, one has to specify before-hand what is the cost function c which is used to determine such a total cost. Although the problem was initially posed in R n , it can clearly be formulated for general complete separable metric (Polish) spaces endowed with Borel probably measures. It may be worth mentioning right away, that the general problem remains unsolved even for simple cost functions in R n , although substantial progress has been made in special cases, especially during the last two decades.
The most obvious cost function c with a geometric significance, is the Euclidean distance d(x, y) between of the points x in the pile and points y in the excavation. This choice of cost function c(x, y) turned out to be difficult to deal with, the root cause being its lack of uniform convexity properties. To get a simpler problem without deviating too much from the original, one uses instead as cost function which turns out to be more manageable as it is uniformly convex in R n × R n . Monge's transport problem can be mathematically formulated as follows [34,35,[65][66][67] (these are general references for this whole subsection): consider two Borel measures µ + , µ − on R n , or in more general Polish measure spaces mentioned above, such that both of which are finite. In addition, consider the class of Borel maps f : R n → R n pushing forward This can be translated as requiring for all continuous functions w : R n → R with A 1 , A 2 ⊂ R n indicating the support of µ + , µ − respectively. Let the set of such functions be indicated by F. Given a cost function c : R n ×R n → [0, ∞) consider the total cost functional The goal is to determine the existence and find the properties of an optimal transport f o ∈ F such that This problem has proved to be quite difficult to address. There are at least three reasons for this • The problem is highly non-linear. To see this more concretely, let's assume that both µ + , µ − are absolutely continuous with respect to the volume element of R n with corresponding Radon-Nikodym densities ρ + , ρ − . Then it turns out that the push forward condition (68) translates into the Monge-Ampére, non-linear, equation where det stands for the Jacobian determinant of the differential map ∇f .
• Such a solution may not exist: consider for instance µ + to be the Dirac delta function but not µ − .
• The transport condition (69) is not weakly sequentially closed and this creates the additional complication that the minimum (71) may not be realized, as subsequences of approximating maps may not converge in any reasonably weak topology.
Progress toward solving Monge's problem was slow until Kantorovich [68,69] substantially reformulated it in more amenable terms. The difficulties associated with the Monge problem are partly attributable to the fact that the optimal transport map the f o that we seek cannot "split" the measure dµ + . Heuristically, f ∈ F cannot move a part of dµ + in a location of R n and another part in some other location of R n , as such a map would not be well-defined. Such a constraint (68) is too strong to handle. Instead, Kantorovich made the following two modifications. First, he transformed the problem into a linear one, by recasting it as follows: Consider the space P(R n × R n ) of Borel probability measures µ whose push-forwards p 1 , p 2 on the first and second R n factors respectively are µ + and µ − P = {µ : Probability measures on R n × R n such that (p 1 The elements of P are called transference plans. Consider then, for µ ∈ P the modified cost functional This functional is linear in µ so if some assumptions on the cost function can be made, compactness can ascertain the existence of at least one minimizing measure, called optimal transference plan, µ o ∈ P It should be stressed at this point that there is no a priori guarantee that an optimal transference plan µ o of the Kantorovich functional I K (75) corresponds to a mapping f o sought after in Monge's functional I M (71). The second crucial observation allowing to make progress in solving the Kantorovich problem is the Kantorovich -Koopmans [68][69][70] duality: in the spirit of duality in convex analysis and geometry they reformulated the variational problem (74) so that instead of a minimisation, it became a maximisation problem. To formulate this dual variational problem, define the set Then, introduce the dual of the Kantorovich functional I K * as The Kantorovich-Koopmans duality is the statement that the minimisation problem (75) is equivalent to the maximisation This problem is amenable to the methods of linear programming and as such it is easier to solve than the original (70). What is still not clear though is whether a solution of (78) can be derived from a solution of (71). The converse is clearly true: suppose that f o is a minimizer of (71). Then Id × f o : R n → R n × R n and µ = (Id × f o ) µ + ∈ P minimizes I K in (74) so it is a solution to (75). Let's be more general in this paragraph and assume that M is a Polish space (complete and separable metric space) endowed with a distance function d and let µ ∈ P(M × M) be a transference plan with marginals µ + and µ − as above. Moreover, let's assume that M is compact, for simplicity. Let's choose as cost function c(x, y) = d p (x, y), p ≥ 1 The Kantorovich functional (74) with such a cost becomes Then (80) can be considered as a measure of the discrepancy between the marginals µ + , µ − . Naturally, there are many ways to define the discrepancy between measures [71], depending on one's goals. For our purposes, the optimal transference plan (75) with the cost function (79), in other words the minimizer of (80), enters naturally in the picture. It is interesting to notice that such an optimal transference plan always exists under the above assumptions [35] so the variational problem (75) does have an actual solution. If, in addition one sets or, in other words, if one defines for any two Borel measures µ 1 , µ 2 ∈ P(M × M) where the minimum is taken over all possible transference plans µ ∈ P with marginals µ 1 , µ 2 , then it can be immediately checked that W p (µ 1 , µ 2 ) satisfies all the properties of a distance function. As such, it is called the p-Monge-Kantorovich-Rubinstein-Vasherstein distance, or in short and even though it is a partial misnomer, the p-Wasserstein distance (metric). All these metrics, except W ∞ , give the same topology on a compact M which is the weak- * topology. For such a distance function to be non-trivial the integral in (82) must converge. This becomes quite important especially when M is non-compact. Hence, the definition of the p-Wasserstein distance restricts the elements of P to only the ones with finite p-moments. The number of converging moments required for each case of p is the only essential difference between the different W p metrics on P. The subspace P p (M×M) of P(M×M) of measures with converging p-moments, endowed with the Wasserstein distance then becomes a metric space with several desirable properties [35] and is called the p-Wasserstein space of M. We forego expanding on this topic of current research as further information is not needed in the subsequent discussion..

The Brenier Map and Its Extensions: the Role of Convexity
We go back to R n , initially at least, and to assuming that c(x, y) = 1 2 d 2 (x, y) where d denotes the Euclidean distance in R n . We would like to determine a solution to Monge's problem in this case, namely to minimize Skipping a detailed explanation of the reasons why [72,73], we just state the result [74] (Brenier map): Let µ + and µ − be probability measures on R n which are absolutely continuous with respect to the Lebesgue measure (volume) vol of R n . Then there exists a convex function ϕ : R n → R whose gradient f = ∇ϕ : R n → R n pushes forward µ + to µ − . This map is unique, almost everywhere, and it therefore provides a unique solution to Monge's problem (70). Immediate generalizations are due to [75]. In the case of the Brenier map, the Monge-Ampére equation (72) for the corresponding densities ρ + , ρ − takes the more familiar form as long as ∇ϕ(x) ∈ R n . The convexity of ϕ implies that Jacobian obeys det(∇ 2 ϕ) ≥ 0 and it also guarantees that f is almost everywhere differentiable. To prove this, one can use the dual Kantorovich formulation (77). The constraint in (76) becomes Now, changing variables to so, (78) reduces to minimizing the functional under the constraint (87). By standard results of convex conjugacy theory [76], we know that there is such a pair u o , v o minimizing (88) and moreover these two functions are Legendre-Fenchel transforms of each other, namely they obey We will skip the non-trivial topic of the regularity of solutions to these variational problems altogether, referring to [35,67] for some statements and an extensive list of references on such issues. The question that is raised now is whether the above results can be extended from R n to Riemannian manifolds (M, g). This was addressed in part by [77][78][79] and by [80] for metric measure spaces. One modification that needs to be made is to replace the Legendre-Fenchel transform (86) by the generalized convexity transform for h : M → R ∪ ∞ with the cost function c(x, y) having the usual form c(x, y) = 1 2 d 2 (x, y) as for R n . Then Brenier's theorem [78] is extended as follows. For a closed, connected Riemmanian manifold (M, g) let µ + be absolutely continuous with respect to vol M and µ − arbitrary. Then there is a map ϕ : M → R such that ϕ cc = ϕ (ϕ is a c-concave function) so that the map exp(−∇ϕ) pushes forward µ + to µ − . Such a map is a unique minimizer within the set P(M × M) for Monge's transportation functional (83). In the language of the 2-Wasserstein space W 2 (M), the Monge transport between µ + , µ − takes place along the unique Wasserstein geodesic joining them. The last statement is substantially non-trivial, because if M were a general length space, there could be an uncountably infinite number of geodesics joining the two measures.
A second modification addresses the issue of convexity on Riemannian manifolds. There is no unique extension of convexity from R n to a Riemannian manifold M. Even then, one can either work directly geometrically with the space at hand, utilising properties of the distance or the volume functions, or express convexity indirectly through functional inequalities such as the Brunn-Minkowski, the Brascamp-Lieb, the Prekopa-Leindler etc. In [79,80] the second approach was taken. Results in Comparison Geometry were derived and extensively used. We will state just one such result to pave the way to the synthetic definition of the generalised Ricci curvature below. To generalise the linear interpolation tx + (1 − t)y between two points x, y ∈ R n to the case of M with distance function d, they [79] start by considering the set where t ∈ [0, 1] and for z lying between x, y. This definition can be extended from a point y to the If P ∈ M, consider the open ball centered at P of radius r by B P (r), and introduce a volume distortion by the volume ratio for t = 0. Obviously w 1 (x, P ) = 1, and w t (x, P ) ≥ 1 if the curvature is non-negative. whereas the opposite is true if the curvature is non-positive. In R n clearly w t (x, P ) = 1. An interpretation of (93) from a relativistic viewpoint, pretending for a moment that we are working in a space of Lorentzian signature, since light does travel on null geodesics w 0 (x, P ) represents the magnification of the area of a small light source located near P as is seen by an observer at point x. We compare this distortion function (93) with that of the standard n-dimensional space forms having constant sectional curvatures k > 0 and k < 0 respectively. The Ricci curvature in all these case is Ric = (n − 1)k. Then for k ∈ R n we set Then, if (M, g) is such that Ric ≥ (n − 1)k along any geodesic of length d(x, y) one has from the Bishop-Gromov comparison theorem (27) w t (x, y) ≥ sn k (td(x, y)) sn k (d(x, y)) where equality holds when M has constant sectional curvature k. This result be used in the sequel in the synthetic definition of the generalised Ricci curvature. As in Otto's work for R n , it was noticed that providing lower bounds on the Hessian (63) which is related to the energy functional (57), could be useful in determining rates of convergence of gradient flows on M to their asymptotic configurations. For the "energy" functional (57) with m = 1 [77] calculated its formal Hessian of P(M) and found that it is bounded from below by kg ρ (56) as long as the generalized Ricci tensor (R N ) ij for N = ∞ of M is bounded from below by kg ij . Hence the k-convexity, as in (63), of an appropriate energy functional in P(M) is intimately related to the a lower bound of the generalized Ricci curvature of M. The converse statement, still using the m = 1 case of the "energy" functional in (57), was established by [81]. The obvious question is whether these considerations can be extended to the cases of the "energy" functional in (57) for m = 1 and how would this influence the definitions of the generalized Ricci curvature. An answer was provided by [34,[82][83][84][85][86].

Displacement Convexity and Synthetic Definition of the Generalized Ricci Curvature
In this subsection we follow [34,86] and can be even more general than before by considering, not necessarily smooth, metric measure spaces (X, d, ν). Here ν is Borel will be considered as a "reference measure". For Riemannian manifolds M the role of ν is played by the volume, but a generic metric measure space lacks such a natural choice. It turns out that without substantial loss of generality, the constructions remain largely the same if we only consider measures µ absolutely continuous with respect to ν with Radon-Nikodym density ρ, namely µ = ρν. The general case is discussed in [34,86]. The first step is to consider a function U : [0, ∞) → R which is continuous and convex. Second, given µ, ν ∈ P(X), define the functional U ν (µ) by The role of these definitions is to allow us to construct "energy functionals" for a gradient flow akin to (57). Actually, from a physical viewpoint these are the entropy functionals S q and S BGS in the first and second lines of (57) respectively. Hence the original assumptions on the function U . To recover (57), consider the single-parameter family of functions U N : [0, ∞) → R with N ∈ [1, ∞], which is given by Then Upon setting in (98), we recover (57), or equivalently (2), (3). This also fits nicely with a standing assumption of our prior work such as [87][88][89][90] in which we have examined properties of S q for values of the non-extensive parameter q ∈ [0, 1] which are equivalent to N ∈ [1, ∞].
At this point, one can ask what additional properties should the functions U obey so as to qualify as being the foundation of the functionals (96) which should be "reasonable". Since functionals such as (96) can be interpreted as entropies, one can search for desirable properties that thermodynamic potentials possess [6,25,26,91] and demand these to hold in (96) or equivalently for the functions U . This motivated the introduction of the displacement convexity classes DC N and DC ∞ by [92]. A continuous, convex function belongs to the displacement convexity class DC N , i.e., U ∈ DC N , for N ∈ [1, ∞), if is convex on (0, ∞). Moreover, a continuous, convex function belongs to DC ∞ , i.e., U ∈ DC ∞ , if is convex on (−∞, +∞). We can see that, if N ≤ N , then DC N ⊂ DC N . More importantly, for our purposes, consider the following two functions that can be derived from U if it is sufficiently smooth, which have the form of Legendre transforms which is motivated by the definition of the pressure, if U were to be the internal energy, at the level of functions rather than functionals. Moreover consider the "iterated pressure" p 2 (r) = r dp(r) dr − p(r) Then, with the above assumptions on U and also assuming that it is smooth, we have that U ∈ DC N if and only if the function r −→ p(r) is non-decreasing on (0, ∞). Equivalently U ∈ DC N if and only if Among them, (104) shows the role of the function (97) giving rise to (98) and equivalently (57), (2) in DC N . This function (97) is by no means unique. However it has the best possible asymptotic behaviour that preserves the convexity property (100) defining DC N . Something similar can be said about the class DC ∞ : if U ∈ DC ∞ , then either U is linear or there exist a, b > 0 so that U (r) ≥ ar log r − br. The latter functional form results to S BGS for a probability density ρ. From the present viewpoint therefore, the function (97) giving rise to (98) is not unique at all, but has a very desirable functional form for membership in DC N and DC ∞ that moreover gives rise to the entropic functionals (2), (3).
As was also hinted in the last paragraph of Section 3.4, the definition of lower Ricci curvature bounds for a metric measure space is expressed via bounds on the convexity properties of functionals arising from functions belonging to DC N or DC ∞ , such as (97). The "amount of convexity", an example of which is shown in (63), is encoded in the definition of k-convexity. Convexity of a smooth function f on the unit interval t ∈ [0, 1], which will be assumed to parametrize a geodesic in (X, d, ν), with x, y ∈ X can be determined either by using the local criterion d 2 f dt 2 ≥ 0 or by Jensen's inequality By the same token, for k-convex functions, one can either use the criterion d 2 f dt 2 ≥ k or the inequality Relying on these, and for k ∈ R, a functional U ν with background measure ν is called • k-displacement convex if for any two µ 0 , µ 1 ∈ P(X) and for all Wasserstein geodesics {µ t }, t ∈ [0, 1], we have for all t ∈ [0, 1] • weakly k-displacement convex, if for all µ 0 , µ 1 ∈ P(X) there is at least one Wasserstein geodesic along which (108) holds.
For the rest of this section we follow very closely [34,86]. For any k ∈ R, define Λ : We say that for such a k ∈ R, the metric measure space (X, d, ν) has N = ∞ generalized Ricci curvature bounded from below by k if the Wasserstein space W 2 (X) is weakly Λ(U )-displacement convex, namely for any two measures µ 0 , µ 1 ∈ P(X), U ∈ DC ∞ and all t ∈ [0, 1]. For the finite N case we need a few more definitions: given again k ∈ R and for N = 1, define In addition, define It should be noted that the last three entries of (113) are the same as the right-hand-side of (95) from which they originate. Now consider a transference plan η ∈ P(X × X) and decompose it in terms of its marginals µ 0 and µ 1 where dη(x 0 |x 1 ), dη(x 1 |x 0 ) indicate the corresponding "conditional" measures. Then, the metric measure space (X, d, ν) has N -Ricci curvature bounded below by k, if there is some optimal transference plan η from µ 0 to µ 1 with Wasserstein geodesic µ t , so that for all U ∈ DC N and for all t ∈ [0, 1] Even though this is a far from trivial statement, one can see it as a weight-averaged version of (107) with weights provided by measure ratios akin to the right hand side of (95) embedded into the entropy functionals whose convexity properties in W 2 (M) are used to reflect the Ricci curvature properties of M, all in a comparison sense. We observe that the definition (115) is synthetic: nowhere have we required the metric measure space to be smooth. Its apparent drawback for applications to Physics is that this definition of Ricci curvature is only defined in a comparison sense. However, it is directly related to the convexity properties of S q in the Wasserstein space W 2 (X). On the other hand, the definition (41) of the generalised Ricci tensor/curvature in smooth metric measure spaces is local, so it is easy to compute, in principle, but it appears to have nothing to do with S q . However (41) and (115) give the same result, in a comparison sense, for a measured length space, hence for a Riemannian manifold: The Riemannian manifold (M, g) has its generalised Ricci curvature bounded below by k (115) if and only if [86] its N -Ricci tensor (41) is also bounded below by k, namely (R N ) ij ≥ kg ij . Before closing this Section, it may be worth noticing that for non-branching spaces such as Riemannian manifolds, there is no real distinction between an element of the displacement convexity class DC N and (97), so in this sense the entropic functional S q itself is unique in the determination of the generalised Ricci curvature (115).

Isoperimetric Interpretation of the Non-extensive Parameter and Related Matters
We can look closer at (99) which provides a relation between the free parameter N in the definition of the generalized Ricci tensor (41) and the non-extensive parameter q in the functional (2) under the additional tacit restriction to q ∈ [0, 1]. Inverting (99) we get This points out to an interpretation of q through N : N can be interpreted as an effective isoperimetric dimension of the measure µ on the smooth metric measure space (M, g, µ). It should be noted that the isoperimetric inequalities [24,93] ("why is a soap bubble spherical"), may arguably be the most important and influential among all the geometric inequalities [94]. For this reason, such an interpretation of q allows us to use the very extensive set of techniques and results that are known on this topic, mostly in a comparison sense, in reaching conclusions that may be of physical interest. It may be worth mentioning at this point that the isoperimetric dimension is an asymptotic property of a space and can be larger than its Hausdorff dimension [93], even if such a space is a Riemannian manifold: consider for instance the hyperbolic n-space whose Hausdorff dimension is n, but whose isoperimetric dimension is infinite, as volumes and areas of the boundaries of subspaces of hyperbolic spaces are related by linear isoperimetric inequalities. However, it becomes immediately obvious, that such an interpretation has a problem: indeed, even if we confine our attention to q ∈ [0, 1], it seems through, mostly, data fittings for finding q that the corresponding N is quite small, and certainly finite, for most systems. After all q is a thermodynamic parameter, so it is to be expected that its value in actual systems is finite, or even "relatively" small. On the other hand, for Hamiltonian systems of many degrees of freedom, their configuration and phase spaces have dimensions of order of magnitude n ∼ 10 23 which represents a gross mismatch with the values of N computed through (116) and from data fittings of q. We have not been able to resolve this interpretational conundrum in an acceptably convincing way. The best that we can currently state, is to vaguely suggest that N should actually represent an effective isoperimetric dimension of the configuration or phase space per degree of freedom, something more akin to N n in our notation. Given this, it might be of some interest to attempt to explain the appearance of the escort distributions [6,95] used in the calculations of the thermodynamic parameters [6], under this light.
The Bishop-Gromov generalised comparison theorem (43) can also provide quantitative support for the relation between (41) and (2). It may be worth noticing, that according to [96,97], S q describes the cases of systems whose configuration or phase space volume increases in a power-law, rather than an exponential, manner. Obviously checking whether this is true explicitly, in specific examples (of Hamiltonian systems of many degrees of freedom) seems to be quite hard, if feasible at all. However the generalized Ricci curvature (41) provides a feasible, at least in principle, way of doing so by using the Bishop-Gromov comparison result (43): if R N > 0 then, according to (43), the corresponding measures on the configuration / phase space increase slower than a power-law fashion with exponent equal to N . This also provides another dimensional interpretation of N as the maximal Hausdorff dimension exponent bounding the expansion of measures in the configuration or phase space of the microscopic system.
There is a slightly different way of understanding the effect of lower bounds of (41); it is in a way, a geometric analogue to the construction of the canonical ensemble in equilibrium Statistical Mechanics. In this treatment, proposed in [49], one initially considers the system of interest, let's call its configuration / phase space by B coupled to a "thermostat" F which can be large but not infinite (F is compact). Let the combined system's configuration/phase space be indicated by M. Naturally, the evolution in M is Hamiltonian along geodesics chosen with respect to the metric of M. Project this evolution on M "down" to the system of interest B assuming that the tangents to the geodesic curves on M and B have equal lengths with respect to the respective metrics. This requirement essentially amounts to assuming that the average kinetic energy per degree of freedom of M, which might be called "temperature" but it is far from obvious that it would have a physical meaning for generic systems, is the same as that of B. Then this projection p : M → B is actually a Riemannian submersion [98]. Moreover, assume that the pushforward p of dvol M is just a multiple of dvol B and let N = dim F. Let the Ricci curvatures of M, B be indicated by superscripts with respect to their corresponding metrics. Then for any k ∈ R, [49] proves that if Ric M ≥ k, then Ric B N ≥ k. So, a way to understand the meaning of the generalized Ricci tensor is to see it as the Ricci tensor due to the submersion of a higher dimensional space preserving the measure of the base up to a multiplicative constant. This statement may also make the generalised Ricci tensor quite useful for applications in theories involving higher (greater than 4) dimensional space-times. The result and the general ideas are also close to the treatment of [99,100] who expressed the non-extensive parameter q in terms of the scaling properties of the Hamiltonians of the "thermostat" F and of the system under study B.
There is the also the obvious question of how would one go about dealing with cases of systems having q > 1. Whether Hamiltonian systems of many degrees of freedom can be described by S q , q ∈ R was taken for granted for a considerable amount of time. More recently though, some dissenting opinions have appeared (see for instance [13,15]). We will not go into this very important matter which deserves a thorough investigation, in the present work. Let's assume for argument's sake that the "conventional wisdom" is true and S q can describe such Hamiltonian systems. Then a possible way around the restriction q ∈ [0, 1] that we use in this work may come from "dualities" of the non-extensive parameter. It has been observed for a while [6], that systems described by S q seem to have particularly nice properties under the following transformations/"dualities" of the non-extensive parameter These are a set of generators of conformal/Möbius transformations on the complex plane, assuming that q ∈ C. Their existence may be allow us to extend the validity of elements of the formalism presented here outside the restricted range q ∈ [0, 1] that we have assumed. Whether or to what extent such a goal can actually be achieved, even formally, its implications and the origin of the dualities (117) is the subject of one of our current investigations [101].

Assessment and Omissions
In this work, we have attempted to bring to the attention of our audience the existence, construction and geometric significance of the generalised (Bakry-Émery-) Ricci tensor (41) as an analytic tool for probing the microscopic dynamics on the configuration or phase space for systems whose collective behaviour is described by the Havrda-Charvát/Daróczy/Cressie-Read/Tsallis entropy (2). Such a straightforwardly computable geometric quantity may be of some interest in performing concrete calculations in specific models, calulcations that may shed some light in the dynamical aspects [102] of the underlying systems of many degrees of freedom [103] described by such functionals. Our view is that this generalised Ricci tensor may do for systems described by S q what the ordinary Ricci tensor has done and can still do for S BGS [102,103]. The generalized Ricci curvature, which has a purely synthetic definition (115) without resorting to differential properties, may also be of some interest to the Quantum Gravity community for formulating the Einstein equations in a synthetic rather than in a differential way, since it is widely believed that differentiability and smoothness, of space-time should be an emergent, rather than an a priori assumed, property.
We would like to point out that there is very little, if any at all, new material in this work. Far more extensive and authoritative treatments can be found in the literature, such as [35]. Our goal was to help motivate and make somewhat more familiar, from a physical viewpoint, some ideas that lie behind the construction and properties of the generalised Ricci curvature. We think that bringing such an object to the attention of the practitioners of "non-extensive" entropy may create some interest which may eventually help elucidate, analytically, issues pertaining to the underlying dynamics of systems described by S q . In this spirit, we have omitted entirely from the present work any discussion whatsoever about the regularity and the convergence (usually in the measured Gromov-Hausdorff sense) of the underlying structures. Such important issues, and of potential physical relevance, can be found in [35], in the original papers or in recent reviews on these topics.