Skip to content
Publicly Available Published by De Gruyter May 26, 2017

The Exponentiated Generalized-G Poisson Family of Distributions

  • Gokarna R. Aryal EMAIL logo and Haitham M. Yousof

Abstract

In this article we propose and study a new family of distributions which is defined by using the genesis of the truncated Poisson distribution and the exponentiated generalized-G distribution. Some mathematical properties of the new family including ordinary and incomplete moments, quantile and generating functions, mean deviations, order statistics and their moments, reliability and Shannon entropy are derived. Estimation of the parameters using the method of maximum likelihood is discussed. Although this generalization technique can be used to generalize many other distributions, in this study we present only two special models. The importance and flexibility of the new family is exemplified using real world data.

MSC 2010: 60E05; 62P99

1 Introduction

In the last decades, several generalized distributions have been proposed based on different modification methods. These modification methods require the addition of one or more parameters to base model which could provide better adaptability in the modeling of real lifetime data. Modern computing technology has made many of these techniques accessible even if analytical solutions are very complicated. Several continuous univariate-G families have recently appeared. Some notable family includes Marshall–Olkin-G family by Marshall and Olkin [22], exponentiated-G class by R. C. Gupta, P. L. Gupta and R. D. Gupta [17], transmuted exponentiated generalized-G family by Yousof, Afify, Alizadeh, Butt, Hamedani and Ali [36], transmuted geometric-G by Afify, Alizadeh, Yousof, Aryal and Ahmad [1], Kumaraswamy transmuted-G by Afify, Cordeiro, Yousof, Alzaatreh and Nofal [2], Burr X-G by Yousof, Afify, Hamedani and Aryal [37], the odd Lindley-G family of distributions by Silva, Percontini, de Brito, Ramos, Venancio and Cordeiro [33], exponentiated transmuted-G family by Merovci, Alizadeh, Yousof and Hamedani [23], the odd-Burr generalized family by Alizadeh, Cordeiro, Nascimento, Lima and Ortega [5], the transmuted Weibull-G family by Alizadeh, Rasekhi, Yousof and Hamedani [6], the type I half-logistic family by Cordeiro, Alizadeh and Diniz Marinho [11], the complementary generalized transmuted Poisson family by Alizadeh, Yousof, Afify, Cordeiro and Mansoor [7], the Zografos–Balakrishnan odd log-logistic family of distributions by Cordeiro, Alizadeh, Ortega and Serrano [12], logistic-X by Tahir, Cordeiro, Alzaatreh, Mansoor and Zubair [34], a new Weibull-G by Tahir, Zubair, Mansoor, Cordeiro, Alizadeh and Hamedani [35], the generalized odd log-logistic family by Cordeiro, Alizadeh, Ozel, Hosseini, Ortega and Altun [13], the beta odd log-logistic generalized family of distributions by Cordeiro, Alizadeh, Tahir, Mansoor, Bourguignon and Hamedani [14], beta transmuted-H by Afify, Yousof and Nadarajah [3], generalized transmuted-G by Nofal, Afify, Yousof and Cordeiro [30] and beta Weibull-G family by Yousof, Rasekhi, Afify, Ghosh, Alizadeh and Hamedani [38] among others.

In this paper we propose and study a generalized family of distribution using the genesis of Poisson distribution with the following motivation. Suppose that a system has N subsystems functioning independently at a given time where N has zero truncated Poisson (ZTP) distribution with parameter λ. It is the conditional probability distribution of a Poisson-distributed random variable, given that the value of the random variable is not zero. The probability mass function (pmf) of N is given by

P(N=n)=1[1-exp(-λ)]exp(-λ)λnn!for n=1,2,.

Note that for ZTP variable the expected value and variance are respectively given by

E(N)=λ[1-exp(-λ)]

and

Var(N)=λ+λ2[1-exp(-λ)]-λ2[1-exp(-λ)]2.

Suppose that the failure time of each subsystem has the exponentiated Generalized-G (“EGG(a,b)” for short) distribution defined by the cumulative distribution function (cdf) and probability density function(pdf) given by

H(x;a,b,𝝍)={1-[1-G(x;𝝍)]a}b

and

h(x;a,b,𝝍)=abg(x;𝝍)[1-G(x;𝝍)]a-1{1-[1-G(x;𝝍)]a}b-1,

respectively, where a>0 and b>0 are two additional shape parameters. Let Yi denote the failure time of the ith subsystem and let X=min{Y1,Y2,,YN}. Then the conditional cdf of X given N is

F(xN)=1-P(X>xN)=1-[1-H(x;a,b,𝝍)]N.

Therefore, the unconditional cdf of X, as described in [31], can be expressed as

(1)F(x;a,b,λ,𝝍)=1-exp{-λ{1-[1-G(x;𝝍)]a}b}[1-exp(-λ)].

The cdf in (1) is called the exponentiated generalized G Poisson (“EGGP”) family of distributions. The corresponding pdf is

(2)f(x;a,b,λ,𝝍)=abλg(x)[1-G(x)]a-1{1-[1-G(x)]a}b-1[1-exp(-λ)]exp{λ{1-[1-G(x)]}ab}.

For b=1 we have EGP class of distribution and for a=1 we have GGP class of distribution both of which are embedded in EGGP class.

Using the power series expansion of exp(x), we express the pdf in (2) as

f(x)=abg(x)[1-G(x)]a-1[1-exp(-λ)]i=0(-1)i{1-[1-G(x)]a}b(i+1)-1i!λ-i-1.

Using the series expansion (1-z)b-1=j=0(-1)jΓ(b)j!Γ(b-j)zj the last equation can be expressed as

(3)f(x)=k=0tkπk+1(x),

where

tk=ab(-1)k[1-exp(-λ)](k+1)i,j=0(-1)i+j(b(i+1)-1j)(a(j+1)-1k)i!λ-i-1

and

πk+1(x)=(k+1)g(x)[G(x)]k.

This is the Exp-G pdf with power parameter (k+1). By integrating (3), we obtain the mixture representation of F(x) as

(4)F(x)=k=0tkΠk+1(x),

where Πk+1(x) is the cdf of the Exp-G family with power parameter (k+1). Equation (4) reveals that the EGGP density function is a linear combination of Exp-G densities. Thus, some structural properties of the new family such as the ordinary and incomplete moments and the generating function can be immediately obtained from well-established properties of the Exp-G distributions.

The properties of Exp-G distributions have been studied by many authors in recent years, see Mudholkar and Srivastava [25] and Mudholkar, Srivastava and Freimer [26] for exponentiated Weibull (EW) distributions, R. C. Gupta, P. L. Gupta and R. D. Gupta [17] for exponentiated Pareto distributions, Gupta and Kundu [18] for exponentiated exponential distributions, Nadarajah and Kotz [29] for the exponentiated-type distributions, Nadarajah [27] for exponentiated Gumbel distributions, Shirke and Kakade [32] for exponentiated log-normal distributions and Nadarajah and Gupta [28] for exponentiated gamma distributions (EGa), among others.

The rest of the paper is outlined as follows. In Section 2 we provide the formulation of EGGP models for two special distributions. Mathematical properties of the EGGP model are discussed in Section 3. In Section 4 we discuss stress-strength models. The order statistics is discussed in Section 5. Parameter estimation procedures using method of maximum likelihood are presented in Section 6. Section 7 provides the application of the two generalized distributions to model real world data. Some concluding remarks are given in Section 8.

2 Special Models

The formulation provided in Section 1 can be used to generalize any classical probability distribution. For illustration purpose we will generalize the following two popular and versatile distributions, namely: the Weibull (W) distribution and the Pareto (Pa) distribution. The parameters of these models are positive real numbers. The pdf and cdf of these distributions are provided in Table 1.

Table 1

The pdf and cdf of Weibull and Pareto distributions.

ModelPdf: g(x;𝝍)Cdf: G(x;𝝍)Support
Wβαβxβ-1exp[-(αx)β]1-exp[-(αx)β](0,)
Pa(αx)(θx)α1-(θx)α(θ,)

2.1 The EGWP Distribution

The cdf and pdf of the EG-Weibull Poisson (EGWP) distribution are given, respectively, by

F(x)=1-exp(-λ{1-exp(-a(αx)β)}b)[1-exp(-λ)]

and

f(x)=abλβαβxβ-1[1-exp(-λ)]exp(-a(αx)β){1-exp(-a(αx)β)}b-1exp(λ{1-exp(-a(αx)β)}b).

Plots of the pdf and cdf of the EGWP distribution are displayed in Figure 1 for some parameter values. As we shall see from the graphs EGWP distribution is more flexible compare to classical Weibull distribution.

Figure 1 Pdf (top) and cdf (bottom) of EGWP distribution.
Figure 1 Pdf (top) and cdf (bottom) of EGWP distribution.
Figure 1

Pdf (top) and cdf (bottom) of EGWP distribution.

2.2 The EGPaP Distribution

The cdf and pdf of the EG-Pareto Poisson(EGPaP) distribution (for x>θ) are, respectively, given by

F(x)=1-exp(-λ{1-(θx)aα}b)[1-exp(-λ)]

and

f(x)=abλαθα(θx)(a-1)α[1-(θx)aα]b-1xα+1[1-exp(-λ)]exp(λ[1-(θx)aα]b).

Plots of the pdf and cdf of the EGPaP distribution are displayed in Figure 2 for some parameter values. As we shall see from the graphs EGPaP distribution is more flexible compare to the Pareto distribution.

Figure 2 Pdf (top) and cdf (bottom) of EGPaP distribution
Figure 2 Pdf (top) and cdf (bottom) of EGPaP distribution
Figure 2

Pdf (top) and cdf (bottom) of EGPaP distribution

3 Mathematical Properties

In this section we provide some structural and mathematical properties of the EGGP distribution including the quantile function, moments, entropy measure, residual and revered residual life.

3.1 Quantile Function

The quantile function of a distribution is the real solution of F(xq)=q for 0q1. The quantile function is obtained by inverting equation (1) provided that closed form expression for QG(q)=G-1(q) is available. Setting

F(x)=1-exp{-λ{1-[1-G(x;𝝍)]a}b}[1-exp(-λ)]=q,

we have

G(x)=1-[1-{-λ-1log(1-(1-exp(-λ))q)}1/b]1/a.

Therefore

x=G-1(1-[1-{-λ-1log(1-(1-exp(-λ))q)}1/b]1/a).

We can use the inversion method to simulate random numbers from a given distribution. For example, we can simulate random numbers X from EGWP distribution by

1-exp(-(αx)β)=1-[1-{-λ-1log(1-(1-exp(-λ))U)}1/b]1/a,

which implies

x=1α{-1alog[1-{-λ-1log(1-(1-exp(-λ))U)}1/b]}1/β,

where U has uniform distribution on (0,1).

3.2 General Properties

The rth ordinary moment of X is given by μr=E(Xr)=-xrf(x)𝑑x. Using (2), we obtain

(5)μr=k=0tkE(Yk+1r).

Henceforth, Yk+1 denotes the Exp-G distribution with power parameter (k+1), where

E(Yk+1r)=(k+1)-xrg(x;𝝍)G(x;𝝍)k𝑑x,

which can be computed numerically in terms of the baseline quantile function (qf) QG(u;𝝍)=G-1(u;𝝍) as

E(Yk+1r)=(k+1)01QG(u;𝝍)ruk𝑑u.

Setting r=1 in (5), we have the mean of X. The last integration can be computed numerically for most parent distributions. The skewness and kurtosis measures can be calculated from the ordinary moments using well-known relationships:

Skewness(X)=E(X3)-3E(X)E(X2)+2[E(X)]3[Var(X)]32

and

Kurtosis(X)=E(X4)-4E(X)E(X3)+6E(X2)[E(X)]2-3[E(X)]4[Var(X)]2,

where E(X2)=j=0tkE(Yk2) and Var(X)=E(X2)-[E(X)]2. The nth central moment of X, say Mn, are given by

Mn=E(X-μ)n=h=0n(-1)h(nh)(μ1)nμn-h.

The cumulants (κn) of X follow recursively from

κn=μn-r=0n-1(n-1r-1)κrμn-r,

where κ1=μ1, κ2=μ2-μ12,κ3=μ3-3μ2μ1+μ13, etc. The skewness and kurtosis measures also can be calculated from the ordinary moments using well-known relationships. The moment generating function (mgf) of X, say MX(t)=E(etX), is given by

MX(t)=r=0trr!μr=k,r=0trtkr!E(Yk+1r).

The main application of the first incomplete moment refers to the Bonferroni and Lorenz curves. These curves are very useful in economics, reliability, demography, insurance and medicine. The answers to many important questions in economics require more than just knowing the mean of the distribution, but its shape as well. This is obvious not only in the study of econometrics but in other areas as well. The sth incomplete moments, say φs(t), is given by φs(t)=-txsf(x)𝑑x. Using equation (3), we obtain

(6)φs(t)=k=0tk-txsπk+1(x)𝑑x.

The first incomplete moment of the EGGP family, φ1(t), can be obtained by setting s=1 in (6). Another application of the first incomplete moment is related to mean residual life and mean waiting time given by m1(t;𝝍)=(1-φ1(t))/R(t;𝝍)-t and M1(t;𝝍)=t-(φ1(t)/F(t;𝝍)), respectively. The amount of scatteredness in a population is evidently measured to some extent by the totality of deviations from the mean and median. The mean deviations about the mean [δμ(X)=E(|X-μ1|)] and about the median [δμ(X)=E(|X-M|)] of X can be, used as measures of spread in a population, expressed by

δμ(X)=0|X-μ1|f(x)𝑑x=2μ1F(μ1)-2φ1(μ1)

and

δM(X)=0|X-M|f(x)𝑑x=μ1-2φ1(M),

respectively, where μ1=E(X) comes from (5), F(μ1) is simply calculated, φ1(μ1) is the first incomplete moment and M is the median of X. The mean deviations about the mean [δ1=E(|X-μ1|)] and about the median [δ2=E(|X-M|)] of X are given by δ1=2μ1F(μ1)-2φ1(μ)1 and δ2=μ1-2φ1(M), respectively, where μ1=E(X), M=Median(X)=Q(0.5) is the median, F(μ1) is easily calculated from (1) and φ1(t) is the first incomplete moment given by (6) with s=1.

Now, we provide two ways to determine δ1 and δ2. First, a general equation for φ1(t) can be derived from (3) as

φ1(t)=k=0tkJk+1(x),

where

Jk+1(x)=-txπk+1(x)𝑑x

is the first incomplete moment of the Exp-G distribution. A second general formula for φ1(t) is given by

φ1(t)=k=0tkVk+1(t),

where

Vk+1(t)=(k+1)0G(t)QG(u)uk𝑑u

can be computed numerically. These equations for φ1(t) can be applied to construct Bonferroni and Lorenz curves defined for a given probability π by B(π)=φ1(q)/(πμ1) and L(π)=φ1(q)/μ1, respectively, where μ1=E(X) and q=Q(π) is the qf of X at π. For the EGWP model we have the following results for r>-β and s>-β:

μr=k,h=0tk(k+1)(-1)hαr(h+1)(r+β)/β(kh)Γ(1+rβ)

and

φs(t)=k,h=0tk(k+1)(-1)hαs(h+1)(s+β)/β(kh)Γ(1+sβ,(αt)β).

3.3 Probability Weighted Moments

The PWMs are expectations of certain functions of a random variable and they can be defined for any random variable whose ordinary moments exist. The PWM method can generally be used for estimating parameters of a distribution whose inverse form cannot be expressed explicitly. The (s,r)th PWM of X following the EGGP family, say ρs,r, is formally defined by

ρs,r=E{XsF(X)r}=-xsF(x)rf(x)𝑑x.

Using equations (2) and (3), we can write

f(x)F(x)r=k=0wkπk+1(x),

where

wk=ab[1-exp(-λ)]r+1(k+1)w,i,j=0(-1)w+i+j+k(rw)(b(i+1)-1j)(a(1+j)-1k)i!λ-i-1(1+w)-i.

Then the (s,r)th PWM of X can be expressed as

ρs,r=k=0wkE(Yk+1s).

3.4 Entropy Measures

The Rényi entropy of a random variable X represents a measure of variation of the uncertainty. The Rényi entropy is defined by

Iδ(X)=(1-δ)-1log-f(x)δ𝑑x,δ>0 and δ1.

Using the power series expansion, the pdf in (2) can be expressed as

f(x)δ=k=0mkg(x)δ[G(x)]k,

where

mk=aδbδ[1-exp(-λ)]δi,j=0(-1)i+j+k(b(i+δ)-δj)(a(j+δ)-δk)i!λ-δ-iδ-i.

Therefore, the Rényi entropy of the EGGP family is given by

Iδ(X)=(1-δ)-1log{k=0mk-g(x)δ[G(x)]k𝑑x}.

The q-entropy, say Hq(X), can be obtained as

Hq(X)=(q-1)-1log{1-[k=0mk-g(x)q[G(x)]k𝑑x]},

where

mk=aqbq[1-exp(-λ)]qi,j=0(-1)i+j+k(b(i+q)-qj)(a(j+q)-qk)i!λ-q-iq-i,q>0,q1.

The Shannon entropy of a random variable X, say SI, is defined by

SI=E{-[logf(X)]}.

It is the special case of the Rényi entropy when δ1.

3.5 Residual Life and Reversed Residual Life

The nth moment of the residual life, say mn(t)=E[(X-t)nX>t], n=1,2,, uniquely determines F(x). The nth moment of the residual life of X is given by

mn(t)=11-F(t)t(x-t)n𝑑F(x).

Therefore

(7)mn(t)=11-F(t)k=0tktxrπk+1(x)𝑑x,

where tk=tkr=0n(nr)(-t)n-r. The nth moment of the reversed residual life, say Mn(t)=E[(t-X)nXt] for t>0 and n=1,2,, uniquely determines F(x). We obtain

Mn(t)=1F(t)0t(t-x)n𝑑F(x).

Then the nth moment of the reversed residual life of X becomes

(8)Mn(t)=1F(t)k=0tk0txrπk+1(x)𝑑x,

where tk=tkr=0n(-1)r(nr)tn-r. Another interesting function is the mean residual life (MRL) function or the life expectation at age t defined by m1(t)=E[(X-t)X>t], which represents the expected additional life length for a unit which is alive at age t. The MRL of X can be obtained by setting n=1 in equation (7). The mean inactivity time (MIT) or mean waiting time (MWT), also called the mean reversed residual life function, is given by M1(t)=E[(t-X)Xt], and it represents the waiting time elapsed since the failure of an item on the condition that this failure had occurred in (0,t). The MIT of the EGGP family of distributions can be obtained easily by setting n=1 in equation (8). For the EGWP model we have the following results: For n>-β

mn(t)=11-F(t)k,h=0tk(k+1)(-1)hαn(h+1)(n+β)/β(kh)Γ(1+nβ,(αt)β)

and

Mn(t)=1F(t)k,h=0tk(k+1)(-1)hαn(h+1)(n+β)/β(kh)Γ(1+nβ,(αt)β).

4 Stress-Strength Models

The stress-strength model is the most widely used approach for reliability estimation. This model is used in many applications of physics and engineering, such as strength failure and system collapse. In stress-strength modeling,

R=Pr(X2<X1)=0f(x1)F(x2)𝑑x

is a measure of reliability of the system when it is subjected to random stress X2 and has strength X1 (see [20]). The system fails if and only if the applied stress is greater than its strength and the component will function satisfactorily whenever X1>X2. Moreover, R can be considered as a measure of system performance and naturally arises in electrical and electronic systems. Another interpretation can be that the reliability, say R, of the system is the probability that the system is strong enough to overcome the stress imposed on it. Let X1 and X2 be two independent random variables have EGGP(x;a1,b1,λ1,𝝍) and EGGP(x;a2,b2,λ2,𝝍) distributions. The reliability R is given by

R=0f1(x;a1,b1,λ1,𝝍)F2(x;a2,b2,λ2,𝝍)𝑑x.

Then

R=k,w=0Ωk,w,

where

Ωk,w=a1a2b1b2(-1)k+w[1-exp(-λ1)][1-exp(-λ2)](w+1)(k+w+2)×i,j,m,h=0(-1)i+j+m+h(b1(i+1)-1j)(b2(m+1)-1h)(a1(j+1)-1k)(a2(h+1)-1w)i!m!λ1-i-1λ2-m-1.

5 Order Statistics

Let X1,,Xn be a random sample from the EGGP family of distributions and let X(1),,X(n) be the corresponding order statistics. The pdf of the ith order statistic, say Xi:n, can be written as

(9)fi:n(x)=f(x)B(i,n-i+1)j=0n-i(-1)j(n-ij)Fj+i-1(x),

where B(,) is the beta function. Substituting (1) and (2) into equation (9), we get

f(x)F(x)j+i-1=k=0dkπk+1(x),

where

dk=ab[1-exp(-λ)]j+i(k+1)w,m,h=0(-1)w+m+h+k(j+i-1w)(b(m+1)-1h)(a(h+1)-1k)m!λ-m-1(1+w)-m.

Moreover, the pdf of Xi:n can be expressed as

fi:n(x)=j=0n-i(-1)j(n-ij)B(i,n-i+1)k=0dkπ(x)k+1,

therefore, the density function of the EGGP order statistics is a mixture of EG densities. Based on the last equation, we note that the properties of Xi:n follow from those properties of Yk+1. For example, the moments of Xi:n can be expressed as

(10)E(Xi:nq)=j=0n-i(-1)j(n-ij)B(i,n-i+1)k=0dkE(Yk+1q).

For the EGWP model we have

E(Xi:nq)=k,h=0j=0n-i(k+1)(-1)j+h(n-ij)(kh)dkαqB(i,n-i+1)(h+1)(q+β)/βΓ(1+qβ).

The L-moments are analogous to the ordinary moments but can be estimated by linear combinations of order statistics. They exist whenever the mean of the distribution exists, even though some higher moments may not exist, and are relatively robust to the effects of outliers. Based upon the moments in equation (10), we can derive explicit expressions for the L-moments of X. They are linear functions of expected order statistics defined by

λr=1rd=0r-1(-1)d(r-1d)E(Xr-d:r),r1.

The first four L-moments are given by

λ1=E(X1:1),λ2=12E(X2:2-X1:2),λ3=13E(X3:3-2X2:3+X1:3),λ4=14E(X4:4-3X3:4+3X2:4-X1:4).

One can simply obtain the L-moments λr for X from (10) with q=1.

6 Estimation

Let X1,,Xn be a random sample from the EGGP distribution with parameters λ,a,b and 𝝍. Further, let 𝚿=(a,b,λ,𝝍) be a ((p+3)×1) parameter vector, where 𝝍 is a (p×1) baseline parameter vector. For determining the MLE of 𝚿, we have the log-likelihood function

=nloga+nlogb+nlogλ-nlog[1-exp(-λ)]+i=1nlogg(xi;𝝍)+(a-1)i=1nlogG¯(xi;𝝍)+(b-1)i=1nlogsi-λi=1nsib,

where G¯(xi;𝝍)=1-G(xi;𝝍),si=1-G¯(xi;𝝍)a. The components of the score vector

𝑼(𝚿)=𝚿=(a,b,λ,𝝍)

are given by

Ua=na+i=1nlogG¯(xi;𝝍)+(b-1)i=1npisi-bλi=1npisib-1,Ub=nb+i=1nlogsi-λi=1nlogsisi-b,Uλ=nλ+nexp(-λ)1-exp(-λ)-i=1nsib

and (for r=1,2,,p)

U𝝍r=i=1ng(xi;𝝍)g(xi;𝝍)-(a-1)i=1nG(xi;𝝍)G¯(xi;𝝍)+(b-1)i=1nqisi-bλi=1nqisi1-b,

where

g(xi;𝝍)=g(xi;𝝍)𝝍r,G(xi;𝝍)=G(xi;𝝍)𝝍r,pi=-logG¯(xi;𝝍)G¯(xi;𝝍)-a,qi=aG(xi;𝝍)G¯(xi;𝝍)1-a.

Setting the nonlinear system of equations Ua=Ub=Uλ=0 and U𝝍=𝟎 and solving them simultaneously yields the MLE 𝚿^=(a^,b^,λ^,𝝍^). To solve these equations, it is usually more convenient to use nonlinear optimization methods, such as the quasi-Newton algorithm, to numerically maximize . For interval estimation of the parameters, we obtain the p×p observed information matrix J(𝚿)={2rs} (for r,s=a,b,λ,𝝍), whose elements can be computed numerically. Under standard regularity conditions when n, the distribution of 𝚿^ can be approximated by a multivariate normal Np(0,J(𝚿^)-1) distribution to construct approximate confidence intervals for the parameters. Here, J(𝚿^) is the total observed information matrix evaluated at 𝚿^. The method of the re-sampling bootstrap can be used for correcting the biases of the MLEs of the model parameters. Good interval estimates may also be obtained using the bootstrap percentile method. The elements of J(𝚿) are given in Appendix A.

7 Applications

In many statistical applications, the interest is centered on estimating the parameters and evaluate the goodness-of-fit of the model to analyze the data on hand. In this section, we provide the effectiveness of the EGGP distribution by means of modeling two different data sets choosing two special models discussed in Section 2. These data sets have been used by several authors to show the applicability of other competing models. We also provide a formative evaluation of the goodness-of-fit of the models and make comparisons with other distributions. The measures of goodness-of-fit, including the Akaike information criterion (AIC), Bayesian information criterion (BIC), Anderson–Darling (A), Cramér–von Mises (W) and Kolmogrov–Smirnov (KS) statistics, are computed to compare the fitted models. The statistics A and W are described in detail by Chen and Balakrishnan [9]. In general, the smaller the values of these statistics, the better the fit to the data. One can employ the Likelihood Ratio Test (LRT) to contrast the adaptability of the EGGP distribution over the other distributions. The required computations are carried out in the R language.

Example 1: Cancer Patient Data.

This data set describes the remission times (in months) of a random sample of 128 bladder cancer patients studied by Lee and Wang [21]. For these data, we compare the fit of the EGWP with the other five parameter distributions which has been generalized using the Weibull genesis. We compare the fits of the EGWP with the generalized transmuted-W (GTW) distribution (Nofal, Afify, Yousof and Cordeiro [30]), the McDonald Weibull (McW) distribution (Cordeiro, Hashimoto and Ortega [15]), the modified beta Weibull (MBW) distribution (Khan [19]) and the transmuted additive Weibull (TAW) distribution (Elbatal and Aryal [16]) with the corresponding densities given by (for x>0)

McW:f(x)=βλαβB(a/λ,b)xβ-1e-(αx)β[1-e-(αx)β]a-1{1-(1-e-(αx)β)λ}b-1,
MBW:f(x)=βα-βλaB(a/λ,b)xβ-1e-b(xα)β[1-e-(xα)β]a-1{1-(1-λ)[1-e-(xα)β]λ}-a-b,
GTW:f(x)=βαβxβ-1e(αx)β[1-e-(αx)β]1-a{a(1+λ)-λ(a+b)[1-e-(αx)β]b},
TAW:f(x)=(αbxb-1+aβxβ-1)e(αxb+axβ){1-λ+2λe-(αxb+axβ)}.

The parameters of the above densities are all positive real numbers except |λ|1 for the GTW and TAW distributions. The statistics of the fitted models are presented in Table 2 and the MLEs and the corresponding standard errors are given in Table 3. We note from Table 2 that the EGWP gives the lowest values for the AIC, BIC, CAIC, HQIC, A and W statistics as compared to the other generalizations of the Weibull distribution. Therefore, we conclude that the EGWP distribution yields the best fit to model the remission times of bladder cancer patients.

Table 2

The AIC, CAIC, HQIC, BIC, W and A statistics for cancer patient data.

Goodness-of-fit criteria
ModelAICCAICHQICBICWA
EGWP829.448829.939835.242843.7080.02270.1505
GTW831.347831.839837.141845.6070.04690.3058
McW831.680832.172837.474845.940.05040.3299
MBW838.027838.519843.821852.2880.10680.7207
TAW838.478838.97844.272852.7390.11290.7033
Table 3

MLEs and their standard errors (in parenthesis) for the cancer patient data.

Estimates
Modela^b^α^β^λ^
EGWP0.42021.58570.15400.92373.7776
(2.3728)(0.7778)(0.9355)(0.3111)(2.4207)
GTW2.79650.01280.29910.65420.002
(1.117)(7.214)(0.151)(0.121)(1.769)
McW4.06332.60360.11920.55820.0393
(2.111)(2.452)(0.109)(0.178)(0.202)
MBW57.416719.385910.15020.16322.0043
(37.317)(13.490)(22.437)(0.044)(0.789)
TAW0.000031.00650.11390.9722-0.1630
(0.0061)(0.035)(0.032)(0.125)(0.280)

Example 2: Flood Data.

This data set describes the exceedances of flood peaks (in m3/s) of the Wheaton River near Carcross in Yukon Territory, Canada for the years 1958–1984. These data were analyzed by many authors including Choulakian and Stephenes [10], Akinsete, Famoye and Lee [4], Nadarajah [27], Merovci and Puka [24], Bourguignon, Silva, Zea and Cordeiro [8], among others. We compare the fits of EGPaP with the Kumarswamy Pareto (KwP) distribution, the beta Pareto (BP) distribution, the transmuted Pareto (TP) distribution, the exponentiated Pareto (EP) distribution and the Pareto(P) distribution whose pdf are given by

KwP:f(x;a,b,α,θ)=abαθαxα+1[1-(θx)α]a-1{1-[1-(θx)α]a}b-1,
BP:f(x;a,b,α,θ)=1B(a,b)αθαxα+1[1-(θx)α]a-1(θx)α(b-1),
TP:f(x;α,θ,λ)=αθαxα+1[1-λ+2λ(θx)α].

The parameters of the above densities are all positive real numbers except |λ|1 for TP distribution. The MLEs and corresponding standard errors are given in Table 4 and the statistics of the fitted models are presented in Table 5. We note from Table 5 that the EGPaP gives the lowest values for the AIC, CAIC, BIC, HQIC and KS statistics as compared to the other generalizations of the Pareto distribution. Therefore, the EGPaP distribution yields the best fit to model the exceedances of flood peaks.

Table 4

Estimated parameters and their standard errors for Wheaton river data.

Modela^b^λ^α^θ^
EGPaP6.51634.988020.41480.02640.1
(2.2125)(0.8487)(8.9005)(0.0088)
KwP2.855385.84680.05280.1
(0.3371)(60.4213)(0.0185)
BP3.147385.75080.00880.1
(0.4993)(0.0001)(0.0015)
TP11-0.9520.34900.1
(0.089)(0.072)
EP2.879710.42410.1
(0.4911)(0.0463)
P110.24380.1
(0.0287)
Table 5

The AIC, CAIC, BIC, HQIC and KS test statistics of Wheaton river data.

Statistics
Model-(,x)AICCAICBICHQICKS
EGPaP255.131520.262521.171531.645524.7940.1428
KwP271.200548.400548.753555.230551.1190.1700
BP283.700573.400573.753580.230576.1190.1747
TP286.201576.402576.575580.954578.2140.2870
EP287.300578.600578.774583.153580.4130.1987
P303.100608.200608.257610.477609.1060.3324

Fitted pdf, cdf and QQ-plots for both data are provided in Figure 3. It can be observed that the EGWP distribution is appropriate to model the cancer patient data and the EGPaP distribution is appropriate to model the flood peak exceedance data.

Figure 3 Top: Fitted pdf (left), cdf (center) and QQ-plots (right)of the EGWP distribution.
Bottom: Fitted pdf (left), cdf (center) and QQ-plots (right) of the EGPaP distribution.
Figure 3

Top: Fitted pdf (left), cdf (center) and QQ-plots (right)of the EGWP distribution. Bottom: Fitted pdf (left), cdf (center) and QQ-plots (right) of the EGPaP distribution.

8 Conclusions

In this study, we have introduced the so-called exponentiated generalized G-Poisson family of distribution. Some mathematical properties of the new family including ordinary and incomplete moments, quantile and generating functions, mean deviations, order statistics and their moments, reliability and Shannon entropy are derived. Although this generalization technique can be used to generalize many other distributions, for illustration purposes we have chosen the Weibull distribution and the Pareto distribution as base distributions. The importance and flexibility of the new family are illustrated by means of two different examples, one for each generalized family. We hope that this study will serve as a reference and help to advance future research in the subject area.

A Appendix

The elements of the observed matrix J(𝚿) are given below:

Uaa=-na2-(b-1)i=1n{G¯(xi;𝝍)asi[logG¯(xi;𝝍)]-2+pi2si2}-b(b-1)λi=1nsib-2pi-2+bλi=1nsib-1G¯(xi;𝝍)a[logG¯(xi;𝝍)]-2,
Uab=i=1npisi-λi=1npisib-1(1+blogsi),
Uaλ=-bi=1npisi1-b,
Ua𝝍=-i=1nG(xi;𝝍)G¯(xi;𝝍)+(b-1)i=1nsimi-piqisi2-bλi=1nmi+(b-1)piqisi-1si1-b,
Ubb=-nb2-λi=1n(logsi)2si-b,
Ubλ=-i=1nlogsisi-b

and

Uλ𝝍=-bi=1nqisi1-b,
Ub𝝍=i=1nqisi-λi=1n{1+blogsi}qi-1si1-1,
Uλλ=-nλ2+ne-λ(1-e-λ)+ne-2λ(1-e-λ)2,
U𝝍r𝝍l=i=1ng(xi;𝝍)g′′(xi;𝝍)-[h(xi;𝝍)]2g(xi;𝝍)2+(b-1)i=1nwisi-(b-1)i=1nqi2si2
-(a-1)i=1nG¯(xi;𝝍)G′′(xi;𝝍)+[G(xi;𝝍)]2G¯(xi;𝝍)2-bλi=1n[(b-1)qi2sib-2+wisib-1],

where

g′′(xi;𝝍)=[2g(xi;𝝍)𝝍r𝝍l],G′′(xi;𝝍)=[2G(xi;𝝍)𝝍r𝝍l]

and

mi=G¯(xi;𝝍)aG¯(xi;𝝍)-1G(xi;𝝍)+alogG¯(xi;𝝍)G(xi;𝝍)G¯(xi;𝝍)1-a,
wi=a{G¯(xi;𝝍)a-1G′′(xi;𝝍)-(a-1)[G(xi;𝝍)]2G¯(xi;𝝍)a-2}.

Acknowledgements

The authors are grateful to the editor and anonymous reviewer for their constructive comments and valuable suggestions which certainly improved the presentation and quality of the paper.

References

[1] A. Z. Afify, M. Alizadeh, H. M. Yousof, G. Aryal and M. Ahmad, The transmuted geometric-G family of distributions: Theory and applications, Pak. J. Statist. 32 (2016), 139–160. Search in Google Scholar

[2] A. Z. Afify, G. M. Cordeiro, H. M. Yousof, A. Alzaatreh and Z. M. Nofal, The Kumaraswamy transmuted-G family of distributions: Properties and applications, J. Data Sci. 14 (2016), 245–270. 10.6339/JDS.201604_14(2).0004Search in Google Scholar

[3] A. Z. Afify, H. M. Yousof and S. Nadarajah, The beta transmuted-H family for lifetime data, Stat. Interface 10 (2017), 505–520. 10.4310/SII.2017.v10.n3.a13Search in Google Scholar

[4] A. Akinsete, F. Famoye and C. Lee, The beta-Pareto distribution, Statistics 42 (2008), 547–563. 10.1080/02331880801983876Search in Google Scholar

[5] M. Alizadeh, G. M. Cordeiro, A. D. C. Nascimento, M. D. S. Lima and E. M. M. Ortega, Odd-Burr generalized family of distributions with some applications, J. Stat. Comput. Simul. 83 (2016), 326–339. 10.1080/00949655.2016.1209200Search in Google Scholar

[6] M. Alizadeh, M. Rasekhi, H. M. Yousof and G. G. Hamedani, The transmuted Weibull-G family of distributions, Hacet. J. Math. Stat. (2017), 10.15672/HJMS.2017.440. 10.15672/HJMS.2017.440Search in Google Scholar

[7] M. Alizadeh, H. M. Yousof, A. Z. Afify, G. M. Cordeiro and M. Mansoor, The complementary generalized transmuted Poisson-G family, Austrian J. Stat., to appear. 10.17713/ajs.v47i4.577Search in Google Scholar

[8] M. Bourguignon, M. B. Silva, L. M. Zea and G. M. Cordeiro, The Kumaraswamy Pareto distribution, J. Stat. Theory Appl. 12 (2013), 129–144. 10.2991/jsta.2013.12.2.1Search in Google Scholar

[9] G. Chen and N. A. Balakrishnan, A general purpose approximate goodness-of-fit test, J. Quality Technol. 27 (1995), 154–161. 10.1080/00224065.1995.11979578Search in Google Scholar

[10] V. Choulakian and M. A. Stephenes, Goodness-of-fit for the generalized Pareto distribution, Technometrics 43 (2001), 478–484. 10.1198/00401700152672573Search in Google Scholar

[11] G. M. Cordeiro, M. Alizadeh and P. R. Diniz Marinho, The type I half-logistic family of distributions, J. Stat. Comput. Simul. 86 (2016), 707–728. 10.1080/00949655.2015.1031233Search in Google Scholar

[12] G. M. Cordeiro, M. Alizadeh, E. M. Ortega and L. H. V. Serrano, The Zografos–Balakrishnan odd log-logistic family of distributions: Properties and ypplications, Hacet. J. Math. Stat. 45 (2016), 1781–1803. 10.15672/HJMS.20159714145Search in Google Scholar

[13] G. M. Cordeiro, M. Alizadeh, G. Ozel, B. Hosseini, E. M. M. Ortega and E. Altun, The generalized odd log-logistic family of distributions: Properties, regression models and applications, J. Stat. Comput. Simul. (2016), 10.1080/00949655.2016.1238088. 10.1080/00949655.2016.1238088Search in Google Scholar

[14] G. M. Cordeiro, M. Alizadeh, M. H. Tahir, M. Mansoor, M. Bourguignon and G. G. Hamedani, The beta odd log-logistic generalized family of distributions, Hacet. J. Math. Stat. 45 (2016), 1175–1202. 10.15672/HJMS.20157311545Search in Google Scholar

[15] G. M. Cordeiro, E. M. Hashimoto and E. M. Ortega, McDonald Weibull model, Statistics 48 (2014), 256–278. 10.1080/02331888.2012.748769Search in Google Scholar

[16] I. Elbatal and G. Aryal, On the transmuted additive Weibull distribution, Austrian J. Stat. 42 (2013), 117–132. 10.17713/ajs.v42i2.160Search in Google Scholar

[17] R. C. Gupta, P. L. Gupta and R. D. Gupta, Modeling failure time data by Lehmann alternatives, Comm. Statist. Theory Methods 27 (1998), 887–904. 10.1080/03610929808832134Search in Google Scholar

[18] R. D. Gupta and D. Kundu, Generalized exponential distributions, Aust. N. Z. J. Stat. 41 (1999), 173–188. 10.1111/1467-842X.00072Search in Google Scholar

[19] M. N. Khan, The modified beta Weibull distribution, Hacet. J. Math. Stat. 44 (2015), 1553–1568. 10.15672/HJMS.2014408152Search in Google Scholar

[20] S. Kotz, C. D. Lai and M. Xie, On the effect of redundancy for systems with dependent components, IIE Trans. 35 (2003), 1103–1110. 10.1080/714044440Search in Google Scholar

[21] E. T. Lee and J. W. Wang, Statistical Methods for Survival Data Analysis, 3rd ed., Wiley, New York, 2003. 10.1002/0471458546Search in Google Scholar

[22] A. W. Marshall and I. Olkin, A new methods for adding a parameter to a family of distributions with application to the exponential and Weibull families, Biometrika 84 (1997), 641–652. 10.1093/biomet/84.3.641Search in Google Scholar

[23] F. Merovci, M. Alizadeh, H. M. Yousof and G. G. Hamedani, The exponentiated transmuted-G family of distributions: Theory and applications, Comm. Statist. Theory Methods (2016), 10.1080/03610926.2016.1248782. 10.1080/03610926.2016.1248782Search in Google Scholar

[24] F. Merovci and L. Puka, Transmuted Pareto distribution, ProbStat Forum 7 (2014), 1–11. Search in Google Scholar

[25] G. S. Mudholkar and D. K. Srivastava, Exponentiated Weibull family for analysing bathtub failure rate data, IEEE Trans. Reliab. 42 (1993), 299–302. 10.1109/24.229504Search in Google Scholar

[26] G. S. Mudholkar, D. K. Srivastava and M. Freimer, The exponentiated Weibull family: A reanalysis of the bus-motor-failure data, Technometrics 37 (1995), 436–445. 10.1080/00401706.1995.10484376Search in Google Scholar

[27] S. Nadarajah, The exponentiated Gumbel distribution with climate application, Environmetrics 17 (2005), 13–23. 10.1002/env.739Search in Google Scholar

[28] S. Nadarajah and A. K. Gupta, The exponentiated gamma distribution with application to drought data, Calcutta Statist. Assoc. Bull. 59 (2007), 29–54. 10.1177/0008068320070103Search in Google Scholar

[29] S. Nadarajah and S. Kotz, The exponentiated-type distributions, Acta Appl. Math. 92 (2006), 97–111. 10.1007/s10440-006-9055-0Search in Google Scholar

[30] Z. M. Nofal, A. Z. Afify, H. M. Yousof and G. M. Cordeiro, The generalized transmuted-G family of distributions, Comm. Statist. Simulation Comput. 46 (2017), 4119–4136. 10.1080/03610926.2015.1078478Search in Google Scholar

[31] M. W. A. Ramos, P. R. D. Marinho, G. Cordeiro, R. V. da Silva and G. G. Hamedani, The Kumaraswamy-G Poisson family of distributions, J. Stat. Theory Appl. 14 (2015), 222–239. 10.2991/jsta.2015.14.3.1Search in Google Scholar

[32] D. T. Shirke and C. S. Kakade, On exponentiated lognormal distribution, Int. J. Agric. Statist. Sci. 2 (2006), 319–326. Search in Google Scholar

[33] F. G. Silva, A. Percontini, E. de Brito, M. W. Ramos, R. Venancio and G. M. Cordeiro, The odd Lindley-G family of distributions, Austrian J. Stat. 46 (2017), 65–87. 10.17713/ajs.v46i1.222Search in Google Scholar

[34] M. H. Tahir, G. M. Cordeiro, A. Alzaatreh, M. Mansoor and M. Zubair, The logistic-X family of distributions and its applications, Comm. Statist. Theory Methods 45 (2016), 7326–7349. 10.1080/03610926.2014.980516Search in Google Scholar

[35] M. H. Tahir, M. Zubair, M. Mansoor, G. M. Cordeiro, M. Alizadeh and G. G. Hamedani, A new Weibull-G family of distributions, Hacet. J. Math. Stat. 45 (2016), 629–647. 10.15672/HJMS.2015579686Search in Google Scholar

[36] H. M. Yousof, A. Z. Afify, M. Alizadeh, N. S. Butt, G. G. Hamedani and M. M. Ali, The transmuted exponentiated generalized-G family of distributions, Pak. J. Stat. Oper. Res. 11 (2015), 441–464. 10.18187/pjsor.v11i4.1164Search in Google Scholar

[37] H. M. Yousof, A. Z. Afify, G. G. Hamedani and G. Aryal, The Burr X generator of distributions for lifetime data, J. Stat. Theory Appl., to appear. 10.2991/jsta.2017.16.3.2Search in Google Scholar

[38] H. M. Yousof, M. Rasekhi, A. Z. Afify, I. Ghosh, M. Alizadeh and G. G. Hamedani, The beta Weibull-G family of distributions: Theory, characterizations and applications, Pakistan J. Statist., to appear. Search in Google Scholar

Received: 2017-1-20
Revised: 2017-3-27
Accepted: 2017-3-29
Published Online: 2017-5-26
Published in Print: 2017-6-1

© 2017 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 15.5.2024 from https://www.degruyter.com/document/doi/10.1515/eqc-2017-0004/html
Scroll to top button