Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A Gull Alpha Power Weibull distribution with applications to real and simulated data

Abstract

In this paper, we produced a new family of distribution called Gull Alpha Power Family of distributions (GAPF). A Special case of GAPF is derived by considering the Weibull distribution as a baseline distribution called Gull Alpha Power Weibull distribution (GAPW). The suitability of the proposed distribution derives from its ability to model both the monotonic and non-monotonic hazard rate functions which are a common practice in survival analysis and reliability engineering. Various statistical properties were derived in addition to their special cases. The unknown parameters of the model are estimated using the maximum likelihood method. Moreover, the usefulness of the proposed distribution is supported by using two real lifetime data sets as well as simulated data.

Introduction

From last few years, researchers made a contribution to the theory of probability so as to remove some of the limitations of the existing probability distributions. For example, the Exponential distribution fails to model the monotonic and non-monotonic hazard rate functions, it can only model the constant hazard rate of an object; Gamma distribution can only model the data with monotonically increasing failure rate. But in practice, various data exists which follows a non-monotonic hazard rate function, for example, the lifetime of an electronic device or the accident rate follows a non-monotonic hazard rate function.

It is usual practice to modify the existing probability models so as to model both the monotonic and non-monotonic hazard rate function and also to provide a suitable fit. One such modification is to produce a generator and then applied to the existing models so as to derive a new probability model. For example, Al-Aqtash et.al [1] produced a new family of distributions using the logit function and derived the special case named as Gumbel-Weibull distribution. Alzaatreh et.al [2] investigated the gamma-X family of distribution and explored the special case by employing the normal distribution as a baseline distribution. Abid, & Abdulrazak [3] Presented a truncated Frechet-G family of distribution. Korkmaz & Genç [4] defined a generalized two-sided class of probability distribution. Alzaghal et.al [5] worked on the T-X family of distributions. Aldeni et.al [6] explained a new family using the quantile functions of the generalized lambda distribution. For other generators, we refer to see [710]. The more recent modified Weibull distributions are [3], [4], [67], [10].

The scope of developing a new family of distribution is to produce a new probability distribution so as to remove some of the difficulties found in the existing probability distributions. The proposed distribution will not only model the monotonic and non-monotonic hazard rate function, but also increase flexibility and provided a better fit as compared to other probability distribution distributions provided in the literature.

In this paper, a new family of distribution is proposed called Gull Alpha Power Family of distribution. The special case of this family is derived by employing the Weibull distribution called Gull Alpha Power Weibull distribution (GAPW). GAPW is a modified form of the Weibull distribution which can model the non-monotonic hazard rate shapes. Various statistical properties have been derived such as hazard rate function, survival function, and moments. Two real data sets and simulated data are used to check the versatility of the proposed model. The paper detailed discussion is as follows.

Gull Alpha Power Family (GAPF) of distributions

This section illustrates a new family of probability distributions called Gull Alpha Power Family of distributions or in short GAPF. Let Y is a continuous random variable then the cumulative distribution function (CDF) of Gull Alpha Power Family of distribution is defined as (1)

Where α is being the shape parameter and not be zero. The probability density function related to the above CDF takes the form (2)

Gull Alpha Power Weibull distribution (GAPW)

This section illustrates the special form of GAPF by employing the CDF of the Weibull distribution. The CDF of the Weibull distribution [11] is given by (3) where β is the scale parameter and γ is the shape parameter.

The Weibull distribution [11] is one of the most important and has been widely used in many real-world problems. For example, in reliability applications, Keshavan et.al [12] used the Weibull distribution to analyze the fracture strength of glass data, Fok et.al [13] utilized to model the failure of brittle materials data, Li et.al [14] by working the failure probability of concrete components. In Geophysics, Al-Hasan & Nigmatullin [15] modeled the wind speed data, to model the data related to earthquake [16] considered the Weibull distribution, and the data concerned with environmental radioactivity was analyzed by Dahm et.al [17]. Tsumoto & Okiai [18] applied the Weibull distribution to the impulse breakdown of oil-filled cable data.

The Weibull distribution can model only increasing, decreasing or a constant failure rate. This distribution fails to model a non-monotonic hazard rate function, for example, a failure rate of the electronic device, accident rate, and infant mortality rate. To achieve this goal, researchers have been constructing different modified versions of the Weibull distribution, for example, Almalki & Yuan [19] presented the New modified Weibull distribution, A new extension of Weibull distribution is presented by [20], a four parameter Weibull distribution is studied by Lemonte et.al [21], Gumbel-Weibull distribution was explored by Al-Aqtash et.al [22], Almheidat et.al [23] investigated the generalized form of the Weibull distribution, Almalki & Nadarajah [24] by working with the discrete form of the Weibull distribution, and a flexible Weibull extension was introduced by Bebbington et.al [25].

Let Y is a continuous random variable which follows GAPW distribution then the CDF and PDF are respectively given by (4) (5)

The proposed distribution contains three parameters that is β>0 is the scale and α>0, γ>0 being the shape parameters.

The graphical representations of the probability density function and cumulative distribution function with different values of parameters are given in Fig 1.

The survival and hazard rate function

Let a random variable Y follows GAPW (α,β,γ). Then the survival function of GAPW is defined as (6)

Similarly, the failure or hazard rate function of GAPW is defined as , transforming Eqs (4) and (5), we obtain the result given below (7)

Fig 2 shows the behavior of the hazard rate function with different parameter values.

The quantile function and median

The quantile function is used to conduct a simulation study as well as to measure the median, quartile, octile, decile, and percentile. The quantile function is the real solution of a random variable Y to the expression given as (8)

Substituting (4), we obtain the result as

The simplified form is then given by

Using Mathematica software, we get the following result (9) where the product Log function W (Z) is defined as

For the median, put u = 0.5 in Eq (9).

The rth moments

Let a random variable Y has GAPW distribution with parameters α,β, and γ then the rth moments (about the origin), say is defined as (10)

Recalling (5) we get (11)

Solving the first part of the above integral form

The above integral takes the following form

Finally, we obtained the result as (12)

Now to solve the second part

Using the same transformation as earlier, the integral we may write as

We obtained the following result (13)

By combining (12) and (13), it’s observed that the rth moment does not exist in general.

Order statistics

Let Y1,Y2,Y3,…Yn be ordered random variables from GAPW, then the PDF of the ith order statistic is given by (14)

Using (4) and (5), the minimum and maximum order statistic of the Gull Alpha Power distribution are given by

Parameter estimation

Since the parameters of the probability model are unknown and it is to be estimated using information obtained from a sample. For a detailed discussion on maximum likelihood estimation, we refer to see [2628]. In this section, the usual method maximum likelihood estimates are used to find out the estimate of the parameters. Let suppose an independent random sample of size n that is Y1,2Y,Y3Yn is selected from GAPW (α,β,γ). The Likelihood function is defined as

Substituting (5) in the above expression, we get (15)

To derive the estimate of the parameters, we have to take the partial derivatives with respect to α,β,γ and then equate the result to zero (16) (17) (18)

Eqs (16)–(18) is not in closed form. Hence it is difficult to calculate the values of the parameters. However, one can use the iteration procedure used in mathematics that is the Bisection and Newton Raphson method to get the MLE.

Renyi entropy

By definition, the Renyi entropy of the random variable Y belong to GAPW (α,β,γ) is given by (19) (20)

Using the expression given below

Hence (20) takes the following form (21) by solving the above expression, we get (22)

Replace (22) in (19), we obtained the following result for the Renyi entropy (23)

Mode

The mode of the random variable Y is defined by the following equation.

The more simplified form of the above expression may be written as (24)

Using Mathematica software, it has been observed that the density function will be maximum only at y = 0

Skewness and kurtosis

The mathematical form of the Galton Skewness and Moors kurtosis of the GAPW distribution with three parameters are defined by the following relationship (25) (26)

Where Q describe different quartiles values. Table 1 illustrates the numerical description of the Skewness and Kurtosis for different values of parameters.

Mean Residual Life (MRL)

The mean residual life of the gull alpha power distribution with parameters α,β,γ is define as (27)

Plugging in the above two expressions in (27), we get (28)

By solving (28) finally, we obtained the following result (29)

Special cases

This section illustrates two special cases of the Gull Alpha Power Weibull distribution.

  1. Case. γ = 1
    When we put γ = 1 in (4) and (5), then it shall be referring to the CDF and PDF of the Gull Alpha Power Exponential distribution (GAPE). The mathematical forms are described as (30) (31)
  2. Case. γ = 2
    If we replace γ = 2 in the expressions (4) and (5), the derived probability function will stand for the Gull Alpha Power Rayleigh distribution (GAPR). The CDF and PDF of GAPR are respectively given by (32) (33)

Applications

In this section, we provide two real life applications of the proposed model in which one data set follows a non-monotonic hazard rate shape and the second data follows a monotonic hazard rate shape so as to achieve the objectives and delineate efficiency of the proposed model. The performance of the model is judged by commonly used goodness of fit measures including Cramer-von mises (W), Anderson darling (A), Akaike information criteria (AIC), Consistent Akaike information criteria (CAIC), Hannan and quin information criteria (HQIC), and Bayesian information criteria (BIC). The mathematical form of these criteria is defined by where, is the maximized likelihood function and yi is the given random sample, is the maximum likelihood estimator and p is the number of parameters in the model.

As a general rule, a probability model with fewer values of these criteria should be considered the best fitted model among other probability distributions.

Data set 1: Remission time of Bladders cancer patients

The data set consists of the remission time of 128 bladder cancer patients. The data set is taken from Aldeni and Famoye [29] with the values are as follows

0.080, 0.200, 0.400, 0.500, 0.510, 0.810, 0.900, 1.050, 1.190, 1.260, 1.350, 1.400, 1.460, 1.760, 2.020, 2.020, 2.070, 2.090, 2.230, 2.260, 2.460, 2.540, 2.620, 2.640, 2.690, 2.690, 2.750, 2.830, 2.870, 3.020, 3.250, 3.310, 3.360, 3.360, 3.480, 3.520, 3.570, 3.640, 3.700, 3.820, 3.880, 4.180, 4.230, 4.260, 4.330, 4.340, 4.400, 4.500, 4.510, 4.870, 4.980, 5.060, 5.090, 5.170, 5.320, 5.320, 5.340, 5.410, 5.410, 5.490, 5.620, 5.710, 5.850, 6.250, 6.540, 6.760, 6.930, 6.940, 6.970, 7.090, 7.260, 7.280, 7.320, 7.390, 7.590, 7.620, 7.630, 7.660, 7.870, 7.930, 8.260, 8.370, 8.530, 8.650, 8.660, 9.020, 9.220, 9.470, 9.740, 10.06, 10.34, 10.66, 10.75, 11.25, 11.64, 11.79, 11.98, 12.02, 12.03, 12.07, 12.63, 13.11, 13.29, 13.80, 14.24, 14.76, 14.77, 14.83, 15.96, 16.62, 17.12, 17.14, 17.36, 18.10, 19.13, 20.28, 21.73, 22.69, 23.63, 25.74, 25.82, 26.31, 32.15, 34.26, 36.66, 43.01, 46.12, 79.05.

Fig 3 shows the theoretical and empirical pdf and cdf of the GAPW distribution using the bladder cancer patient’s data and it is observed that the GAPW is the best-fitted line as compared to others. Fig 4 demonstrates the Q-Q and P-P plot of the bladder cancer patient data. The TTT plot in Fig 5 clearly shows that this data follows a non-monotonic hazard rate shapes. Table 2 reflects the maximum likelihood estimates, standard errors, and the log-likelihood values. Table 3 defines the goodness of fit measures for the bladder cancer data. It has been observed that the goodness of fit measures has fewer values for GAPW while analyzing the bladder cancer data. Hence the proposed distribution provides a better fit as compared Weibull exponential (W.E), exponential (E), Weibull (W), Rayleigh (R) and Algoharai inverse flexible Weibull (AIFW) distribution.

thumbnail
Fig 4. Theoretical and empirical Pdf and Cdf with Q-Q plot and P-P plot for GAPW.

https://doi.org/10.1371/journal.pone.0233080.g004

thumbnail
Table 2. Maximum likelihood estimates and their standard errors.

https://doi.org/10.1371/journal.pone.0233080.t002

thumbnail
Table 3. Goodness of fit measures of the GAPW for bladder cancer data.

https://doi.org/10.1371/journal.pone.0233080.t003

Data set 2: Bank customers Data

The data set waiting time of 100 bank customers is taken from Ghitany et al. [30]. The data set values are given below

0.8,0.8,1.3,1.5,1.8,1.9,1.9,2.1,2.6,2.7,2.9,3.1,3.2,3.3,3.5,3.6,4,4.1,4.2,4.2,4.3,4.3,4.4,4.4,4.6,4.7,4.7,4.8,4.9,4.9,5.0,5.3,5.5,5.7,5.7,6.1,6.2,6.2,6.2,6.3,6.7,6.9,7.1,7.1,7.1,7.1,7.4,7.6,7.7,8,8.2,8.6,8.6,8.6,8.8,8.8,8.9,8.9,9.5,9.6,9.7,9.8,10.7,10.9,11.0,11.0,11.1,11.2,11.2,11.5,11.9,12.4,12.5,12.9,13.0,13.1,13.3,13.6,13.7,13.9,14.1,15.4,15.4,17.3,17.3,18.1,18.2,18.4,18.9,19.0,19.9,20.6,21.3,21.4,21.9,23,27,31.6,33.1,38.5.

Fig 6 illustrates the theoretical and empirical Pdf and Cdf of the GAPW distribution using the 100 bank customer’s data. The graph clearly shows that the red line is the best-fitted line to theoretical data. Fig 7 displays the Q-Q and P-P plot of the bank customer’s data. The TTT plot in Fig 8 clearly shows that this data follows a monotonic hazard rate shapes. Table 4 gives the maximum likelihood estimates of the unknown parameters of GAPW including the standard errors, and the log-likelihood values. The values given in Table 5 declared that GAPW leads to a better fit than other versions of the Weibull distribution like Weibull exponential (W.E), exponential (E), Weibull (W), Rayleigh (R) and Algoharai inverse flexible Weibull (AIFW) distribution.

thumbnail
Fig 7. Theoretical and empirical Pdf and Cdf with Q-Q plot and P-P plot for GAPW.

https://doi.org/10.1371/journal.pone.0233080.g007

thumbnail
Table 4. Maximum likelihood estimates and their standard errors.

https://doi.org/10.1371/journal.pone.0233080.t004

thumbnail
Table 5. Goodness of fit measures of the GAPW for bank customers data.

https://doi.org/10.1371/journal.pone.0233080.t005

Simulations

To perform simulations an expression (9) was used to generate artificial data from the GAPW distribution. The simulations are performed 100 times with a different set of parameters with different sample of size n. The maximum likelihood estimates and their standard errors are given in Table 6. The tabulated values clearly show as we increase the sample size, both the ml estimates and the standard errors are decreases. The general formula for computing the mean square error and bias are as follows

thumbnail
Table 6. Maximum likelihood Estimates and their standard errors.

https://doi.org/10.1371/journal.pone.0233080.t006

Conclusion

In this paper, we produced a new generator called Gull Alpha Power Family of distributions or in short GAPF. A special case of this family was derived by employing the CDF of the Weibull distribution as a baseline distribution. The special case is known as Gull Alpha Power Weibull distribution (GAPW). Various statistical properties have been discussed in addition to the parameter estimation using the maximum likelihood method. In future, a researcher may conduct a study on estimation of the parameters of the proposed model under Bayesian paradigm by using informative and non-informative priors. For a detailed discussion on Bayesian estimation we refer to see [3134]. Furthermore, we have explored the special cases of GAPW. The significance of the proposed model is justified by using two real data sets as well as the simulated data. The TTT plot of the bladder cancer patient’s data clearly demonstrates that this data follows a non-monotonic hazard rate shape. While the TTT plot of the bank customers data follows a monotonic hazard rate shape. It has been observed that the proposed model performs well in both the non-monotonic and a monotonic hazard rate shapes as compared to the Weibull (W), Weibull Exponential (W.E), Exponential (E), Rayleigh (R), and Algoharai Inverse Flexible Weibull (AIFW) distribution.

Supporting information

S1 Table. Numerical values of skewness and kurtosis.

https://doi.org/10.1371/journal.pone.0233080.s001

(DOCX)

S2 Table. Maximum likelihood estimates and their standard errors.

https://doi.org/10.1371/journal.pone.0233080.s002

(DOCX)

S3 Table. Goodness of fit measures of the GAPW for bladder cancer data.

https://doi.org/10.1371/journal.pone.0233080.s003

(DOCX)

S4 Table. Maximum likelihood estimates and their standard errors.

https://doi.org/10.1371/journal.pone.0233080.s004

(DOCX)

S5 Table. Goodness of fit measures of the GAPW for bladder cancer data.

https://doi.org/10.1371/journal.pone.0233080.s005

(DOCX)

S6 Table. Maximum likelihood Estimates and their standard errors.

https://doi.org/10.1371/journal.pone.0233080.s006

(DOCX)

S1 Data. Remission time of Bladders cancer patients [26].

https://doi.org/10.1371/journal.pone.0233080.s007

(DOCX)

Acknowledgments

I would appreciate and thank the referees for their comments and further suggestions so that to improve the paper further.

References

  1. 1. Al-Aqtash R., Famoye F., & Lee C. (2015). On generating a new family of distributions using the logit function. Journal of Probability and Statistical Science, 13(1), 135–152.
  2. 2. Alzaatreh A., Famoye F., & Lee C. (2014). The gamma-normal distribution: Properties and applications. Computational Statistics & Data Analysis, 69, 67–80.
  3. 3. Abid S. H., & Abdulrazak R. K. [0, 1] (2017). Truncated Frechet-G Generator of Distributions. Applied Mathematics. 3, 51–66.
  4. 4. Korkmaz M. Ç., & Genç A. İ. (2017). A new generalized two-sided class of distributions with an emphasis on two-sided generalized normal distribution. Communications in Statistics-Simulation and Computation, 46(2), 1441–1460.
  5. 5. Alzaghal A., Famoye F., & Lee C. (2013). Exponentiated TX family of distributions with some applications. International Journal of Statistics and Probability, 2(3), 31.
  6. 6. Aldeni M., Lee C., & Famoye F. (2017). Families of distributions arising from the quantile of generalized lambda distribution. Journal of Statistical Distributions and Applications, 4(1), 25.
  7. 7. Cordeiro G. M., Alizadeh M., Ramires T. G., & Ortega E. M. (2017). The generalized odd half-Cauchy family of distributions: properties and applications. Communications in Statistics-Theory and Methods, 46(11), 5685–5705.
  8. 8. Alzaatreh A., Lee C., Famoye F., & Ghosh I. (2016). The generalized Cauchy family of distributions with applications. Journal of Statistical Distributions and Applications, 3(1), 12.
  9. 9. Alzaatreh A., Lee C., & Famoye F. (2014). T-normal family of distributions: A new approach to generalize the normal distribution. Journal of Statistical Distributions and Applications, 1(1), 16.
  10. 10. Nasir M. A., Aljarrah M., Jamal F., & Tahir M. H. (2017). A new generalized Burr family of distributions based on quantile function. Journal of Statistics Applications and Probability, 6(3), 1–14.
  11. 11. Weibull W. (1951). A statistical distribution function of wide applicability. Journal of applied mechanics, 18(3), 293–297.
  12. 12. Keshavan M. K., Sargent G. A., & Conrad H. (1980). Statistical analysis of the Hertzian fracture of pyrex glass using the Weibull distribution function. Journal of Materials Science, 15(4), 839–844.
  13. 13. Fok S. L., Mitchell B. C., Smart J., & Marsden B. J. (2001). A numerical study on the application of the Weibull theory to brittle materials. Engineering Fracture Mechanics, 68(10), 1171–1179.
  14. 14. Li Q. S., Fang J. Q., Liu D. K., & Tang J. (2003). Failure probability prediction of concrete components. Cement and concrete research, 33(10), 1631–1636.
  15. 15. Al-Hasan M., & Nigmatullin R. R. (2003). Identification of the generalized Weibull distribution in wind speed data by the Eigen-coordinates method. Renewable Energy, 28(1), 93–110.
  16. 16. Huillet T., & Raynaud H. F. (1999). Rare events in a log-Weibull scenario-Application to earthquake magnitude data. The European Physical Journal B-Condensed Matter and Complex Systems, 12(3), 457–469.
  17. 17. Dahm H., Niemeyer J., & Schröder D. (2002). Application of the Weibull distribution to describe the vertical distribution of cesium-137 on a slope under permanent pasture in Luxembourg. Journal of environmental radioactivity, 63(3), 207–219. pmid:12440510
  18. 18. Tsumoto M., & Okiai R. (1974). A new application of Weibull distribution to impulse breakdown of oil-filled cable. IEEE Transactions on Power Apparatus and Systems, (1), 360–366.
  19. 19. Almalki S. J., & Yuan J. (2013). A new modified Weibull distribution. Reliability Engineering & System Safety, 111, 164–170.
  20. 20. Dey S., Sharma V. K., & Mesfioui M. (2017). A new extension of Weibull distribution with application to lifetime data. Annals of Data Science, 4(1), 31–61.
  21. 21. Lemonte A. J., Cordeiro G. M., & Ortega E. M. (2014). On the additive Weibull distribution. Communications in Statistics-Theory and Methods, 43(10–12), 2066–2080.
  22. 22. Al-Aqtash R., Lee C., & Famoye F. (2014). Gumbel-weibull distribution: Properties and applications. Journal of Modern applied statistical methods, 13(2), 11.
  23. 23. Almheidat M., Lee C., & Famoye F. (2016). A generalization of the Weibull distribution with applications. Journal of Modern Applied Statistical Methods, 15(2), 47.
  24. 24. Almalki S. J., & Nadarajah S. (2014). A new discrete modified Weibull distribution. IEEE Transactions on Reliability, 63(1), 68–80.
  25. 25. Bebbington M., Lai C. D., & Zitikis R. (2007). A flexible Weibull extension. Reliability Engineering & System Safety, 92(6), 719–726.
  26. 26. Chen F., & Chen S. (2011). Injury severities of truck drivers in single-and multi-vehicle accidents on rural highways. Accident Analysis & Prevention, 43(5), 1677–1688.
  27. 27. Chen F., Chen S., & Ma X. (2018). Analysis of hourly crash likelihood using unbalanced panel data mixed logit model and real-time driving environmental big data. Journal of safety research, 65, 153–159. pmid:29776524
  28. 28. Dong B., Ma X., Chen F., & Chen S. (2018). Investigating the differences of single-vehicle and multivehicle accident probability using mixed logit model. Journal of Advanced Transportation, 2018.
  29. 29. Aldeni M., Lee C., & Famoye F. (2017). Families of distributions arising from the quantile of generalized lambda distribution. Journal of Statistical Distributions and Applications, 4(1), 25.
  30. 30. Ghitany M. E., Al-Awadhi F. A., & Alkhalfan L. (2007). Marshall–Olkin extended Lomax distribution and its application to censored data. Communications in Statistics—Theory and Methods, 36(10), 1855–1866.
  31. 31. Analysis of safety factors for urban expressways considering the effect of congestion in Shanghai, China”, Accident Analysis and Prevention, 2016, 95, 503–511. pmid:26721569
  32. 32. Canavos G. C., & Taokas C. P. (1973). Bayesian estimation of life parameters in the Weibull distribution. Operations Research, 21(3), 755–763.
  33. 33. Okasha H. M. (2014). E-Bayesian estimation for the Lomax distribution based on type-II censored data. Journal of the Egyptian Mathematical Society, 22(3), 489–495.
  34. 34. Jan U., & Ahmad S.P. (2017). Bayesian analysis of inverse Lomax distribution using approximation techniques. Bayesian Analysis,7(7).