Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Type-I heavy tailed family with applications in medicine, engineering and insurance

Abstract

In the present study, a new class of heavy tailed distributions using the T-X family approach is introduced. The proposed family is called type-I heavy tailed family. A special model of the proposed class, named Type-I Heavy Tailed Weibull (TI-HTW) model is studied in detail. We adopt the approach of maximum likelihood estimation for estimating its parameters, and assess the maximum likelihood performance based on biases and mean squared errors via a Monte Carlo simulation framework. Actuarial quantities such as value at risk and tail value at risk are derived. A simulation study for these actuarial measures is conducted, proving that the proposed TI-HTW is a heavy-tailed model. Finally, we provide a comparative study to illustrate the proposed method by analyzing three real data sets from different disciplines such as reliability engineering, bio-medical and financial sciences. The analytical results of the new TI-HTW model are compared with the Weibull and some other non-nested distributions. The Baysesian analysis is discussed to measure the model complexity based on the deviance information criterion.

Introduction

In many practical situations, such as financial sciences, reliability engineering and bio-medial sciences, data are usually positive, and their distribution is unimodal hump shaped and extreme values yielding heavier tails than the classical models. For example, in health science research, (a) the medical expenditures that exceed a given treshould [1] and (b) the lenght of stay in hospitals [2, 3], present highly skewed, heavy tailed data, for which the standard classical distributions and simple variable transformation are insufficient to provide an adequate fit to such data. In reliability engineering, the interest most often lies in the occurrence of rather exceptional events which are associated with the tail part of a statistical distribution. For example, the earthquakes, tsunamis, hurricanes, electrical or power massive failures etc., are some examples of such type of rare and extreme events [4]. All aforementioned events and the rate at which they happens, are associated with the heaviness of the tail and shape of distributions. In financial and risk management problems, one of the important tasks is to predict accurately the losses that occurs with a high fiscal value. Underestimation of the probability of these losses leads to severe operational risk like underestimating the bankruptcy and premium, among others. In such circumstances, the models with heavy-tailed are the best candidate that provides plausibly the best fit for the right tail, see [5].

According to [6], a distribution is said to be heavy-tailed, if the right tail probabilities are heavier than the exponential distribution, that is, its survival function (sf) satisfies for all p > 0; see [7]. The right tail of a model is an important issue in a number of contexts, particularly, pertaining to the insurance problems, where it shows the total impact of insurance losses, and in risk theory, where it is associated with the extreme value theory.

As we discussed above, on a number of occasions, real data sets show a behavior with extreme values producing tails which are heavier than those of classical well-known statistical distributions. In such cases, the utilization of the standard distributions may be not be a good choice to apply; see [8]. They performed an empirical study of distributions using exploratory data analysis and other empirical approaches to estimate the risk. They rejected the idea of using exponential, gamma and Weibull distributions due to their poor results and pointed out that one would need to use a model that is flexible enough in its structure.

These results motivated the researchers to look for more flexible models providing the best fit with greater accuracy in modeling data. In this regard, a number of approaches for extending and generalizing heavy-tailed distributions have been introduced. The new developments have been made through many different approaches such as (i) transformation of variables, (ii) composition of two or more models, (iii) compounding of models, and finally (iv) finite mixture of models.

Recent the study of [9] showed that skewed student t model and skewed-normal model are the best competitors as the skewed distributions adjust right-skewness and high kurtosis; for further detail see [10]. Financial risks and the insurance losses take positive values on the real line and consequently these skew models may not be suitable choice. In such situations, the transformation of variables, particularly the exponential transformation, has proven to be substantial because these distributions are defined on . Furthermore, [11] showed that the transformation approach is simple to use but most often the inference as well as derivation of the other statistical properties become complicated.

Another promising approach for obtaining new flexible heavy-tailed families of distributions, which might provides reasonably the best fit for heavy-tailed losses, is the composition approach; see [12]. However, it must be noted, the new distributions obtained via the composition approach usually involve three or more parameters causing difficulties in the estimation and computational processes.

Another prominent approach is compounding of distributions to cater data modeling with unimodality, right-skewness and heavy tails [13]. However, the density function obtained via this approach may not always have a closed form expression which makes the estimation more complicated as shown in [14].

Finite mixture models represent a further approach to define very flexible distributions which are also able to capture, for instance, multimodality of the underlying distribution [15]. The price to pay for this greater flexibility is a more complicated and computationally challenging inference.

We carry on this branch of distribution theory, and propose a new family of heavy-tailed distributions via T-X family technique. The proposed class is very flexible and provides the best fit for the considered heavy-tailed insurance data.

The rest of work done in study is arranged in the following sections: the proposed family is discussed in Section 2. A sub-case of the proposed class is introduced and the shapes of its density and hazard rate functions are sketched in Section 3. Statistical properties of the new family are obtained in Section 4. The expressions for the maximum likelihood estimators are derived in Section 5. In the same section, a Monte Carlo simulation study is presented. The actuarial measures are derived in Section 6. In the same section, a simulation study based on these measures is also provided. Practical applications are discussed in Section 7. Finally, the article is concluded in Section 8.

Proposed method

This section offers the genesis of the proposed method. Recently, [16] proposed the T-X family method that is specified by the cumulative distribution function (cdf) (1) where K[F(x; ξ)] fulfills certain conditions; see [16]. The probability density function (pdf) corresponding to Eq (1) is Deploying the T-X approach, a good deal of new families of statistical models have been proposed in the literature; see [1720] and [21]. Let T ∼ exp(1), then its cdf is given by (2) Corresponding to expression (), the density function is (3) If v(t) follows Eq (3) and setting in Eq (1), the cdf of the type-I heavy-tailed (TI-HT) family follows as (4) where F(x; ξ) is the baseline distribution function which may depend on Form Eq (4), we can see that G(x; θ, ξ) = F(x; ξ) for θ = 1.

Some key motivations of the proposed TI-HT method are the following: (i) An easy and convenient approach to modify the existing models, (ii) to improve the flexibility of the available models in the literature, (iii) to introduce a generalized form of existing models with closed expression for their distribution functions, (iv) to avail the best fit to real-world data as compared to other models with fewer parameters, same number of parameters and higher number of parameters and, (v) to provide an adequate fit to the heavy-tailed data in applied fields such as reliability engineering, medical and financial sciences and, other related fields.

The pdf associated to Eq (4) is (5) We concentrate our focus to a special sub-case of the new family, called type-I heavy-tailed Weibull (TI-HTW) distribution. Finally, we direct our attention to the results related to the TI-HTW model with real life data in three different disciplines. The first data set is taken from bio-medical field and the results of the TI-HTW model is compared to five other competitor distributions including (a) two-parameter Weibull distribution and (b) three-parameter models such as alpha power transformed Weibull (APTW), Marshall-Olkin Weibull (MOW), transmuted Weibull (TW) and modified Weibull (MW) distributions. The second data set is taken from reliability engineering and the comparison of the new model is made with three other well-known distributions such as (a) the three-parameter extended alpha power transformed Weibull (Ex-APTW) and (b) four-parameter Kumaraswamy Weibull (Ku-W) and beta Weibull (BW) distributions. The third data set is taken from financial sciences and the results of the proposed model is compared with Weibull and other heavy-tailed models including Lomax and Burr-XII distributions.

Sub-model description

In the following section, we introduce the genesis of the TI-HTW distribution and discuss its special cases.

Type-I heavy tailed Weibull distributionxs

Consider the cdf , and pdf , where ξ = (α, γ), of the two-parameter Weibull distribution with shape parameter α > 0 and scale parameter γ > 0. Then, cdf of the TI-HTW model is defined by (6) The pdf of the TI-HTW model is (7) Plots for the pdf of the TI-HTW are sketched in Fig 1, whereas the hrf plots of TI-HTW are showed in Fig 2.

Special cases of the TI-HTW distribution

Let X follows the TI-HTW model with parameters (α, θ, γ). Then X reduces to

  1. Weibull model with parameters α and γ, with θ = 1.
  2. One parameter Weibull model with parameter α, with θ = γ = 1.
  3. Exponential with parameter γ, with θ = α = 1.
  4. Rayleigh distribution with parameter γ, with θ = 1 and α = 2.
  5. One parameter TI-HTW distribution with parameters α and θ, with γ = 1. (New)
  6. Type-I Heavy Tailed Exponential (TI-HTE) with parameters θ and γ, with α = 1. (New)
  7. Type-I Heavy Tailed Rayleigh (TI-HTR) with parameters θ and γ, with α = 2. (New)

Statistical properties

In the following subsections, we study some statistical properties of the TI-HT distributions including the quantile function (qf), rth moment and moment generating function.

Quantile function

The qf of the TI-HT distributions is (8) where u ∈ (0, 1). From expression Eq (8), we can see that the proposed model has closed form solution of the qf which makes it easier to generate random numbers for any sub-case of the TI-HT family.

Moments

This sub-section deals with the derivation of rth moment of the TI-HT distributions. The rth moment of the TI-HT distributions is derived as (9) Using Eq (5) in Eq (9), we have (10) Using the expansion (https://math.stackexchange.com/questions/1624974/series-expansion-1-1-xn) (11) Using x = (1 − θ)F(x; ξ) and n = θ + 1 in Eq (11), we get (12) Also using the series representation (13) Using y = F(x; ξ) and m = θ − 1 in Eq (13), we get (14) Using Eqs (12) and (14) in Eq (10), we have (15) where

For some pre-defined parameters values, numerical results for the descriptive measures (mean, variance, skewness and kurtosis of the TI-HTW mode are given in Tables 1 and 2.

thumbnail
Table 1. Descriptive measures of TI-HTW distribution for α = 0.9, γ = 1 and different values of θ.

https://doi.org/10.1371/journal.pone.0237462.t001

thumbnail
Table 2. Descriptive measures of TI-HTW distribution for θ = 0.5, γ = 1 and different values of α.

https://doi.org/10.1371/journal.pone.0237462.t002

For γ = 1.5 and different values α and θ, plots for the mean, variance, skewness and kurtosis of the TI-HTW distribution are displayed in Figs 3 and 4.

thumbnail
Fig 3. Plots for the mean, variance, skewness and kurtosis of the TI-HTW distribution.

https://doi.org/10.1371/journal.pone.0237462.g003

thumbnail
Fig 4. Plots for the mean, variance, skewness and kurtosis of the TI-HTW distribution.

https://doi.org/10.1371/journal.pone.0237462.g004

The moment generating function (mgf) of the TI-HT random variable X, say MX (t), is derived as (16) Using Eq (15) in Eq (16), we get the mgf of the TI-HT distributions.

Estimation and simulation study

In the following section, we obtain the maximum likelihood estimators (MLEs) of the parameters of the proposed family. Furthermore, we conduct a Monte Carlo simulation study to assess the behavior of these estimators.

Maximum likelihood estimation

Numerous approaches for estimating the un-known parameters have been suggested to obtain the estimates of the parameters. Among them, the maximum likelihood (ML) approach is the most prominent and frequently used method. The estimators obtained via this approach possess useful properties and can be utilized for constructing the confidence interval and other statistical tests. The normal approximation of the MLEs can easily be treated either numerically or analytically. For more details about maximum likelihood estimation, we refer to [22, 23]. In this sub-section, we adopt the ML approach for estimating the parameters of TI-HT family. Suppose X1, X2,…, Xn form an observed sample taken randomly from the TI-HT family with pdf (5). The corresponding log-likelihood function to (5) is (17) where Θ = (α, γ, θ)T. The computer software such as ASS (PROC UNMIXED) can be used to maximize the log-likelihood function directly or via differentiating Eq (17). The partial derivatives of Eq (17) are given by (18) and (19) Equating the nonlinear system of equations and to zero, and simultaneously solving these expressions, yields the MLEs and , respectively.

Monte Carlo simulation study

In this sub-section, we investigate the performance of the MLEs. For the simulation purposes, the special sub-model TI-HTW distribution is considered. The simulation process is conducted based on the following steps:

  • N = 1000 samples of size n = 25, 50, 75, …, 1000 are generated from TI-HTW model with parameters α, γ and θ. The inversion procedure of generating random number is used.
  • Compute MLEs of (α, γ, θ).
  • Compute biases and mean square error (MSE) of the model parameters.
  • Coverage probabilities (CPs) are calculated at the 95% confidence interval (C.I).
  • Steps (i)-(iii) are repeated for n.

The simulation results are provided in Tables 3 and 4. The results in these tables indicate that the behavior of the estimates of the TI-HTW parameters are good, showing small bias and creditable MSEs in all studied cases; that is, these estimates are quite reliable and very close to the actual values. Further, the biases are approaching to 0 as the sample size increases, proving that the estimates are behaved asymptotically unbiased estimators. Moreover, the MSEs decrease as the sample size increases, showing that these estimators are consistent for the TI-HTW parameters.

thumbnail
Table 3. Simulation results for different combination of the parameters of TI-HTW distribution.

https://doi.org/10.1371/journal.pone.0237462.t003

thumbnail
Table 4. Simulation results for different combination of the parameters of TI-HTW distribution.

https://doi.org/10.1371/journal.pone.0237462.t004

Actuarial measures

In actuarial sciences and management institutions, one of the key tasks of the actuaries is to evaluate the exposure of market risk in a portfolio of instruments. In this section, we calculate some important risk measures including value at risk (VaR) and tail value at risk (TVaR) for the TI-HTW, which play a crucial role in portfolio optimization under uncertainty.

VaR measure

Let X follow the TI-HTW model with pdf (7), then the VaR of X denoted by VaRq (q is a specified level of significance) is given by (20)

TVaR measure

The TVaR is one of the most important risk measures that quantifies the expected loss provided that an event outside a specified level of probability has occurred. Let X has the TI-HTW model, then the TVaR of X is computed as (21) Inserting (12) and (14) in (21), we get (22) where

On solving we get (23) where

Numerical study of the risk measures

In the current sub-section, we conduct numerical study of the VaR and TVaR measures for the TI-HTW distribution. The VaR and TVaR of the TI-HTW distribution are compared with the Weibull distribution as a nested model and the exponentiated Weibull (EW) distribution [24] as a non-nested model, which is one the most prominent generalization of the Weibull model. The numerical results are obtained as follows.

  1. We generated a sample of size n = 100 from the Weibull, EW and TI-HTW distributions and their parameters have been estimated via ML method.
  2. 1000 repetitions are made to calculate the VaR and TVaR for these distributions.
  3. The numerical results of the risk measures are provided in Tables 5 and 6. Further, these results are displayed graphically in Figs 5 and 6, respectively.
thumbnail
Table 5. Simulation results for VaR and TVaR of the Weibull, EW and TI-HLW distributions.

https://doi.org/10.1371/journal.pone.0237462.t005

thumbnail
Table 6. Simulation results for VaR and TVaR of the fWeibull, EW and TI-HLW distributions.

https://doi.org/10.1371/journal.pone.0237462.t006

The simulation is performed for the Weibull, EW and proposed models for selected values of their parameters. A model with higher values for VaR and TVaR is said to have a heavier tail. The simulated results provided in Tables 5 and 6 shows that the proposed TI-HTW model has higher values of the risk measures than the Weibull and EW distributions. Figs 5 and 6 also show that the proposed model has a heavier tail than the Weibull and EW distributions.

Comparative study

This section, we consider three heavy-tailed data from applied areas such as medical, engineering and financial sciences to study the flexibility of the proposed family. The key motivations of considering the heavy-tailed distributions are that they adequately provide the best fit to the heavy-tailed data. For each data set, the TI-HTW distribution is compared with different well-known distributions and we observed that the proposed distribution outclass other competitors.

To decide about the goodness of fit among the applied distributions, we consider certain analytical measures. In this regard, we consider two discrimination measures such as the Akaike information criterion (AIC) and Bayesian information criterion (BIC); see [25].

In addition to the discrimination measures, other goodness of fit measures such as Cramer-Von-Mises (CM) test statistic, Anderson Darling (AD) test statistic and Kolmogorov–Smirnov (KS) test along with its p-values are also considered. The formulae for these measures can be found in [26].

A distribution with lower values of these analytical measures is considered to be a good candidate model among the applied distributions for the underlying data sets. By considering these statistical tools, we observed that the TI-HTW model is the best competitor compared to other models because the values of all selected criteria are significantly small for it.

A real life application from bio-medical sciences

The first data set is reported in [27], and it refers to the remission times of bladder cancer patients. For the first data set, the TI-HTW distribution is compared with the Weibull, MOW [28], MW [29], TW [30] and APTW [31] distributions. A number of authors have been used these distributions to model bio-medical data sets. For example, [32] used the Weibull and MOW distributions to model the survival times of the cancer patients. These data were analyzed by [33] and [34].

The maximum likelihood estimates of the models for cancer data are presented in Table 7. The analytical measures of the competitive models are provided in Table 8. Form Table 8, it is clear that the proposed distribution has lower values of these measures than the other models. The fitted cdf and Kaplan-Meier survival plots of the proposed model for cancer data are plotted in Fig 7. The PP plot of the TI-HTW model and box plot of the cancer data are sketched in Fig 8. From Fig 7, we can see that the proposed model fits the estimated cdf and Kaplan Meier survival plots very closely. From Fig 8, we can easily detect that the data set is skewed to the right (see box plot) and proposed model is closely followed the PP-plot.

thumbnail
Fig 7. Estimated cdf and Kaplan Meier survival plots of the TI-HTW distribution for data 1.

https://doi.org/10.1371/journal.pone.0237462.g007

thumbnail
Fig 8. PP plot of the TI-HTW distribution and the box plot for data 1.

https://doi.org/10.1371/journal.pone.0237462.g008

thumbnail
Table 7. Estimated values with standard error (in parenthesis) of the competitive models for data 1.

https://doi.org/10.1371/journal.pone.0237462.t007

thumbnail
Table 8. Discrimination and goodness of fit measures of the TI-HTW and other competitive models for data 1.

https://doi.org/10.1371/journal.pone.0237462.t008

A real life application from reliability engineering

Here, we investigate the TI-HTW distribution via analyzing a heavy-tailed reliability engineering data which are reported in [35], and they refer to failure time of coating machine. To show the potentiality of the proposed method, the TI-HTW distribution is applied in comparison with the Ex-APTW, Ku-W and BW distributions. The Ku-W [36]and Ex-APTW [37] have been used to model failure times data. Al-Malki [38] showed that the BW distribution is one of the most prominent extensions of the Weibull distribution that can be used quite effectively in failure rate time data.

Corresponding to data set 2, the values of the model parameters are reported in Table 9. The analytical measures of the proposed and other competitive models are provided in Table 10. The estimated cdf and Kaplan-Meier survival plots are sketched in Fig 9, which show that proposed distribution fits the estimated cdf and Kaplan-Meier survival plots very closely. The PP and box plots are sketched in Fig 10.

thumbnail
Fig 9. Estimated cdf and Kaplan Meier survival plots of the TI-HTW distribution for data 2.

https://doi.org/10.1371/journal.pone.0237462.g009

thumbnail
Fig 10. PP plot of the TI-HTW distribution and the box plot for data 2.

https://doi.org/10.1371/journal.pone.0237462.g010

thumbnail
Table 9. Estimated values of the model parameters with standard error (in parenthesis) of the fitted models for data 2.

https://doi.org/10.1371/journal.pone.0237462.t009

thumbnail
Table 10. Analytical measures of of the TI-HTW and competitive models for data 2.

https://doi.org/10.1371/journal.pone.0237462.t010

A real life application from insurance sciences

The third data set from the insurance sciences and represents the vehicle insurance losses which are available at: http://www.businessandeconomics.mq.edu.au. For the third data, the TI-HTW distribution is compared with the Weibull, Lomax and Burr-XII distributions which are widely used in modeling financial and financial risk management problems. The Weibull distribution is one of the best competitors for modeling actuarial data up to a specified threshold; see [39]. Further, the Lomax [40] and Burr [41] distributions have been widely used in data modeling with tail beyond the threshold.

The parameter values are reported in Table 11 for the insurance data, and the analytical measures are presented in Table 12. The estimated cdf and Kaplan-Meier survival plots are sketched in Fig 11. The PP and box plots are sketched in Fig 12. From Figs 11 and 12, it is clear that the proposed model fits the estimated cdf, Kaplan-Meier survival and PP plots very well.

thumbnail
Fig 11. Estimated cdf and Kaplan Meier survival plots of the TI-HTW distribution for data 3.

https://doi.org/10.1371/journal.pone.0237462.g011

thumbnail
Fig 12. PP plot of the TI-HTW distribution and the box plot for data 3.

https://doi.org/10.1371/journal.pone.0237462.g012

thumbnail
Table 11. Maximum likelihood estimates of the fitted models with standard error (in parenthesis) for data 3.

https://doi.org/10.1371/journal.pone.0237462.t011

thumbnail
Table 12. Analytical measures of the TI-HTW and other competing models for data 3.

https://doi.org/10.1371/journal.pone.0237462.t012

Baysesian analysis

We adopt a Bayesian formulation for our proposed model and drive posterior inference using Markov chain Monte Carlo (MCMC) algorithm. To generate MCMC samples from posterior distribution of the parameters of our joint model for Bayesian inference, we have used WinBUGS software [42]. Note that to specify a likelihood contribution for a distribution that is not listed in WinBUGS, we have used the “zeros trick” [43]. In particular, a Poisson (λ) observation of zero has likelihood exp(−λ), so if our observed data is set of 0’s, and λ[i] is set to −log(L[i]), we obtain the correct likelihood contribution. Since, λ[i] should always be >0 as it is a Poisson mean, we may need to add suitable constant to ensure that is positive. This is equivalent to multiplying each likelihood term by ec. This process does not influence the likelihood since it is equivalent to multiplying the resulting posterior distribution by a constant term equal to enc. Thus, the likelihood takes the form (24) The choice of a good prior distribution plays a key role in Bayesian inference. In practice, no information is precise enough to lead to the exact determination of the prior distribution. However, non-informative prior that allows the data to dominate to determine the posterior distribution are suggested for the Bayes-MCMC methods. We consider standard distribution for priors, such as gamma priors for α, γ and θ, as these are positive-valued random variables. Note that gamma priors are widely used in Bayesian literature for positive-valued random variables. For assessing convergence, a simple (informal) method of assessing chain convergence is to look at some graphical diagnostics such as trace plot, autocorrelation plot and density plots to determine the mixing of chains. If the chains show a reasonable degree of randomness between iterations, it signifies that the Markov chain has found an area of high likelihood and is integrating over the target density and hence indicating that it has converged. Moreover, we also use the Gelman-Rubin statistic R, another popular technique for diagnosing convergence. It is based on comparison of with in chain and between chain variances. Values of R substantially above 1 indicate lack of convergence. However, some authors suggests that R <1.2 is acceptable. To examine the empirical performance of the proposed methodology for model adequacy, deviance information criterion (DIC) is the most widely used criterion for model comparison in Bayesian analysis [44] and [45]. It is derived based on two principles: (i) goodness of fit measured via the deviance statistic, and (ii) model complexity measured by an estimate of the effective number of parameters, denoted by pD. When comparing two or more models, it is suggested that DICMDICmin > 10 or if the difference lies between 5 and 10, then there is considerably less support for Model M compared to the model with minimum DIC. However, DICMDICmin < 5 shows that no support for a model with the lowest DIC and may lead to misleading inference.

A real life application to AIDS data

[46] described a study involving 467 human immunodeficiency virus (HIV) infected patients who had failed or were intolerant to zidovudine therapy (ZT). The main objective was to compare two antiretroviral drugs to prevent the progression of HIV infections: didanosine (ddI) and zalcitabine (ddC). To analyze the data, We construct two Morkov chains each of 100,000 iteration to approximate posterior density, each following a 10,000 iteration as a burn-in period. we consider here only the TI-HTW and Weibull distributions to model the time-to-event process. For sake of simplicity, we summarize only DIC values and distributional parameters values in Table 13.

thumbnail
Table 13. Bayesian analysis of the TI-HTW and Weibull models.

https://doi.org/10.1371/journal.pone.0237462.t013

DIC values of the TI-HTW distribution and Weibull fits are 7314.9 and 7328.83, respectively, suggesting that the TI-HTW distribution has a superior fit over the Weibull distribution.

Conclusions

The importance of the extended distributions first realized in financial sciences and later in other applied fields such as engineering and medical sciences. To cater data in those fields, a number of methods have been introduced. In this context, we have studied a versatile three parameters heavy-tailed model, called type-1 heavy tailed Weibull distribution as a special case of a new approach allowing closed form expressions for some basic mathematical and other related properties. The proposed class is called type-I heavy-tailed family. The usefulness of the proposed family of heavy-tailed distributions has been proved via three data sets from medical, engineering and financial sciences and the model performs reasonably good than the well-known competing heavy-tailed distributions. The developed family in this work is a promising method for modeling data in the distribution theory, may be useful for the researchers who deal with such data sets. Thus, the new model can be served as a good competitor alternative to other existing models.

Future work includes (i) bivariate extension of the actuarial measures and the Monte Carlo simulation study of these measures, (ii) modeling heavy-tailed data with bivariate extension, (iii) regression problems with covariates and (iv) parameter reduction.

References

  1. 1. Venturini S, Dominici F, Parmigiani G. Gamma shape mixtures for heavy-tailed distributions. Ann Appl Stat, 2012; 2(2): 756–776.
  2. 2. Ickowicz A, Sparks R. Modelling hospital length of stay using convolutive mixtures distributions. Stat Med, 2017; 36(1): 122–135. pmid:27704639
  3. 3. Harini S, Subbiah M, Srinivasan MR. Fitting length of stay in hospitals using transformed distributions. Comm Stat Case Stud Data Anal Appl, 2017; 4(1): 1–8.
  4. 4. Karagrigoriou A, Vonta I. Statistical Inference for Heavy-Tailed Distributions in Technical Systems. In 2014 Ninth International Conference on Availability, Reliability and Security, 2014; 412-419.
  5. 5. Bhati D, Ravi S. Evolution by gene duplication. On generalized log-Moyal distribution: A new heavy tailed size distribution. Insur Math Econ, 2018; 79: 247–259.
  6. 6. Beirlant J, Matthys G, Dierckx G. Heavy-tailed distributions and rating. Astin Bull, 2001; 31(1): 37–58.
  7. 7. Resnick SI. Discussion of the Danish data on large fire insurance losses. Astin Bull, 1997; 27(1), 139–151.
  8. 8. Dutta K, Perry J. Evolution by gene duplication. A tale of tails: an empirical analysis of loss distribution models for estimating operational risk capital, 2006.
  9. 9. Adcock C, Eling M, Loperfido N. Skewed distributions in finance and actuarial science: a review. Eur J Financ, 2015; 21: 1253–1281.
  10. 10. Punzo A. A new look at the inverse Gaussian distribution with applications to insurance and economic data. J Appl Stat, 2019; 46(7): 1260–1287.
  11. 11. Bagnato L, Punzo A. Finite mixtures of unimodal beta and gamma densities and the k-bumps algorithms. Computation Stat, 2013; 28(4): 1571–1597.
  12. 12. Bakar SA, Hamzah N, Maghsoudi M, Nadarajah S. Modeling loss data using composite models Insur Math Econ, 2015; 76: 146–154.
  13. 13. Mazza A, Punzo A. Modeling Household Income with Contaminated Unimodal Distributions. In: Petrucci A., Racioppi F., Verde R. (Eds.), New Statistical Developments in Data Science, Springer Proceedings in Mathematics & Statistics, 2019; 288: 373-S391.
  14. 14. Punzo A, Mazza A, Maruotti A. Fitting insurance and economic data with outliers: a flexible approach based on finite mixtures of contaminated gamma distributions. J Appl Stat, 2018; 45(14): 2563–2584.
  15. 15. Punzo A, Bagnato L, Maruotti A. Compound unimodal distributions for insurance losses. Insur Math Econ, 2018; 81: 95–107.
  16. 16. Alzaatreh A, Lee C, Famoye F. A new method for generating families of continuous distributions. Metron, 2013; 71(1): 63–79.
  17. 17. Ahmad Z, Hamedani GG, Butt NS. Recent developments in distribution theory: a brief survey and some new generalized classes of distributions. Pak J Stat Oper Res, 2019; 15(1): 87–110.
  18. 18. Cordeiro GM, Afify AZ, Ortega EM, Suzuki AK, Mead ME. The odd Lomax generator of distributions: properties, estimation and applications. J Comput Appl Math, 2019; 347: 222–237.
  19. 19. Nassar M, Kumar D, Dey S, Cordieor GM, Afify AZ. The Marshall–Olkin alpha power family of distributions with applications. J Comput Appl Math, 2019; 351: 41–53.
  20. 20. Afify AZ, Alizadeh M. The odd Dagum family of distributions: properties and applications. J Appl Prop Stat, 2020; 15(1): 45–72.
  21. 21. He W, Ahmad Z, Afify AZ, Goual H. The arcsine exponentiated-X family: validation and insurance application. Complexity, 2020; 2020: 1–18.
  22. 22. Dong B, Ma X, Chen F, Chen S. Investigating the differences of single-vehicle and multivehicle accident probability using mixed logit model. Journal of Advanced Transportation, 2018.
  23. 23. Chen F, Song M, Ma X. Investigation on the injury severity of drivers in rear-end collisions between cars using a random parameters bivariate ordered probit model. International journal of environmental research and public health, 2019; 16(14): 2632.
  24. 24. Mudholkar GS, Srivastava DK. Exponentiated Weibull family for analyzing bathtub failure-rate data. IEEE T Reliab, 1993; 42(2): 299–302.
  25. 25. Sun J, Li T, Li F, Chen F. Analysis of safety factors for urban expressways considering the effect of congestion in Shanghai. China. Accident Analysis & Prevention; 95: 503–511.
  26. 26. Afify AZ, Cordeiro GM, Yousof HM, Saboor A., Ortega EM. The Marshall-Olkin additive Weibull distribution with variable shapes for the hazard rate. CHacettepe J Math Stat, 2018; 47(2): 365–381.
  27. 27. Lee ET, Wang J. Statistical methods for survival data analysis (Vol. 476). John Wiley & Sons. 2003.
  28. 28. Marshall AW, Olkin I. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika, 1997; 84(3): 641–652.
  29. 29. Sarhan AM, Zaindin M. Modified Weibull distribution. Applied Sciences, 2009; 11: 123–136.
  30. 30. Aryal GR, Tsokos CP. Transmuted weibull distribution: a generalization of theweibull probability distribution. Euro J. of Pure and Appl Mathe, 2011; 4(2): 89–102.
  31. 31. Dey S, Sharma VK, Mesfioui M. A new extension of Weibull distribution with application to lifetime data. Annals of Data Sci, 2017; 4(1): 31–61.
  32. 32. Alizadeh M, Tahir MH, Cordeiro GM, Mansoor M, Zubair M, Hamedani GG. The Kumaraswamy marshal-Olkin family of distributions. Egypt Math Soc, 2015; 23(3): 546–557.
  33. 33. Afify AZ, Mohamed OA. A new three-parameter exponential distribution with variable shapes for the hazard rate: estimation and applications. Mathematics, 2020; 8: 1–17.
  34. 34. Sen S, Afify AZ, Al-Mofleh H, Ahsanullahd M. The quasi xgamma-geometric distribution with application in medicine. Filomat, 2019; 33(16): 5291–5330.
  35. 35. Algamal ZY. Exponentiated exponential distribution as a failure time distribution. Iaqi J Stat Sci, 2008; 14: 63–75.
  36. 36. Cordeiro GM, Ortega EM, adarajah S. The Kumaraswamy Weibull distribution with application to failure data. J Franklin Inst, 2010; 347(8): 1399–1429.
  37. 37. Ahmad Z, Ilyas M, Hamedani GG. The Extended Alpha Power Transformed Family of Distributions: Properties and Applications. J Data Sci, 2019; 17(4): 726–741.
  38. 38. Al-Malki SJ. Statistical analysis of lifetime data using new modified Weibull distributions (Doctoral dissertation, The University of Manchester (United Kingdom)). 2014.
  39. 39. Scollnik DP, Sun C. Modeling with weibull-pareto models. N Amer Actu J, 2012; 16(2): 260–272.
  40. 40. Scollnik DP. On composite Log-normal-Pareto models. Scand Actuar J, 2007; 1: 20–33.
  41. 41. Nadarajah S., Bakar S. A. New composite models for the Danish fre insurance data. Scand. Actuar. J, 2014; 2: 180–187.
  42. 42. Lunn DJ, Thomas A, Best N, Spiegelhalter D. WinBUGS-a Bayesian modelling framework: concepts, structure, and extensibility STAT COMPUT, 2000; 10(4): 325–337.
  43. 43. Ntzoufras I. Bayesian modeling using WinBUGS John Wiley & Sons, 2011; 698.
  44. 44. Zeng Q, Guo Q, Wong SC, Wen H, Huang H, Pei X. Jointly modeling area-level crash rates by severity: a Bayesian multivariate random-parameters spatio-temporal Tobit regression. Transportmetrica A: Transport Science; 2019, 15(2): 1867–1884.
  45. 45. Zeng Q, Hao W, Lee J, Chen F. Investigating the impacts of real-time weather conditions on freeway crash severity: a Bayesian spatial analysis. International journal of environmental research and public health; 2020, 17(8): 27–68.
  46. 46. Rizopoulos D. Joint models for longitudinal and time-to-event data: With applications in R. CRC press, 2012.