A new generalized family of distributions: Properties and applications

1 Department of Statistics, Govt. Postgraduate College B. R. Bahawalpur, Bahawalpur 63100, Pakistan 2 Mathematics Department, Umm-Al-Qura University, Makkah, Saudi Arabia 3 Department of Mathematics, College of Science and Humanities in Al-Kharj, Prince Sattam bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia 4 Department of Mathematics, Faculty of Science, Mansoura University, Mansoura 35516, Egypt 5 Department of Statistics, Mathematics and Insurance, Benha University, Benha 13511, Egypt


Introduction
The classical probability distributions were generalized through the induction of location, scale and shape parameters. Recently, appreciable attempts have been made in the development of the new probability distributions which have big privilege of more flexibility, fitting specific and several real world sequence of events.
The improvement in the G-Classes revolution began with the fundamental article of Alzaatreh et al. [9] in which they proposed transformed (T)-transformer (X) (T-X) family. (1.1) Many authors constructed extended generalized families by using T-X approach. Some examples of generalized classes are, beta-G [19], Kw-G type-1 [17], log-gamma-G type-2 [10], gamma-X [40], exponentiated T-X [9], Weibull-G [11], exponentiated-Weibull-H [13] and generalized odd Lindley-G [3]. Motivated by the new prospect in term of accuracy and exibility of a new distribution, we propose a new propose a new flexible family called, log-logistic tan generalized (LLT-G) family which provides greater accuracy and flexibility in fitting real-life data. Some general properties of the LLT-G class will be provided here. We provide two applications for one special sub-model of the proposed class, called log-logistic tan-Weibull (LLT-W) distribution which has decreasing, increasing, bathtub and unimodal hazard rate functions.
Let r(t) = c s t s c−1 1 + ( t s ) C −2 be the pdf of a rv 0 < t < ∞ and W[G(x)] = tan π 2 G α (x) which satisfies the conditions of T-X family. The cdf and pdf of the new LLT-G family take the forms x > 0, α, c, s > 0 (1.2) and f (x) = π α c 2 s c g(x) [G(x)] α−1 sec 2 π 2 G α (x) tan π 2 G α (x) x > 0, α, c, s > 0. (1.3) The LLT-G hazard rate function (hrf) has the form iii) The LLT-Weibull as a special sub-model of LLT-G class provides adequate fits than other modified models generated by other existing families under same baseline model.
This paper is outlined as follows. Five special models of the LLT-G family are provided in Section 2. In Section 3, we derived some mathematical properties of LLT-G family. Estimation of the LLT-G parameters using maximum likelihood is discussed in Section 4. Further, we present simulations for the LLT-W model to address the performance of the proposed estimators in Section 4. In Section 5, we show the importance and applicability of the LLT-G family via two real-life data applications. In Section 6, the paper is summarized.

Five special distributions
We study five sub-models of the LLT-G class using the baseline Weibull, normal, Rayleigh, exponential and Burr XII distributions (1.3) and provide some plots for their pdfs and hrfs. Figures 1-5 reveals that the special sub-models of the LLT-G class can provide left skewed, symmetrical, right skewed, unimodal, bimodal and reversed-J densities, and increasing, modified bathtub, decreasing, bathtub, unimodal, reversed-J shaped, and J-shaped failure rates.

LLT-Weibull (LLT-W) distribution
Using the Weibull pdf, g( The pdf associated with (2.1) reduces to The plots in Figure 1 depict pdf and hrf shapes of the LLT-W distribution for different parametric values.

LLT-normal (LLT-N) distribution
The pdf of normal distribution is g( The cdf of the LLT-N reduces to 3) The LLT-N pdf takes the form

LLT-Rayleigh (LLT-R) distribution
Using the Rayleigh rv with pdf, g(x) = 2 a x e −a x 2 , x > 0, a > 0, the cdf of the LLT-R distribution becomes The pdf of the LLT-R reduces to (2.6)

LLT-exponential (LLT-E) distribution
Consider the exponential distribution with pdf g(x) = b e −b x , x > 0, b > 0. Hence, the cdf and pdf of LLT-E distribution are and (2.8)

LLT-Burr XII (LLT-BXII) distribution
Let X be a Burr XII rv with pdf g( The pdf of the LLT-BXII reduces to (2.10) Figure 5 displays density and hazard rate plots of the LLT-BXII distribution.

Properties
In this section, the LLT-G properties such as quantile function (qf), useful expansion, moments, generating function, and order statistics are derived. The expressions derived for the LLT-G family can be handled using symbolic computation software, such as, Maple, Matlab, Mathematica, Mathcad, and R because of their ability to deal with complex and formidable size mathematical expressions. Established explicit formulae to evaluate statistical and mathematical measures can be more efficient than computing them directly by numerical integration. It is noted that the infinity limit in the sums of these expressions can be substituted by a large positive integer such as 40 or 50 for most practical purposes.

Quantile function
The qf of X is calculated directly by inverting (2.1) as Eq (3.1) can be used to simulate any baseline model and to obtain the median=Q(1/2), Bowley's skewness and Moors kurtosis.
The cdf of the LLT-G family can be written as Using the binomial expansion and the convergent series of tan function which can be calculated using the Mathematica software. tan Applying the last two equations to (3.2), the cdf of LLT-G reduces to We can rewrite the Eq (3.2) as where . (3.5) The below power series can be calculated using the Mathematica software where h α(2 j+ic) (x) denotes the exp-G pdf with parameter α(2 j + ic). Hereafter, a rv having the pdf delivers up that the LLT-G pdf is a linear combination of exp-G densities. Thus, a few properties of LLT-G class can be calculated someplace precisely from exp-G properties.

Moments
Assuming that Z is a rv with a baseline G(x). The moments of X can be derived from the (r, k)th probability weighted moment (PWM) of Z defined by [22] as (3.8) Using Eq (3.7), one can write where τ r,α(2 j+ic) = 1 0 Q G (u) r u α(2 j+ic) du which is computed numerically for any baseline qf. The measures of skewness, β 1 , and kurtosis, β 2 , can be defined by the following two formulae and The plots of skewness and kurtosis for the LLT-W distribution are visualized in Figure 6.  Figure 6. Skewness and kurtosis Plots for the LLT-W model.

Moment generating function
We present two formulae for the mgf, M(s) = E(e s X ), of the rv X. The first one comes from Eq (3.7) as where M α(2 j+ic) (s) is the exp-G mgf with power parameter α(2 j + ic). Further, Eq (3.12) can also take the form where the quantity ρ 2 j+iβ (s) = 1 0 exp [s Q G (u)] u α(2 j+ic) du is calculated numerically.

Incomplete moments
Incomplete moments are beneficial in calculating some inequality measures and mean deviations. The nth incomplete moments of the LLT-G family has the form (3.14) For most baseline distributions, the above (3.14) can be obtained numerically. The first incomplete moment, m 1 (z), can be calculated from Eq (3.7) as where Eq (3.15) is the primary quantity to obtain the mean deviations. We apply (3.15) to the LLT-W model. The LLT-W pdf with power parameter α(2 j + ic), reduces to and then Another formula for m 1 (z) follows, Eq (3.7) with u = G(x), as

Order statistics
Consider a random sample from the LLT-G class, X 1 , . . . , X n . The pdf of the ith order statistic, X i:n , has the form After some algebra, the pdf of X i:n takes the form where the exp-G density, h α(2(r+t)+c(m+1)) (x), has a power parameter α(2(r + t) + c(m + 1)) where b r [c(m + 1) − 1] and c t (2) are the coefficients of the power series expansion of tan and sec functions by using Mathematica. Eq (3.21) can be used to calculate some quantities of the LLT-G order statistics from the exp-G quantities.

Estimation and simulation
We will discuss the estimation of the LLT-G parameters using the maximum likelihood (MLE) and study the performance of these estimators via simulations.

Maximum likelihood estimation
Consider a random sample from the LLT-G class, x 1 , . . . , x n , with Θ = (α, s, c, ξ) . Then, the log-likelihood of Θ reduces to n (Θ) = n log ( where A i = π 2 G α (x i ; ξ). The above log-likelihood (4.1) is maximized simply by statistical programs namely, R, Mathcad, SAS, Mathematica and Ox programs. It also is maximized by solving the following likelihood equations.

Simulations
This section deals with checking the MLEs performance in estimating the LLT-W parameters using a simulation study. For n = 50, 100, 300 and 500, we generated 1,000 samples from the LLT-W model for various parametric values using the inversion method via the qf of the LLT-W given by (4.6) The numerical results of the mean square error (MSE), bias, coverage probability (C.P) and average width (AW) of the MLEs of the model parameters are obtained using R software. Further, the graphical representation of bias, MSE and C.P on several parametric values are also provided. The numerical results for bias, MSE, C.P and AW are listed in Tables 1 and 2

Modeling real-life data
This section deals with checking the flexibility of the LLT-W distribution using two real-life data applications.
The first real data contain 63 observations about strengths of 1.5 cm glass fibers which are reported in Smith and Naylor [38]. The second data set from Choulakian and Stephens (2001) [12], is the exceedances of flood peaks (in m3/s) of Wheaton River, Yukon Territory, Canada. The data consists of 72 exceedances for the years 1958-1984, rounded to one decimal place. Tables 3 and 4 report the MLEs (with their associated (standard errors)) of the parameters of competing models, and the statistics K-S, p-value, A * and W * .
The LLT-W model is compared with the ODW, OLLEW, KW, WBXII, KEBXII, BBXII, WL distributions in Tables 3 and 4. We note that the LLT-W model gives the lowest values for all discrimination measures and largest p-value among all fitted models. Hence,the LLT-W model could be chosen as a good alternative to explain glass fibers and Wheaton river data. The results in Tables 3  and 4 indicate that LLT-W provides better fits for glass fibers and Wheaton river data sets as compared to other competing models. More visual comparison of the four best competing distributions are provided in Figures 10 and 11. The fitted densities of the four best models are shown in Figure 10 for glass fibers and Wheaton river data, whereas the estimated distribution functions for four best models are depicted in Figure 11. Further, the hrf plots of the LLT-W distribution for glass fibers and Wheaton river data are illustrated in Figure 12. Based on visual comparison, we can conclude that the LLT-W distribution provides a close fit for glass fibers and Wheaton river data and it can be utilized in fitting data with increasing and modified bathtub hazard rates.

Conclusions
This paper provides a new log-logistic tan generalized (LLT-G) class with three additional shape parameters to capture skewness and kurtosis behavior. Five special models of the LLT-G family are presented by choosing Weibull, normal, Rayleigh, exponential and Burr XII, as baseline distributions in the proposed family, to obtain the LLT-Weibull, LLT-normal, LLT-Rayleigh, LLT-exponential and LLT-Burr XII. The general mathematical properties are obtained for the LLT-G class. The LLT-G parameters estimation is discussed by maximum-likelihood approach and simulation results are obtained to check the performance of these estimators. The importance and flexibility of the LLT-Weibull are checked empirically using two sets of real-life data, proving that it can provide better fit as compared with other competing models, such as odd log-logistic modified-Weibull, odd Dagum-Weibull, odd log logistic exponentiated-Weibull, Weibull-Lomax, Kumaraswamy-Weibull, Kumaraswamy-exponentiated-Burr XII, and beta-Burr XII distributions.