Statistical inference for the Nadarajah-Haghighi distribution based on ranked set sampling with applications

: In this article, the maximum likelihood and Bayes inference methods are discussed for determining the two unknown parameters and speciﬁc lifetime parameters of the Nadarajah-Haghighi distribution, such as the survival and hazard rate functions, with the inclusion of ranked set sampling and simple random sampling. The estimated conﬁdence intervals for the two parameters and any function of them are developed based on the Fisher-information matrix. Metropolis-Hastings algorithm and Lindley-approximation are used for generating the Bayes estimates and related highest posterior density credible ranges for the unknown parameters and reliability parameters under the presumption of conjugate gamma priors. A Monte-Carlo simulation study and a real-life data set have been used to assess the e ﬃ cacy of the proposed methods.


Introduction
McIntyre [12] developed a sampling strategy to ascertain the mean pasture yield in Australia, which was later known as ranked set sampling (RSS).This sampling technique is useful when it is more convenient and affordable to rank the sample units without utilizing their precise values.In order to generate RSS with set size n, we first draw n simple random samples (SRSs) of size n from the population of interest.Then, we arrange each sample of size n in ascending order of magnitude.When ranking is done at this step, it is not required to actually measure the sample units because it can be done using a subjective method like eye inspection or personal judgement.The sample unit from the ith sample with judgement rank i, i = 1, 2, ..., n, has been chosen for actual quantification.It is important to note that a limited set size is suggested to facilitate informal rankings.RSS-based methods are frequently more effective than their SRS equivalents; this issue has been investigated for a number of common issues in the RSS literature, including, Dell and Clutter [2], Huang [5], Kvam and Samaniego [8], MacEachern et al. [11], Mohie El-Din et al. [14,15], Stokes [18], Stokes and Sager [19], Takahasi and Wakimoto [22], Wang et al. [23] and Zamanzade and Mahdizadeh [24].
Statistical inference for NHD unknown lifetime parameters have not yet been studied using RSS.This study's main objective was to obtain MLEs and BEs based on lifetime data collected under RSS to produce point and interval estimates of the unknown NHD parameters as well as some lifetimeparameters, such as the SF and HRF.Independent conjugate gamma priors of the unknown parameters are taken into account under the squared error loss function (SEL).Additionally, we construct ACIs and HPD for θ and µ, as well as any function of them.In order to compute the BEs and produce the associated HPD, we recommend using Lindley-approximation and Metropolis-Hastings (MH) algorithm.A Monte-Carlo simulation (MC-simulation) analysis was carried out to assess the outcomes of various estimations through their mean-square error (MSE) and absolute-relative bias (RA).The following portions of the study are organized as follows: We derive the MLEs and associated twosided ACIs in Section 2. In Section 3, we develop the SEL-based Bayesian inference.The results of MC-simulation are presented in Section 4. Section 5 describes inferences based on life-time data analysis as an illustration.Section 6 presents a final conclusion.

Bayesian inference
We use the assumption that the unknown parameters θ and µ are both independent random variables, and have conjugate gamma priors, i.e., θ ∼ Gamma(c 1 , d 1 ) and µ ∼ Gamma(c 2 , d 2 ), respectively, in order to derive the BEs.In order to reflect prior information regarding the two unknown parameters θ and µ, the hyper parameters c j , d j , j = 1, 2 were chosen.As a result, π(θ, µ) ∝ θ c 1 −1 µ c 2 −1 e −(d 1 θ+d 2 µ) , θ, µ > 0 and c j , d j > 0, j = 1, 2 can be applied to express the joint prior density of θ and µ.The joint posterior PDF of θ and µ in the continuous Bayes' theorem can be defined by π(θ, µ|x) ∝ L(θ, µ|x)π(θ, µ).The joint posterior distribution is As a consequence, the BEs of δ(ϑ), ϑ = (θ, µ) under SEL, is the posterior expectation of δ(ϑ) and is provided by Nevertheless, the BEs are computed as two-dimensional integrals based on Eq (3.3), for which there is no closed-form solution.As a result, we advise adopting Lindley's approximation and MCMC methodology as two techniques for approximating Eq (3.3).

MH-algorithm based on RSS
To implement MCMC approach and produce samples from the joint posterior distribution, we utilize MH sampler algorithm (Metropolis et al. [13] and Hastings [4]).According to Eq (3.1), the conditional posterior distributions of θ and µ are provided by and (3.28) Analytical reduction to well-known distributions of the conditional posterior distributions of Eqs (3.27) and (3.28) of unknown parameters θ and µ is not possible.Hence, in order to get the BEs as well as accompanying HPD credible interval, MH algorithm with normal proposed distribution has been utilized to get random samples from such conditional posterior distributions.MH algorithm's stages are carried out as follows: Step 1: Select initial values θ (0) and µ (0) .
Step 4: At a specific time t, BEs of the SF and HRF are provided by and Step 5: Put J = J + 1.
Step 6: Steps 3-5 should be repeated M-times to obtain
The 100(1 − γ)% HPD of ρ is given by (ρ (J * ) , ρ (J * +(1−γ)M) ), where J * is selected such that So, we can obtain the shortest length of HPD interval for ρ.The initial simulated iterations of the algorithm, M 0 , are frequently discarded at the start of the analytical implementation (burn-in period), to guarantee convergence.The drawn samples, ρ ( j) , j = M 0 +1, ..., M, are then large enough to construct the BEs.As a result, any function of θ and µ under SEL has MCMC BEs of ρ(θ, µ) that are provided by

MC-simulation analysis
This section compares the estimated values for θ, µ, κ(t) and ξ(t) using MC-simulation analysis.We calculated MLEs as well as BEs for the reliability characteristics and unknown-parameters.For Bayesian computing, Lindley's approximation as well as MCMC techniques using SEL were utilized.With regard to their MSE and RA values, various estimators have been compared.A 95% ACI/HPD intervals' performances have also been compared using average lengths (ALs).We determined our mean estimates, or MSEs in addition RAs, which are provided, respectively, by θ , where ς = 1000 and θ denotes the estimates of the parametric function.We have used the informative priors of θ and µ with c 1 = c 2 = 1 and d 1 = 0.2, d 2 = 0.3 when (θ, µ) = (0.2, 0.3), such that the hyper-parameter values selected to satisfy the prior-mean are transformed into the predicted value of the associated parameter.For specified time t = 5, the true values of κ(t) and ξ(t) were further approximated to be κ(5) = 0.8178 and ξ(5) = 0.0288.
Based on true values of θ and µ, we generated n sets of random samples, each one of size n from NHD(θ, µ) distribution, by using the transformation , n, U i was taken from U(0, 1).Then, by using an RSS scheme (SCH-I) and SRS scheme (SCH-II), samples of size n = 4, 7, 10 were obtained.
The various BEs were produced by using MH sampler technique suggested in the theoretical section, with 12, 000 MCMC samples; we dropped the first 2000 samples as burn-in.We calculated ALs of the 95% ACI/HPD of θ, µ, κ(t) and ξ(t).Statistical computer language Mathematica was used to carry out extensive computations.Table 1 displays mean of MLEs and BEs for θ, µ, κ(t) and ξ(t).Additionally, Table 2 includes the ALs of θ, µ, κ(t) and ξ(t).
Table 1 displays that the reliability features of the NHD in terms of the MSEs and RAs based on MLEs and BEs are extremely superb.MSEs and RAs of all estimates decline as n increases, as predicted.Additionally, BEs utilizing gamma informative priors are superior to MLEs in terms of the MSEs and RAs because they involve prior knowledge.Based on the MSEs and RAs, MCMC technique employing the MH algorithm is superior to Lindley's approximation method.HPD outperform ACIs in regard to ALs for interval estimation.Additionally, by increasing n, the ALs of ACIs and HPD credible intervals decrease.We recommend applying point and interval BEs using the MH method.Furthermore, MSEs and RAs of MLEs and BEs for parameters were higher for SCH-II than SCH-I in most cases when comparing SCH-I and SCH-II.Given that the experiment's observations for SCH-I were more random than for SCH-II, therefore, it would be expected that the data collected by SCH-I would reveal more details regarding reliability-parameters than the sample collected by SCH-II.

Real-life data analysis
In order to show how the suggested technique may be used to study real-life events, a real data set was examined.The analyzed data-set, which depicts failure times for 33 small electronic applications with known failure causes in an autonomous life test, is shown in [9].There were totals of 17 and 16 observed failures attributed to causes 1 and 2, respectively, from the entire failure times.For computational ease, each failure time point in the original data set was split by a thousand.Salem et al. [17] analyzed and addressed transformed failure times of electronic applications.Before sampling, we fitted the NHD to the whole set of data and compared it with the fitting of three-lifetime distributions, specifically, the gamma distribution (G-D), Weibull distribution (W-D) and generalizedexponential distribution (GE-D).
The data's compatibility with the G-D, W-D, GE-D and NHD is one issue that has to be addressed.The reliability of the proposed model has been evaluated using the goodness of fit test (GFT) statistics with related pvalues from the Kolmogorov Smirnov (KS) and Anderson Darling (AD) tests.Additionally, we evaluated the following: • Hannan-Quinn-criteria, HQC = −2 l + 2w log(log(n)), n is sample size and w is the number of model parameters are taken into account, see Ashour et al. [1].
The optimal distribution often has the greatest pvalues and lowest values for the − l, AC, ACc, CAC, BC, HQC, KS, and AD statistics.Table 3 lists the values for MLEs of the distribution parameters and the accompanying GFT metrics.Table 4 provides the statistical results of the KS and AD GFTs, and their pvalues.Additionally, we assessed the distributions' goodness of fit by using a graphing method.For the G-D, W-D, GE-D and NHD, we created quantile-quantile (Q-Q) plots, which are shown in Figure 1.The points F −1 (i−0.5)n ; θ , x (i) , i = 1, 2, ..., n are shown on a Q-Q plot; θ is the MLE of θ and x (i) is ordered data.We have also included two graphs that were produced using the estimated parameters for a more precise explanation.The SFs of the G-D, W-D, GE-D and NHD have been fitted by using the empirical CDF plot, which is the first plot.As shown in Figure 2, the subsequent plot represents the histogram of the real-data together with PDFs for the G-D, W-D, GE-D and NHD.The fact that it is frequently impossible to demonstrate the existence and uniqueness of the generated estimators is one of the main problems with ML inference.The contour plot of the log-LF based on NHD parameters is presented in Figure 3 to help address this issue using the observed failure data.It shows the existence and uniqueness of the MLEs θ and µ.Therefore, to begin the computational iteration, we suggest using these estimates as initial hypotheses.
The NHD is the best option among the comparable G-D, W-D and GE-D that have been studied to fit lifetime data as it had the lowest GFTs and greatest pvalues, as can be observed from Tables 3 and  4. The Q-Q plots also lend credence to the aforementioned conclusions.We can obtain at random six sets with n = 6 items in each set to develop RSS to demonstrate the inferential techniques used in the previous sections (see Table 5).The reliable features κ(t) and ξ(t) at the specified time t = 2, in addition to MLEs and BEs of θ and µ, were calculated and are listed in Table 6.Under a non-informative prior, Lindley-approximation and MCMC method were used to develop BEs.The first 10,000 iterations were removed from the sequence after we generated 50,000 MCMC samples.The MLEs for θ, µ, κ(t) and ξ(t) were used as initial-values for the MCMC sampler procedure.Additionally, Table 6 lists a 95% ACIs and HPD intervals.Forty-thousand outcome of θ, µ, κ(t) and ξ(t) are shown in Figure 4 together with their sample means (solid line (-)) and 95% two sided intervals (dashed lines (---)).Additionally, this plot demonstrates that MCMC algorithm has achieved good convergence and burn-in-period sample size is appropriate to eliminate initial proposals effects.

Conclusions
The issue of determining the NHD's life-time parameters when the observed data are drawn by RSS is addressed in this article based on MLEs and BEs.These estimations can be computed numerically but cannot be achieved in closed forms.The BEs were got by utilizing the Lindley-approximation and MCMC techniques with gamma-priors under SEL.Asymptotic-normality MLEs as well as the delta technique were used to construct 95%-ACIs of life-time parameters.We used MH technique to get point estimations and related HPD credible intervals because Lindley approximation approach is unable to develop the HPD credible intervals.

Figure 1 .
Figure 1.Q-Q plots of G-D, W-D, GE-D and NHD models.

Figure 2 .
Figure 2. SF and PDF results for G-D, W-D, GE-D and NHD based on electronic data.

Figure 3 .
Figure 3. Log-LFs of MLEs and contour-plots of log-LFs of θ and µ for electronic data.

Table 2 .
ALs of 95% ACIs and HPDs for the parameters and reliability characteristics.

Table 3 .
Fitting summary based on lifetime data.
Note: The best model is represented by values in bold font.
Note: The best model is represented by values in bold font.

Table 5 .
An RSS with sample size n = 6.