Abstract

We present a Bayesian approach for analysis of competing risks survival data with masked causes of failure. This approach is often used to assess the impact of covariates on the hazard functions when the failure time is exactly observed for some subjects but only known to lie in an interval of time for the remaining subjects. Such data, known as partly interval-censored data, usually result from periodic inspection in production engineering. In this study, Dirichlet and Gamma processes are assumed as priors for masking probabilities and baseline hazards. Markov chain Monte Carlo (MCMC) technique is employed for the implementation of the Bayesian approach. The effectiveness of the proposed approach is illustrated with simulated and production engineering applications.

1. Introduction

In survival analysis, competing risks models are often used when multiple potential causes of failure are observed. Often, the causes of failure are unidentified [1, 2]. Consider the example shown in Reiser et al. [1]. Out of the 682 IBM PS/2 computer system units tested (a system is made of three components: motherboard, disc drives, and power supply), eight failed and the remaining 674 were right censored at various times [1]. The eight systems failed due to malfunction in at least one of the three components {1, 2, 3} and often the cause is narrowed down to several possibilities, e.g., {1, 3}, indicating that the cause of failure is either the motherboard or power supply [1]. This type of incomplete data is generally referred to as masked data, where the cause of failure can only be identified up to a minimum random subset (MRS) .

Let T be the time until a unit experiences a failure due to one of the possible causes of failure. If the precise cause of failure is identified as , then is a singleton. If the cause of failure is not identified, then , resulting in full masking of the cause. Thus, every observed failure time , , is accompanied by a observed MRS denoted by . However, another reason for incomplete observations could be due to the unavailability of knowledge about the exact failure time T.

There is abundant literature of masked data [19]. Miyakawa [8] provided maximum likelihood estimates (MLEs) with two causes of failure and independent exponential failure times when the data is masked and uncensored. Dinse [9] suggested nonparametric maximum likelihood estimators of prevalence and mortality when the MRS is a known cause or a full masked cause.

In production engineering, researchers were often interested in the assessment of components reliability. When the impact of the risk factors on the hazard function is of interest, Bayesian analysis under cause-specific hazard framework is often considered; see, for example, [17]. Bayesian approach is preferred as it is flexible in allowing a general pattern of masking as well as relaxing assumptions [2]. In their work, Reiser et al. [1] presented a Bayesian analysis assuming exponentially distributed component lifetimes, and Guttman et al. [7] took this work further to the case where the masking probability depends on the actual cause of failure. Considering the partial masking cases, Mukhopadhyay and Basu [4] explored a Bayesian approach with independent Weibull distributions and made the assumption of identical shape structures for all the risks. On the other hand, Basu et al. [6] used a Bayesian analysis for masked data from a general component system with nonidentical Weibull distributions. In their follow-up study, Basu et al. [5] explored a Bayesian approach based on a general flexible parametric framework for complex forms of censoring. Without the symmetry assumption, Kuo and Yang [10] developed a Bayesian analysis with independent exponential as well as Weibull distributions, while Mukhopadhyay and Basu [3] studied the case of a series system, the components of which follow independent log-normal life distribution. Xu and Tang [11] considered a nonparametric Bayesian approach for masked data which extended the findings of Neath and Samaniego [12] to series systems with partially masked competing risks. In contrast and based on a cause-specific formulation, Flehinger et al. [13] proposed an approach of completely parametric cause-specific hazards using stage 1 and stage 2 information when the failure times for the competing risks have a Weibull distribution. In their work, Craiu and Reiser [14] developed an expectation-maximization- (EM-) based method that allowed dependent competing risks and produced estimators for the subdistribution functions. Similarly, Lu and Tsiatis [15] presented parametric models to estimate the regression coefficients whereby the cause-specific hazard for the cause of interest is associated with the covariates through a proportional hazards relationship. Sen et al. [2] introduced a semiparametric Bayesian approach using variety in priors.

Most of the works mentioned above were developed for masked data based on the series of system formulation considering cases where the failure time was complete (no censored units), right-censored (RC), or interval-censored. In this paper, we investigated the case where the data is masked and the failure time is partly interval-censored (PIC) using the Bayesian approach. We employed the Cox proportional hazards model, which is used extensively but mostly for public health studies; see, for example, Han et al. [16] and Liu et al. [17].

The rest of the paper is organized as follows. Section 2 introduces the model formulation and Bayesian computation techniques. Section 3.1 provides some results from modest simulation to evaluate the model performance. The illustration of our approach using an actual data set were presented in Section 3.2 Section 4 concludes the paper.

2. Materials and Methods

2.1. Model Structure

In masked data, we do not only observe the failure time for every unit but also a set of causes that contains the true cause of failure. Assume that we observe units each with causes of failure acting on it. Let denote the observed collection of covariates, then for any unit , we observe the vector , where denotes the failure time and denotes the minimum random subset (MRS) of causes that are possibly responsible for the unit failure. Following Kuo and Yang [10], for the unit, the likelihood contribution from the data consists in can be expressed aswhere denotes the actual cause of failure of the unit. Note that . When the observation of C is incomplete (see Crowder [12]), the likelihood contribution for an observed failure can be modified to

In the case of partly interval-censored data where the time of failure is incomplete, but observe the exact failure time for some units, however only the interval of time that includes the true failure time for the remaining units is observed. This type of data often arises in production engineering when units are inspected periodically. Let (], where , denotes the observed interval including the true failure time of the unit (). If the unit failure occurs before the first inspection time, then we have a left-censored observation (); if the unit does not fail until the last inspection time, then we have a right-censored observation (). Define as censoring indicators taking the value of one if the failure time is left-censored or interval-censored and taking a value of zero otherwise. Then, the likelihood contribution of the unit when the observation of is incomplete can be expressed aswhere and () are the numbers of the units whose failure time is exact and interval-censored (including left- and right-censored), respectively.

In this study, we used semiparametric Bayesian approach to estimate the regression coefficients for the set of covariates. Further, we adopted the cause-specific formulation utilizing the popular proportional hazards (PH) model that is of the formwhere and are the baseline hazard and the regression coefficient of the cause of failure and represents the vector of the covariates, respectively.

In this study, both failure time and cause of failure are incomplete; therefore, we need to consider the two cases discussed above to formulate the likelihood function. Let () denote the numbers of the units whose failure times are exact, right-censored, and interval-censored (including left-censored), respectively. Then, the full likelihood of masked and partly interval-censored data can be expressed as

Substituting equation (4) in (5), the full-likelihood function emerges as

When dealing with the masked data, the unrealistic symmetry assumption that involves an equal chance of observing a similarly masked subset of causes regardless of the actual cause is often adopted, i.e.,

Assumption (7) makes the analysis proceed with a reduced likelihood function that is not reliant on the masking probabilities. In contrast, in this paper, we applied a Bayesian analysis, which does not rely on the symmetry assumption [1], using the full likelihood function of equation (6). We model the masking probabilities to be independent of the failure time and dependent on the cause of failure. Moreover, we allow them to depend on subject-level covariates.

It is often of interest to determine the cause that is responsible for the unit failure when it is masked. For this purpose, we need to compute the diagnostic probability, which is the probability of the risk, causing the unit to fail given the observed masking set and the unit’s failure time. According to our full likelihood, we have two different ways to compute the diagnostic probabilities depending on whether the failure time of the unit is exact or interval-censored (including left-censored). First, when the unit is exact, the diagnostic probability can be defined as

Second, when the unit is interval-censored, the diagnostic probability can be defined as

2.2. Bayesian Framework

For semiparametric approaches to Bayesian inference involving hazard regression models, Cox’s PH model is one of the most commonly adopted models. Suppose there are N units under observation with K competing risks acting on them, and the observed data , i = 1, …, N, j = 1, …, K, where is the process counting the failures due to cause j occurring up to time t and is the at risk indicator for cause j. Let be a small increment of over the interval [t, t + dt), then and equal 1 if the event occurs in [0, t) and [t, t + dt), respectively, and 0 otherwise. Under noninformative censoring, the likelihood (specific to the cause of failure) of the data is proportional to

This is basically as if the counting process increments over the time interval [t, t + dt) are independent Poisson random variables with means . That is,where is the immediate probability that the unit at risk at time t has the event j in the next time interval [t, t + dt). After specifying the likelihood form, the interest turns to the prior distributions. We need to define prior distributions to the unknown parameters which are assumed to be stochastically independent. The prior specifications follow standard practice in semiparametric Bayesian analysis where popular priors that conjugate for the likelihood function are utilized. Following Sen et al. [2] and Xu and Tang [11], we assigned independent Dirichlet priors to the masking probabilities. Let denote the number of sets that include the cause and let denote the collection of potential MRS’s that contain cause , then the random Dirichlet variables can be defined as where and are the Dirichlet parameters.

Using an independent gamma process for the prior for cause-specific baseline hazards, , where is the cumulative baseline hazard specific to cause of failure. Here, can be regarded as a prior guess at unknown hazard function specific to cause of failure while c represents the degree of confidence in this guess.

The regression coefficients are assumed to be independently normal distributed, i.e.,where , and are the regression coefficient, the mean, and the variance, respectively, specific to the cause of failure.

After defining the prior distributions, our interest turns to the joint posterior distribution:where denotes the likelihood function, denotes the prior distribution, and denotes the observed data. Since (14) has a complicated form, we utilize the MCMC technique to generate random draws from the related full conditional posterior distributions, namely, , , and . These distributions need to be identified for the construction of an effective simulation method.

In order to assess the performance of our approach, we followed the cause-specific hazards-based simulation design of Beyersmann et al. [18]. All analyses were implemented in WinBUGS software [19] and R statistical software version 2.6.2 [20].

3. Results and Discussion

3.1. Simulation Study

We considered a competing risks model with two causes of failure, where each has a Weibull distributed lifetime with parameters , , and set , , , . First, we simulate the failure times and the censored times. Then, we simulate the causes of failure and mask them randomly with equal chances to be masked or unmasked, which results in masked right-censored data. Last, we create inspection times so that the data becomes partly interval-censored data that includes exact, left-censored, right-censored, and interval-censored failure times. The obtained data consists of exact failure times, as well as right-, left-, and interval-censored failure times. Furthermore, of the observations are masked while and of the observations fail due to causes 1 and 2, respectively.

A comparison of our approach on simulated partly interval-censored (PIC) data with the ones obtained using Sen’s approach was done [2]. The reported results are based on five chains each of iterations with burn-in of iterations. We present the results from the simulated study in Table 1 and Figure 1. The results show that the estimates from the two approaches are comparable. Although our model deals with left-, right-, and interval-censored failure times with considerable missing information, Table 1 shows that its posterior estimations of regression coefficients are reasonably close to those estimated by the model with only right-censored failure times. On the other hand, Figure 1 shows a comparison between the cumulative baseline hazards. The cumulative baseline hazards obtained from the two models are noticeably close with slight fluctuation for both causes (Figure 1).

3.2. Application to Electric Motors Data

We applied our approach to the data on failure times of insulation systems for electric motors (with their corresponding causes of failure), reported in Klein and Basu [21]. There are three common types of insulation system failures: turn, phase, and ground. The experiment was conducted at three different stress levels , , and , where units were tested at each level. To illustrate our approach, we converted this data into a PIC data with masked causes of failure. The obtained results are based on five chains where each is run with a burn-in of iterations, retained draws, and a thinning to every draw. Convergence is monitored and achieved for all parameters. Table 2 describes the number of units across the masking sets and the failure/censored times. Table 3 summarizes the posteriors estimates (mean, median, standard error (SE), and posterior credible interval (PCI)) of the regression coefficients while Table 4 shows the posterior mean of the diagnostic probabilities of the full masked units which are computed through equations (8) and (9). The results indicate that 50%, 25%, and 25% of the units fail due to turn, phase, and ground causes, respectively. Figure 2 depicts the cumulative baseline hazard functions of the three causes of failure. It is obvious from the figure that the three causes have almost the same cumulative hazard. In addition, Figures 35 demonstrate that the hazard increases with growing stress, irrespective of the cause. This is exactly the purpose of such experiments as it is run at high levels of stress to accelerate the failure as well as to reduce the cost and the experiment period.

4. Conclusion

In this study, we have presented a Bayesian approach for the analysis of competing risks data with masked causes of failure. The Bayesian approach for competing risks models was derived where the cause of failure is masked and failure time is incomplete. This method offers some flexibility in modelling as it is not built on assumptions with questionable validity such as the symmetry assumption or independence of the competing risks. Furthermore, it provides an assessment of the risk factors’ (covariates) effect on the hazard function. The results of the simulation support the feasibility of our modelling approach.

Data Availability

The data used to support the findings of this study are available within the article.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

The authors would like to thank Mrs. Lynn Mason for editing the manuscript language. The publication of this article was funded by the Qatar National Library.