Abstract
Survival analysis studies time to event data, also called survival data in biomedical research. The main challenge in the analysis of survival data is to develop inferential methods that take into account the incomplete information contained in censored observations. The seminal paper of Kaplan and Meier (J Am Stat Assoc 53:457–481,1958) gave a boost to the development of statistical methods for time to event data subject to right censoring; methods that have been applied in a broad variety of scientific fields including health, engineering and economy. A basic quantity in survival analysis is the survival distribution: \(S(t) = P(T > t)\), with T the time to event or, in case of a bivariate vector of lifetimes \((T_1,T_2)\), \(S(t_1,t_2) = P(T_1> t_1,T_2 > t_2\)). Nonparametric estimation of these basic quantities received, since Kaplan and Meier (J Am Stat Assoc 53:457–481,1958), considerable attention resulting in many publications scattered over a large period of time and a large field of applications. The purpose of this paper is to review, in a unified way, nonparametric estimation of S(t) and \(S(t_1,t_2)\) for time to event data subject to right censoring. Interesting to realize is that, in the multivariate setting, the form of the nonparametric estimator for \(S(t_1,t_2)\) is determined by the actual censoring scheme. In this survey we focus, for the proposed (implicitly) existing or new nonparametric estimators, on the asymptotic normality. By doing so we fill some gaps in the literature by introducing some new estimators and by providing explicit expressions for the asymptotic variances often not yet available for some of the existing estimators.
Similar content being viewed by others
References
Abrams S, Janssen P, Veraverbeke N (2021) Quantiles of the conditional residual lifetime. Statistics 55:1271–1290
Abrams S, Janssen P, Veraverbeke N (2023) Nonparametric estimation of the quantiles of the conditional lifetime distribution. Technical Report (submitted)
Akritas MG (1994) Nearest neighbor estimation of a bivariate distribution under random censoring. Ann Statist 22:1299–1327
Akritas MG, Van Keilegom I (2003) Estimation of bivariate and marginal distributions with censored data. J Royal Statist Soc B 65:457–471
Amico M, Van Keilegom I (2018) Cure models in survival analysis. Annu Rev Statist Appl 5:311–342
Beran R (1981) Nonparametric regression with randomly censored survival data. Technical Report, University of California, Berkeley
Braekers R, Veraverbeke N (2005) A copula-graphic estimator for the conditional survival function under dependent censoring. Can J Statist 33:429–447
Breslow N, Crowley J (1974) A large sample study of the life table and product limit estimates under random censorship. Ann Statist 2:437–453
Burke M (1988) Estimation of a bivariate distribution under random censorship. Biometrika 75:379–382
Conde-Amboage M, Van Keilegom I, González-Manteiga W (2021) A new lack-of-fit test for quantile regression with censored data. Scand J Statistics 48:665–688
Dabrowksa DM (1988) Kaplan-Meier estimate on the plane. Ann Statist 16:1475–1489
Ebrahimi N, Molefe D, Ying Z (2003) Identifiability and censored data. Biometrika 90:724–727
Földes A, Rejtő L (1981) A LIL type result for the product limit estimator. Z Wahrsch Verw Gebiete 56:75–86
Geerdens C, Janssen P, Van Keilegom I (2020) Goodness-of-fit test for a parametric survival function with cure fraction. TEST 29:768–792
Geerdens C, Janssen P, Veraverbeke N (2016) Large sample properties of nonparametric copula estimators under bivariate censoring. Statistics 50:1036–1055
Gijbels I, Veraverbeke N (1988) Weak asymptotic representations for quantiles of the product limit estimator. J Statist Plan Inference 18:151–160
Gill RD (1992) Multivariate survival analysis. Theory Prob Appl 37:18–31
Gill RD (1994) Glivenko-Cantelli for Kaplan-Meier. Math Methods Statist 3:76–78
González-Manteiga W, Cadarso Suarez C (1994) Asymptotic properties of a generalized Kaplan-Meier estimator with some applications. J Nonparametric Stat 4:65–78
González-Manteiga W, Heuchenne C, Sanchez-Sellero C, Beretta A (2020) Goodness-of-fit tests for censored regression based on artificial data points. TEST 29:599–615
Gribkova S, Lopez O (2015) Nonparametric copula estimation under bivariate censoring. Scand J Stat 42:925–946
Hougaard P (2000) Analysis of Multivariate Survival Data. Springer-Verlag, New York
Johansen S (1978) The product limit estimator as maximum likelihood estimator. Scand J Stat 5:195–199
Kaplan EL, Meier P (1958) Nonparametric estimation from incomplete observations. J Am Stat Assoc 53:457–481
Langberg NA, Shaked M (1982) On the identifiability of multivariate life distributions. Ann Prob 10:773–779
Lin DY, Ying ZA (1993) A simple nonparametric estimator of the bivariate survival function under univariate censoring. Biometrika 80:573–581
Lo S-H, Singh K (1986) The product-limit estimator and the bootstrap: some asymptotic representations. Prob Theory Rel Fields 71:455–465
Lopez O (2012) A generalization of the Kaplan-Meier estimator for analyzing bivariate mortality under right-censoring with applications in model checking for survival copula models. Insur Math Econ 51:505–516
Lopéz-Cheda A, Cao R, Jácome MA (2017a) Nonparametric latency estimation for mixture cure models. TEST 26:353–376
Lopéz-Cheda A, Cao R, Jácome MA, Van Keilegom I (2017b) Nonparametric incidence estimation and bootstrap bandwidth selection in mixture cure models. Comput Stat Data Anal 105:144–165
Major P, Rejtő L (1988) Strong embedding of the estimator of the distribution function under random censorship. Ann Stat 16:1113–1132
Maller R, Zhou S (1996) Survival analysis with long term survivors. Wiley, London
Nelsen RB (2006) An introduction to copulas, 2nd edn. Springer, New York
Peláez-Suárez, R. (2022). Nonparametric estimation of the probability of default in credit risk. Doctoral Thesis, Universidade da Coruña, Spain
Prentice RL, Cai J (1992) Covariance and survivor function estimation using censored multivariate failure time data. Biometrika 79:495–512
Prentice RL, Zhao S (2018) Nonparametric estimation of the multivariate survivor function: the multivariate Kaplan-Meier estimator. Lifetime Data An 24:3–27
Pruitt R (1991) On negative mass assigned by the bivariate Kaplan-Meier estimator. Ann Stat 19:443–453
Pruitt R (1993) Identifiability of bivariate survival curves from censored data. J Am Stat Assoc 88:573–579
Rivest L, Wells MT (2001) A martingale approach to the copula-graphic estimator for the survival function under dependent censoring. J Multiv Anal 79:138–155
Robins JM, Rotnitzky A (1992) Recovery of information and adjustment for dependent censoring using surrogate markers. In: Jewell N, Dietz K Jr, Farewell VT (eds) AIDS epidemiology methodological issues. Birkhauser, Boston, pp 297–331
Satten GA, Datta S (2001) The Kaplan-Meier estimator as an inverse-probability-of-censoring weighted average. Am Stat 55:207–210
Serfling R (1980) Approximations theorems of mathematical statistics. Wiley, New York
Stute W (1993a) Consistent estimation under random censorship when covariables are present. J Multiv Anal 45:89–103
Stute W, Wang J-L (1993b) The strong law under random censorship. Ann Stat 21:1591–1607
Stute W (1995) The central limit theorem under random censorship. Ann Stat 23:422–439
Stute W (1996) Distributional convergence under random censorship when covariables are present. Scand J Stat 23:461–471
Tsiatis AA (1975) A nonidentifiability aspect of the problem of competing risks. Proc Nat Acad Sci United States of America 72:20–22
Van Keilegom I, Veraverbeke N (1997) Estimation and bootstrap with censored data in fixed design nonparamteric regression. Ann Inst Statist Math 49:467–491
Van Keilegom I, Veraverbeke N (1998) Bootstrapping quantiles in a fixed design regression model with censored data. J Stat Plan Inference 69:115–131
Van Keilegom I (2004) A note on the nonparametric estimation of the bivariate distribution under dependent censoring. J Nonparametric Stat 16:659–670
van der Laan MJ (1996) Efficient estimation in the bivariate censoring model and repairing NPMLE. Ann Stat 24:596–627
Wang W, Wells MT (1997) Nonparametric estimators of the bivariate survival function under simplified censoring conditions. Biometrika 84:863–880
Zheng M, Klein JP (1995) Estimates of the marginal survival for dependent competing risks based on an assumed copula. Biometrika 82:127–138
Acknowledgements
The authors thank the editor and three referees for their valuable comments that led to an improved version of the manuscript.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Janssen, P., Veraverbeke, N. Nonparametric estimation of univariate and bivariate survival functions under right censoring: a survey. Metrika 87, 211–245 (2024). https://doi.org/10.1007/s00184-023-00911-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00184-023-00911-7