Abstract
Using the so-called martingale difference correlation (MDC), we propose a novel censored-conditional-quantile screening approach for ultrahigh-dimensional survival data with heterogeneity (which is often present in such data). By incorporating a weighting scheme, this method is a natural extension of MDC-based conditional quantile screening, as considered in Shao and Zhang (2014), to handle ultrahigh-dimensional survival data. The proposed screening procedure has a sure-screening property under certain technical conditions and an excellent capability of detecting the nonlinear relationship between independent and censored dependent variables. Both simulation results and an analysis of real data demonstrate the effectiveness of the new censored conditional quantile-screening procedure.
Similar content being viewed by others
References
Bair E, Tibshirani R. Semi-supervised methods to predict patient survival from gene expression data. PLoS Biol, 2004, 2: 511–522
Chu W, Li R, Reimherr M. Feature screening for time-varying coEfficient models with ultrahigh dimensional longitudinal data. Ann Appl Stat, 2016, 10: 596–617
Cui H, Li R, Zhong W. Model free feature screening for ultrahigh dimensional discriminant analysis. J Amer Statist Assoc, 2015, 110: 630–641
Fan J, Feng Y, Wu Y. Ultrahigh dimensional variable selection for Cox’s proportional hazards model. Inst Math Stat Collect, 2010, 6: 70–86
Fan J, Lv J. Sure independence screening for ultrahigh dimensional feature space (with discussion). J R Stat Soc Ser B Stat Methodol, 2008, 70: 849–911
He X, Wang L, Hong H. Quantile-adaptive model-free variable screening for high-dimensional heterogeneous data. Ann Statist, 2013, 41: 342–369
Kong Y, Li D, Fan Y, et al. Interaction pursuit in high-dimensional multi-response regression via distance correlation. Ann Statist, 2016, 45: 897–922
Li G, Peng H, Zhang J, et al. Robust rank correlation based screening. Ann Statist, 2012, 40: 1846–1877
Li R, Zhong W, Zhu L. Feature screening via distance correlation learning. J Amer Statist Assoc, 2012, 107: 1129–1139
Ma S, Li R, Tsai C-L. Variable screening via partial quantile correlation. J Amer Statist Assoc, 2017, 112: 650–663
Pan R, Wang H, Li R. Ultrahigh dimensional multi-class linear discriminant analysis by pairwise sure independence screening. J Amer Statist Assoc, 2016, 111: 169–179
Rosenwald A, Wright G, Chan W C, et al. The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. New Engl J Med, 2002, 346: 1937–1947
Serfling R L. Approximation Theorems in Mathematical Statistics. New York: Wiley, 1980
Shao X, Zhang J. Martingale difference correlation and its use in high dimensional variable screening. J Amer Statist Assoc, 2014, 109: 1302–1318
Song R, Lu W, Ma S, et al. Censored rank independence screening for high-dimensional survival data. Biometrika, 2014, 101: 799–814
Szekely G J, Rizzo M L, Bakirov N K. Measuring and testing dependence by correlation of distances. Ann Statist, 2007, 35: 2769–2794
Wang L, Liu J, Li Y, et al. Model-free conditional independence feature screening for ultrahigh dimensional data. Sci China Math, 2017, 60: 551–568
Wang J, Wang L. Locally weighted censored quantile regression. J Amer Statist Assoc, 2009, 104: 1117–1128
Wu Y, Yin G. Conditional quantile screening in ultrahigh-dimensional heterogeneous data. Biometrika, 2015, 102: 65–76
Yang G R, Yu Y, Li R Z, et al. Feature screening in ultrahigh dimensional Cox’s model. Statist Sinica, 2016, 26: 881–901
Zhao S, Li Y. Principled sure independence screening for Cox models with ultrahigh-dimensional covariates. J Multi-variate Anal, 2012, 105: 397–411
Zhou T, Zhu L. Model-free feature screening for ultrahigh dimensional censored regression. Stat Comput, 2017, 27: 947–961
Zhu L, Li L, Li R, et al. Model-free feature screening for ultrahigh dimensional data. J Amer Statist Assoc, 2011, 106: 1464–1475
Acknowledgements
This work was supported by the National Statistical Scientific Research Projects (Grant No. 2015LZ54). The authors thank the two anonymous reviewers for their constructive comments, which have led to a dramatic improvement of the earlier version of this article.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Xu, K., Huang, X. Conditional-quantile screening for ultrahigh-dimensional survival data via martingale difference correlation. Sci. China Math. 61, 1907–1922 (2018). https://doi.org/10.1007/s11425-016-9208-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11425-016-9208-6
Keywords
- ultrahigh-dimensional survival data
- martingale difference correlation
- censored-conditional-quantile screening
- sure-screening property