Skip to main content
Log in

Conditional-quantile screening for ultrahigh-dimensional survival data via martingale difference correlation

  • Articles
  • Published:
Science China Mathematics Aims and scope Submit manuscript

Abstract

Using the so-called martingale difference correlation (MDC), we propose a novel censored-conditional-quantile screening approach for ultrahigh-dimensional survival data with heterogeneity (which is often present in such data). By incorporating a weighting scheme, this method is a natural extension of MDC-based conditional quantile screening, as considered in Shao and Zhang (2014), to handle ultrahigh-dimensional survival data. The proposed screening procedure has a sure-screening property under certain technical conditions and an excellent capability of detecting the nonlinear relationship between independent and censored dependent variables. Both simulation results and an analysis of real data demonstrate the effectiveness of the new censored conditional quantile-screening procedure.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bair E, Tibshirani R. Semi-supervised methods to predict patient survival from gene expression data. PLoS Biol, 2004, 2: 511–522

    Article  Google Scholar 

  2. Chu W, Li R, Reimherr M. Feature screening for time-varying coEfficient models with ultrahigh dimensional longitudinal data. Ann Appl Stat, 2016, 10: 596–617

    Article  MathSciNet  MATH  Google Scholar 

  3. Cui H, Li R, Zhong W. Model free feature screening for ultrahigh dimensional discriminant analysis. J Amer Statist Assoc, 2015, 110: 630–641

    Article  MathSciNet  MATH  Google Scholar 

  4. Fan J, Feng Y, Wu Y. Ultrahigh dimensional variable selection for Cox’s proportional hazards model. Inst Math Stat Collect, 2010, 6: 70–86

    Article  Google Scholar 

  5. Fan J, Lv J. Sure independence screening for ultrahigh dimensional feature space (with discussion). J R Stat Soc Ser B Stat Methodol, 2008, 70: 849–911

    Article  MathSciNet  Google Scholar 

  6. He X, Wang L, Hong H. Quantile-adaptive model-free variable screening for high-dimensional heterogeneous data. Ann Statist, 2013, 41: 342–369

    Article  MathSciNet  MATH  Google Scholar 

  7. Kong Y, Li D, Fan Y, et al. Interaction pursuit in high-dimensional multi-response regression via distance correlation. Ann Statist, 2016, 45: 897–922

    Article  MathSciNet  MATH  Google Scholar 

  8. Li G, Peng H, Zhang J, et al. Robust rank correlation based screening. Ann Statist, 2012, 40: 1846–1877

    Article  MathSciNet  MATH  Google Scholar 

  9. Li R, Zhong W, Zhu L. Feature screening via distance correlation learning. J Amer Statist Assoc, 2012, 107: 1129–1139

    Article  MathSciNet  MATH  Google Scholar 

  10. Ma S, Li R, Tsai C-L. Variable screening via partial quantile correlation. J Amer Statist Assoc, 2017, 112: 650–663

    Article  MathSciNet  Google Scholar 

  11. Pan R, Wang H, Li R. Ultrahigh dimensional multi-class linear discriminant analysis by pairwise sure independence screening. J Amer Statist Assoc, 2016, 111: 169–179

    Article  MathSciNet  Google Scholar 

  12. Rosenwald A, Wright G, Chan W C, et al. The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. New Engl J Med, 2002, 346: 1937–1947

    Article  Google Scholar 

  13. Serfling R L. Approximation Theorems in Mathematical Statistics. New York: Wiley, 1980

    Book  MATH  Google Scholar 

  14. Shao X, Zhang J. Martingale difference correlation and its use in high dimensional variable screening. J Amer Statist Assoc, 2014, 109: 1302–1318

    Article  MathSciNet  MATH  Google Scholar 

  15. Song R, Lu W, Ma S, et al. Censored rank independence screening for high-dimensional survival data. Biometrika, 2014, 101: 799–814

    Article  MathSciNet  MATH  Google Scholar 

  16. Szekely G J, Rizzo M L, Bakirov N K. Measuring and testing dependence by correlation of distances. Ann Statist, 2007, 35: 2769–2794

    Article  MathSciNet  MATH  Google Scholar 

  17. Wang L, Liu J, Li Y, et al. Model-free conditional independence feature screening for ultrahigh dimensional data. Sci China Math, 2017, 60: 551–568

    Article  MathSciNet  MATH  Google Scholar 

  18. Wang J, Wang L. Locally weighted censored quantile regression. J Amer Statist Assoc, 2009, 104: 1117–1128

    Article  MathSciNet  MATH  Google Scholar 

  19. Wu Y, Yin G. Conditional quantile screening in ultrahigh-dimensional heterogeneous data. Biometrika, 2015, 102: 65–76

    Article  MathSciNet  MATH  Google Scholar 

  20. Yang G R, Yu Y, Li R Z, et al. Feature screening in ultrahigh dimensional Cox’s model. Statist Sinica, 2016, 26: 881–901

    MathSciNet  MATH  Google Scholar 

  21. Zhao S, Li Y. Principled sure independence screening for Cox models with ultrahigh-dimensional covariates. J Multi-variate Anal, 2012, 105: 397–411

    Article  MATH  Google Scholar 

  22. Zhou T, Zhu L. Model-free feature screening for ultrahigh dimensional censored regression. Stat Comput, 2017, 27: 947–961

    Article  MathSciNet  MATH  Google Scholar 

  23. Zhu L, Li L, Li R, et al. Model-free feature screening for ultrahigh dimensional data. J Amer Statist Assoc, 2011, 106: 1464–1475

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Statistical Scientific Research Projects (Grant No. 2015LZ54). The authors thank the two anonymous reviewers for their constructive comments, which have led to a dramatic improvement of the earlier version of this article.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kai Xu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xu, K., Huang, X. Conditional-quantile screening for ultrahigh-dimensional survival data via martingale difference correlation. Sci. China Math. 61, 1907–1922 (2018). https://doi.org/10.1007/s11425-016-9208-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11425-016-9208-6

Keywords

MSC(2010)

Navigation