Abstract
With the widespread application of high-throughput technologies, multi-omics data are playing an increasingly important role in cancer subtyping. However, the heterogeneity and high-dimensionality of different omics data make it a challenging task to integrate them into a consistent model. In this paper, we propose a novel multi-omics cancer subtyping method based on Multi-Kernel Partition Alignment Subspace clustering (MKPAS). Given multiple omics datasets, MKPAS first uses multiple kernel functions to generate kernel matrices as the input of multi-view subspace learning model. Second, it uses subspace learning and rank constraint for each omics data learning an ideal graph structure. Third, to make the clustering results consistent from all omics datasets, a common clustering indication matrix is learned by using the multi-view partition alignment method. Finally, MKPAS integrates multi-kernel learning, subspace learning, graph learning, weights correction and partition alignment into a unified framework, making the process of multi-omics data clustering more intuitive and the clustering results more reliable. To validate the effectiveness of MKPAS, we conduct cancer subtyping experiments on multiple TCGA datasets. The experimental results show that MKPAS is effective for cancer subtype prediction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Burrell, R.A., McGranahan, N., Bartek, J., et al.: The causes and consequences of genetic heterogeneity in cancer evolution. Nature 501(7467), 338–345 (2013)
Bedard, P.L., Hansen, A.R., Ratain, M.J., et al.: Tumour heterogeneity in the clinic. Nature 501(7467), 355–364 (2013)
Schuster, S.C.: Next-generation sequencing transforms today’s biology. Nat. Methods 5(1), 16–18 (2008)
Shen, R., Olshen, A.B., Ladanyi, M.J.B.: Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis. Bioinformatics 25(22), 2906–2912 (2009)
Mo, Q., Wang, S., Seshan, V.E., et al.: Pattern discovery and cancer gene recognition in integrated cancer genomic data. Proc. Natl. Acad. Sci. 110, 4245–4250 (2013)
Lee, D., Sebastian, H.S.: Algorithms for non-negative matrix factorization. Adv. Neural Inf. Process. Syst. 13 (2000)
Shi, Q., Zhang, C., Peng, M., et al.: Pattern fusion analysis by adaptive alignment of multiple heterogeneous omics data. Bioinformatics 33(17), 2706–2714 (2017)
Wang, B., Mezlini, A.M., Demir, F., et al.: Similarity network fusion for aggregating data types on a genomic scale. Nat. Methods 11(3), 333–337 (2014)
Ma, T., Zhang, A.: Integrate multi-omic data using affinity network fusion (ANF) for cancer patient clustering. In: Proceedings of 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 398–403 (2017)
Guo, Y., Li, H., Cai, M., et al.: Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification. BMC Med. Genomics 12(9), 191 (2019)
Lin, Y.Y., Liu, T.L., Fuh, C.S.: Multiple kernel learning for dimensionality reduction. IEEE Trans. Pattern Anal. Mach. Intell. 33(6), 1147–1160 (2011)
Kang, Z., Peng, C., Cheng, Q., et al.: Unified spectral clustering with optimal graph. Proc. AAAI Conf. Artif. Intell. 32(1), 1–9 (2018)
Kang, Z., Guo, Z., Huang, S., et al.: Multiple partitions aligned clustering. arXiv preprint arXiv:1909.06008 (2019)
Gao, H., Nie, F., Li, X., et al.: Multi-view subspace clustering. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 4238–4246 (2015)
Tian, G.: Generalizations of the FKKM theorem and the Ky Fan minimax inequality, with applications to maximal elements, price equilibrium, and complementarity. J. Math. Anal. Appl. 170(2), 457–471 (1992)
Tzortzis, G., Likas, A.: Kernel-based weighted multi-view clustering. In: 2012 IEEE 12th International Conference on Data Mining, pp. 675–684 (2012)
Wen, Z., Yin, W.: A feasible method for optimization with orthogonality constraints. Math. Program. 142(1–2), 397–434 (2012). https://doi.org/10.1007/s10107-012-0584-1
Schönemann, P.H.: A generalized solution of the orthogonal procrustes problem. Psychometrika 31(1), 1–10 (1966)
Rousseeuw, P.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20(1), 53–65 (1987)
Mantel, N.: Evaluation of survival data and two new rank order statistics arising in its consideration. Cancer Chemother. Rep. 50(3), 163–170 (1966)
Goel, M.K., Khanna, P., Kishore, J.: Understanding survival analysis: Kaplan-Meier estimate. Int. J. Ayurveda Res. 1(4), 274–278 (2010)
Chaudhuri, K., Kakade, S.M., Livescu, K., et al.: Multi-view clustering via canonical correlation analysis. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 129–136 (2009)
Rappoport, N., Shamir, R.: NEMO: cancer subtyping by integration of partial multi-omic data. Bioinformatics 35(18), 3348–3356 (2019)
Nguyen, H., Shrestha, S., Draghici, S., et al.: PINSPlus: a tool for tumor subtype discovery in integrated genomic data. Bioinformatics 35(16), 2843–2846 (2019)
Acknowledgements
This research was supported by the National Natural Science Foundation of China (Grant No. 61906198) and the Natural Science Foundation of Jiangsu Province (Grant No. BK20190622).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Liu, J., Hou, L., Ge, S. (2023). Multi-omics Cancer Subtype Recognition Based on Multi-kernel Partition Aligned Subspace Clustering. In: Huang, DS., Premaratne, P., Jin, B., Qu, B., Jo, KH., Hussain, A. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2023. Lecture Notes in Computer Science, vol 14088. Springer, Singapore. https://doi.org/10.1007/978-981-99-4749-2_34
Download citation
DOI: https://doi.org/10.1007/978-981-99-4749-2_34
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-4748-5
Online ISBN: 978-981-99-4749-2
eBook Packages: Computer ScienceComputer Science (R0)