Abstract
Topological Data Analysis (TDA) aims to extract relevant information from the underlying topology of data projections. In the healthcare domain, TDA has been successfully used to infer structural phenotypes from complex data by linking patients who display demographic, clinical, and biomarker similarities.
In this paper we propose pheTDA, a TDA-based framework to assist the computational definition of novel phenotypes. More in details, the pheTDA (i) guides the application of the Topological Mapper algorithm to derive a robust data representation as a topological graph; (ii) identifies relevant subgroups of patients from the topology; (iii) assess discriminative features for each subgroup of patients via predictive models.
We applied the proposed tool on a population of 725 patients with suspected coronary artery disease (CAD). pheTDA identified five novel subgroups, one of which is characterized by the presence of diabetic patients showing high cardiovascular risk score. In addition, we compare the results obtained with existing clustering algorithms, showing that pheTDA obtains better performance when compared to spectral decomposition followed by k-means.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Lum, P., et al.: Extracting insights from the shape of complex data using topology. Sci. Rep. 3, 1236 (2013)
Singh, G., et al.: Topological methods for the analysis of high dimensional data sets and 3D object recognition. EPBG@Eurographics (2007)
Nicolau, M., et al.: Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. PNAS, USA. 108, 7265–7270 (2011)
Ge, Y., Wang, T.: Circulating, imaging, and genetic biomarkers in cardiovascular risk prediction. Trends Cardiovasc. Med. 21, 105–112 (2011)
Veen, H., et al.: Kepler Mapper: a flexible Python implementation of the Mapper algorithm. JOSS 4, 1315 (2019)
Gower, J.: A general coefficient of similarity and some of its properties. Biometrics 27, 857–871 (1971)
Stekhoven, D., Bühlmann, P.: MissForest-non-parametric missing value imputation for mixed-type data. Bioinformatics 28, 112–118 (2011)
Palmieri, L., et al.: CUORE project: implementation of the 10-year risk score. Euro. J. Cardiovasc. Prev. Rehabil. 18, 642–649 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Albi, G., Gerbasi, A., Chiesa, M., Colombo, G.I., Bellazzi, R., Dagliati, A. (2023). A Topological Data Analysis Framework for Computational Phenotyping. In: Juarez, J.M., Marcos, M., Stiglic, G., Tucker, A. (eds) Artificial Intelligence in Medicine. AIME 2023. Lecture Notes in Computer Science(), vol 13897. Springer, Cham. https://doi.org/10.1007/978-3-031-34344-5_38
Download citation
DOI: https://doi.org/10.1007/978-3-031-34344-5_38
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34343-8
Online ISBN: 978-3-031-34344-5
eBook Packages: Computer ScienceComputer Science (R0)