Abstract
A rich biomedical knowledge graph can support the multi-domain data integration necessary for the application of Artificial Intelligence models in personalised medicine. Constructing such a knowledge graph from already available biomedical ontologies relies on ontology matching, however, current ontology matching systems are geared towards the alignment of pairs of ontologies of the same domain one at a time. This approach, when applied to a multi-domain problem such as personalised medicine in an all vs. all fashion, poses scalability issues while also ignoring the particularities of the multi-domain aspect.
In this work we evaluate a state-of-the-art ontology matching system, AgreementMakerLight, in the task of building a network of 28 integrated ontologies to construct a knowledge graph for Explainable AI in personalised oncology, highlighting its shortcomings. To address them, we have developed a novel holistic ontology alignment strategy building on AgreementMakerLight that clusters ontologies based on their semantic overlap measured by fast matching techniques with a high degree of confidence, and then applies more sophisticated matching techniques within each cluster. We implemented two within cluster alignment strategies, one based on pairwise alignment and another on incremental alignment.
The within-cluster incremental alignment reduced alignment time by 80% when compared with within-cluster pairwise alignment, achieving 88% coverage of its mappings. Compared to an all vs. all pairwise approach, holistic approaches reduce total running time by up to 60%.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
- 3.
We use KG to denote the integrated network of ontologies which constitute the semantic backbone of the full fledged KG.
- 4.
Although the UMLS provides mappings between some of our ontologies, its usage license does not allow public reuse.
- 5.
Experiments were run in a machine with 100Gb of available RAM.
- 6.
- 7.
Individual statistics available in the supplementary materials.
References
Babalou, S., Grygorova, E., König-Ries, B.: \(\cal{C}\)o\(\cal{M}\)erger: a customizable online tool for building a consistent quality-assured merged ontology. In: Harth, A., et al. (eds.) ESWC 2020. LNCS, vol. 12124, pp. 19–24. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-62327-2_4
Caldarola, E.G., Rinaldi, A.M.: An approach to ontology integration for ontology reuse. In: 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI), pp. 384–393. IEEE (2016)
Chari, S., Gruen, D.M., Seneviratne, O., McGuinness, D.L.: Directions for explainable knowledge-enabled systems. arXiv preprint arXiv:2003.07523 (2020)
Chari, S., Gruen, D.M., Seneviratne, O., McGuinness, D.L.: Foundations of explainable knowledge-enabled systems. arXiv preprint arXiv:2003.07520 (2020)
Chatterjee, N., Kaushik, N., Gupta, D., Bhatia, R.: Ontology merging: a practical perspective. In: Satapathy, S.C., Joshi, A. (eds.) ICTIS 2017. SIST, vol. 84, pp. 136–145. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-63645-0_15
Cruz, I.F., Stroe, C., Palmonari, M.: Interactive user feedback in ontology matching using signature vectors. In: ICDE 2012, pp. 1321–1324. IEEE (2012)
Euzenat, J., Shvaiko, P.: Ontology Matching, 2nd edn. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38721-0
Faria, D., Pesquita, C., Mott, I., Martins, C., Couto, F.M., Cruz, I.F.: Tackling the challenges of matching biomedical ontologies. J. Biomed. Semant. 9(1), 1–19 (2018)
Faria, D., Pesquita, C., Santos, E., Palmonari, M., Cruz, I.F., Couto, F.M.: The AgreementMakerLight ontology matching system. In: Meersman, R., et al. (eds.) OTM 2013. LNCS, vol. 8185, pp. 527–541. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-41030-7_38
Ferreira, J.D., Teixeira, D.C., Pesquita, C.: Biomedical ontologies: coverage, access and use. In: Wolkenhauer, O. (ed.) Systems Medicine Integrative, Qualitative and Computational Approaches, pp. 382–395. Academic Press, Elsevier (2020). https://doi.org/10.1016/B978-0-12-801238-3.11664-2, http://www.sciencedirect.com/science/article/pii/B9780128012383116642
Gruetze, T., Böhm, C., Naumann, F.: Holistic and scalable ontology alignment for linked open data. LDOW 937, 1–10 (2012)
Harrow, I., et al.: Matching disease and phenotype ontologies in the ontology alignment evaluation initiative. J. Biomed. Semant. 8(1), 1–13 (2017)
Hertling, S., Paulheim, H.: Order matters: matching multiple knowledge graphs. arXiv preprint arXiv:2111.02239 (2021)
Jiménez-Ruiz, E.: Logmap family participation in the OAEI 2020. In: Proceedings of the 15th International Workshop on Ontology Matching (OM 2020), vol. 2788, pp. 201–203. CEUR-WS (2020)
Köhler, S.: Improving ontologies by automatic reasoning and evaluation of logical definitions. BMC Bioinf. 12, 418 (2011)
Lecue, F.: On the role of knowledge graphs in explainable AI. Semantic Web 11(1), 41–51 (2020)
Lima, B., Faria, D., Couto, F.M., Cruz, I.F., Pesquita, C.: Oaei 2020 results for aml and amlc. (2020)
Megdiche, I., Teste, O., Trojahn, C.: An extensible linear approach for holistic ontology matching. In: Groth, P., Simperl, E., Gray, A., Sabou, M., Krötzsch, M., Lecue, F., Flöck, F., Gil, Y. (eds.) ISWC 2016. LNCS, vol. 9981, pp. 393–410. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46523-4_24
Noy, N.F., Shah, N.H., Whetzel, P.L., et al.: Bioportal: ontologies and integrated data resources at the click of a mouse. Nucleic Acids Res. 37(2), W170–W173 (2009)
Oliveira, D., Pesquita, C.: Improving the interoperability of biomedical ontologies with compound alignments. J. Biomed. Semant. 9(1), 1–13 (2018)
Osman, I., Ben Yahia, S., Diallo, G.: Ontology integration: approaches and challenging issues. Inf. Fusion 71, 38–63 (2021)
Otero-Cerdeira, L., Rodríguez-Martínez, F.J., Gómez-Rodríguez, A.: Ontology matching: a literature review. Expert Syst. Appl. 42(2), 949–971 (2015)
Pesquita, C.: Towards semantic integration for explainable artificial intelligence in the biomedical domain. In: BIOSTEC 2021, vol. 5, pp. 747–753 (2020)
Pesquita, C., Faria, D., Santos, E., Couto, F.M.: To repair or not to repair: reconciling correctness and coherence in ontology reference alignments. In: Ontology Matching (2013)
Pesquita, C., Faria, D., Stroe, C., Santos, E., Cruz, I.F., Couto, F.M.: What’s in a ‘nym’? synonyms in biomedical ontology matching. In: Alani, H., et al. (eds.) ISWC 2013. LNCS, vol. 8218, pp. 526–541. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-41335-3_33
Pour, N., Algergawy, A., Amini, R., Faria, D., et al.: Results of the ontology alignment evaluation initiative 2020. In: OM 2020, vol. 2788, pp. 92–138. CEUR-WS (2020)
Rahm, E.: Towards large-scale schema and ontology matching. In: Schema Matching and Mapping, pp. 3–27. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-16518-4_1
Rahm, E.: The case for holistic data integration. In: Pokorný, J., Ivanović, M., Thalheim, B., Šaloun, P. (eds.) ADBIS 2016. LNCS, vol. 9809, pp. 11–27. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-44039-2_2
Roussille, P., Megdiche, I., Teste, O., Trojahn, C.: Boosting holistic ontology matching: generating graph clique-based relaxed reference alignments for holistic evaluation. In: Faron Zucker, C., Ghidini, C., Napoli, A., Toussaint, Y. (eds.) EKAW 2018. LNCS (LNAI), vol. 11313, pp. 355–369. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-03667-6_23
Saleem, K., Bellahsene, Z., Hunt, E.: Porsche: performance oriented schema mediation. Inf. Syst. 33(7–8), 637–657 (2008)
Silva, M.C., Faria, D., Pesquita, C.: Integrating knowledge graphs for explainable artificial intelligence in biomedicine? In: Ontology Matching Workshop at the International Semantic Web Conference (2021)
Stoilos, G., Geleta, D., Shamdasani, J., Khodadadi, M.: A novel approach and practical algorithms for ontology integration. In: Vrandečić, D., et al. (eds.) ISWC 2018. LNCS, vol. 11136, pp. 458–476. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00671-6_27
Acknowledgments
This work was supported by FCT through the LASIGE Research Unit (UIDB/00408/2020 and UIDP/00408/2020). It was also partially supported by the KATY project which has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement No. 101017453.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Silva, M.C., Faria, D., Pesquita, C. (2022). Matching Multiple Ontologies to Build a Knowledge Graph for Personalized Medicine. In: Groth, P., et al. The Semantic Web. ESWC 2022. Lecture Notes in Computer Science, vol 13261. Springer, Cham. https://doi.org/10.1007/978-3-031-06981-9_27
Download citation
DOI: https://doi.org/10.1007/978-3-031-06981-9_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-06980-2
Online ISBN: 978-3-031-06981-9
eBook Packages: Computer ScienceComputer Science (R0)