Merging data curation and machine learning to improve nanomedicines

https://doi.org/10.1016/j.addr.2022.114172Get rights and content

Abstract

Nanomedicine design is often a trial-and-error process, and the optimization of formulations and in vivo properties requires tremendous benchwork. To expedite the nanomedicine research progress, data science is steadily gaining importance in the field of nanomedicine. Recently, efforts have explored the potential to predict nanomaterials synthesis and biological behaviors via advanced data analytics. Machine learning algorithms process large datasets to understand and predict various material properties in nanomedicine synthesis, pharmacologic parameters, and efficacy. “Big data” approaches may enable even larger advances, especially if researchers capitalize on data curation methods. However, the concomitant use of data curation processes needed to facilitate the acquisition and standardization of large, heterogeneous data sets, to support advanced data analytics methods such as machine learning has yet to be leveraged. Currently, data curation and data analytics areas of nanotechnology-focused data science, or 'nanoinformatics', have been proceeding largely independently. This review highlights the current efforts in both areas and the potential opportunities for coordination to advance the capabilities of data analytics in nanomedicine.

Introduction

The medical use of nanomaterials has shown promising advancement in the field of drug delivery. From the first FDA-approved nanodrug Doxil to the clinical use of lipid nanoparticles for mRNA vaccines, numerous nanoparticles can efficiently and safely deliver small molecules, proteins, nucleic acids, or other active pharmaceutical ingredients (APIs) [1], [2], [3]. Encapsulation in nanoparticles improves drug solubility, stability, and potentially facilitates the crossing of biological barriers and the selective targeting to disease sites, when these pharmacologic advantages have not or cannot be engineered into APIs [4]. However, designing nanocarriers and optimizing the delivery strategy can be time-intensive, and typically involves a substantial amount of trial and error. Clinical translation of nanomedicines often does not follow pre-clinical development, often due to the complexity and heterogeneity of nanoparticles[5] To address these issues, advanced data analytics methods promise to accelerate and improve the nanomedicine development processes.

While the prediction of nanomaterial behavior in vivo is challenging due to a lack of systematic studies across material types, artificial intelligence (AI) platforms promise to improve and streamline nanomedicine development. Since 2007, when the term ‘nanoinformatics’ was coined [6], [7], there has been an explosion of nanomaterial-related data science efforts, which has generated large datasets with nanomaterial characterizations [7]. However, there is a disconnect between the scientists curating the databases and those utilizing AI. As a result, some AI platforms are developed using small-scale, project-specific datasets that lack translational values. In addition, non-standardized reporting metrics make it difficult to compare different material entities, and non-centralized databases are inaccessible to research groups who are not equipped with data mining skills. Herein, the aims of this review are to (1) review the most recent AI studies that capture the epitome of nanomedicine platform development, (2) introduce public databases focused on nanomaterial characterizations, and, most importantly, (3) advocate for a collaboration between the research groups developing AI platforms and nano-informaticians in order to increase the clinical utility of nanomedicine.

Section snippets

Introduction to ‘Nanoinformatics’ and Machine learning

Many science and engineering fields today are becoming more data-driven (ie. data analysis is increasingly powering experimental decision-making). Data science is a multidisciplinary field that combines mathematics and statistics to extrapolate meaningful insights from data [8], [9]. As a result of the increasing interest in medical applications of nanotechnology, a new field of data science has emerged. ‘Nanoinformatics’ bridges computer science, information technology, nanotechnology, and

Recent advances in nanomedicine using machine learning

Over the past decade, machine learning applications in biomedical science and engineering has gained popularity and changed the landscape of nanomedicine research. According to Web of Science, the number of publications with the keywords “nanomedicine” and “machine learning” or “AI” has increased 10 times within the past 10 years. Nanomedicine research has benefited from various machine learning studies, as many traditional methodologies are unable to delineate the complexity and heterogeneity of

Data curation for nanomedicine

Recognizing the need for a centralized nanomedicine database to strengthen analyses, nano-informaticians create platforms that accommodate the storage and sharing of heterogeneous nanomedicine data [6], [7], [10]. This process falls within a major area of data science - data curation. Historically originated in library science, data curation consolidates data from various sources into one database [96]. In addition to categorizing data into a useful presentation, data curation also involves

Conclusions and outlook

In the past decades, the field of nanomedicine has rapidly expanded preclinically and clinically. However, many practical obstacles have delayed the translation of most pre-clinical research into the clinic, including formulation difficulties/scale-up, unsatisfactory PK/PD, and costs [132]. We believe these issues can be ameliorated if the field can learn from the massive amounts of data already collected by the field. Recent advances in “big data” analytic approaches, including machine

Declaration of Competing Interest

D.A.H. is an inventor of the patent “Fucoidan nanogels and methods of their use and manufacture” US patent #9,737,614 issued July 7, 2016, a cofounder and officer with equity interest of Lime Therapeutics Inc., cofounder with equity interest of Goldilocks Therapeutics Inc. and Resident Diagnostics, Inc., and a member of the scientific advisory board of Concarlo Holdings LLC Nanorobotics Inc., and Mediphage Bioceuticals Inc.

Acknowledgements

This work was supported in part by the NCI (R01-CA215719, P30-CA008748), NINDS (R01-NS116353), the American Cancer Society Research Scholar Grant (GC230452), the Expect Miracles Foundation - Financial Services Against Cancer, Emerson Collective, the Louis and Rachel Rudin Foundation, the Alan and Sandra Gerry Metastasis Research Initiative, the Center for Metastasis Research Scholars Fellowship Program, Mr. William H. Goodwin and Mrs. Alice Goodwin and the Commonwealth Foundation for Cancer

References (133)

  • X. Hou et al.

    Lipid nanoparticles for mRNA delivery

    Nat. Rev. Mater.

    (2021)
  • A.C. Anselmo et al.

    Nanoparticles in the clinic: An update post COVID-19 vaccines

    Bioeng. Transl. Med.

    (2021)
  • M.T. Manzari et al.

    Targeted drug delivery strategies for precision medicines

    Nat. Rev. Mater.

    (2021)
  • D. Sun et al.

    What Went Wrong with Anticancer Nanomedicine Design and How to Make It Right

    ACS Nano

    (2020)
  • V. Maojo et al.

    Nanoinformatics and DNA-Based Computing: Catalyzing Nanomedicine

    Pediatr. Res.

    (2010)
  • Maojo, V.; Fritts; De La Iglesia, D.; Cachau; Garcia-Remesal; Mitchell; Kulikowski, Nanoinformatics: a new area of...
  • R.A. Irizarry

    The Role of Academia in Data Science Education.

    (2020)
  • F. Provost et al.

    Data Science and its Relationship to Big Data and Data-Driven Decision Making

    Big Data

    (2013)
  • S. Panneerselvam et al.

    Nanoinformatics: Emerging Databases and Available Tools

    Int. J. Mol. Sci.

    (2014)
  • M. Lijowski

    caNanoLab – A Tool To Benefit Biomedical Nanomaterials Research

    Nature Precedings

    (2010)
  • T. Sterling et al.

    ZINC 15 – Ligand Discovery for Everyone

    J. Chem. Inf. Model.

    (2015)
  • N. Jeliazkova et al.

    The eNanoMapper database for nanomaterial safety information

    Beilstein J. Nanotechnol.

    (2015)
  • Z. Ban et al.

    Machine learning predicts the functional composition of the protein corona and the cellular recognition of nanoparticles

    Proc. Natl. Acad. Sci.

    (2020)
  • D.P. Russo et al.

    Virtual Molecular Projections and Convolutional Neural Networks for the End-to-End Modeling of Nanoparticle Activities and Properties

    Anal. Chem.

    (2020)
  • F. Pellegrino et al.

    Machine learning approach for elucidating and predicting the role of synthesis parameters on the shape and size of TiO2 nanoparticles

    Sci. Rep.

    (2020)
  • T. Panch et al.

    Artificial intelligence, machine learning and health systems. Journal of

    Global Health

    (2018)
  • K.T. Butler et al.

    Machine learning for molecular and materials science

    Nature

    (2018)
  • M.I. Jordan et al.

    Machine learning: Trends, perspectives, and prospects

    Science

    (2015)
  • Z. Ghahramani

    Probabilistic machine learning and artificial intelligence

    Nature

    (2015)
  • E.J. Topol

    High-performance medicine: the convergence of human and artificial intelligence

    Nat. Med.

    (2019)
  • H. Zhu

    Big Data and Artificial Intelligence Modeling for Drug Discovery

    Annu. Rev. Pharmacol. Toxicol.

    (2020)
  • P. Schneider et al.

    Rethinking drug design in the artificial intelligence era

    Nat. Rev. Drug Discovery

    (2020)
  • C. Curtis et al.

    Predicting in situ nanoparticle behavior using multiple particle tracking and artificial neural networks

    Nanoscale

    (2019)
  • A. Auer et al.

    nanoTRON: a Picasso module for MLP-based classification of super-resolution data

    Bioinformatics

    (2020)
  • J. Pottel et al.

    The activities of drug inactive ingredients on biological targets

    Science

    (2020)
  • D. Reker et al.

    Computationally guided high-throughput design of self-assembling drug nanoparticles

    Nat. Nanotechnol.

    (2021)
  • R. Kumar et al.

    Efficient Polymer-Mediated Delivery of Gene-Editing Ribonucleoprotein Payloads through Combinatorial Design, Parallelized Experimentation, and Machine Learning

    ACS Nano

    (2020)
  • G. Yamankurt et al.

    Exploration of the nanomedicine-design space with high-throughput screening and machine learning

    Nat. Biomed. Eng.

    (2019)
  • X. Yan et al.

    Prediction of Nano-Bio Interactions through Convolutional Neural Network Analysis of Nanostructure Images

    ACS Sustainable Chem. Eng.

    (2020)
  • A.V. Singh et al.

    Machine-Learning-Based Approach to Decode the Influence of Nanomaterial Properties on Their Interaction with Cells

    Acs Appl Mater Inter

    (2021)
  • N.R. Stillman et al.

    In silico modelling of cancer nanomedicine, across scales and transport barriers

    npj Comput. Mater.

    (2020)
  • V. Francia et al.

    Interactions at the cell membrane and pathways of internalization of nano-sized materials for nanomedicine

    Beilstein J. Nanotechnol.

    (2020)
  • M.I. Kotzabasaki et al.

    QSAR modeling of the toxicity classification of superparamagnetic iron oxide nanoparticles (SPIONs) in stem-cell monitoring applications: an integrated study from data curation to model development

    RSC Adv.

    (2020)
  • A. Rybińska-Fryca et al.

    Structure–activity prediction networks (SAPNets): a step beyond Nano-QSAR for effective implementation of the safe-by-design concept

    Nanoscale

    (2020)
  • A.V. Singh et al.

    Artificial Intelligence and Machine Learning Empower Advanced Biomedical Material Design to Toxicity Prediction

    Advanced Intelligent Systems

    (2020)
  • W. Poon et al.

    A framework for designing delivery systems

    Nat. Nanotechnol.

    (2020)
  • R. Pollice et al.

    Data-Driven Strategies for Accelerated Materials Design

    Acc. Chem. Res.

    (2021)
  • O. Adir et al.

    Integrating Artificial Intelligence and Nanotechnology for Precision Cancer Medicine

    Adv. Mater.

    (2020)
  • A.V. Singh et al.

    Artificial Intelligence and Machine Learning in Computational Nanotoxicology: Unlocking and Empowering Nanomedicine

    Adv Healthc Mater

    (2020)
  • H. Tao et al.

    Nanoparticle synthesis assisted by machine learning

    Nat. Rev. Mater.

    (2021)
  • Cited by (34)

    View all citing articles on Scopus
    1

    These authors contributed equally.

    View full text