Abstract
The emergence of SARS-CoV-2 variants diminished the efficacy of current antiviral drugs and vaccines. Hence, identifying highly conserved sequences and potentially druggable pockets for drug development was a promising strategy against SARS-CoV-2 variants. In viral infection, the receptor-binding domain (RBD) proteins are essential in binding to the host receptor. Others, Heparan sulfate (HS), widely distributed on the surface of host cells, is thought to play a central role in the viral infection cycle of SARS-CoV-2. Therefore, it might be a reasonable strategy for antiviral drug design to interfere with the RBD in the HS binding site. In this study, we used computational approaches to analyze multiple sequences of coronaviruses and reveal important information about the binding of HS to RBD in the SARS-CoV-2 spike protein. Our results showed that the potential hot-spots, including R454 and E471, in RBD, exhibited strong interactions in the HS-RBD binding region. Therefore, we screened different compounds in the natural product database towards these hot-spots to find potential antiviral candidates using LibDock, Autodock vina and furthermore applying the MD simulation in AMBER20. The results showed three potential natural compounds, including Acetoside (ACE), Hyperoside (HYP), and Isoquercitrin (ISO), had a strong affinity to the RBD. Our results demonstrate a feasible approach to identify potential antiviral agents by evaluating the binding interaction between viral glycoproteins and host receptors. The present study provided the applications of the structure-based computational approach for designing and developing of new antiviral drugs against SARS-CoV-2 variants.
Similar content being viewed by others
Introduction
Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), a type II coronavirus, caused a pandemic of acute respiratory illness starting in late 2019, leading to a severe public health crisis and economic losses worldwide1. Notably, the emergence of new SARS-CoV-2 variants due to genetic mutations has raised concerns about the effectiveness of COVID-19 antiviral drugs and vaccines2,3. SARS-CoV-2 variants can lead to changes in binding sites that may affect the ability of the virus to bind to specific drug targets, reducing the efficacy of antiviral drugs or rendering them completely ineffective against the mutant strains4. As new variants of the virus evolve, it becomes increasingly important to understand the disease and drug targets5. For example, the Omicron variant has a remarkably high transmission rate, detected in 155 countries5. There is an urgent need to explore antiviral therapeutic targets and effective clinical drugs6,7. Therefore, the most severe outbreaks of SARS-CoV-2 highlight the importance of coronaviruses as human pathogens and emphasize the need to develop new antiviral strategies to combat acute respiratory illness caused by SARS-CoV-2 variants.
The particles of SARS-CoV-2 have an irregular shape containing an outer envelope with distinctive, 'club-shaped' peplomers that give the virus a crown-shaped appearance (corona)8. The viral genome of SARS-CoV-2 consists of a single-stranded positive-sense RNA of about 30 kb. It contains several genes coding for various structural and non-structural proteins required for the production of virion progeny9. The virion of the SARS-CoV-2 envelope surrounding the nucleocapsid contains the following structural proteins: spike (S), membrane (M), envelope (E), and nucleocapsid (N) protein10. In particular, the trimeric spike glycoprotein (S protein) of SARS-CoV-2 is a crucial target for virus neutralization11. Several therapeutic targets are repurposed against COVID-19 and viral elements used in COVID-19 vaccine candidates12. The S protein covering the surface of SARS-CoV-2 binds to the host receptor angiotensin-converting enzyme 2 (ACE2), leading to the entry of the virus into the cell13. COVID-19 variants often alter critical viral proteins, particularly the S protein, which is responsible for viral entry into host cells. Many mutations in the emerging variants are located in the receptor-binding domain (RBD) of the S protein, which is targeted by most monoclonal antibodies from COVID-19 patients14.
Heparan sulfate (HS) is a linear polysaccharide found on the surface of cells and plays a critical role in several biological processes, including cell signaling, adhesion, and viral attachment15. Viruses use HS interactions to enhance attachment to the surface of host cells and improve their chances of interacting with specific entry receptors16,17,18. In COVID-19 infection, HS serves as a co-receptor and facilitates the binding of the viral S protein to the host cell19. The S protein of the SARS-CoV-2 virus recognizes and attaches to HS molecules present on the surface of host cells. This interaction triggers a conformational change in the S protein, exposing the RBD that allows the virus to attach to the ACE2 receptor on the host cell membrane. This binding event is essential for the subsequent internalization of the virus into the host cell and initiating the infection process. Furthermore, HS has been shown to be essential for the entry of viruses into host cells through its interaction with ACE2 and the RBD of the S protein of SARS-CoV-220. Hence, the development of compounds that effectively interfere with the binding of HS to the RBD in the viral S protein presents a good strategy against SARS-CoV-2 infection.
In the filed of drug discovery against SARS-CoV-2 infection, many research articles have reported on multiple computational techniques21,22,23,24,25. These computational approaches were applied, such as molecular docking, molecular dynamics, and digital genetic sequencing. These methods were incorporated into drug-targeting strategies in bioinformatics, genomics, and proteomics26,27,28,29. Previously, there has been a predominant focus on targeting particular proteins, such as Mpro30, papain-like protease31, RBD32, or host-ACE233. Recntly, several natural products showed excellent antiviral activities by targeting structural and non-structural proteins of SARS-CoV-234,35,36. Therefore, the use of natural products can be a good option for drug development.
NRICM101 a traditional Chinese medicine formula developed by the National Research Institute of Chinese Medicine (NRICM) in Taiwan. This formula had successfully shown antiviral activities to prevent SARS-CoV-2 infection37. This natural herbal-based formula was proposed by NRICM to target viral respiratory infection and immunomodulation during the SARS CoV outbreak in 200338,39. One of the pharmaceutical functions of NRICM101 was proposed to interfere with host cell invasion and viral replication by binding the viral spike protein40. However, little is known about the active compounds in NRICM101 in suppressing SARS-CoV-2.
In this report, we used computational approach to analyze multiple sequences of coronaviruses and reveal important information about the binding of HS to RBD in the SARS-CoV-2 S protein. First, we identified the conserved 'hot-spots' on the RBD as potential targets for the development of new antiviral drugs. A total of 1382 herbal compounds from the natural chemical formulation of NRICM101 were docked to the identified hot-spots as potential antiviral drug candidates using in silico methods. Our results indicate the possible mechanism of NRICM101 regarding the active ingredients in suppressing SARS-CoV-2 and may provide useful information for the design and development of new antiviral agents against SARS-CoV-2 and other pandemic diseases.
Materials and methods
Structure and sequence preparation of RBD in SARS-CoV-2 spike protein
The X-ray crystal structures and sequences of RBD in different species different species (Guangxi pangolin, PDB ID: 7DDP; Guangdong pangolin, PDB ID: 7DDO; RaTG13, PDB ID: 7TTX; BANAL-236, PDB ID: 7PKI) and different variants of SARS-CoV-2 S protein (wild type, PDB ID: 7DDD; Beta, PDB ID: 7PRY; Gamma, PDB ID: 7M8K; Delta, PDB ID: 7W92; Kappa, PDB ID: 7VXI; Epsilon, PDB ID: 7N8H; Omicron, PDB ID: 7TB4) were retrieved from the Research Collaboratory for Structural Bioinformatics (RCSB) protein data bank (PDB) (https://www.rcsb.org/). Protein preparation was carried out in PyMOL (The PyMOL Molecular Graphics System, Version 2.3.2, Schrödinger, LLC.) by removing water molecules, any co-crystallized compounds, and other unnecessary regions. The amino acid region from 336 to 515 was selected for further in silico computational studies41. Multiple structure and sequence alignments of the RBDs of all selected multi-species coronavirus S protein and variants of the SARS-CoV-2 S protein were performed using PyMOL, ConSurf online server (https://consurf.tau.ac.il/)42. The Weblogo online tool was used to display the consensus sequences from the multiple sequence alignments of all RBDs of SARS-CoV-2 S protein (https://weblogo.berkeley.edu/logo.cgi).
Identifying the potential binding sites (hot-spots) involved in the interaction is an important step in docking to test new chemical substances. The residues in the active site form a pocket containing a variety of hydrogen acceptors, donors, hydrophilic regions, and hydrophobic regions. To determine the binding cavity in the RBD of the SARS-CoV-2 spike protein, protein preparation was performed in PyMOL by removing water molecules, any co-crystallized compounds, and other unnecessary regions. The highly conserved region Y453 to G476 of the RBD of the wild-type SARS-CoV-2 S protein was selected and docked to HS using the LibDock program in Discovery studio client v20.1.0.19225 (Discovery studio 2019) to predict the key active site residues in the RBD of the wild type SARS-CoV-2 protein for binding with HS in terms of hydrogen bonding and hydrophobic interaction37,38.
Preparation of natural compounds from NRICM101
Thousands of important compounds from ten medicinal plants in NRICM101 (Scutellaria baicalensis, Houttuynia cordata, Morus alba, Saposhnikovia divaricata, Trichosanthes kirilowii, Isatis indigotica, Glycyrrhiza glabra, Magnolia officinalis, Mentha haplocalyx and Nepeta tenuifolia)39 were used for a virtual screening and molecular docking study against RBD of SARS-CoV-2 target proteins. The structures of compounds were retrieved from PubChem40 and ZINC1543 databases, and verified the protonation state of compounds, then finally merged with Open Babel software for molecular docking43.
Molecular docking
The binding pocket with X: 245.6069; Y: 216.0802; Z: 197.8738 in a radius 11.49 Å containing residues R454, F456, R457, S459 and E471 of the RBD of the wild-type SARS-CoV-2 S protein (PDB ID: 7DDD) was selected as the binding site for screening compounds that could potentially inhibit the RBD. Otherwise after molecular docking, the binding energies for compound-receptor interactions were calculated by ‘Binding Energy’ protocol44 of Discovery studio 2019 Virtual screening was performed using the LibDock45 and binding energies module of Discovery studio 2019. LibDock is a rigid-based docking module. It calculates hot-spots for the protein using a grid placed in the binding site and polar and apolar probes. The hot-spots are then used to align the compounds to form favorable interactions. Virtual screening was performed by docking all prepared compounds to the defined active site using LibDock. Based on the LibDock scores, all docked poses were ranked and grouped, and all compounds were also ranked according to the LibDock scores. Subsequently, the compounds were filtered out if the LibDock score > 80 and met the criteria of Lipinski's rule of 5 (logP: − 2–5). The second platform for molecular docking was applied Autodock vina as the second platform for molecular docking. All targets were loaded in the AutoDockTools-1.5.7 including RBD, Acetoside (ACE), Isoquercitrin (ISO), Hyperoside (HYP), Chrysin 6-C-arabinoside 8-C-glucoside (CAG), Oroxyloside (ORO), and Chrysin 6-C-glucoside 8-C-arabinoside (CGA) to convert into pdbqt format; a prerequisite for Vina Autodock software. The binding pocket that X: 245.6344; Y: 217.076; Z: 197.1578 with grid box sized 20 × 20 × 20 Å surrounding the binding site of RBD. Docking procedure was repeated over ten times with exhaustiveness set at 846. Finally, the six best potential compounds with Libdock binding energies < − 200 kcal/mol and more than 3 hydrogen bonds with the active site residues determined by LigPlot+ 47 were selected for the molecular dynamics (MD) simulation48.
MD simulation
The best binding complex conformations of RBD with top six compounds (Table S1) and HS were selected for molecular dynamics simulations (MD) to determine and compare the stability of the complex. To generate the initial structure for MD simulation of RBD complexes with top six compounds and HS, we establish the force field for top six compounds and HS by the antechamber of AmberTools2049. In addition, these RBD complexes were solvated in a cubic box of TIP3P water model. The size of the simulation system was enough to ensure at least 10 Å between the protein and the edge of the simulation box. The force field of ff14SB50, GAFF2, and the water model TIP3P were used for the tLEaP of AmberTools2049. Each MD simulation for RBD complexes were done by AMBER20 with three independent repeats49. The minimization is calculated of 2500 steps of steepest descent and then 2500 steps of conjugate gradient in all 3 sections. The first section constrains all atoms of protein. In the second section, the atoms of backbone were constrained, the water, lipid and the side chains of protein will be minimized. In the third section, the entire system was optimized without any constraint. Then, the simulation systems were heated from 0 to 300 K, at 100 ps and equilibrated by NVT ensemble calculation method, in which the total number of atoms, volume and temperature are fixed, at 100 ps. Finally, the production of each MD simulations of 100 ns were performed from the NPT ensemble calculation method, in which the total number of atoms are fixed and at 300 K and 1 atm. Trajectory analysis was performed using the analysis tools by Cpptraj of the AmberTools2049 MD package. The root mean square deviation (RMSD), the root mean square fluctuations (RMSF), the radius of gyration (Rg) and the number of hydrogen bonds of the complex of compound and protein were calculated to determine the stability between each compound and the protein51.
Calculation of binding free energies
The Molecular Mechanics Generalized Born Surface Area (MM-GBSA) method to estimate the binding free energy of RBD with top six compounds and HS by with MMPBSA.py program in AmberTools2049. The MM-GBSA calculations were applied to 100 snapshots extracted, in the interval of 1 ns, from the 100 ns of the MD simulation of RBD complexes. The MM-GBSA method has the four terms to calculate the free energy, the Van der Waals interaction energy (vdW), the electrostatic energy (ele), the polar solvation energy (pol) and the non-polar solvation energy (nonpol). The free energy difference of binding is composed of the following terms:
To calculated the binding free energy for each system of RBD complex. The average free energy of complex, receptor and compound were obtained to estimate of the binding free energy as following formula.
The parameters of MM-GBSA calculations are following the standard MM-PB/GBSA method implemented in the AmberTools2049.
In our approach, we have chosen to neglect entropic contributions, aligning with the recommendation to avoid entropic calculations in normal mode analysis (NMA) due to their suggested high uncertainties52,53,54. Yang et al. suggested that including entropic contributions calculated by NMA could potentially worsen predictions in both MM-GBSA and MM-PBSA55. Hou et al. indicated the inclusion of the entropy term did not consistently improve prediction accuracy, as observed in cases such as α-thrombin, avidin, and cytochrome C peroxidase53. They suggested that, even without considering conformational entropy, MM-PBSA could still achieve satisfactory accuracy in ranking ligand affinities, aligning with conclusions from previous studies.
Principal component analysis (PCA)
Principal component analysis (PCA) is a standard tool in statistical mechanics used in order to determine the correlated motions of the residues to a set of linearly uncorrelated variables call principal components. The PCA of RBD complexes with top six compounds and HS were done using the ProDy56. Each RBD complex generated 100 snapshots at 1 ns intervals over the course of a 100 ns MD simulation. Only the Ca positions were used for the analysis. PC1, PC2, and PC3 which represent the first three principal components.
Binding free energy landscape (BFEL) analysis
In this study, the Binding Free energy landscape (BFEL) is constructed in a 3-dimensional coordinate system57: The x axis and the y axis are the first (PC1) and second (PC2) principal components with the highest eigen values calculated from PCA analysis for HS–RBD and compound–RBD complexes and the z axis is the calculated binding free energy of each binding conformation from MM-GBSA. The xy plane was divided into 100 ×100 mesh grids, and the scatter data of binding free energy were fitted to the grids by using Origin8.
ADME and toxicity prediction
Absorption, distribution, metabolism, excretion, and toxicity (ADMET) prediction was done to determine the drug-likeness properties of selected compounds. The 3D structures of compounds (ACE, HYP, ISO, CAG, CGA, and ORO) were saved in the smiles format and were uploaded on SWISSADME (Molecular Modeling Group of the SIB (Swiss Institute of Bioinformatics), Lausanne, Switzerland) for the prediction. SWISSADME is a web tool used for the prediction of ADME and the pharmacokinetic properties of a molecule. The predicted outcome includes lipophilicity, water solubility, physicochemical properties, and pharmacokinetics58. PROTOX-II is a server that predicts the carcinogenicity, cytotoxicity, and the LD50 value and toxicity class of a question molecule in rodents59.
Results
The conservation analysis of RBD in the Coronavirus Sarbecovirus family
The coronaviruses spike (S) protein plays a key role in receptor recognition and cell membrane fusion. The structure of the S protein is trimeric and consists of a signal peptide (residues 1–13) at the N-terminus, the S1 domain (residues 14–685), and the S2 domain (residues 686–1273). The receptor binding domain (RBD) is located at the bottom of the S1 domain (residues 319–541) and is responsible for receptor binding and membrane fusion (Fig. 1A). The RBD adopted two distinct interconverted conformations, Up-RBD (green) and Down-RBD (red). The Up-RBD configuration is necessary for efficient binding to the human ACE2 receptor60.
To analyze the conservation of RBD in the Sarbecovirius family, a multiple sequence alignment was performed in the following Sarbecovirus RBD, the host including pangolin (GX-pangolin and GD-pangolin), bat (RaTG13 and BANAL-236), and human (SARS-CoV-2). As shown in Fig. 1B, the region of RBD (residues 336–515) in the Sarbecovirus has found to be a highly conserved regions among most of the species. Moreover, the superposition of the five RBDs in the Sarbecovirus family showed a root-mean-square deviation (RMSD) of the backbone atoms of about 0.78 to 0.94 Å (Fig. 1C). Interestingly, the similar RBD structures consisted of a five-stranded antiparallel β-sheet sandwiched between α-helices and loops arranged in a right-handed, fist-shaped structure conserved among the Sarbecovirus family.
The highly conserved region of RBD in the SARS-CoV-2 variants
Sequence analysis is the process of examining amino acid sequences to understand their features, function, structure, or evolution61. To identify the highly-conserved region of the RBD, multiple sequence alignment was performed and analyzed among the wild type (WT; Wuhan-Hu-1) and several SARS-CoV-2 variants, including alpha (B.1.1.7), beta (B.1.351), gamma (P.1), epsilon (B.1.429), kappa (B.1.617.1), delta (B.1.617.2), and omicron (BA.1, BA.1.1, B.1.1.529, BA.2, BA.2.12.1, BA.2.13, BA.2.75, BA.3, BA.4). Interestingly, twenty-four amino acid residues (Y453 to G476) were completely identical in all variants of SARS-CoV-2 RBDs (Fig. 2A). Multi-sequence alignments using the conservation function of PyMOL showed that the region (Y453 to G476) was highly conserved between different SARS virus strains (Fig. 2B).
To compare the differences between the shapes in the SARS-CoV-2 S RBD, the structure coordinates of the original and the six RBD variants were obtained from the protein data bank (RCSB PDB) (Fig. 2C). As shown in the Fig. 2D, the superposition of the RBD variants of SARS-CoV-2 showed the RMSD of the backbone atoms of about 0.7 to 1.3 Å, indicating that the structures of the RBD variants of SARS-CoV-2 have a high degree of similarity. The RMSD of the common prevalent delta (B.1.617.2) and omicron (B.1.1.529) variants are 0.9 and 1.1 Å, respectively (Table 1). These results indicated the structural features of the RBD variants in SARS-CoV-2 are highly similar and conserved within the region spanning Y453 to G476 residues.
The druggable pocket prediction of RBD in SARS-CoV-2 variants
Previous research has shown that the region between Y453 to G476 was the heparan sulfate (HS) binding region in the RBD (HS-RBD binding region)20, which was bound by polysaccharide and helped the virus to rapidly enter cells. Blocking this region can affect the life cycle of the virus, and its infection efficiency is greatly reduced62. To better understand the details of the binding between HS and RBD, molecular docking with HS and RBD (PDB ID: 7DDD) was performed using Libdock module and Autodock vina, then further analyzed by LigPlot+ software. As shown in Fig. 3A, HS interacted with SARS-CoV-2 RBD through hydrogen bonding at residues S454, F456, R457, S459, and E471. The hydrophobic interactions with residues K458, S469, I472, Y473, Q474, and P491 indicate that HS interacted closely with RBD (Fig. 3B). To verify the stability of the HS-RBD complex compared to RBD, MD simulation was performed for 100 ns. The average of RMSD was 2.1 ± 0.2 Å for RBD and 2.0 ± 0.3 Å for HS-RBD complex, indicating the HS-RBD complex was more stable than RBD (Fig. 3C) To investigate the importance of the identified residues to HS-RBD binding site, the simulation of point mutations was performed on five residues within hydrogen bonding (R454, F456, R457, S459, and E471) (Fig. 3D). The average RMSD values obtained from the point mutations simulations were 2.0 ± 0.3 Å, 2.6 ± 0.2 Å, 2.3 ± 0.2 Å, 2.0 ± 0.1 Å, 2.0 ± 0.1 Å and 2.0 ± 0.2 Å for the wild type (black), R454A (red), E471A (purple), F456A (orange), R457A (dark yellow) and S459A (dark green), respectively (Fig. S1). The RMSD showed a significantly increasing in the residues R454, and E471 compared to the wild-type, indicated that these two residues play a critical role in the formation of the HS-RBD complex. In addition, binding energy calculations revealed a significant increase in the binding energy of residues R454 and E471 after point mutations, while the other three residues were similar to the wild type (data not shown). These results suggest R454 and E471 play an important role in RBD stability. Consequently, the druggable pocket, including R454, and E471, were considered potential ‘hot-spots’ for further drug development in this study.
Molecular docking for RBD inhibitors from natural compounds
To detect the possible RBD inhibitors via druggable pocket identified in this study, a molecular docking was performed with the natural compounds database. LibDock and AutoDock vina were used to screen RBD inhibitors from the NRICM101's herbal ingredients. LibDock is a high-throughput algorithm for docking compounds against a receptor's active sites, which in our case were residues of R454, and E471. The compound conformations were docked to polar and apolar receptor interaction sites (hot-spots), and the best-scoring poses were reported. All of the natural products from NRICM101's herbal ingredients (1382 compounds) were docked to the active sites of RBD (PBD ID: 7DDD) with LibDock. The resulting 6 compounds most frequently identified in affinity screening (Libdock score > 80) were selected and presented in Table 2. Subsequently, top six candidates were identified according to the Lipinski's rule: (1) the molecular weight of the compounds ranges from 250 to 500 Da, and (2) the ideal LogP ranges from − 2 to 5. (3) more than 3 hydrogen bonds to the receptor SARS-CoV-2 S RBD. In addition, the similar docking pose were also revealed by Autodock vina (Fig. S1).
The molecular interactions and binding energy analysis in the binding pocket of the RBD with the candidate compounds were shown in Fig. 4 and Table 3. The active site of the SARS-CoV-2 RBD exhibited hydrogen bonding with the druggable pocket residues. By looking into the hydrophobic interactions located in residues S469, Y473, and P491, the site of action in the ACE-RBD complex was found to the maximum hydrogen bonds to RBD (R454, F456, R457, K458, D467, I468, and S469) with the lowest binding energy of − 234.56 kcal/mol in Libdock. Besides, natural compounds, ISO and HYP, with the same binding energy of − 220.32 kcal/mol in Libdock., also illustrated the binding with active site in H-bond and hydrophobic interaction through R456, F457, K458, S469, E471 and Y473. These results revealing the natural compounds, ACE, HYP and ISO can be considered as a potential RBD inhibitor.
MD simulation analysis with candidate natural compounds
To gain insight into the dynamic behavior of the compounds at the active site of the RBD protein, MD simulations were performed for the RBD protein structure and complexed with six candidate compounds (Fig. 5A). To quantify the structural stability of the protein-compound complexes, the root-mean-square deviation (RMSD) of the backbone Cα atoms was measured. The average of RMSD was reported 2.3 ± 0.3 Å for HS-RBD. However, the average of RMSD for RBD complexed with ACE, HYP, ISO, CAG, ORO, and CGA were reported as 1.9 ± 0.2 Å, 2.1 ± 0.2 Å, 2.1 ± 0.3 Å, 2.4 ± 0.3 Å, 2.2 ± 0.5 Å and 2.7 ± 0.5 Å, respectively (Fig. S2. These results indicated that the docked poses of candidates in the MD simulation were reliable under conditions closed to well-equilibrated systems.
Radius of gyration (Rg) is a parameter linked to the tertiary structural volume of a protein-compound complex and has been applied to obtain insight into the stability of a protein-compound complex in a biological system. A complex is supposed to have a higher radius of gyration due to less tight packing. The average Rg values for HS, ACE, HYP, ISO, CAG, ORO, CGA were found to be 18.0 ± 0.1 Å, 17.7 ± 0.1 Å, 17.7 ± 0.2 Å, 17.8 ± 0.1 Å, 17.8 ± 0.1 Å, 17.8 ± 0.1 Å, and 17.8 ± 0.1 Å, respectively (Fig. S3). Based on the plots of Rg (Fig. 5B), the curve of HS was positioned above those of three complexes, ACE, HYP, and ISO. These results suggested that the overall stability of the protein-compound complex in these three complexes were better than HS.
Analyzing the communication between RBD and candidate natural compounds
The interaction between the top six potential compounds was further preformed the hydrogen bonding analysis and PCA. The average number of hydrogen bonds of RBD complexed with ACE, HYP, ISO, CAG, ORO, and, CGA were calculated as 18.2 ± 1.6, 17.0 ± 1.7, 15.4 ± 1.6, 16.2 ± 1.8, 14.7 ± 1.8 and 16.5 ± 1.7 Å respectively (Fig. 6). Compared to the reference natural HS complexed RBD were calculated as 15.5 ± 1.4 hydrogen bonds formation (Fig. 6A). These results were consistent with the RMSD of RBD complexed with candidates for MD simulation (Fig. 5). In the Fig. 6B, four compounds, including ACE, ISO, HYP, and CGA, revealed strong hydrogen bonding with the protein through the course of simulation. It can be seen that ACE has the highest number of hydrogen bonds, down to which the second-highest is compound ISO and then HYP. Hydrogen bonding in the case of CAG, and ORO is lower than the rest of the compounds.
Next, we compared the root mean square fluctuations (RMSF) of C-α atoms during the MD for each compound-RBD complex. While the RMSF values in the case of all six compounds were very similar, the fluctuations in the CGA were considerably higher. Notably, the RMSF values at the active sites around (R454-E471) remain consistently low as well as the results shown in triplicate of simulation. These findings were confirmed by residue-based free energy decomposition analysis (Fig. 7). Generally, if the ΔG(Binding) interaction energy calculated by MM-GBSA, between the residue and the substrate is lower than − 1 kcal mol−1, comprised of ΔE (vdW), ΔE (Elec), ΔE (Polar Solv) and ΔE (Non-polar Solv), those residues are considered to be important in potential hot-spots (Table 4).
Notably, HYP exhibited the most hydrogen bonding interacting with RBD. However, the overall stability of the compound-protein complex was affected by various factors containing electrostatic, hydrogen bonding, van der Waals, and solvation forces, etc. As shown in Fig. S4, the binding free energy by solvation for HYP-RBD was the most unstable. Considering all contributing factors, the stability of ACE-RBD still outperformed that of HYP-RBD.
Essential dynamics PCA analysis for candidate natural compounds
In order to understand the principal structural dynamics revealed by each compound-RBD complex, we plotted the PCA analysis. The given figures showed three eigenvectors or PCA for the RBD being docked with the top six potential compounds based on their extracted trajectories and exposed in clusters. Examining these eigenvectors supports the solid and clustered motions in the target’s corresponding complexes during the simulation. All the compounds exhibited a distinct transition at various points, indicating the alteration in conformation caused by the attachment of the compound. Each dot represents an individual frame. Importantly, ACE, ISO, and HYP in Fig. 8 exhibited distinct dynamic trends upon binding to RBD in comparison to HS. These results suggested that these compounds had the potential to block the interaction between RBD and HS without being utilized by RBD in infecting host cells. Conversely, the PCA distributions of the other compounds, CAG, ORO, and CGA, complexed to RBD are similar to that of HS, suggesting the possibility of downstream effects similar to HS-RBD interactions. Furthermore, local hydrogen bonding analysis (Fig. 6B) revealed CAG, ORO, and CGA compounds exhibited few hydrogen bonds in the MD, indicating a reduced capacity to stabilize complexes. These findings suggested ACE, HYP, and ISO may have more potential in combating SARS-CoV-2.
Identification of key residues involved in compound binding to RBD from binding free energy landscape analysis
Then, the lowest energy conformation, calculated by MM-GBSA, was selected in the free energy surface from the most abundant clusters as a part of stable low-energy conformation for the analysis object. Binding Free-energy landscape (BFEL) was used to obtain several low-energy conformations, which are the most suitable analytical subjects according to RMSD-equalized trajectory (Fig. 9). The ACE-RBD docked complex has the most stable dynamics structure since its relative stability of binding free energy was − 43.9 kcal/mol while having six hydrogen bonds with eight hydrophobic interactions (Fig. 9A). The HYP-RBD relative stability of binding free energy was − 38.1 kcal/mol, thus ranking it the second most stable complex with fourteen hydrogen bonds and four hydrophobic interactions (Fig. 9B). The third most stable docking complex is ISO-RBD, with relative stability of binding free energy of − 33.2 kcal/mol, forming five hydrogen bonds with eight hydrophobic interactions (Fig. 9C). Importantly, the residues such as R454 and E471 were all observed to participate in forming stable conformations.
ADME and toxicity prediction of candidate natural compounds
The ADME properties of the selected natural compounds predicted by SwissADME are summarized in Table 5. All the natural compounds exhibit Log Po/w values below 5, indicating favorable lipophilicity characteristics that may confer protection against ROS degradation. The TPSA (topological polar surface area) ranges from 176.12 to 245.29 Å2, with consensus log Po/w values falling within the optimal range of − 1.26 to 0.55. Compounds satisfying the TPSA ≤ 250 Å2 parameter are expected to display adequate solubility, oral bioavailability, and cell permeability. With regards to intestinal absorption (human), absorbance of less than 30% is considered to be poorly absorbed. HYP and ISO were predicted to have a poor absorption. P-glycoprotein is a member of the ATP-binding transmembrane glycoprotein family [ATP-binding cassette (ABC)], which can excrete drugs or other exogenous chemicals from cells. The results suggested that ACE, CAG, CGA, and Oroxyloside are substrates of P-glycoprotein—they may be actively exuded from cells by P-glycoprotein. Cytochrome P450s is an important enzyme system for drug metabolism in liver. The two main subtypes of cytochrome P450 are CYP2D6 and CYP3A4. The results showed that all compounds were not substrates for the two subtypes. The ProTox-II Web server was employed to assess organ toxicity profiles, toxicological endpoints, and LD50 values of the compounds. As shown in Table 6, all selected natural compounds are non-carcinogenic and non-cytotoxic, indicating their safety for use.
Discussion
SARS-CoV-2 is a pandemic respiratory infectious disease with serious public health and economic implications63. In view of the serious situation, many antiviral drugs and vaccines have been developed against COVID-19 targeting the SARS-CoV-2 spike (S) protein64,65. However, the efficacy of drugs and vaccines is usually limited by multiple spike mutations of the SARS-CoV-2 variants1,66,67. To address the challenge of treating SARS-CoV-2 variants, identifying highly conserved sequences and potentially druggable pockets for drug development represents a promising strategy68. In SARS-CoV-2 infection, heparan sulfate (HS) from host molecules is essential for recognition of the receptor binding domain (RBD) protein in the SARS-CoV-2 spike by ACE2, regardless of the high variability and therefore prevents the virus from easily acquiring drug resistance69. On the other hand, specific viral structural proteins with highly conserved protein sequences, such as the spike RBD are expressed during viral infection.
The spike RBD is located at the bottom of the S1 domain and plays an important role in host ACE2 receptor binding within membrane fusion through HS-assisted70,71,72. In the current study, we observed that RBD is associated with the recognition of ACE2 ability and is also high-conversed in all SARS-CoV-2 variants. The results of sequence alignments showed that the wild type SARS-CoV-2 has highly conserved sequences from Y453 to G476 among other variants such as beta, gamma, delta, kappa, epsilon and omicron. In addition, structural alignments of all SARS-CoV-2 variants revealed that the highly conserved protein sequences were also located in the HS binding region of RBD.
The detail interactions were found that the cellular HS interacted with RBD by forming hydrogen bonds at residues R454, F456, R457, S459 and, E471 and by hydrophobic interactions with residues K458, S469, I472, Y473, Q474 and, P491. Therefore, to block this region as a reasonable target and feasible strategy for the design and development of anti-COVID-19. Notably, the development of antiviral drugs obtained from this approach will be broad-spectrum agents targeting viruses that use the interaction between RBD and HS to facilitate their life cycle, including COVID-19. In drug design, the identification of key residues is important. In the current study, binding positions from HS served as clues, and the simulation of point mutations applied to probe potentially significant sites. The RMSD changes for each residue post-mutation. The analysis of RMSD yielded insights into the impact of point mutations on complex stability. In Fig. 3, significant RMSD elevation was observed for R454A and E471A mutations after a 100 ns simulation, underscoring the pivotal roles of these two residues in complex stability. To avoid the possibility of a high mutation rate in the region where HS binds to RBD, the active sites including R454 and E471 as essential ‘hot-spots’ were further selected for the development of a potent antiviral drug in this study.
Natural products were demonstrated remarkable efficacy against SARS-CoV-2 infection73,74,75,76. For example, Crocin, a terpenoid compound, was showed a promising binding affinity with the major protease of SARS-CoV-2 in the docking study77. Broussochalcone A, a flavonoid isolated from Broussonetia papyrifera (L.), has higher affinity and stability in the Mpro of SARS-CoV-2 than lopinavir78. Berberine isolated from Hydrastis canadensis L. was shown to have a much lower binding energy to chymotrypsin-like protease (3CLpro). The MD simulation of berberine in a complex with 3Clpro showed high stability and indicated a strong effect against SARS-CoV-2 by decreasing the activity of 3Clpro79. In this study, the active natural compounds in a traditional herbal formula, NRICM101, were investigated for the treatment of SARS-CoV-2 variants. The molecular docking and MD simulations were performed for several natural compounds from NRICM101 with the highly conserved region of RBD spike protein. These results in the study revealed three potential natural compounds, including Acetoside (ACE), Hyperoside (HYP), and Isoquercitrin (ISO), with a strong affinity to the RBD. These natural compounds were studied for antiviral or pharmacological activities, however, it is the first time to reveal how these natural compounds bind to RBD, leading to their potential as RBD inhibitors against the SARS-CoV-2 variants. In detail, ACE, extracted from Scutellaria baicalensis R., exhibited the highest binding affinity to RBD. ACE was also proposed for anti-SARS-CoV-2 effects by inhibiting 3Clpro80. HYP, a prominent component of Abelmoschus odelli (L.) medik, was also identified as having anti-SARS-CoV-2 effects by inhibiting 3Clpro and ACE2 activity81. Besides, HYP also showed the anti-cancer effects by mediation of the NF-κB signaling pathway82. ISO, derived from Hypericum perforatum L., demonstrated anti-inflammatory activity effects by reducing prostaglandin E2 levels83.
The PCA analysis revealed the dynamic trends of compound complexed to protein.
The PCA results demonstrated distinctions in the binding of natural HS to the RBD protein compared to three potential compounds, ACE, HYP, and ISO. These distinctions implied potential inhibitory behavior for the three compounds, as the movement trends of the complexes significantly differed from the natural HS-RBD complex. Combining with the observed stability of the complex by BFEL, indicating that there these three compounds have the potential as RBD inhibitors to form a stable complex with RBD. While further in vitro and in vivo testing is essential, our initial studies have uncovered the possibility of ACE, HYP, and ISO in treating the Sarbecovirus family, including SARS-CoV-2 variants.
Conclusion
In summary, the bioinformatics and structural information were analyzed and identified the highly conserved regions of RBD in the Coronavirus Sarbecovirus family and SARS-CoV-2 variants. Further, the structure-based computational approach with molecular docking and MD simulation was performed to screen and characterize the potential inhibitors of SARS-CoV-2 spike protein RBD. With the establishment of the ‘hot-spots’ model, 1382 natural products from NRICM101 were comprehensively screened, and the compounds ACE, ISO and HYP were identified that apparently interfere with the RBD activities in this study. We demonstrated that ACE blocked the active site of RBD by interacting with residues R454, F456, R457, K458, D467, I468, and S469 via hydrogen bond interactions and hydrophobic contacts with E654, R466, E471, Y473, Q474 and P491, which are key residues for the structure-based lead optimization against RBD protein. The discovery of the RBD inhibitor ACE from Scutellaria baicalensis R. holds great potential for the development of new and promising therapeutics for the treatment of SARS-CoV-2 variants.
Data availability
All data will be available upon request to the corresponding author.
Abbreviations
- S protein:
-
Spike protein
- HS:
-
Heparan sulfate
- RBD:
-
Receptor binding domain
- RMSD:
-
The root mean square deviation
- RMSF:
-
The root mean square fluctuations
- ACE:
-
Acetoside
- ISO:
-
Isoquercitrin
- HYP:
-
Hyperoside
- CAG:
-
Chrysin 6-C-arabinoside 8-C-glucoside
- ORO:
-
Oroxyloside
- CGA:
-
Chrysin 6-C-glucoside 8-C-arabinoside
- MD:
-
Molecular dynamics
- PCA:
-
Principal component analysis
- FEL:
-
Free energy landscape
References
Krammer, F. SARS-CoV-2 vaccines in development. Nature 586(7830), 516–527 (2020).
McLean, G. et al. The impact of evolving SARS-CoV-2 mutations and variants on COVID-19 vaccines. mBio 13(2), e0297921 (2022).
Hsu, J.-N. et al. Targeting the N-terminus domain of the coronavirus nucleocapsid protein induces abnormal oligomerization via allosteric modulation. Front. Mol. Biosci. https://doi.org/10.3389/fmolb.2022.871499 (2022).
Gil, C. et al. COVID-19: Drug targets and potential treatments. J. Med. Chem. 63(21), 12359–12386 (2020).
Samy, A. et al. SARS-CoV-2 potential drugs, drug targets, and biomarkers: A viral–host interaction network-based analysis. Sci. Rep. 12(1), 11934 (2022).
Yin, J. et al. Advances in the development of therapeutic strategies against COVID-19 and perspectives in the drug design for emerging SARS-CoV-2 variants. Comput. Struct. Biotechnol. J. 20, 824–837 (2022).
Chang, C.-K. et al. Targeting protein–protein interaction interfaces in COVID-19 drug discovery. Comput. Struct. Biotechnol. J. 19, 2246–2255 (2021).
Trojánek, M. et al. A novel coronavirus (SARS-CoV-2) and COVID-19. Cas Lek Cesk 159(2), 55–66 (2020).
Yang, H. & Rao, Z. Structural biology of SARS-CoV-2 and implications for therapeutic development. Nat. Rev. Microbiol. 19(11), 685–700 (2021).
Dai, L. & Gao, G. F. Viral targets for vaccines against COVID-19. Nat. Rev. Immunol. 21(2), 73–82 (2021).
Shamsi, A. et al. Potential drug targets of SARS-CoV-2: From genomics to therapeutics. Int. J. Biol. Macromol. 177, 1–9 (2021).
Ayele, A. G., Enyew, E. F. & Kifle, Z. D. Roles of existing drug and drug targets for COVID-19 management. Metabol. Open 11, 100103 (2021).
Shang, J. et al. Cell entry mechanisms of SARS-CoV-2. Proc. Natl. Acad. Sci. 117(21), 11727–11734 (2020).
Cheng, M. H. et al. Impact of new variants on SARS-CoV-2 infectivity and neutralization: A molecular assessment of the alterations in the spike-host protein interactions. iScience 25(3), 103939 (2022).
Stewart, M. D. & Sanderson, R. D. Heparan sulfate in the nucleus and its control of cellular functions. Matrix Biol. 35, 56–59 (2014).
Rusnati, M. et al. Sulfated K5 Escherichia coli polysaccharide derivatives: A novel class of candidate antiviral microbicides. Pharmacol. Ther. 123(3), 310–322 (2009).
Aquino, R. S. & Park, P. W. Glycosaminoglycans and infection. Front. Biosci. 21(6), 1260–1277 (2016).
Cagno, V. et al. Heparan sulfate proteoglycans and viral attachment: True receptors or adaptation bias?. Viruses 11(7), 596 (2019).
Yue, J. et al. Heparan sulfate facilitates spike protein-mediated SARS-CoV-2 host cell invasion and contributes to increased infection of SARS-CoV-2 G614 mutant and in lung cancer. Front. Mol. Biosci. 8, 649575 (2021).
Clausen, T. M. et al. SARS-CoV-2 infection depends on cellular heparan sulfate and ACE2. Cell 183(4), 1043-1057.e15 (2020).
Ahmad, S. et al. Mobile technology solution for COVID-19: Surveillance and prevention. In Computational Intelligence Methods in COVID-19: Surveillance, Prevention, Prediction and Diagnosis (ed. Raza, K.) 79–108 (Springer Singapore, 2021).
Wadood, A. et al. Epitopes based drug design for dengue virus envelope protein: A computational approach. Comput. Biol. Chem. 71, 152–160 (2017).
Ahmad, S. et al. Reporting dinaciclib and theodrenaline as a multitargeted inhibitor against SARS-CoV-2: An in-silico study. J. Biomol. Struct. Dyn. 41(9), 4013–4023 (2023).
Ahmad, S. et al. Multitargeted molecular dynamic understanding of butoxypheser against SARS-CoV-2: An in silico study. Nat. Prod. Commun. 17(7), 1934 (2022).
Alturki, N. A. et al. In-silico screening and molecular dynamics simulation of drug bank experimental compounds against SARS-CoV-2. Molecules https://doi.org/10.3390/molecules27144391 (2022).
Tarique, M. et al. Novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV2) and other coronaviruses: A genome-wide comparative annotation and analysis. Mol. Cell. Biochem. 476(5), 2203–2217 (2021).
Kaushik, A. C. et al. CoronaPep: An anti-coronavirus peptide generation tool. IEEE/ACM Trans. Comput. Biol. Bioinform. 18(4), 1299–1304 (2021).
Barh, D. et al. SARS-CoV-2 variants show a gradual declining pathogenicity and pro-inflammatory cytokine stimulation, an increasing antigenic and anti-inflammatory cytokine induction, and rising structural protein instability: A minimal number genome-based approach. Inflammation 46(1), 297–312 (2023).
Jawarkar, R. D. et al. QSAR based virtual screening derived identification of a novel hit as a SARS CoV-229E 3Clpro inhibitor: GA-MLR QSAR modelling supported by molecular docking, molecular dynamics simulation and MMGBSA calculation approaches. Arab. J. Chem. 15(1), 103499 (2022).
Mehmood, A. et al. Discovering potent inhibitors against the Mpro of the SARS-CoV-2. A medicinal chemistry approach. Comput. Biol. Med. 143, 105235 (2022).
Alzamami, A. et al. Hemi-Babim and fenoterol as potential inhibitors of Mpro and Papain-like protease against SARS-CoV-2: An in-silico study. Medicina https://doi.org/10.3390/medicina58040515 (2022).
Yadav, M. K. et al. Predictive modelling and therapeutic repurposing of natural compounds against the receptor-binding domain of SARS-CoV-2. J. Biomol. Struct. Dyn. 41(5), 1527–1539 (2023).
Li, D. et al. HD5 and LL-37 inhibit SARS-CoV and SARS-CoV-2 binding to human ACE2 by molecular simulation. Interdiscip. Sci. 13(4), 766–777 (2021).
Frediansyah, A. et al. Microbial natural products with antiviral activities, including anti-SARS-CoV-2: A review. Molecules 27(13), 4305 (2022).
Chakravarti, R. et al. A review on potential of natural products in the management of COVID-19. RSC Adv. 11(27), 16711–16735 (2021).
Li, D. et al. Discovery of a natural product with potent efficacy against SARS-CoV-2 by drug screening. Interdiscip. Sci. Comput. Life Sci. 14(1), 55–63 (2022).
Kalyaanamoorthy, S. & Chen, Y. P. Structure-based drug design to augment hit discovery. Drug Discov. Today 16(17–18), 831–839 (2011).
Debaisieux, S. et al. The ins and outs of HIV-1 tat. Traffic 13(3), 355–363 (2012).
Tsai, K. C. et al. A traditional Chinese medicine formula NRICM101 to target COVID-19 through multiple pathways: A bedside-to-bench study. Biomed. Pharmacother. 133, 111037 (2021).
Kim, S. et al. PubChem in 2021: New data content and improved web interfaces. Nucleic Acids Res. 49(D1), D1388-d1395 (2021).
Rolta, R. et al. In silico screening of hundred phytocompounds of ten medicinal plants as potential inhibitors of nucleocapsid phosphoprotein of COVID-19: An approach to prevent virus assembly. J. Biomol. Struct. Dyn. 39(18), 7017–7034 (2021).
Ashkenazy, H. et al. ConSurf 2010: Calculating evolutionary conservation in sequence and structure of proteins and nucleic acids. Nucleic Acids Res. 38(suppl_2), W529–W533 (2010).
Sterling, T. & Irwin, J. J. ZINC 15—ligand discovery for everyone. J. Chem. Inf. Model. 55(11), 2324–2337 (2015).
Gogoi, B. et al. Identification of potential plant-based inhibitor against viral proteases of SARS-CoV-2 through molecular docking, MM-PBSA binding energy calculations and molecular dynamics simulation. Mol. Divers. 25(3), 1963–1977 (2021).
Rao, S. N. et al. Validation studies of the site-directed docking program LibDock. J. Chem. Inf. Model. 47(6), 2159–2171 (2007).
El Hassab, M. A. et al. In silico identification of potential SARS COV-2 2′-O-methyltransferase inhibitor: Fragment-based screening approach and MM-PBSA calculations. RSC Adv. 11(26), 16026–16033 (2021).
Laskowski, R. A. & Swindells, M. B. LigPlot+: Multiple ligand-protein interaction diagrams for drug discovery. J. Chem. Inf. Model. 51(10), 2778–2786 (2011).
Yang, L. et al. Computational study of novel natural inhibitors targeting O6-methylguanine-DNA methyltransferase. World Neurosurg. 130, e294–e306 (2019).
Case, D. A. et al. AMBER 2020. (2020).
Maier, J. A. et al. ff14SB: Improving the accuracy of protein side chain and backbone parameters from ff99SB. J. Chem. Theory Comput. 11(8), 3696–3713 (2015).
Shanmugarajan, D. et al. Curcumin to inhibit binding of spike glycoprotein to ACE2 receptors: Computational modelling, simulations, and ADMET studies to explore curcuminoids against novel SARS-CoV-2 targets. RSC Adv. 10(52), 31385–31399 (2020).
Greene, D. et al. Modeling membrane protein-ligand binding interactions: The human purinergic platelet receptor. J. Phys. Chem. B 120(48), 12293–12304 (2016).
Hou, T. et al. Assessing the performance of the MM/PBSA and MM/GBSA methods. 1. The accuracy of binding free energy calculations based on molecular dynamics simulations. J. Chem. Inf. Model 51(1), 69–82 (2011).
Yang, T. Y. et al. Virtual screening using molecular simulations. Proteins-Struct. Funct. Bioinform. 79(6), 1940–1951 (2011).
Yang, T. et al. Virtual screening using molecular simulations. Proteins 79(6), 1940–1951 (2011).
Bakan, A., Meireles, L. M. & Bahar, I. ProDy: Protein dynamics inferred from theory and experiments. Bioinformatics 27(11), 1575–1577 (2011).
Londhe, A. M. et al. Investigation of molecular details of Keap1-Nrf2 inhibitors using molecular dynamics and umbrella sampling techniques. Molecules 24(22), 4085 (2019).
Daina, A., Michielin, O. & Zoete, V. SwissADME: A free web tool to evaluate pharmacokinetics, drug-likeness and medicinal chemistry friendliness of small molecules. Sci. Rep. 7(1), 42717 (2017).
Banerjee, P. et al. ProTox-II: A webserver for the prediction of toxicity of chemicals. Nucleic Acids Res. 46(W1), W257-w263 (2018).
Pang, Y. T. et al. SARS-CoV-2 spike opening dynamics and energetics reveal the individual roles of glycans and their collective impact. Commun. Biol. 5(1), 1170 (2022).
Rives, A. et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc. Natl. Acad. Sci. 118(15), e2016239118 (2021).
Muralidar, S., Gopal, G. & Visaga Ambi, S. Targeting the viral-entry facilitators of SARS-CoV-2 as a therapeutic strategy in COVID-19. J. Med. Virol. 93(9), 5260–5276 (2021).
Gorain, B. et al. Fighting strategies against the novel coronavirus pandemic: Impact on global economy. Front. Public Health https://doi.org/10.3389/fpubh.2020.606129 (2020).
Papageorgiou, A. C. & Mohsin, I. The SARS-CoV-2 spike glycoprotein as a drug and vaccine target: Structural insights into its complexes with ACE2 and antibodies. Cells 9(11), 2343 (2020).
Rodríguez, Y. et al. Small molecules targeting SARS-CoV-2 spike glycoprotein receptor-binding domain. ACS Omega 7(33), 28779–28789 (2022).
Harvey, W. T. et al. SARS-CoV-2 variants, spike mutations and immune escape. Nat. Rev. Microbiol. 19(7), 409–424 (2021).
Malik, J. A. et al. The SARS-CoV-2 mutations versus vaccine effectiveness: New opportunities to new challenges. J. Infect. Public Health 15(2), 228–240 (2022).
Lin, S.-M. et al. Structure-based stabilization of non-native protein–protein interactions of coronavirus nucleocapsid proteins in antiviral drug design. J. Med. Chem. 63(6), 3131–3141 (2020).
Naqvi, A. A. T. et al. Insights into SARS-CoV-2 genome, structure, evolution, pathogenesis and therapies: Structural genomics approach. Biochim. Biophys. Acta Mol. Basis Dis. 1866(10), 165878 (2020).
Lan, J. et al. Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor. Nature 581(7807), 215–220 (2020).
Ali, A. & Vijayan, R. Dynamics of the ACE2–SARS-CoV-2/SARS-CoV spike protein interface reveal unique mechanisms. Sci. Rep. 10(1), 14214 (2020).
Sztain, T. et al. A glycan gate controls opening of the SARS-CoV-2 spike protein. Nat. Chem. 13(10), 963–968 (2021).
Hengphasatporn, K. et al. Halogenated baicalein as a promising antiviral agent toward SARS-CoV-2 main protease. J. Chem. Inf. Model. 62(6), 1498–1509 (2022).
Deetanya, P. et al. Interaction of 8-anilinonaphthalene-1-sulfonate with SARS-CoV-2 main protease and its application as a fluorescent probe for inhibitor identification. Comput. Struct. Biotechnol. J. 19, 3364–3371 (2021).
Hengphasatporn, K. et al. Promising SARS-CoV-2 main protease inhibitor ligand-binding modes evaluated using LB-PaCS-MD/FMO. Sci. Rep. 12(1), 17984 (2022).
Nutho, B. et al. Why are lopinavir and ritonavir effective against the newly emerged coronavirus 2019? Atomistic insights into the inhibitory mechanisms. Biochemistry 59(18), 1769–1779 (2020).
Aanouz, I. et al. Moroccan medicinal plants as inhibitors against SARS-CoV-2 main protease: Computational investigations. J. Biomol. Struct. Dyn. 39(8), 2971–2979 (2021).
Ghosh, R. et al. Identification of polyphenols from Broussonetia papyrifera as SARS CoV-2 main protease inhibitors using in silico docking and molecular dynamics simulation approaches. J. Biomol. Struct. Dyn. 39(17), 6747–6760 (2021).
Chowdhury, P. In silico investigation of phytoconstituents from Indian medicinal herb ‘Tinospora cordifolia (giloy)’ against SARS-CoV-2 (COVID-19) by molecular dynamics approach. J. Biomol. Struct. Dyn. 39(17), 6792–6809 (2021).
Vincent, S. et al. Molecular docking studies on the anti-viral effects of compounds from Kabasura Kudineer on SARS-CoV-2 3CL(pro). Front. Mol. Biosci. 7, 613401 (2020).
Kaul, R. et al. Promising antiviral activities of natural flavonoids against SARS-CoV-2 targets: Systematic review. Int. J. Mol. Sci. 22(20), 11069 (2021).
Qiu, J. et al. Hyperoside induces breast cancer cells apoptosis via ROS-mediated NF-κB signaling pathway. Int. J. Mol. Sci. 21(1), 131 (2020).
Valentová, K. et al. Isoquercitrin: Pharmacology, toxicology, and metabolism. Food Chem. Toxicol. 68, 267–282 (2014).
Acknowledgements
The authors gratefully acknowledge the Ministry of Science and Technology, Taiwan (NSTC 111-2311-B-309-001-MY3 to C.-M.C.) for the funding. We also acknowledge Prof. Ming-Hon Hou form National Chung Hsing University, Taiwan for the Discovery Studio software and Prof. Yuh-Ju Sun form National Tsing Hua University, Taiwan for AMBER20.
Funding
NSTC 111-2311-B-309-001-MY3 to C.-M.C.
Author information
Authors and Affiliations
Contributions
Conceptualization, Z.S.Y., and C.M.C.; methodology and software, T.S.L. and Y.S.H.; investigation, Z.S.Y., and C.M.C.; data curation, C.M.C.; writing—original draft preparation, Z.S.Y. and C.M.C.; writing—review and editing, C.M.C., C.C.C., Z.S.Y. and Y.S.H.; funding acquisition, C.M.C. All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yang, ZS., Li, TS., Huang, YS. et al. Targeting the receptor binding domain and heparan sulfate binding for antiviral drug development against SARS-CoV-2 variants. Sci Rep 14, 2753 (2024). https://doi.org/10.1038/s41598-024-53111-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-53111-2
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.