Building HMM and molecular docking analysis for the sensitive detection of anti-viral pneumonia antimicrobial peptides (AMPs)

Bakare, Olalekan Olanrewaju; Keyster, Marshall; Pretorius, Ashley

doi:10.1038/s41598-021-00223-8

Download PDF

Article
Open access
Published: 18 October 2021

Building HMM and molecular docking analysis for the sensitive detection of anti-viral pneumonia antimicrobial peptides (AMPs)

Olalekan Olanrewaju Bakare^1,2,
Marshall Keyster² &
Ashley Pretorius¹

Scientific Reports volume 11, Article number: 20621 (2021) Cite this article

1487 Accesses
4 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Pneumonia is the main reason for mortality among children under five years, causing 1.6 million deaths every year; late research has exhibited that mortality is increasing in the elderly. A few biomarkers used for its diagnosis need specificity and precision, as they are related to different infections, for example, pulmonary tuberculosis and Human Immunodeficiency Virus. There is a quest for new biomarkers worldwide to diagnose the disease to defeat these previously mentioned constraints. Antimicrobial peptides (AMPs) are promising indicative specialists against infection. This research work used AMPs as biomarkers to detect viral pneumonia pathogens, for example, Respiratory syncytial virus, Influenza A and B viruses utilizing in silico technologies, such as Hidden Markov Model (HMMER). HMMER was used to distinguish putative anti-viral pneumonia AMPs against the recognized receptor proteins of Respiratory syncytial virus, Influenza A, and B viruses. The physicochemical parameters of these putative AMPs were analyzed, and their 3-D structures were determined utilizing I-TASSER. Molecular docking interaction of these AMPs against the recognized viral pneumonia proteins was carried out using the PATCHDOCK and HDock servers. The results demonstrated 27 anti-viral AMPs ranked based on their E values with significant physicochemical parameters in similarity with known experimentally approved AMPs. The AMPs additionally had a high anticipated binding potential to the pneumonia receptors of these microorganisms sensitively. The tendency of the putative anti-viral AMPs to bind pneumonia proteins showed that they would be promising applicant biomarkers to identify these viral microorganisms in the point-of-care (POC) pneumonia diagnostics. The high precision observed for the AMPs legitimizes HMM’s utilization in the disease diagnostics’ discovery process.

Machine learning for antimicrobial peptide identification and design

Article 26 February 2024

Structure guided prediction of Pyrazinamide resistance mutations in pncA

Article Open access 05 February 2020

Anthraquinolone and quinolizine derivatives as an alley of future treatment for COVID-19: an in silico machine learning hypothesis

Article Open access 09 September 2021

Introduction

The high mortality emerging from pneumonia infection requires the hunt for new diagnostic strategies and biomarkers to determine patients’ status before the onset of signs and symptoms¹. Mortality from pneumonia is partly due to delayed therapy because there is no sensitive approach to identifying the disease’s viral causation. A few biomarkers that, until now, have been utilized in the detection of the sickness are unpredictable as they are involved in different illnesses, consequently giving false-positive or false-negative results². Antibodies that have been said to be the best quality standard for disease detection because of their high sensitivity have, as of late, been connected to different weaknesses, for example, cross-reactivity³. The problems mentioned above contribute to the poor diagnosis of the disease resulting in drug abuse and drug resistance by microorganisms. Henceforth the need to make a quest for sensitive biomarkers for pneumonia diagnosis is essential.

Several diagnostic biomarkers have been connected to pneumonia, among which are C-reactive protein (CRP), Procalcitonin (PCT), a Soluble triggering receptor expressed on myeloid cells-1 (STREM-1), CD163, and High Mobility Group Box-1(HMGB-1). It is known that CRP and PCT have proven valuable in the detection of the disease as they are created in an impressively high amount. Yet, there is uncertainty in their sensitivity towards pneumonia as they can be produced as a result of other inflammatory stimuli in the neuron, atherosclerotic plaques, myocytes, and lymphocytes⁴; while the mechanism controlling their production at these locales is not established⁵. There is ongoing research into discovering more sensitive biomarkers for pneumonia diagnosis to solve its diagnosis and treatment⁶ completely. These include the involvement of other biomarkers for their likely utilization in pneumonia diagnosis.

Additionally, several methods exist to detect pneumonia pathogens, ranging from blood cultures, polymerase chain reaction, matrix-assisted laser desorption or ionization-time of flight, immunofiltration, turbidimetric immunoassay dependent on latex agglutination, to mention a few¹. It is possible to detect locally confined bacteria/viruses causing pneumonia. However, these techniques endure inadequacies, for example, the lack of sensitivity of blood cultures⁶, the failure of X-ray to distinguish the causative microorganism⁷, absence of precision of the polymerase chain reaction⁸, failure to detect viral pneumonia in the matrix-assisted laser desorption or ionization-time of flight⁹, high cost and absence of sensitivity of immunofiltration and turbidimetric immunoassay¹⁰. Thus, there is a need to discover new methods or improve the existing ones for sensitive and accurate disease detection to eliminate false-positive/false-negative results.

Antimicrobial peptides (AMPs) are small molecular weight oligopeptides with an expansive range of antimicrobial activity against bacteria, viruses, and fungi. These peptides have been conserved throughout evolution with hydrophobic and hydrophilic side chains that help them traverse the aqueous environment and lipid-rich biological membranes¹¹. Likewise, recent research has demonstrated that AMPs enhance the host immunity through receptor-dependent interactions, which have significance in different capacities such as angiogenesis, wound healing, and chemotaxis¹². The recent roles of the AMPs suggest that they are significant and already undervalued molecules. In work by Tincho, Gabere (13), for instance, a list of experimentally approved anti-HIV AMPs was discovered utilizing HMM to develop a few sensitive models for their prediction. This research established the discovery of a few AMPs that bound the HIV p24 protein for the sensitive diagnosis of both HIV 1 and 2 through the construction of a lateral flow device (LFD)¹⁴.

Many in silico tools exist to distinguish AMPs, among which is HMMER uses a profile strategy for AMPs prediction. Each sequence is exhibited as a bunch of similitudes (probabilities) with a group of successions models¹⁵. The development of a profile is typically restricted to the utilization of positive information (functional sequences) without discrimination. HMM is used to compute statistical analysis of different DNA alignments, to distinguish genomic features, for example, insertions, deletions, substitutions, and to identify protein domains for homology modeling of protein families¹⁶. Clusters by HMM likewise permit a minimum amount of likeness between all peptides. Another significant element of HMMER is the capacity to capture data by preserving the content in a sequence alignment¹⁷. For this reason, high molecular structures, for example, protein domains, are regularly characterized utilizing HMMER¹⁸. This research aims to discover novel AMPs as biomarkers to recognize viral pneumonia due to the high mortality related to the illness with the aid of in silico tools, such as HMMER, to accelerate the discovery process.

Materials and methods

Data retrieval (literature mining)

The experimentally approved anti-pneumonia AMPs for the viral pathogens (Respiratory syncytial virus, Influenza A and B viruses) were recovered from the antimicrobial peptide databases, for example, Antimicrobial Peptides Database (APD3)^19,20, Collection of Antimicrobial Peptides (CAMP)²¹, and Anti-viral peptides databases (AVPDB)²². Curation was carried out through literature mining to affirm that all the recovered AMPs were either experimentally approved or anticipated. Duplicate experimentally validated AMPs were then removed from the recovered list utilizing the Cluster Database at High Identity with Tolerance (CD-HIT)²³.

Training and testing datasets (data mining)

The final list of the experimentally validated AMPs was sorted by their particular pathogenic strains with INFA—anti-Influenza A; INFB—anti-Influenza B; and RSV-anti-Respiratory Syncytial Virus^24,25,26. Every classification of the strain-specific datasets was arbitrarily separated into two subsets: seventy-five percent of every dataset was used as the training set (to assemble each profile). At the same time, one-quarter was utilized as the testing dataset.

Construction of AMPs profiles (text mining)

The HMMER algorithm version 2.3.2²⁷ was utilized to build detailed pathogen-targeted models/profiles utilizing the training datasets. All the HMM profiles were constructed on the Ubuntu 12.04 LTS operating system. The assignment was cultivated on a terminal, and the command lines used to fabricate each profile was composed by the corresponding algorithm and the means associated with their development were as beneath:For the initial step, the training datasets of each target class were adjusted utilizing the ClustalW alignment device²⁸. The task was carried out utilizing the command line:

The command line essentially states ≪do an alignment of the sequences which are in the capitalized form found in the input record “target class.fasta” with the FastA, utilizing ClustalW as numerous alignment instruments and GCG Postscript yield for graphical printing≫. The command’s yield brings about the development of adjusted sequences, called “target class.msf”. The modified sequences were utilized as a contribution to the subsequent stage.

The subsequent stage upgrades the development of the profiles of the target class sequences by indicating the normal motifs/signatures inside the profiles. To accomplish this, the “Build profiles” was run utilizing the accompanying command line:

To improve the sensitivity of the profiles, the document created (target class. hmm) from the profile building step was constructed by utilizing the order line:

The subsequent profiles “target class.hmm” was utilized in assessing the profiles execution by testing the built profiles on a free AMP dataset.

Independent profile testing

The independent testing of each constructed profile was carried out in a stage called “Query profiles” The testing datasets were queried against the constructed profiles utilizing the command line, with an E-value of 0.05:

Performance measurement of each profile

Statistical performance measures were then determined utilizing sensitivity, specificity, accuracy, and Matthews Correlation Coefficient (MCC) as indicators. The measures utilized are depicted as follows: Sensitivity is the percentage of anti-pneumonia AMPs against a specific microbe (testing sets) accurately anticipated as anti-pneumonia AMPs (positive). The sensitivity is characterized by the Eq. (1):

$${\text{Sensitivity}} = \left( {\frac{TP}{{TP + FN}}} \right) \times 100$$

(1)

Specificity is the level of non-anti-pneumonia AMPs (negative datasets) effectively anticipated as non-anti-pneumonia AMPs (negative). The specificity is characterized in Eq. (2):

$${\text{Specificity}} = \left( {\frac{TN}{{TN + FP}}} \right) \times 100$$

(2)

Accuracy is the percentage of accurately anticipated peptides (anti-pneumonia AMPs and non-anti-pneumonia AMPs). The accuracy is characterized in Eq. (3):

$${\text{Accuracy}} = \left( {\frac{TP + TN}{{TP + FP + TN + FN}}} \right) \times 100$$

(3)

Matthews correlation coefficient (MCC) is a proportion of both sensitivity and specificity. MCC = 0 shows arbitrary expectation, while MCC = 1 demonstrates the perfect forecast. It is characterized in Eq. (4):

$${\text{MCC}} = \left( {\frac{(TP \times TN) - (FN \times FP)}{{\sqrt {(TP + FN) \times (TN + FP) \times (TP + FP) \times (TN + FN)} }}} \right)$$

(4)

Identification of novel putative anti-Pneumonia AMPs from proteome sequences

Proteome sequences were queried by the profiles with the list of all proteome sequences (in the fasta design) recovered from the Ensembl information base (http://www.ensembl.org/index.html) and the UniProt information base (http://www.uniprot.org/). A cut-off E-value was set to be 0.05 for the retrieval of putative anti-pneumonia AMPs. This was cultivated utilizing “hmmsearch” module of the HMMER software with the command line utilized expressed underneath:

where the target class.hmm in one of the three profiles, target class query.txt speaking to the species examined against the profile and result file.txt is the outcome document realized from querying the species against a specific microbe profile.

Identification of receptors

Viral receptors, for example, cell surface receptors and nucleoproteins, were recognized for the viral causative pathogens (Respiratory syncytial virus, Influenza A, and B) involved in pneumonia to fill in as targets for the distinguished AMPs utilizing a few in-silico strategies. Viral pneumonia proteins were gathered from different protein data banks (PDB), for example, the National Center for Biotechnology Information (NCBI), UniProt, Google Scholar, and Ensembl through literature mining. Curation was performed to confirm that all the recovered viral pneumonia proteins were complete or incomplete. Fractional proteins were removed, and complete protein was retained for additional examination. BLAST investigation was performed utilizing the UniProt interface for further affirmation of specificity with the end goal that the viral pneumonia proteins retrieved were absent in other microorganisms and viruses.

Physicochemical properties of the putative anti-Pneumonia AMPs and the pneumonia proteins

Physicochemical properties of the putative anti-pneumonia AMPs and pneumonia receptor proteins were determined utilizing the calculation interface of Bactibase (http://bactibase.pfba-lab-tun.org/physicochem)^28,29 and APD3 (https://wangapd3.com/main.php)^18,19 utilizing the amino acid sequences of the putative peptides as information.

De novo structure predictions of the putative anti-Pneumonia AMPs and Pneumonia proteins (receptors) using I-TASSER

3-D structures of the anti-Pneumonia AMPs and the viral pneumonia receptor proteins were predicted by transferring each sequence onto the I-TASSER (Iterative Threading ASSEmbly Refinement) site²⁹. The 3-D structures of the AMPs and their protein receptors were visualized utilizing the PyMOL version 1.3³⁰. This was accomplished by downloading the most recent version of the PyMol on Ubuntu Linux, utilizing the terminal command line.

Docking analysis of the putative anti-pneumonia AMPs and Pneumonia Proteins 3-D structures using PatchDock

The 3-D structures of the anti-viral Pneumonia putative AMPs and the viral pneumonia protein receptors PDB files from I-TASSER were transferred onto the PatchDock server using the default RMSD of 4 and subsequent selection of “protein-small ligand”³¹. HDock, an efficient molecular docking algorithm with an accurate scoring function for biomolecular interactions, was also used as an alternative docking tool to check the consistency of the PatchDock tool³². This tool combined Physics with Bioinformatics-based methods to generate structure prediction and interaction.

Interaction analysis between the anti-viral Pneumonia putative AMPs and their respective viral pneumonia protein receptors was carried out utilizing the PyMol software³⁰.

Results

Retrieval of anti-viral AMPs (VAP-AMPs) and profile creation using HMM

Literature mining uncovered 176 experimentally validated anti-viral pneumonia antimicrobial peptides (VAP-AMPs) in total for the CAMP, APD3, and AVPDB databases against the microbes Respiratory Syncytial Virus, Influenza A, and B in the order 112, 52 and 12, respectively. The initial phase in the profile construction pipeline was the random grouping of the various classes into ¾ and ¼ of the experimentally validated AMPs (Table 1). The ¾ is the training dataset, expected to prepare the HMM software to test whether the functionally significant amino acid consensus is captured. After this, multiple alignments were produced utilizing HMM ClustalW. A total of three AMP profiles was produced for every one of the accompanying classes ((anti-Respiratory syncytial virus (RSVM) and, anti-Influenza A, and B (INFA and INFB).

Table 1 Profile creation by HMM.

Full size table

Independent testing of the created profiles and evaluation of the independent testing results

Each constructed profile was tested against a positive dataset (testing datasets) which was about 25% of the dataset (training datasets in Table 1). Since experimentally validated AMPs were used, the assumption is that the profiles developed ought to have the option to recognize different sequences with precisely the same action and separate those that have no anti-pneumonia activity from the same microorganism. The constructed profiles were examined against a negative control dataset, comprised of random fragments of 17,236 neuropeptides, which had no recorded anti-pneumonia action. This independent testing was carried out with the negative dataset (neuropeptides) to confirm whether the trained profiles would distinguish non-anti-pneumonia peptides.

The independent testing of the profiles was evaluated utilizing the true positive (TP), false-positive (FP), true negative (TN), and false-negative (FN). A cut-off E-value of 0.05 was applied to the HMM tool to fortify the profile’s capacity to separate between the TP anti-pneumonia AMP and the false-negative anti-pneumonia AMPs. TP speaks to effectively anticipated positive sequences (anti-pneumonia AMPs), TN indicates accurately predicted negative groupings (non-anti-pneumonia AMPs), FP (False-positive) is the quantity of non-anti-pneumonia AMPs wrongly anticipated as anti-pneumonia AMPs (AP-AMPs), FN is the number of anti-pneumonia AMPs wrongly anticipated as non-anti-pneumonia AMPs. It was conceivable to ascertain the quantity of TP AMPs from the complete number of input sequences; accordingly, the FP number could be extrapolated with the outcomes that appeared in Table 2, mirroring the limit of each profile to recognize true anti-pneumonia AMPs from false anti-pneumonia AMPs. In Table 2, INFB had all its testing datasets as TP while RSVM had 22 of its 28 testing datasets as TP. Nonetheless, INFA had 6 of its 13 testing datasets as TP, which could be because of an overlap of homologous relationships in the AMPs utilized in their profiles.

Table 2 Independent testing of profiles against test and negative datasets.

Full size table

Performance measurement of the target-specific profiles

After evaluating the capacity of the tested profiles, the performance was determined to calculate the performance of each profile, utilizing specificity, sensitivity, accuracy, and MCC, presented by organic chemist Brian W. Matthews in 1975³³. The specificity, sensitivity, accuracy, and MCC were determined as detailed in Table 3.

Table 3 Summary of performance measurement of the profiles.

Full size table

From the results in Table 3, sensitivity values were high in Anti-Influenza B virus (INFB) and Anti-Respiratory Syncytial Virus (RSVM) of anti-viral profiles tested. The high sensitivity values of INFB and RSVM profiles indicated the right prediction. The moderate sensitivity of INFA could be ascribed to the huge overlap in the conserved space of the AMPs utilized for their profile development¹⁷. The specificity results for all profiles were 100%, indicating a correct prediction. The accuracy results of the profiles showed a correct prediction with the elimination of mistakes by invalidating misclassified AMPs from both positive and negative datasets. MCC values for all the profiles indicated huge outcomes, with the most minimal value recorded for Anti-Influenza A virus (INFA) (0.50). The MCC value of 0.5 to 1 relates to the ideal expectation, while ‘0’ points to an irregular prediction. Hence all profiles showed right expectation (INFB > RSVM > INFA). The MCC is considered to give the best performance estimation of models since it joins sensitivity, specificity, and accuracy³³.

Proteome sequence databases query and discovery of putative anti-pneumonia AMPs

The discovery stage (Table 4) was to look for novel anti-viral pneumonia AMPs for the pneumonia pathogens (Influenza A, B just as Respiratory Syncytial Virus) in order to recognize peptides that had similar signatures/motifs and properties as the input sequences used to assemble the profiles RSVM, INFA, and INFB. The matches of the separate profiles to the proteome sequences additionally appeared with E-values (Table 4) of 0.05 to discover putative AMPs. The final list of anti-viral AMPs was arranged by their E-values, with those having the smallest E-values described as the most probable putative anti-viral pneumonia AMPs.

Table 4 Final list of the Anti-bacteria and Anti-viral pneumonia AMPs with their sources.

Full size table

Physicochemical properties of the AMPs

The physicochemical parameters of the putative AMPs were determined using APD3 and BACTIBASE to ascertain that the AMP sequences conform to other known AMPs. Physicochemical parameters, for example, atomic weight amino acid components, hydrophobicity, Boman index, net charge, isoelectric potential, and half-life, were utilized to assess the anti-viral AMPs (Table 5). The amino acid composition of the AMPs adds to the molecular weight since the AMPs are comprised of amino acids and can be a distinctive component to separate between two classes of protein/eptides³⁴. Aside from this, the anti-viral pneumonia AMPs likewise have common amino acids that could recognize them from each other. BOPAM-INFA1, 6, and 8 had proline; BOPAM-INFA2 had proline, valine, isoleucine, and leucine; BOPAM-INFA3 had threonine; BOPAM-INFA4-5 had proline and glutamine; while BOPAM-INFA7 had proline, isoleucine, serine, and valine. BOPAM-INFB1 had valine and proline; BOPAM-INFB2-3 had asparagine and leucine; BOPAM-INFB4-5 had leucine; while BOPAM-INFB6 had asparagine and leucine. BOPAM-RSV1 had isoleucine and lysine; BOPAM-RSV2 had serine; BOPAM-RSV3 and 12 had isoleucine and asparagine; BOPAM-RSV4, 5, 7, 8, 9, 10, 11 had asparagine; BOPAM-RSV6 had isoleucine while BOPAM-RSV13 had aspartate. The anti-pneumonia AMPs such as BOPAM-INFA1, 3, 4, 5, 6, and 8, BOPAM-RSV9 had hydrophobicity less than 30% due to the presence of more polar amino acid residues. All the anti-viral peptides such as BOPAM-INFB3, 6, BOPAM-INFA1, 2, 3, 4, 6, 7, 8, BOPAM-RSV2, 3, 4, 6, 7, 8, 10, 11, 12, 13 were predominantly neutral or negative. Cationic AMPs are said to be positively connected with expanded antimicrobial activities³⁵. Nonetheless, the absence of the positive charge in the net charge of anti-viral AMPs does not interpret an absence of antimicrobial activities since some negatively charged AMPs have recently been discovered, for example, surfactant associated anionic peptides in the APD3 database (AP00528) with a net charge of − 5 which has antibacterial activity and maximin H5 with charge ranging between − 1 and − 7 which has bacterial growth inhibition against Listeria monocytogenes³³. Anti-viral pneumonia AMPs pI range from 3.85 to 12.50 shows solubility properties for the AMPs regardless of the difference in charges of acid and alkaline media³⁶. Isoelectric potential (pI) of peptides is an element of individual amino acids in the backbone groups. At a pH beneath the pI, AMPs convey a net positive charge and vice versa. The outcomes of the Boman index demonstrated negative values for BOPAM-INFA2, 7, and BOPAM-INFB4, and 5. A negative Boman index is said to be positively correlated with a more hydrophobic peptide, showing a high protein binding potential, while a more hydrophilic peptide will, in general, have a more positive index³⁷. Notwithstanding, specific peptides’ inclination to have a positive Boman index has been accounted for with the capacity to distinguish HIV in a lateral flow device¹³. Anti-viral pneumonia AMPs had BOPAM-INFA1, 3, 4, 5, 6, 8 with a half-life of 7.2 h, BOPAM-INFA2, and 7 had a half-life of 1.2. BOPAM-INFB1, 2 had a half-life of 30 h, BOPAM-INFB3-6 had a half-life of 5.5 h. BOPAM-RSV1 had a half-life of 20 h, and BOPAM-RSV10 had a half-life of 100 h; all other BOPAM-RSV had a half-life range between 1 and 4.5 h. AMPs have been said to generally exhibit a short half-life because they are not stable. Half-life values as low as 1 h have been reported for AMP molecules used for HIV diagnosis¹³.

Table 5 Physicochemical parameter of the antibacterial and antiviral pneumonia putative AMPs.

Full size table

Retrieval of protein receptors of pneumonia pathogens

This stage was carried out‏ to assess the diagnostic potential of some immunogenic proteins of viral pneumonia to serve as targets for the putative antimicrobial peptides to determine these microbes. For example, a few pneumonia proteins, cell surface receptors, and nucleoproteins were analyzed for the viruses: Influenza A, Influenza B viruses, Respiratory Syncytial virus. These recovered protein receptors were projected to be potentially applicable in the diagnosis of viral pneumonia associated with these viruses. The Respiratory syncytial virus has some immunogenic receptors that have potential diagnostic pertinence, such as membrane fusion core protein chains³⁸. The virus has Human RSV fusion protein core chain A with molecular weight 4869.38 Da, isoelectric point 4.38, hydrophobicity 39.53%, charge—4, instability index 49.41, and half-life of 1 h in mammals. Influenza A virus has some protein receptors of potential importance in its detection. It has 416a monomeric nucleoprotein with molecular weight 56,297.78 Da, isoelectric point 9.45, hydrophobicity 29.52%, charge + 12, instability index 36.35, and half-life of 30 h in mammals. Influenza B virus receptor proteins of diagnostic potential were recognized and investigated. Influenza B virus has nucleoprotein with molecular weight 61,644.09 Da, isoelectric point 9.43, hydrophobicity 31.61%, charge + 18, instability index 39.98 half-lives of 30 h in mammals (Table 6). Instability index, molecular weight, and half-life are a function of how stable a protein can be, and any protein with an instability index lower than or equal to 40 is said to be stable; hydrophobicity enhances protein binding to ligands; while the net charge determines the behavior of the proteins in acidic or alkaline solution with all proteins having a net zero charge at the isoelectric point³⁹.

Table 6 Physicochemical properties of the retrieved pneumonia receptor proteins.

Full size table

Structure prediction of the putative anti-pneumonia AMPs and Pneumonia protein receptors

Representative figures from the I-TASSER server after predicting the 3-D structures of the anti-pneumonia AMPs (ligands) and the protein receptors are shown in Fig. 1. The results demonstrate that all AMPs predicted showed different secondary structures, including α-helices, parallel β-sheet, anti-parallel β-sheet, extended, and loop conformational structures.

For structure prediction assessment utilizing I-TASSER (Table 7), a few parameters, for example, Confidence score (C-score), Template modeling score (TM-score), and Root Mean Square Deviation (RMSD), were utilized for the prediction of the putative AMPs and pneumonia protein receptor 3-D structures. The results demonstrated that the C-score of all the anticipated 3-D structures for the anti-viral pneumonia AMPs and the pneumonia receptor proteins were between the estimations of − 5 to 2 (see Table7), which suggests an existing template by I TASSER for their structure prediction⁴⁰. The determined C-score of BOPAM-RSV11 was lower than that of the other AMPs and could show that this molecule had no accessible template for prediction by I-TASSER but was not a random prediction²⁹. TM-score has of late been proposed for estimating the structural compatibility between two structures⁴¹. A TM-score > 0.5 shows a model of right topology, and a TM-score < 0.17 implies irregular compatibility. From the results, the TM-score of the predicted structures of the AMPs and protein receptors was higher than the cut-off value of 0.5. This signifies that these structures had a correct topology with structural similarity to the templates that were used to predict their structures^29,41. Although there is no defined RMSD value for 3-D structure prediction, an RMSD value of 2–4 Å is considered good, and an RMSD ≤ 1 Å is considered ideal. Thus, all anti-viral pneumonia AMPs and the receptor proteins having RMSD within the accepted range (Table 7) had less distance and the atomic deviation between the peptides and the templates used for their 3-D structure prediction^42,43.

Table 7 Quality assessment scores of the predicted 3-D structures of the pneumonia receptors and the putative anti-pneumonia AMPs.

Full size table

RMSD is sensitive to local error since it is an average distance of all residue sets in two structures, hence the for proposing TM-score. For example, a misorientation of the structure will increase the RMSD value even though the global topology of the structure is right. TM-score is not sensitive to misorientation in the region of the residues, which makes the score insensitive toward the local modelling mistake and, in this manner, a more reliable measure.

Docking interaction analysis of the putative anti-pneumonia amps with viral pneumonia receptors

The output figures from the PATCHDOCK and HDock servers after predicting the docking interaction between the anti-pneumonia AMPs (ligands) and the protein receptors were analyzed (Fig. 2). The spatial docking interaction analysis indicated that all the AMPs bound firmly to their proteins. Also, the computational investigation was done to affirm the AMPs with the most binding potential. These amino acid residues partook in the complex formation and towards which terminal of the proteins the binding occurs. Among the anti-Influenza A AMPs, only BOPAM-INFA1 bound at a different orientation to the nucleoprotein receptor. In contrast, BOPAM-INFB4 bound differently to the influenza B nucleoprotein when compared to other anti-Influenza B AMPs. All anti-Respiratory syncytial virus AMPs are bound on the same chain A fusion protein orientation except BOPAM-RSV2, 6, and 9.

BOPAM-RSVs bound more firmly to chain A protein with the highest binding geometry score noticed for BOPAM-RSV4. In a similar vein, the BOPAM-INFAs bound more firmly to nucleoprotein with the most binding geometry score noticed for BOPAM-INFA4. Also, in Table 8, BOPAM-INFA5, BOPAM-INFB4, and BOPAM-RSV4 have the highest area scores of 1601.80, 1740.90, and 1244.20, respectively, which denote the approximate interface area of their complexes to their respective receptors. It is also observed that BOPAM-INFA7, BOPAM-INFB6, and BOPAM-RSV8 have the lowest ACE scores of − 474.03, − 259.94, and − 368.59, which is the desolvation free energy needed for the ligand to shift atoms from water to the interior of the protein receptors⁴⁴.

Table 8 Quality assessment scores of the docking analysis for the anti-pneumonia putative AMPs and the pneumonia receptors.

Full size table

The putative anti-influenza A AMPs displayed a high docking energy score using HDock, with BOPAM-INFA8 showing the highest energy − 199 kJ/mol. Similarly, all anti-influenza B AMPs displayed high binding energy to their receptors, with BOPAM-INFB2 having the highest docking energy score. Anti-respiratory syncytial virus AMPs showed high energy docking energy scores, with BOPAM-RSV4 and 3 having the highest docking energy scores to the receptor protein. The root-mean-square values are also generated from the HDock server as indicated in Table 9 alongside the hotspot interacting residues of the anti-viral pneumonia AMPs and their respective receptor proteins. The result from the HDock server shows consistency when compared to the PatchDock server.

Table 9 Quality assessment scores of the docking analysis from HDock for the anti-pneumonia putative AMPs and the pneumonia receptors with the hotspot interacting residues.

Full size table

Discussion

Experimentally validated AMPs were utilized for model construction in this research because their activities have been established since they had demonstrated activity against the target pneumonia viruses with the minimum inhibitory concentration (MIC) as an indicator using the agar dilution or broth micro-dilution strategies, as indicated in the databases⁴⁵. The list of anti-pneumonia AMPs was retained in their separate pathogenic target groups as recovered from the different databases to take into account specific species/microbe profile creation. Also, the profile creation step using the training dataset was carried out to train the HMMER software to assess the discriminatory capacity and quality of the AMPs profiles with both positive (test) and negative (neuropeptides) datasets. This technique of utilizing random sequences as positive and negative datasets is a regularly used method. It depends on the presumption that the probability of discovering random sequences with a discriminative propensity is exceptionally low²⁹. Assessment of the profiles’ performance showed that they were specificity, accuracy, sensitivity, with excellent MCC⁴³. The relatively low sensitivity of INFA suggests there was an overlap in the conserved domains of its AMPs [45. This outcome is in line with the work of Bhadra, Yan (48), where performance was compared using sensitivity, specificity, accuracy, and MCC employing benchmark datasets as inputs. Scanning the profiles to recognize novel anti-viral AMPs, profile INFA yielded eight anti-Influenza A AMPs profile INFB yielded six anti-Influenza B AMPs while RSVM yielded 13 anti-Respiratory Syncytial virus AMPs (Table 4). The HMMER reported credible E-values for the AMPs to capture the sequences’ diversity since the input AMPs were derived from various life forms⁴⁶. There was an exceptionally low probability that these peptides were wrongly predicted to be anti-pneumonia AMPs.

Besides, some protein chains, for example, fusion protein core A, which are integral RNA proteins of Respiratory syncytial virus, mediate passage into the transmembrane glycoproteins of the host cell to elicit apoptosis³⁸. They additionally assume a pivotal function in the virus assembly and interact with the RNA complex and the viral membrane. Recognition of these proteins in the body fluid has indicated just slight antigenic variance, which is not progressive, a significant factor for their utilization in detecting the virus⁴⁷. Influenza A and B nucleoproteins play some significant structural and functional roles that could be investigated for their diagnostics. They are bi-functional membrane/RNA-binding proteins that participate in the encapsulation of the RNA-nucleoprotein core of the membrane envelope [56]. These nucleoproteins have been utilized in the diagnosis of pneumonia [56]. The utilization of receptor protein applicants, for example, Respiratory syncytial virus fusion protein chains A⁴⁸, Influenza A virus nucleoprotein^49,50, and Influenza B virus nucleoprotein⁵¹ in the diagnosis of pneumonia is justified because they are synthesized in generally high concentration inside body fluid across all strains and subtypes of these microorganisms; do not change with time; abundantly available either as cell surface receptor and moderately stable to a gentle in vitro handling.

Moreover, the presence of charged, polar, and non-polar amino acids in the putative anti-viral AMPs and the viral receptors is the conferment of charge, improved hydrophobicity, and increased binding potential on them. The hydrophobicity result of the AMPs lower than 30% is not an ideal physicochemical parameter because decreased hydrophobicity results in poor peptide helicity, diminished self-associating capacity in aqueous conditions, and poor antimicrobial activity⁵². Decreased hydrophobicity observed for BOPAM-INFA1, 3, 4, 6, and 8 is an outcome of polar amino acids, giving them antimicrobial activities. As of late, AMPs from sugar‐functionalized phosphonium polymers have been reported to require the hydrophilic part of their molecular structure to exert antibacterial activities against Gram‐negative Escherichia coli Gram‐positive Staphylococcus aureus⁵³. All the AMPs had significant physicochemical parameters that made them bona vide AMPs in charge and the Boman index. The utilization of physicochemical parameters as indices to assess AMPs is in concurrence with the work by Hollmann, Martinez (53) where a re-assessment of the physicochemical properties of antimicrobial peptides was evaluated, bringing about a characteristic thermal change profile in model vesicles which was utilized to rank novel molecules with unknown biological action.

The structure prediction results generated for the AMPs and the receptors are in accordance with the different structural conformations displayed by known AMPs and proteins. Examples of known AMPs and their structures are tachyplesin from horseshoe crabs and bovine lactoferricin, which have beta-sheet conformations; magainin simple and melittin having alpha-helical conformations. The C-score from I-TASSER is a measure of the certainty of the modeling template used for the prediction to anticipate the quality of the structure, that is, the distance between the anticipated model and the local structures⁴¹. Both TM and RMSD scores are known standards for estimating structural closeness between two structures for accuracy of structural model when the local structure is known³⁰. The peptides’ structures were predicted, and the outcomes demonstrated that these peptides conformed to known AMPs. In any case, the AMPs are thought to be putative anti-pneumonia peptides because of the absence of wet laboratory experiments for these molecules. This outcome relates to the work of Tincho, Gabere (12), where binding geometry scoring was utilized as the criteria in the determination of applicant AMPs for HIV diagnostics. These perceptions were additionally affirmed utilizing an in-house lateral flow device in which the putative AMPs were utilized to recognize HIV in patient samples¹³.

The anti-viral AMPs also displayed high binding energy scores with the viral pneumonia receptors using PatchDock and HDock servers. Both servers use scoring functions to simulate ligands’ conformations on protein receptors. HDock server utilizes the classical force-fields-based scoring function to estimate and assess the non-bonded interactions (electrostatic and van der Waals). The docking interaction analysis of the AMPs revealed that all AMPs bound the respective viral receptors with a high binding capacity with BOPAM-INFA8, BOPAM-INFB2, and BOPAM-RSV4 having the highest binding potential and area with the most reduced atomic contact energy (Tables 8 and 9). Comparing these results with the physicochemical results in Table 5, BOPAM-INFA1, 2, 7, BOPAM-INFB3, 6, BOPAM-RSV3, 7, 12, and 13, which indicated zero net charges, gave the most reduced binding affinities (Boman index values) with the pneumonia receptor proteins. The result from this research showed BOPAM-INFA8, BOPAM-INFB4, and BOPAM-RSV4 as the best applicant specialists for the detection of the respective viral pneumonia pathogens. This binding affinity and other parameters, for example, area and atomic contact energy², are significant in determining novel anti-viral AMPs for potential use in pneumonia diagnosis through the development of an LFD.

Designing and modeling novel AMPs for diagnostics is an active area of research to reduce the abuse of the conventional antibiotic agents and mitigate the non-specificity of the current diagnostic and prognostic biomarkers. One limitation for HMMER’s use is the data correlation with the amino acid residues of AMPs which is hard to capture by this software because of the linear nature of HMM profile. An example of such data correlation is predicting the actual distance between the folding of proteins, their spreading out; and the forecast between the electrical and chemical connectivity. Another constraint is the low sensitivity of HMMER to the utilization of small datasets due to the accessible number of AMPs in the databases to specific targets. Also, AMPs are not advisable for use when proteolytic degradation is possible due to L-amino acids’ presence in them⁵⁴. All these limitations were taken into consideration during the design of this work to ensure that the sensitive detection of the viral pneumonia utilizing anti-viral AMPs was not compromised. The use of the putative AMPs from this analysis would greatly benefit the diagnosis of viral pneumonia through the HMMER’s utilization in the prediction of AMPs for model predictions. One of this work’s qualities is that it would offer knowledge into the modular architecture of AMPs utilizing in silico technologies for potential pneumonia diagnosis. This attempt offers promising perspectives for patients living with these conditions to develop accommodating lifestyles through sensitive detection of the viral pneumonia pathogens and would allow medical practitioners towards correct treatment plans.

Conclusion

This research work distinguished novel AMPs for the potential detection of viral pneumonia utilizing the HMMER in silico technology, where 27 anti-viral peptides were generated. The putative anti-pneumonia AMPs demonstrated conformity to other known AMPs regarding their physicochemical qualities estimated by APD3 and BACTIBASE. This demonstrative framework’s fundamental goal is to facilitate the quest for specific biomarkers for the early recognition of viral pneumonia. Thus, the AMPs have indicated an incredible potential in evading the current diagnostic frameworks’ downsides. This research could be sought after molecular validation through the binding test of these AMPs with the viral proteins individually, utilizing an “on/off” binding test in an LFD setting to build up a model with these AMPs.

Future work

Future work will incorporate the site-directed mutagenesis of the putative AMPs to upgrade them into more potent competitor diagnostic molecules. This analysis would be followed by an in vitro investigation of the anti-pneumonia activity of the transformed peptides. Furthermore, the EC50 of the AMPs and their selective index will be evaluated for the streamlined AMPs. The anti-pneumonia potential of these AMPs will be done on various pseudotypes of the pneumonia microbes to decide their diagnostic potential. Finally, the complex formed between the microbe receptors and putative AMPs will be unraveled utilizing structural biology to approve the perceptions made by the in silico binding examination.

References

Marik, P. E. & Kaplan, D. Aspiration pneumonia and dysphagia in the elderly. Chest 124(1), 328–336 (2003).
Article PubMed Google Scholar
Ngari, C. G., Malonza, D. M. & Muthuri, G. G. A model for childhood pneumonia dynamics. J. Life Sci. 1, 31–40 (2014).
Google Scholar
Wrammert, J. et al. Broadly cross-reactive antibodies dominate the human B cell response against 2009 pandemic H1N1 influenza virus infection. J. Exp. Med. 208(1), 181–193 (2011).
Article CAS PubMed PubMed Central Google Scholar
Murdoch, D. R. et al. Breathing new life into pneumonia diagnostics. J. Clin. Microbiol. 47(11), 3405–3408 (2009).
Article PubMed PubMed Central Google Scholar
Naughton, M., Mulrooney, J. B. & Leonard, B. E. A review of the role of serotonin receptors in psychiatric disorders. Hum. Psychopharmacol. Clin. Exp. 15(6), 397–415 (2000).
Article CAS Google Scholar
O’Brien, K. L. et al. Burden of disease caused by Streptococcus pneumoniae in children younger than 5 years: Global estimates. The Lancet. 374(9693), 893–902 (2009).
Article Google Scholar
Rajpurkar, P. et al. Chexnet: Radiologist-Level Pneumonia Detection on Chest X-rays with deep learning. arXiv preprint. 2017.
Ahn, I. E. et al. Atypical Pneumocystis jirovecii pneumonia in previously untreated patients with CLL on single-agent ibrutinib. Blood 128(15), 1940–1943 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kriegsmann, M. et al. Reliable entity subtyping in non-small cell lung cancer by matrix-assisted laser desorption/ionization imaging mass spectrometry on formalin-fixed paraffin-embedded tissue specimens. Mol. Cell. Proteom. 15(10), 3081–3089 (2016).
Article CAS Google Scholar
Long, L., Zhao, H.-T., Zhang, Z.-Y., Wang, G.-Y. & Zhao, H.-L. Lung ultrasound for the diagnosis of pneumonia in adults: A meta-analysis. Medicine 96(3), e5713 (2017).
Article PubMed PubMed Central Google Scholar
Demaria, M. et al. Cellular senescence promotes adverse effects of chemotherapy and cancer relapse. Cancer Discov. 7(2), 165–176 (2017).
Article CAS PubMed Google Scholar
Beisswenger, C. & Bals, R. Functions of antimicrobial peptides in host defense and immunity. Curr. Protein Pept. Sci. 6(3), 255–264 (2005).
Article CAS PubMed Google Scholar
Tincho, M., Gabere, M. & Pretorius, A. In silico identification and molecular validation of putative antimicrobial peptides for HIV therapy. J. AIDS and Clin. Res. 7, 9 (2016).
Article Google Scholar
Williams, M. et al. Molecular validation of putative antimicrobial peptides for improved Human Immunodeficiency Virus diagnostics via HIV protein p24. J AIDS Clin Res. 7, 571 (2016).
Google Scholar
Porto, W., Pires, A. & Franco, O. Computational tools for exploring sequence databases as a resource for antimicrobial peptides. Biotechnol. Adv. 35(3), 337–349 (2017).
Article CAS PubMed Google Scholar
Madera, M. Profile Comparer: A program for scoring and aligning profile hidden Markov models. Bioinformatics 24(22), 2630–2631 (2008).
Article CAS PubMed PubMed Central Google Scholar
Liu, S., Fan, L., Sun, J., Lao, X. & Zheng, H. Computational resources and tools for antimicrobial peptides. J. Pept. Sci. 23(1), 4–12 (2017).
Article CAS PubMed Google Scholar
Waghu, F. H., Barai, R. S., Gurung, P. & Idicula-Thomas, S. CAMPR3: A database on sequences, structures and signatures of antimicrobial peptides. Nucleic Acids Res. 44(D1), D1094–D1097 (2015).
Article PubMed PubMed Central CAS Google Scholar
Wang, Z. & Wang, G. APD: The antimicrobial peptide database. Nucleic Acids Res. 32(suppl_1), D590–D592 (2004).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, G., Li, X. & Wang, Z. APD2: The updated antimicrobial peptide database and its application in peptide design. Nucleic acids Res. 37(suppl_1), D933–D937 (2008).
Article PubMed PubMed Central CAS Google Scholar
Thomas, S., Karnik, S., Barai, R. S., Jayaraman, V. K. & Idicula-Thomas, S. CAMP: A useful resource for research on antimicrobial peptides. Nucleic Acids Res. 38(suppl_1), D774–D780 (2009).
Article PubMed PubMed Central CAS Google Scholar
Sencanski, M. et al. Natural products as promising therapeutics for treatment of influenza disease. Curr. Pharm. Des. 21(38), 5573–5588 (2015).
Article CAS PubMed Google Scholar
Li, W. & Godzik, A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22(13), 1658–1659 (2006).
Article CAS PubMed Google Scholar
Mulvenna, J. P., Wang, C. & Craik, D. J. CyBase: A database of cyclic protein sequence and structure. Nucleic Acids Res. 34(suppl_1), D192–D194 (2006).
Article CAS PubMed Google Scholar
Gennaro, R. & Zanetti, M. Structural features and biological activities of the cathelicidin-derived antimicrobial peptides. Pept. Sci. 55(1), 31–49 (2000).
Article CAS Google Scholar
Vizioli, J. & Salzet, M. Antimicrobial peptides from animals: Focus on invertebrates. Trends Pharmacol. Sci. 23(11), 494–496 (2002).
Article CAS PubMed Google Scholar
Eddy, S. R. Profile hidden Markov models. Bioinformatics (Oxford, England). 14(9), 755–763 (1998).
Article CAS Google Scholar
Sievers, F. & Higgins, D. G. Clustal Omega, Accurate Alignment of Very Large Numbers of Sequences 105–116 (Springer, 2014).
Google Scholar
Roy, A., Kucukural, A. & Zhang, Y. I-TASSER: A unified platform for automated protein structure and function prediction. Nat. Protoc. 5(4), 725–738 (2010).
Article CAS PubMed PubMed Central Google Scholar
DeLano, W. L. & Bromberg, S. PyMOL User’s Guide 629 (DeLano Scientific LLC, 2004).
Google Scholar
Schneidman-Duhovny, D., Inbar, Y., Nussinov, R. & Wolfson, H. J. PatchDock and SymmDock: Servers for rigid and symmetric docking. Nucleic Acids Res. 33(Suppl_2), W363–W367 (2005).
Article CAS PubMed PubMed Central Google Scholar
Yan, Y., Tao, H., He, J. & Huang, S.-Y. The HDOCK server for integrated protein–protein docking. Nat. Protoc. 15(5), 1829–1852 (2020).
Article CAS PubMed Google Scholar
Liu, Y., Cheng, J., Yan, C., Wu, X. & Chen, F. Research on the Matthews correlation coefficients metrics of personalized recommendation algorithm evaluation. Int. J. Hybrid Inf. Technol. 8(1), 163–172 (2015).
Google Scholar
Kato, H., Rhue, M. R. & Nishimura, T. Role of Free Amino Acids and Peptides in Food Taste (American Chemical Society, 1989).
Book Google Scholar
Dathe, M., Nikolenko, H., Meyer, J., Beyermann, M. & Bienert, M. Optimization of the antimicrobial activity of magainin peptides by modification of charge. FEBS Lett. 501(2–3), 146–150 (2001).
Article CAS PubMed Google Scholar
Bakare, O. O. Identification and Molecular Validation of Biomarkers for the Accurate and Sensitive Diagnosis of Bacterial and Viral Pneumonia. (2019).
Gómez, E. A., Giraldo, P. & Orduz, S. InverPep: A database of invertebrate antimicrobial peptides. J. Glob. Antimicrob. Resist. 8, 13–17 (2017).
Article PubMed Google Scholar
Prendergast, C. & Papenburg, J. Rapid antigen-based testing for respiratory syncytial virus: Moving diagnostics from bench to bedside?. Future Microbiol. 8(4), 435–444 (2013).
Article CAS PubMed Google Scholar
Garg, V. K. et al. MFPPI–multi FASTA ProtParam interface. Bioinformation 12(2), 74 (2016).
Article PubMed PubMed Central Google Scholar
Yang, J. et al. The I-TASSER Suite: Protein structure and function prediction. Nat. Methods 12(1), 7–8 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. I-TASSER server for protein 3D structure prediction. BMC Bioinform. 9(1), 40 (2008).
Article CAS Google Scholar
Park, C. H., Valore, E. V., Waring, A. J. & Ganz, T. Hepcidin, a urinary antimicrobial peptide synthesized in the liver. J. Biol. Chem. 276(11), 7806–7810 (2001).
Article CAS PubMed Google Scholar
Wei, D. S. et al. Mach-Zehnder interferometry using spin-and valley-polarized quantum Hall edge states in graphene. Sci. Adv. 3(8), e1700600 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Zhang, C., Vasmatzis, G., Cornette, J. L. & DeLisi, C. Determination of atomic desolvation energies from the structures of crystallized proteins. J. Mol. Biol. 267(3), 707–726 (1997).
Article CAS PubMed Google Scholar
Kim, I.-W. et al. Characterization and cDNA cloning of a defensin-like peptide, harmoniasin, from Harmonia axyridis. J Microbiol Biotechnol. 22(11), 1588–1590 (2012).
Article CAS PubMed Google Scholar
Madera, M. & Gough, J. A comparison of profile hidden Markov model procedures for remote homology detection. Nucleic Acids Res. 30(19), 4321–4328 (2002).
Article CAS PubMed PubMed Central Google Scholar
Falsey, A. R., Formica, M. A. & Walsh, E. E. Diagnosis of respiratory syncytial virus infection: Comparison of reverse transcription-PCR to viral culture and serology in adults with respiratory illness. J. Clin. Microbiol. 40(3), 817–820 (2002).
Article CAS PubMed PubMed Central Google Scholar
Jha, D. A., Jarvis, H., Fraser, C. & Openshaw, P. J. Respiratory Syncytial Virus (European Respiratory Society, 2016).
Google Scholar
Suarez, D. L. Influenza A Virus. Animal Influenza 1–30 (Wiley, 2016).
Book Google Scholar
Vemula, S. et al. Current approaches for diagnosis of influenza virus infections in humans. Viruses 8(4), 96 (2016).
Article PubMed PubMed Central CAS Google Scholar
Hoffmann, J. et al. Viral and bacterial co-infection in severe pneumonia triggers innate immune responses and specifically enhances IP-10: A translational study. Sci. Rep. 6, 38532 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, Y. et al. Role of peptide hydrophobicity in the mechanism of action of α-helical antimicrobial peptides. Antimicrob. Agents Chemother. 51(4), 1398–1406 (2007).
Article CAS PubMed Google Scholar
Cuthbert, T. J. et al. Surprising antibacterial activity and selectivity of hydrophilic polyphosphoniums featuring sugar and hydroxy substituents. Angew. Chem. 130(39), 12889–12892 (2018).
Article ADS Google Scholar
Bakare, O. O., Gokul, A. & Keyster, M. PR-1-like protein as a potential target for the identification of Fusarium oxysporum: An in silico approach. Biotech 10(2), 8 (2021).
Article CAS Google Scholar

Download references

Funding

Funding was available for the research work (National Research Foundation, South Africa (120712)).

Author information

Authors and Affiliations

Bioinformatics Research Group, University of the Western Cape, Cape Town, 7535, South Africa
Olalekan Olanrewaju Bakare & Ashley Pretorius
Environmental Biotechnology Laboratory, Biotechnology Department, University of the Western Cape, Cape Town, 7535, South Africa
Olalekan Olanrewaju Bakare & Marshall Keyster

Authors

Olalekan Olanrewaju Bakare
View author publications
You can also search for this author in PubMed Google Scholar
Marshall Keyster
View author publications
You can also search for this author in PubMed Google Scholar
Ashley Pretorius
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors conceived, designed, analyzed, and wrote the paper for the experiments.

Corresponding author

Correspondence to Olalekan Olanrewaju Bakare.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bakare, O.O., Keyster, M. & Pretorius, A. Building HMM and molecular docking analysis for the sensitive detection of anti-viral pneumonia antimicrobial peptides (AMPs). Sci Rep 11, 20621 (2021). https://doi.org/10.1038/s41598-021-00223-8

Download citation

Received: 31 May 2021
Accepted: 06 October 2021
Published: 18 October 2021
DOI: https://doi.org/10.1038/s41598-021-00223-8

This article is cited by

Anticandidal Activity and Mechanism of Action of Several Cationic Chimeric Antimicrobial Peptides
- Mojtaba Memariani
- Hamed Memariani
- Reza Mahmoud Robati
International Journal of Peptide Research and Therapeutics (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Machine learning for antimicrobial peptide identification and design

Structure guided prediction of Pyrazinamide resistance mutations in pncA

Anthraquinolone and quinolizine derivatives as an alley of future treatment for COVID-19: an in silico machine learning hypothesis

Introduction

Materials and methods

Data retrieval (literature mining)

Training and testing datasets (data mining)

Construction of AMPs profiles (text mining)

Independent profile testing

Performance measurement of each profile

Identification of novel putative anti-Pneumonia AMPs from proteome sequences

Identification of receptors

Physicochemical properties of the putative anti-Pneumonia AMPs and the pneumonia proteins

De novo structure predictions of the putative anti-Pneumonia AMPs and Pneumonia proteins (receptors) using I-TASSER

Docking analysis of the putative anti-pneumonia AMPs and Pneumonia Proteins 3-D structures using PatchDock

Results

Retrieval of anti-viral AMPs (VAP-AMPs) and profile creation using HMM

Independent testing of the created profiles and evaluation of the independent testing results

Performance measurement of the target-specific profiles

Proteome sequence databases query and discovery of putative anti-pneumonia AMPs

Physicochemical properties of the AMPs

Retrieval of protein receptors of pneumonia pathogens

Structure prediction of the putative anti-pneumonia AMPs and Pneumonia protein receptors

Docking interaction analysis of the putative anti-pneumonia amps with viral pneumonia receptors

Discussion

Conclusion

Future work

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Anticandidal Activity and Mechanism of Action of Several Cationic Chimeric Antimicrobial Peptides

Comments

Search

Quick links