Introduction

Klebsiella pneumoniae is a Gram-negative, encapsulated bacterium often found in hospital settings, especially in intensive care, pediatric and surgical wards (Piperaki et al. 2017). This pathogen is primarily an opportunistic pathogen that colonizes human mucous membranes, including the gastrointestinal tract and oropharynx. From these sites, it can invade other tissues and cause pneumonia, sepsis, liver abscesses, meningitis, bacteremia, urinary tract infections, and many other diseases (Piperaki et al. 2017).

In recent years, due to the long-term use of antibacterial drugs, the incidence of multidrug-resistant K. pneumoniae, especially carbapenem-resistant K. pneumoniae, has been increasing (Bassetti et al. 2018; Dai and Hu 2022; Uzairue et al. 2022). The development process of new antibiotics is very slow and cannot reach the development of drug-resistant K. pneumoniae. Therefore, antibacterial programs that bypass the mechanisms of bacterial drug resistance are of particular importance. Vaccines are effective measures to protect against drug-resistant bacterial infections. The vaccines can significantly reduce antibiotic use, slowing down bacterial drug resistance growth (Piperaki et al. 2017; Malachowa et al. 2019; Assoni et al. 2021).

Despite much research to produce an effective vaccine for K. pneumoniae, no vaccine for human use has yet been approved, and efforts are ongoing to find an effective antigen as a vaccine (Pletz et al. 2016; Opoku-Temeng et al. 2022).

Outer membrane proteins (OMPs) are surface trimmers proteins acting as water-filled channels that allow hydrophilic molecules across the membrane. K. pneumoniae OMPs are considered safe subunit vaccines that have been shown to induce host-specific antibodies and have been effective in animal models (Pletz et al. 2016; Zhu et al. 2021).

Outer membrane protein-K17 (OMPK17, also called OMPX) is a member of the OMP family that is involved in K. pneumoniae pathogenesis (Climent et al. 1997). It has been shown that K. pneumoniae OMPK17 can be detected by immune serum from patients with acute K. pneumoniae infection (Kurupati et al. 2006). It has been also shown that this protein can evoke protection against K. pneumoniae in mice, mainly by inducing T helper type 2 immune response, although its real effect on protecting animal models needs to be confirmed (Hussein et al. 2018). Despite these attempts, no information has been found about OMPK17 dominant immunogenic regions (epitopes) in the literature. Immunoinformatics methods are useful in predicting immunogenic epitopes as vaccine candidates and are widely accepted in many scientific fields (De Groot et al. 2001; He et al. 2010; Rahmat Ullah et al. 2021).

This study aimed to predict both T cell and B cell epitopes of K. pneumoniae OMPK17 via immunoinformatics approaches. Also, applying different analyses an antigenic 25-mer peptide containing both T cell and B cell epitopes was introduced for possible use as a vaccine against K. pneumoniae.

Methods

Retrieving of OMPK17 sequence

The one-letter amino acid sequence of K. pneumoniae OMPK17 was obtained from the NCBI protein database (https://www.ncbi.nlm.nih.gov/protein), with the accession number AAA97932.1. The sequence was as follows: “MNKIARLSALAVVLAASVGTTAFAATSTVTGGYAQSDMQGKANKAGGFNLKYRYEQDNNPLGVIGSFTYTEKDNNSNGTYNKGQYYGITAGPAYRLNDWASIYGVVGVGYGKFQNNNYPHKSDMSDYGFSYGAGLQFNPIENVALDFSYEQSRIRNVDVGTWIAGVGYRF”. Domain search using InterPro (https://www.ebi.ac.uk/interpro/) showed that this protein contained one domain (from amino acid no. 11 to no.170), belonging to OMP proteins namely OMP_b-brl (InterPro code: IPR027385).

BLASTing of this sequence by the BLASTP (https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins) server showed that it is highly conserved among K. pneumoniae serotypes as well as other Klebsiella species such as K. oxytoca, K. quasipneumoniae, K. michiganensis, K. grimontii, K. variicola, and K. aerogenes. The NCBI conserved domain search (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) showed that our used sequence is the conserved domain of outer membrane protein OmpX belonging to the OM-Channels superfamily.

Prediction of T cell epitopes

Class-I (CD8 + T cell-specific) and class-II (CD4 + T cell-specific) T cell epitopes of OMPK17 were predicted using highly frequent human leukocyte antigen (HLA) alleles (HLA-I for class-I and HLA-II for class-II epitope prediction). These HLA alleles are highly frequent in the world population and have been previously used by us for epitope prediction of other K. pneumoniae antigens (Rostamian et al. 2020; Zargaran et al. 2021; Nemati Zargaran et al., 2021) (Table S1).

The T cell epitopes prediction was done as in our previous works (Akya et al. 2019; Rostamian et al. 2020) using three prediction tools including IEDB (http://tools.iedb.org, (Fleri et al. 2017), SYFPEITHI (http://www.syfpeithi.de/bin/MHCServer.dll/EpitopePrediction.htm, (Schuler et al. 2007)), and ProPred (ProPred-I for class-I and ProPred-II class-II epitope prediction) (http://crdd.osdd.net/raghava/propred/, (Singh and Raghava 2003)). The score of ≥ 10 for SYFPEITHI and ProPred-I, was applied as the thresholds. In ProPred-II, the predicted epitopes were listed without reporting their scores, and all of these epitopes were chosen. In IEDB, the percentile rank of ≤ 1 for class-I and ≤ 10 for class-II epitopes were used as the thresholds. Only epitopes with 9-mer amino acids for class-I and 15-mer length for class-II epitopes were selected to be predicted. Of the predicted epitopes, the sequence similarity was determined and the epitopes were clustered by the IEDB Epitope Cluster Analysis tool (http://tools.iedb.org/cluster/). One epitope in each cluster was chosen for further analyses, which was the epitope with the highest prediction score. For class-I epitopes, the immunogenicity was checked by IEDB Class I immunogenicity tool (http://tools.iedb.org/immunogenicity/) (Calis et al. 2013). For class-II epitopes, the ability in interferon-gamma (IFN-γ) production was checked by the IFNepitope server (http://crdd.osdd.net/raghava/ifnepitope/).

Finding class II/class I windows and selecting final T cell epitopes

Regarding that 9-mer class-I may be placed inside 15-mer class-II epitopes, class II/class I windows were found using the Epitope Cluster Analysis tool. The cut-off was set at ≥ 70% which allows the tool to put the epitopes with ≥ 70% similarity in a unique cluster. Those class-II T cell epitopes that contained at least one class-I T cell epitope were selected.

For selecting the final T cell epitopes, the following criteria were applied: 1- The T cell epitopes containing both class-I and class-II epitopes, 2- The epitopes with higher prediction scores, 3- Being positive in IFN-γ production (for class-II epitopes) or have more immunogenicity score (for class-I epitopes), and 4- The epitopes which were predicted by more prediction tools.

Prediction of B cell epitopes

For predicting B cell linear epitopes seven tools of the IEDB server (http://tools.iedb.org/main/bcell/) provided for B cell linear epitope prediction as well as the ABCpred server (http://crdd.osdd.net/raghava/abcpred/, (Saha and Raghava 2006)) was applied. The linear B cell epitope prediction was done as in our previous work (Zargaran et al. 2021). Briefly, by using the tools’ defaults parameters, the provided charts by each IEDB tool were compared and the epitopes that were predicted by at least three (out of seven) tools were selected. The threshold scores of the tools were as follow: Chou & Fasman Beta-Turn: 1.058, Emini Surface Accessibility: 1.000, Karplus & Schulz Flexibility: 0.998, Kolaskar & Tongaonkar Antigenicity: 1.010, Parker Hydrophilicity: 2.014, Bepipred 1.0: 0.321, Bepipred 2.0: 0.500, and ABCpred: 0.510.

Using the Epitope Cluster Analysis tool, the selected epitopes were then compared and clustered with those predicted by the ABCpred server. The epitopes that were predicted by more tools and/or had higher prediction scores were selected.

For the prediction of discontinuous B cell epitopes, since the crystallographic structure of OMPK17 was not available, it was modeled using the SWISS-MODEL server (https://swissmodel.expasy.org/, (Waterhouse et al. 2018)). The structure of E.coli OMPX (PDB ID: 1QJ8) with 85.62% sequence identity with OMPK17 was used as a template. The quality of the created model was checked by qualitative model energy analysis (QMEAN) scoring and Ramachandran plot generating using the MolProbity server (http://molprobity.biochem.duke.edu/, (Chen et al. 2010)).

The discontinuous B cell epitopes were predicted by two prediction tools provided by IEDB including ElliPro (Ponomarenko et al. 2008) and DiscoTope 2.0 (Kringelum et al. 2012). Using the default parameters, the created model of OMPK17 was presented to the tools and the predicted discontinuous epitopes by both tools were taken as more probable discontinuous B cell epitopes.

Allergenicity, toxicity, and human similarity

All selected T cell-, linear B cell-, and major segments of discontinuous B cell epitopes were evaluated for allergenicity toxicity and human similarity. The allergenicity was checked by the structural database of allergenic proteins (SDAP) (Ivanciuc 2003) and AllerCatPro (Maurer-Stroh et al. 2019) using the tools’ defaults.

The toxicity of the epitopes was checked by the ToxinPred server (https://webs.iiitd.edu.in/raghava/toxinpred/algo.php) (Gupta et al. 2013) remaining with the server parameters as defaults.

The possible human similarity was evaluated by putting the FASTA sequence of the epitopes in the BLASTP server and using the human proteome (taxid 9606) as the organism. Epitopes with more than 90% similarity (both coverage and identity indices) with human peptides were omitted from further analyses.

Detecting the epitopic regions comprising both T cell and linear B cell epitopes

The final selected T cell epitopes (both class-I and class-II T cell epitopes) were compared with linear B cell epitopes by the Epitope Cluster Analysis tool and those containing both T cell and B cell epitopes were selected.

The characteristics of the final epitopic region

The physicochemical characteristics of the final peptide were estimated by ProtParam from Expasy (https://web.expasy.org/protparam/) (Adhikari et al. 2018). The physicochemical characteristics studied included hydrophobicity, molecular weight, grand average of hydropathicity index (GRAVY), net charge, instability index, isoelectric pH (pI), and the estimated half-life. The secondary structure of the final 25-mer peptide was calculated using the Stride Web interface (http://webclu.bio.wzw.tum.de/cgi-bin/stride/stridecgi.py).

The conservancy of the 25-mer epitopic peptide was assessed by NCBI conserved domain search and BLASTP server. The Web3DMol server (http://web3dmol.net/) was applied to draw the structure of OMPK17 and demonstrate the location of the final epitope.

Molecular docking

For further study of HLA-T cell epitope binding, molecular docking studies were also done on T cell epitopes (both class-I and II) of the final 25-mer epitopic peptide using HADDOCK 2.2 (http://haddock.science.uu.nl/services/HADDOCK2.2). The 3D structure of the epitopes was modeled by the PEP-FOLD server (Shen et al. 2014) and validated by generating a Ramachandran plot using the MolProbity server. The 3D structure of HLA alleles was obtained from the PDB databank as follows: HLA-B*35:01 (ID: 2CIK) and HLA-DRB1*15:01 (ID: 1BX2). To exact prediction of residues incorporated in the HLA-epitope binding, consensus prediction of interface residues in transient complexes (CPORT) (https://milou.science.uu.nl/services/CPORT/) was used (de Vries and Bonvin 2011). Following these residues collection, the HADDOCK server ran the docking and delivered the results as HADDOCK scores (as the main result), alongside some other scores (van Zundert et al. 2016). The original ligands of HLA PDB files, previously characterized by crystallography, were used as control.

Results

T cell epitope prediction

Using highly frequent HLA alleles, potential T cell epitopes were predicted by three different tools. The predicted epitopes were evaluated with other analyses to choose the best ones. These analyses include 1- Omitting the duplicate epitopes and selecting one from several similar epitopes, 2- Selecting those with high prediction scores or those predicted by more tools, 3- Considering higher immunogenicity score (for class-I epitopes) or being positive for IFN-γ production (for class-II epitopes) as a favored criterion, and 4- Omitting toxic epitopes, allergens, and those with > 90% human similarity. By applying these criteria, the T cell epitopes were selected and their class II/class I windows were found using the cluster analysis tool. The final three class II/class I windows of T cell epitopes and their analysis results are presented in Table 1.

Table 1 The top three predicted T cell epitopes of OMPK17

Linear B epitope prediction

Linear B cell epitope prediction by IEDB tools provided charts indicating regions with scores either above (positive) or below the threshold (negative) (Fig. S1). The positive regions were compared and the regions that had been positive in more tools were chosen as more potential epitopes. Also, the linear B cell epitopes were found by ABCpred and its results were compared and clustered with the epitopes found by IEDB tools. The epitope that was predicted by more tools or had the highest prediction score was chosen. Similar to that was mentioned for T cell epitopes, the linear B cell epitopes were also evaluated for possible toxicity, allergenicity, and human similarity. The top seven linear B cell epitopes and their analysis results are presented in Table 2.

Table 2 List of selected B cell epitopes of OMPK17

Discontinuous B cell epitope prediction

To predict the discontinuous B cell epitopes, initially, OMPK17 was modeled. The model quality was checked by generating Ramachandran and scoring functions provided by the SWISS-MODEL server. Both methods indicated the high quality of the modeling (Fig. 1).

Fig. 1
figure 1

The quality assessment of OMPK17 modeling. The quality of OMPK17 modeling was assessed by scoring functions provided by the SWISS-MODEL server including Global Quality Estimate (GQE) (A), Local Quality Estimate (LQE) (B), and Comparison with non-redundant PDB structures (C) as well as by generating the Ramachandran plot (D). In the GQE plot, the Z-score of several terms (as indicated in the picture) for the model was around zero which shows the better agreement between the model and the experimental structures. In the LQE plot, the majority of the model residues have a score of > 0.6 which means an acceptable quality of modeling. In the comparison plot, the model (indicated by a red star) was close to the grey areas, suggesting a high quality of modeling. Generating the Ramachandran plot (summarized in a table) showed that a high percentage of the model residues are in favored and allowed regions, indicating a high quality of the modeling

DiscoTope 2.0 predicted a discontinuous B cell epitope composed of nine segments of amino acid residues (Table S2). ElliPro tool predicted three discontinuous B cell epitopes, each containing several segments of amino acid residues (Table S2). The epitopic segments were compared and the most dominant segments (those that were longer than four amino acid residues) were presented in Table 2. These segments were also checked for potential toxicity, allergenicity, and human similarity.

The final B/T cell epitopic region

Comparing and clustering T cell and linear B cell epitopes indicated one specific 25-mer region that contained both class-I and class-II T cell epitopes as well as a linear B cell epitope (Table 3). The docking score (HADDOCK score) of the class I epitope-HLA (DMSDYGFSY-HLA B*35:01) was − 63.8±-14.8, which is more negative than the score of the control (class I HLA-original peptide) (-57.3±-10.0).

The HADDOCK score of class II epitope-HLA (HKSDMSDYGFSYGAG-HLA DRB*15:01) was − 27.9±-13.9, which is higher than what was observed for the control (class II HLA-original peptide) (-89.6±-13.9).

The allergenicity and toxicity prediction showed that the peptide is not toxic and not an allergen. The peptide had no ≥ 90% similarity with the human proteome.

The physicochemical characteristics of the final epitopic region (25-mer peptide) are presented in Table 3.

Table 3 The 25-mer epitopic peptide of OMPK17

The peptide had a pI equal to 6.74 and a molecular weight of 2848.01 Dalton. Its estimated half-life was 2 min in E. coli, in vivo, 2.8 h in mammalian reticulocytes, in vitro, and 10 min in yeast, in vivo. The peptide is hydrophilic and stable. It is composed of a mixture of strands, helixes, and turns (Table 3) and is located on the lateral surface of the OMPK17 tertiary structure (Fig. 2). The BLASTP results showed that the final 25-mer peptide has a significant sequence identity among Klebsiella species (Table S3). The conservancy assessment showed that the 25-mer peptide is a member of the highly conserved OM-channels (porin) superfamily in Gram-negative bacteria (Fig. S2).

Fig. 2
figure 2

The final 25-mer peptide location in OMPK17 3D structure. The location of the final 25-mer epitopic peptide is indicated by the green color in the 3D structure of OMPK17 (grey)

Discussion

The lack of an approved vaccine for K. pneumoniae, along with the therapeutic limitations of this pathogen, led us to find new candidates to be used as vaccines. OMPs are among the most prominent vaccine candidates for Gram-negative bacteria including K. pneumoniae. Previous studies on OMPs, mainly on OMPA, have shown that these proteins can induce protection against K. pneumoniae infections mediated by both types of immune responses including humoral (IgA, IgG1, and IgG2) and cell-mediated immunity (TNFa, nitric oxide, IFNγ, and several interleukins) (Malachowa et al. 2019; Assoni et al. 2021).

One member of the OMPs family that has been less studied but whose immunogenic potential has been previously reported (Hussein et al. 2018) is the OMPK17 protein. This protein was able to induce a Th2-type immune response and a significant IgG1 production which led to protection against K. pneumoniae lethal challenge (Hussein et al. 2018). However, the epitopes of this protein have not been identified clearly so far, so in the current design, we decided to find the B cell and T cell epitopes of the OMPK17 protein using various immunoinformatics methods. The results showed that many regions may play the role of T cell and/or B cell epitopes. However, some of these regions have more potential as epitopes. That’s why we did several analyses to reduce the number of these predicted regions and select the best ones. These analyses included the selection of higher-scoring epitopes, the epitopes predicted with more tools, more immunogenic epitopes, the epitopes capable of producing interferon-gamma, the epitopes with more dissimilarity to human peptides, and non-toxic and non-allergenic epitopes. We also used several tools (instead of one tool) to make sure that the selected regions were epitopes, as it has previously been reported that the use of multiple prediction tools increases the likelihood of selecting epitopes correctly (Trost et al. 2007).

Previous reports have shown that both cellular and humoral immunity are necessary for the complete control of K. pneumoniae (Campbell et al. 1996; Ten Hagen et al. 1998; Pletz et al. 2016). Therefore, if we can design a peptide that contains both type T cell epitope and linear B cell epitope, we can expect this peptide to activate both types of immune responses (Lee et al. 2015). Therefore, in the present study, by comparing the predicted T cell and B cell epitopes, we reached a 25-mer peptide that appears to be able to elicit both types of immune responses. It should be noted that although we predicted the discontinuous B cell epitopes of OMPK17, they were not used for the selection of the final 25-mer epitopic region, because the discontinuous B cell epitopes are dependent on the three-dimensional shape of the protein (Chyau Liang 1998), hence, they are not easily comparable to linear B cell or T cell epitopes.

For a vaccine candidate peptide to be able to pass the in vitro and in vivo experiments successfully and, more importantly, to function properly and produce a proper immune response in humans, it must have a stable structure and a series of appropriate physicochemical properties in addition to being immunogenic. The bioinformatics studies showed that the 25-mer peptide found in the present study is very stable, has a suitable pI, and has an acceptable half-life (about 3 h) in the human body. It is noteworthy that if its immunogenicity is confirmed via studies in animal models and humans, the stability and half-life of this peptide can be increased via various chemical/biochemical methods. The docking studies showed that HLA bind to the class-I epitope of our final 25-mer peptide with higher affinity than the original peptide, demonstrating its suitability to bind the HLA and subsequently elicit immune responses. In contrast, the class-II epitope of our final 25-mer peptide had a low docking score, which could be considered a negative point for eliciting immune responses. However, it should be noted that the control original peptide used here was a peptide of 14 amino acids in length while our class-II peptide had 15 amino acids. This little difference in size may cause the difference between the docking scores of our peptide compared to the original peptide. More docking studies with more appropriate controls seem required in this case.

The index pI is an important factor for the selection of a peptide vaccine and should not be in the range of the body tissues’ pH (pH 7.2 to 7.6) (Wang et al. 2018; Campos-Pinto et al. 2019).

In our 25-mer peptide, the pI was 6.74, which is far from the pH range of body tissues.

The hydrophobicity ratio analysis and the GRAVY index both showed that the 25-mer peptide is more hydrophilic. The negative GRAVY, as seen for our peptide, means being more hydrophilic (Kyte and Doolittle 1982; Adhikari et al. 2018).

Moreover, the instability index of the 25-mer peptide estimated that our epitopic peptide was stable. All these parameters also suggested the peptide as an appropriate vaccine candidate for more in vitro and in vivo experiments.

The conservancy assessment of the 25-mer peptide indicated that it belongs to OM_channels porin) superfamily that is highly conserved among Gram-negative bacteria. Therefore, if this 25-mer peptide is developed as a vaccine, it may elicit immune responses against other Gram-negative bacteria as well, although its real effect required much more studies.

Epitope-based vaccines, such as our proposed 25-mer epitopic peptide, are associated with limited immunity. This limitation may be due to several reasons including the easy degradation of these small-size molecules by proteases in the body, and the difficulty of their recognition by the receptors on the surface of immune cells. One effective way to overcome this limitation is conjugating them to traditional adjuvants and/or new built-in adjuvants such as new biomaterials or carriers. Some of the built-in adjuvants that are proposed for conjugation with epitope-based vaccines include virus-like particle carriers, ligands of pattern recognition receptors, bacterial toxins, nanoparticles, etc. (Lei et al. 2019). Another limitation of our proposed peptide as K. pneumoniae vaccine is that the target population of this vaccine will be mainly elderly patients suffering from K. pneumoniae infections with multiple comorbidities. Accordingly, this population may not be able to evoke strong immune responses (Weinberger et al. 2008). In addition, another issue that is neglected and hence should be assumed as a limitation of our study is that K. pneumoniae vaccines should not cross-react with the intestinal normal flora (Ahmad et al. 2012). Finally, an important issue that was neglected in the present study is that K. pneumoniae has several serotypes, so vaccines against one serotype may be ineffective against others. This issue is recommended to be considered in future research.

Altogether, here we found the high immunogenic class-I/class-II T cell and linear/discontinuous B cell epitopes of K. pneumoniae OMPK17 that can be used in future research. Moreover, the present study introduced a high immunogenic 25-mer peptide containing both B cell and T cell epitopes and comprising appropriate physicochemical characteristics that are required for K. pneumoniae vaccine development. The in vitro/in vivo study of this peptide is recommended to clarify its actual efficiency and efficacy.