1 Introduction

Salmonella enterica serovar Typhi (S. Typhi) is a foodborne pathogen responsible for causing typhoid fever, a disease that constitutes a major global health concern [3]. A major feature of S. Typhi infection is its host restriction to humans which means that the infection cycle begins and ends within the human host [16]. Typhoid fever has persisted in the human population through the development of carrier status by some infected individuals who continue to transmit the causative bacterium even after recovering from the clinical symptoms of the disease. Formation of biofilm in the epithelial cells of the gall bladder as well as gallstones, where present, have been implicated in the carrier status and antimicrobial resistance in typhoid fever [18, 19].

A biofilm is a community (colony) of microorganisms that adhere to each other and to a solid (inert or live) surface that is encased in a self-initiated protective extracellular matrix, containing polysaccharides, proteins, extracellular microbial DNA and enzymes, ions and nutrients from the environment [4, 7, 9, 19]. Biofilm contributes to the failure of anti-infective therapy by serving as a reservoir of resistant microorganisms as bacteria in biofilms are up to 103 more resistant to antibiotics than their planktonic counterparts [36]. Due to the heterogeneous nature of biofilms, several mechanisms have been reported for increased antimicrobial resistance in biofilm structures. These include (1) Low diffusion of antibiotics across the extracellular matrix resulting in low intracellular concentration of antibiotic(s). This is due to the presence of glycocalyx, an integral part of bacterial biofilm that accumulates antimicrobial substances and limits the transportation to the organisms within the colony [38], (2) phenotypic and physiological changes due to slow growth rate and starvation responses (oxygen, nutrient deprivation or environmental stress- antibiotics are not active against dormant cells; (3) the expression of efflux pumps (EP) that decrease intracellular antimicrobial concentration; (4) the emergence of persister cells, a biofilm-specific phenotype which are multi drug-tolerant cells that have not acquired genetic resistance [4, 7, 13, 14], Soto et al. [39]; (5) alteration in gene expression and (6) increase in horizontal transmission of resistance elements (Koopman et al., [25]). S. enterica is known to form biofilms on both living and non-living surfaces [19]. This characteristic has been reported to be very important for the survival of pathogenic bacteria within the human body as both innate and adaptive immune responses might not be able to eliminate pathogens within the well-established biofilm [9]. Enteric pathogens, salmonella inclusive, are resistant to the bile's detergent-like property and therefore can survive in the gall bladder. In addition, they respond to the bile environment by regulating other resistance-related genes [19]. An important mechanism for the formation of bacterial biofilm is quorum sensing [37].

Quorum sensing or signalling (QS) is a cell population density-dependent phenomenon that defines intra- and inter-specie cellular communications between bacteria which enables them to respond and adjust to their environment by regulation of certain gene expressions [31,32, 37]. In Salmonella spp, QS is responsible for bile adaptation, enhancement of persister cells and biofilm (Walawalkar et al., [43]), expression of flagella genes [11] and normal expression of salmonella pathogenicity island-1 (SPI-1), a virulence factor), [10]. QS is also involved in the production of secondary metabolites and stress adaptation mechanisms such as bacterial competition systems including the secretion systems (SS) [32]. QS in bacteria is mediated by the production of signalling protein molecules referred to as auto-inducers (AI). AIs are normally produced at an elemental level during the exponential growth phase of bacteria. However, at a cell density-dependent threshold concentration (i.e. the quorum level), they accumulate intra-cellularly via ATP-binding cassette (ABC) transporter and activate receptor proteins (transcription regulators) leading to a cascade of processes that results in the regulation of certain gene expression [46].

The QS system of S. Typhi comprises LuxS/AI-2/luxR homolog SdiA. Salmonella spp produces AI-2 during the exponential phase of growth in a reaction catalyzed by AI-2 synthase (LuxS protein). LuxS catalyzes the final step in the biosynthesis of AI-2 from S-adenosyl methionine yielding 4,5-dihydroxy-2,3-pentanedione (DPD). AI-2 is then obtained from (2R,4S)-2-methyl-2,3,3,4-tetrahydroxytetrahydrofuran (Walawalkar et al., [43]). Accumulation of AI-2 occasioned by a high-density cell population up-regulates the expression of an ATP-binding cassette (ABC) transporter (comprising of LsrA, LsrB, LsrC and LsrD), which transports AI-2 back into the cell [42]. In the cell, AI-2 is phosphorylated by a kinase (LsrK), and further modified by LsrF and LsrG (Vendeville et al., [41]). The resulting molecule binds transcription factors (SdiA) and regulates several genes pertaining to the formation of biofilm and other virulence factors of the bacterium.

Due to the integral involvement of QS in the formation and maintenance of biofilm, interruption of the QS network has been put forward as one of the important anti-virulence strategies [36] used to target biofilm-forming bacterial by any of the following means: (1.) inhibition of synthesis of QS signalling molecules,(2.) blocking the transport of AI into the cell; (3.) inhibition of intracellular processing of AI and (4.) inhibition of receptor proteins.

There abounds, in the literature, evidence of anti-bacterial and anti-biofilm properties of plant materials that can be harnessed for combating antimicrobial resistance of pathogenic organisms by interrupting the biofilm formation system. For instance, various parts of P. guajava have been reported for anti-biofilm activities against Staphylococcus aureus (250 μg/mL of pulp extract) [12], Streptococcus mutans and Streptococcus gordonii (0.78 mg/mL of leave extract) [28]. Fu et al. also reported that aqueous extract of Herba patriniae exhibited an inhibitory effect on almost all the studied biofilm-related genes of Pseudomonas aeruginosa with a significant reduction in the formation of biofilm as well as an alteration in the structure of the mature biofilms. Exopolysaccharide production and swarming motility were also impaired in the disfavour of biofilm formation [15].

Psidium guajava L (family- Myrtaceae; common name- guava), a fruit-bearing plant of the tropics, has been variously reported for traditional treatment of typhoid fever [30] as well as other antibacterial activities. Methanol extract of P. guajava leaves has also been confirmed to possess antibacterial and anti-biofilm activities against S. Typhi [1]. Previously, we identified the phyto-compounds in the methanol extract of P. guajava leaves and their interactions with a protein involved in replication of S. Typhi [2]. In this study, we investigate the interaction between those compounds and key proteins involved in QS in the typhoid bacteria to identify molecules that could be investigated further in the quest for a new drug for tackling typhoid fever.

2 Methods

An overview of the stages involved in this study is presented in Fig. 1. The final docking scores of these phytochemicals are compared to those of ciprofloxacin which is currently used for the treatment of typhoid fever.

Fig. 1
figure 1

The flowchart of the step-by-step methodology of the experiment conducted

2.1 Ligands preparation

The ligands are compounds previously identified in the methanol extract of P. guajava leaves [2]. All seventy-two identified compounds were considered and coded 172. The Simplified Molecular-Input Line-Entry Specification System (SMILES) formats of the ligands were obtained from PubChem [22] and translated to PDB format using the CACTUS online translator [8]. Ligands were converted to PDBQT formats on OpenbabelUI software.

2.2 Drug-likeness and ADMET profiling of ligands

The drug-likeness or otherwise of each compound was assessed via the QED (Quantitative Estimate of Drug-likeness). ADMET (Absorption, Distribution, Metabolism, Excretion and Toxicity) profiles of compounds with desirable QED values were also predicted. Both QED and ADMET were evaluated using ADMETlab 2.0 online webserver [47]. Only compounds with desirable QED and no predicted toxicity were selected for molecular docking.

2.3 Preparation of proteins

2.3.1 Retrieval from Protein Data Bank

The three-dimensional crystal structures (Fig. 2) of two QS proteins of S. Typhi were obtained from PDB (Protein Data Bank) (LuxS, PDB Id: 5v2w; LsrB, PDB Id: 5GTA).

Fig. 2
figure 2

3-D crystal structure of (a) AI-2 synthase, LuxS, PDB Id: 5v2w and (b) AI-2 binding protein, LsrB, PDB Id: 5GTA from S. Typhi retrieved from PDB

2.4 Homology modelling

Two proteins (LsrK and SdiA) whose 3-D structures were not available in PDB at the time of retrieval, were modelled using the SWISS-MODEL homology webserver [44]. Amino acid FASTA sequences of LsrK (530aa, Accession: QKX96401.1) and SdiA (240aa, Accession: QKX97843.1) were retrieved from the website of the National Centre for Biotechnology Information for modelling. SWISS-MODEL performs all the stages of modelling viz: (i.) template identification and selection,(ii.) alignment target-template sequence; (iii.) model building and refinement and (iv.) model evaluation [34].

2.5 Molecular docking

To study the interaction between selected compounds (ligands) and the selected QS proteins of S. Typhi (protein), molecular docking was carried out using AutoDock Vina 1.5.6 software. The proteins were first prepared by adding polar hydrogen atoms, calculating Kollman charges (Table 1) and saving them in PDBQT format on the AutoDock tool [40]. The grid box was set to cover the entire surface of the protein to allow for blind docking. The prepared and optimized ligands were blindly docked in the grid box of the protein to allow ligands to find any favourable binding site on the protein. Ligands were ranked based on their binding affinities with higher negative values signifying better affinity. Ligand–protein interactions were visualized using Discovery Studio Visualizer 2020 software.

Table 1 Properties of the proteins prepared for docking

3 Results

3.1 QED of compounds

The QED of compounds, calculated based on eight physicochemical properties (molecular weight, n-octanol/water distribution coefficient, number of hydrogen bond acceptors, number of hydrogen bond donors, total polar surface area, number of rotatable bonds, the number of aromatic rings and the number of alerts for undesirable functional groups) of each compound are presented in Table 2. The molecular weight of the compounds ranges between 78.05 and 518.13 g/mol. The minimum value of the logarithm of the n-octanol/water distribution coefficient (log P) is 0 while the maximum is 11.202. The numbers of hydrogen bond acceptors and donors are maximum of 8 and 3 respectively. Only eight compounds (4-[5-(4-pyridinyl)-1,2,4-oxadiazol-3-yl]-1,2,5-Oxadiazol-3-amine; 2-(2-Hydroxyphenoxy)-1-phenylethanol; 6-Methyl-4-propan-2-on-3-propyl-2, 6-dioxo-4,5,6,7-tetrahydro-1,2,3-triazolo[4,5-d]pyrimidine; Furazan-3-amine,4-(4-methyl-1-piperidylcarbonyl)-; Benzeneethanamine, 4-methoxy-; Decamethyl-Tetrasiloxane; Arsenous acid, tris(trimethylsilyl) ester; Cyclopentadecanone, 2-hydroxy-) with QED values > 0.67 possess desirable drug-likeness and are thus selected for ADMET profiling.

Table 2 Drug-likeness properties of compounds identified in methanol extract of P. guajava

3.2 ADMET profiling

The Absorption, distribution, metabolism, excretion and toxicity (ADME) properties of the eight compounds with desirable QED are summarized in Table 3. The compound, 6-Methyl-4-propan-2-on-3-propyl-2, 6-dioxo-4,5,6,7-tetrahydro-1,2,3-triazolo[4,5-d]pyrimidine is both a potent inhibitor and substrate of P-glycoprotein while decamethyl-tetrasiloxane and arsenous acid, tris(trimethylsilyl) ester may not be readily absorbed in the human intestine. Furazan-3-amine,4-(4-methyl-1-piperidylcarbonyl)- and arsenous acid, tris(trimethylsilyl) ester also show poor Caco-2 permeability. Four compounds (6-Methyl-4-propan-2-on-3-propyl-2, 6-dioxo-4,5,6,7-tetrahydro-1,2,3-triazolo[4,5-d]pyrimidine, furazan-3-amine,4-(4-methyl-1-piperidylcarbonyl)-; benzeneethanamine, 4-methoxy- and cyclopentadecanone, 2-hydroxy-) may be able to cross the blood–brain barrier. 2-(2-Hydroxyphenoxy)-1-phenylethanol, decamethyl-tetrasiloxane, arsenous acid, tris(trimethylsilyl) ester and cyclopentadecanone, 2-hydroxy-may be tightly bound to plasma protein with 2-(2-Hydroxyphenoxy)-1-phenylethanol and cyclopentadecanone, 2-hydroxy- having free unbound volume of less than 5%. 4-[5-(4-pyridinyl)-1,2,4-oxadiazol-3-yl]-1,2,5-Oxadiazol-3-amine, 2-(2-Hydroxyphenoxy)-1-phenylethanol and decamethyl-tetrasiloxane are likely to exhibit CYP450 inhibition promiscuity as they inhibit more than one isoform of the enzyme. Only furazan-3-amine,4-(4-methyl-1-piperidylcarbonyl), benzeneethanamine, 4-methoxy- and cyclopentadecanone, 2-hydroxy- have a good systemic clearance rate. Although none of the compounds is a potential blocker of hERG (human ether a-go-go) K + channels and may not be mutagenic, 4-[5-(4-pyridinyl)-1,2,4-oxadiazol-3-yl]-1,2,5-Oxadiazol-3-amine and furazan-3-amine,4-(4-methyl-1-piperidylcarbonyl) could cause drug-induced liver injury and lead to hepatocytes toxicity in human. The two compounds, with 6-Methyl-4-propan-2-on-3-propyl-2, 6-dioxo-4,5,6,7-tetrahydro-1,2,3-triazolo[4,5-d]pyrimidine are potential carcinogens. Only the compounds with no toxic profile were selected for the molecular docking study.

Table 3 ADMET profiles of compounds with desirable QED

3.3 Homology modelling

LsrK of S. Typhi was modelled using a crystal structure of Escherichia coli K-12 LsrK and HPr complex (PDB id: 5YA0) as a template, having a sequence identity of 83.02% and a GMQE (Global Model Quality Estimate) value of 0.89 with QMEAN (Qualitative Model Energy ANalysis) of 0.88 ± 0.05. Analysis of the Ramachandran plot of the modelled protein for the ϕ and ψ angles of individual amino acid residues revealed that 95.55% of residues were present in the favoured region while 3.64% and 0.81% residues were in the allowed and outlier regions respectively (Fig. 3). Apart from the Ramachandran outliers, the protein also has Rotamer outliers of about 1.01%. The total number of bad angles present in the chain is 78 out of 5,252.

Fig. 3
figure 3

Ramachandran plot and 3D structure of LsrK from SWISS-MODEL

The template for SdiA of S. Typhi was a 3-D crystal structure of E. coli SdiA in complex with 3-oxo-C6-homoserine lactone (PDB id: 4Y15) with a sequence identity of 72.08% and a GMQE value of 0.88 with QMEAN of 0.85 ± 0.05. The Ramachandran plot and the 3-D structure of the model are presented in Fig. 4. Analysis of the Ramachandran plot of the modelled protein for the ϕ and ψ angles of individual amino acid residues revealed that 97.01% and 2.99% residues were present in the favoured and allowed regions respectively. There was no Ramachandran outlier while Rotamer outliers were about 0.48%. The total number of bad angles present in the chain is 16 out of 5398.

Fig. 4
figure 4

Ramachandran plot and 3D structure of SdiA from SWISS-MODEL

3.4 Molecular docking

Molecular docking revealed that only two compounds (Benzeneethanamine, 4-methoxy- and Cyclopentadecanone, 2-hydroxy-) have an interaction with each of the four proteins studied (Table 4). The energy of affinities of these two compounds was only lower than that of ciprofloxacin in their interactions with SdiA. For all the proteins, Cyclopentadecanone, 2-hydroxy-has a better (lower) affinity energy than Benzeneethanamine, 4-methoxy-. Analysis of the docking output (Table 5 and Fig. 5) revealed that, except for SdiA, Benzeneethanamine, 4-methoxy-interacts with chain A of each protein with more hydrogen bonds than with any other type of (non-covalent) bonds. Almost all the interactions are with polar amino acids. For Cyclopentadecanone, 2-hydroxy-, analysis of the docking output (Table 6 and Fig. 6) revealed fewer hydrogen bond interactions with each protein. Most of the interactions are with basic or neutral amino acids.

Table 4 Binding affinities of selected compounds of P. guajava with selected QS proteins in S. Typhi
Table 5 Binding properties of Benzeneethanamine, 4-methoxy- with selected proteins
Fig. 5
figure 5

3-D and 2-D interactions of Benzeneethanamine, 4-methoxy-) with selected QS proteins of S. Typhi

Table 6 Binding properties of Cyclopentadecanone, 2-hydroxy- with selected proteins
Fig. 6
figure 6

3-D and 2-D interactions of Cyclopentadecanone, 2-hydroxy- with selected QS proteins of S. Typhi

4 Discussion

The process of drug discovery has been made easier with the help of computational drug designing which uses various computational methods to predict interactions between potential drug targets and small molecules to determine the best lead compounds.

Virtual screening of the drug-likeness of small molecules is being estimated using the physicochemical properties of such molecules. Many researchers have come up with different sets of rules regarding the suitability or otherwise of compounds for lead-likeness based on their physicochemical properties, however, such rules are based on a few of those properties. For example, Lipinski's Rule of 5 is based on 4 of those parameters (molecular weight, LogP, Number of hydrogen bond acceptors and donors) [26]. Conversely, QED is a multi-criteria optimization that offers quantitative metrics based on 8 physicochemical parameters and offers a clear-cut distinction between desirable and undesirable compounds (Bickerton et al. [5]). Although many of the compounds studied did not violate more than one of Lipinski's Rule of 5, only eight of them have the 'desirable' value of QED and were thus selected for ADMET profiling. This would ensure that scarce resources are not wasted on molecules which are likely to be abandoned at a later stage of drug design.

In silico prediction ofADMET profiles of small molecules offers the advantage of early detection of poor pharmacokinetic and toxic potentials, avoiding costly late-stage failure in drug development, which is less expensive and time-saving. P-glycoprotein substrate or inhibitor, human intestinal absorption and Caco-2 permeability were used to predict the absorption level of the compounds. Inhibition of P-glycoprotein by 6-Methyl-4-propan-2-on-3-propyl-2, 6-dioxo-4,5,6,7-tetrahydro-1,2,3-triazolo[4,5-d]pyrimidine, low human intestinal absorption of compounds Decamethyl-Tetrasiloxane and Arsenous acid, tris(trimethylsilyl) ester, as well as low apparent Caco-2 permeability of furazan-3-amine,4-(4-methyl-1-piperidylcarbonyl)- and Arsenous acid, tris(trimethylsilyl) ester could lower the bioavailability of these compounds in human. 6-Methyl-4-propan-2-on-3-propyl-2, 6-dioxo-4,5,6,7-tetrahydro-1,2,3-triazolo[4,5-d]pyrimidine, furazan-3-amine,4-(4-methyl-1-piperidylcarbonyl)- and Cyclopentadecanone, 2-hydroxy- could cross the blood–brain barrier into the brain and cause neurological effects. Whether or not the effect will be beneficial or otherwise is a subject of further investigation as some strains of salmonella have been reported to cause post-typhoid neurological complications [24] and drug molecules with the ability to penetrate the BBB would be valuable in such cases [35]. Five isoforms of cytochrome-P450 enzymes (1A2, 3A4, 2C9, 2C19 and 2D6) metabolize about 90% of xenobiotics. Inhibition of these important enzymes can cause clinically important drug-drug interactions that can lead to adverse reactions or treatment failures [27]. 4-[5-(4-pyridinyl)-1,2,4-oxadiazol-3-yl]-1,2,5-Oxadiazol-3-amine, 2-(2-Hydroxyphenoxy)-1-phenylethanol, furazan-3-amine,4-(4-methyl-1-piperidylcarbonyl)- and Benzeneethanamine, 4- methoxy- inhibit at least one isoform of cytochrome-P450 and this poses a hindrance to their suitability as drug candidates unless they are improved upon at the optimization stage. Five toxicity parameters were considered for screening of small molecules in this study viz: hERG, H-HT, DILI, Ames and carcinogenicity. Due to the central role played by the liver in human metabolism, human hepatotoxicity and drug-induced liver injury are important parameters to consider in any prospective drug molecule [6]. Carcinogenicity prediction is also an important parameter as it measures the probability that a molecule may induce cancer. Compounds with potential mutagenicity and carcinogenicity are not further considered in drug design [21]. Based on these three parameters. Three of the compounds (4-[5-(4-pyridinyl)-1,2,4-oxadiazol-3-yl]-1,2,5-Oxadiazol-3-amine, 6-Methyl-4-propan-2-on-3-propyl-2, 6-dioxo-4,5,6,7-tetrahydro-1,2,3-triazolo[4,5-d]pyrimidine and Furazan-3-amine,4-(4-methyl-1-piperidylcarbonyl)-) were thus removed from further consideration.

Homology modelling provides a model of 3-D structures of proteins whose experimental 3-D crystal structures are not available in the Protein Data Bank. The reliability of such structures depends on metrics such as the Global Model Quality Estimation (GMQE) and Qualitative Model Energy Analysis (QMEAN) values. The GMQE value ranges between 0 and 1, and the higher the number, the higher the reliability of the predicted structure, while QMEAN < 4.0 shows reliability [29]. With GMQE and QMEAN of 0.89 and 0.88 ± 0.05 respectively for the modelled LsrK, the model is reliable enough for the molecular docking study. The SdiA model is also reliable (GMQE = 0.88, QMEAN = 0.85 ± 0.05). The torsional angles of this protein are good, according to the Ramachandran plot study. As a result, the structures are reliable.

Given the pivotal role played by the QS network in the development of both carrier status and antibiotic resistance in typhoid fever, and indeed other important bacterial infections, QS inhibitors are gaining more popularity as prospective alternatives in combating bacterial diseases (Jiang et al., [23]). Proteins in the QS network of S. Typhi selected for study are LuxS protein (synthesis of AI-2), LsrB (binding and transport of AI-2 into the cell), LsrK (intracellular phosphorylation of AI-2) and SdiA (transcription factor which is the receptor of the processed AI). Although LuxS does not affect the growth of Salmonella spp., it plays a role in the formation of biofilm as it catalyzes a crucial step in the synthesis of AI-2. It also controls the expression of polarization of the bacterial phase variation [33], the catalytic site of LuxS comprises a Zn2+ coordinated by HIS: A54, HIS: A58 and CYS: A126. These amino acids are crucial for its catalysis [20]. Our molecular docking results show that Benzeneethanamine, 4-methoxy- bind HIS: A58 of LuxS with both Pi-Akyl and Pi-Pi T-shaped bonds and this could inhibit the protein. There is also a hydrogen bond at the neighbouring GLUA:57. Also, Benzeneethanamine, 4- methoxy- also interacts with TRY B:95, ALA B:110 and GLY B:118 while Cyclopentadec anone, 2-hydroxy-interacts with ASP A:80, all of which form part of the binding site of SdiA [17]. Better interactions of the compounds with SdiA (− 6.20 kcal/mol for Benzeneethanamine, 4- methoxy- and − 7.70 kcal/mol for Cyclopentadecanone, 2-hydroxy-) relative to ciprofloxacin (− 4.30 kcal/mol) indicate that they fit well into the binding pockets of the protein and makes them promising S. Typhi QS inhibitors as inactivation of the processed AI receptors has been proven to reduce virulence in some pathogenic bacteria [45, 48]. Interestingly Cyclopentadecanone, 2-hydroxy- has been previously reported to inhibit DNA gyrase, another important protein involved in the replication of S. Typhi [2]. Thus, Cyclopentadecanone, 2-hydroxy-, and Benzeneethanamine, 4-methoxy- are being considered for lead optimization.

5 Conclusion

QED, being a multi-criteria optimization, gave a numerical output which made it straightforward in determining the drug-likeness of the phytocompounds studied. Only eight out of seventy-two compounds had drug-likeness desirability (QED > 0.67). Prediction of ADMET profile accords the removal of compounds that could later be withdrawn if made into drugs. This led to discontinuation with three compounds due to predicted human hepatotoxicity, drug-induced liver injury and carcinogenicity. Benzeneethanamine, 4-methoxy- and Cyclopentadecanone, 2-hydroxy- from P. guajava leaves show remarkable interaction with all four QS proteins of S. Typhi studied with negative binding energies. Future research focused on molecular dynamics of the interactions as well as optimization of the two compounds will give more insight into the possibility of exploring them through in vitro and in vivo experiments against the deadly salmonella specie.