Population Structure, Molecular Epidemiology, and β-Lactamase Diversity among Stenotrophomonas maltophilia Isolates in the United States

Multiple antibiotic resistance mechanisms, including two β-lactamases, L1, a metallo-β-lactamase, and L2, a class A cephalosporinase, make S. maltophilia naturally multidrug resistant. Thus, infections caused by S. maltophilia pose a big therapeutic challenge. Our study aims to understand the microbiological and molecular characteristics of S. maltophilia isolates recovered from human sources. A highlight of the resistance profile of this collection is the excellent activity of the ceftazidime-avibactam and aztreonam combination. We hope this result prompts controlled and observational studies to add clinical data on the utility and safety of this therapy. We also identify 35 and 43 novel variants of L1 and L2, respectively, some of which harbor novel substitutions that could potentially affect substrate and/or inhibitor binding. We believe our results provide valuable knowledge to understand the epidemiology of this species and to advance mechanism-based inhibitor design to add to the limited arsenal of antibiotics active against this pathogen.

IMPORTANCE Multiple antibiotic resistance mechanisms, including two ␤lactamases, L1, a metallo-␤-lactamase, and L2, a class A cephalosporinase, make S. maltophilia naturally multidrug resistant. Thus, infections caused by S. maltophilia pose a big therapeutic challenge. Our study aims to understand the microbiological and molecular characteristics of S. maltophilia isolates recovered from human sources. A highlight of the resistance profile of this collection is the excellent activity of the ceftazidime-avibactam and aztreonam combination. We hope this result prompts controlled and observational studies to add clinical data on the utility and safety of this therapy. We also identify 35 and 43 novel variants of L1 and L2, respectively, some of which harbor novel substitutions that could potentially affect substrate and/or inhibitor binding. We believe our results provide valuable knowledge to understand the epidemiology of this species and to advance mechanismbased inhibitor design to add to the limited arsenal of antibiotics active against this pathogen.
KEYWORDS Stenotrophomonas maltophilia, antibiotic resistance, beta-lactamases, molecular epidemiology S tenotrophomonas maltophilia is a nonfermentative, Gram-negative, environmental bacillus that has emerged as an important cause of health care-associated infections (HCAIs), especially in debilitated and immunocompromised patients. In persons with cystic fibrosis (CF), S. maltophilia is known for colonizing the airways and causing chronic infections (1,2). Although S. maltophilia is primarily associated with respiratory tract infections, this pathogen can cause a wide range of clinical syndromes, including bloodstream, catheter-associated, and skin and soft tissue infections (2)(3)(4). Furthermore, while S. maltophilia is not a highly virulent pathogen, its associated crude mortality rates range from 14% to 69% in patients with bacteremia (1,(5)(6)(7)(8). Recognized risk factors for S. maltophilia infection include underlying malignancy, the presence of indwelling devices, chronic respiratory disease, prior use of antibiotics (particularly imipenem or meropenem), and long-term hospitalization or intensive care unit admission. These risk factors reflect the tendency of S. maltophilia to form biofilms and colonize humid surfaces and its multidrug-resistant (MDR) phenotype (2,9).
S. maltophilia is recognized by the World Health Organization as one of the leading MDR organisms in hospital settings for which disease prevention and treatment strategies must be developed (10). In S. maltophilia, intrinsic antibiotic resistance is mediated by the expression of aminoglycoside-modifying enzymes, qnrB-like quinolone-resistant determinants, multidrug efflux pumps, and two inducible ␤-lactamases, L1 and L2. L1 is a class B3 metallo-␤-lactamase (MBL) that hydrolyzes carbapenems and other ␤-lactams, with the important exception of the monobactam aztreonam (ATM), and is resistant to all clinically available ␤-lactamase inhibitors (1,11,12). L2 is a class A cephalosporinase that confers resistance to extended-spectrum cephalosporins and ATM but can be inhibited by commercially available serine-␤-lactamase inhibitors such as clavulanic acid and avibactam (1,2,13,14). Infections with S. maltophilia isolates demonstrating resistance to antibiotics with historically good susceptibility rates, such as trimethoprim-sulfamethoxazole (TMP-SMX), ticarcillin-clavulanate (TIC-CLV), and fluoroquinolones, are increasingly common (2,5,15,16). Moreover, resistance can also emerge during therapy (14,15).
The genetic background of S. maltophilia strains was previously explored using various molecular methods, including pulsed-field gel electrophoresis (PFGE), amplified fragment length polymorphism (AFLP), and recently by multilocus sequence typing (MLST). These methods demonstrated a high level of molecular heterogeneity, which may be related to the wide environmental distribution of this pathogen (17)(18)(19)(20). Furthermore, allelic variation of the bla L1 and bla L2 genes has also been recognized (21). The implications of such molecular heterogeneity in ␤-lactamase function and the resistance phenotypes that they might confer have not been fully explored and are essential to anticipate emergence of antibiotic resistance and novel phenotypes. cluster with any known ST from a previously reported genomic group, possibly indicating the emergence of new genomic groups. Interestingly, clustering of strains was not observed according to their geographical location, physiological site of isolation, or CF/clinical status (Fig. S2). L1 and L2 ␤-lactamase characterization. Sequences of bla L1 were obtained for 73 out of 130 isolates included (56%). Using primers Smal_L1_F and Smal_L1_R, bla L1 was successfully detected in 27 of the remaining 56 isolates. Therefore, bla L1 could not be FIG 1 Phylogenetic analysis of S. maltophilia from concatenated sequences of the seven housekeeping gene fragments (3,591 nt). Consensus UPGMA tree shows the structure of the studied S. maltophilia population (n ϭ 130). Branch length represents the genetic distance, with the scale bar indicating the number of mutations per nucleotide position. The tree was rooted with a sequence from Stenotrophomonas spp. (not S. maltophilia). The genomic group affiliation according to MLST in the original study by Kaiser et al. (20) is indicated by colored clades: cyan, genomic group 6; blue, genomic group 1; purple, genomic group 4; magenta, genomic group C. Known STs belonging to each genomic group are highlighted in their corresponding color. detected in 29 samples (22%), probably due to sequence variation of the target. On the other hand, bla L2 was successfully sequenced in 116 of the 130 clinical isolates (89%) and detected in 12 of the 14 remaining isolates. Analysis of the coding sequences showed a higher degree of variability in the third codon position than in the first and second positions, by factors of 2 to 4 (Fig. S3). Mutations at position 3 (wobble position) often do not imply a change in the encoded amino acid (and those that do result in amino acid changes tend to conserve physicochemical properties). Overall, the two data sets display relatively high amino acid conservation (compared to examples of natural [22] or lab-generated [23] variation in enzymes), with most positions accepting substitutions by at most 5 amino acid residues different from the reference sequence. Analysis of amino acid residue preferences shows that substitutions largely conserve patterns of residue volume, steric hindrance, flexibility, polarity, and side chain charge. Examples of these conservative amino acid substitutions include Ala77Ser, Val133Leu, and Lys212Arg in L1 and (Val/Leu)169Met, Thr126Ala, and Glu197Asp in L2 (Dataset S1).
Multiple sequence alignment (MSA) of the deduced amino acid sequences of the mature protein (268 amino acids, starting from the EA/T/VPLP sequence after the predicted signal peptide of L1; and 276 or 277 amino acids, starting from the VSP sequence after the predicted signal peptide of L2) and a subsequent protein sequence logo showed that substitutions are scattered throughout the protein (Fig. S4). A sequence logo was further used to identify positions in which two, three, four, or more different residues were found. Mapping of these positions onto the crystal structures of the L1 monomer (PDB ID 2AIO), L1 homotetramer (PDB ID 2QIN), and L2 (PDB ID 1N4O) is shown in Fig. 2.
A dendrogram was then built with all 73 clinical L1 sequences, the 5 reference sequences described by Avison et al. (21), and the amino acid sequence of the FEZ-1 MBL of Legionella gormanii as the outgroup. As shown in Fig. 3, sequences grouped into four distinct clades, identified by different colors. The majority of clinical L1 enzymes were identical, with 100% amino acid identity (34 sequences, 46.6%), or highly related, with Ͼ85% amino acid identity (2 sequences, 2.7%), to the reference L1 B (red clade). Reference L1 C clustered with seventeen clinical L1 enzymes (23.3%), of which two were identical to this reference (green clade). References L1 A and L1 D clustered together with seven clinical L1s, representing 10% of the sequences (blue clade), while reference L1 E clustered with seven clinical L1s. Finally, three L1 clinical sequences were not grouped into any clade.
On the other hand, the dendrogram built using all 116 L2 sequences obtained from the clinical isolates, the four L2 (A to D) reference sequences described by Avison et al. (21), and the amino acid sequence of Klebsiella pneumoniae carbapanemase KPC-2 as the outgroup clearly divided the L2 clinical sequences into two distinct clades (Fig. 4). References L2 A, L2 B, and L2 C grouped with the majority of the clinical L2s (67 sequences; 57.8%) into the brown clade. This clade branches out into two clades: the first clusters 59 identical clinical L2s (18 sequences; 15.5%) or highly related L2s (33 sequences; 28.4%) to the L2 B reference, whereas the other branch groups references L2 A and L2 C with the remaining 8 sequences. The green clade clusters 47 clinical L2s (40.5%) with reference L2 D. This clade branches out into 4 clades.
Further MSAs carried out with the sequences of each clade helped identify repeated sequences, as well as unique L1 and L2 variants. A comparison with the reference types identified unique substitutions yielding 34 new variants of the L1 MBL and 43 new variants of L2, with percent identities ranging from 78% to 99% and from 58% to 99%, respectively (Fig. S5). A list of all the substitutions found in the clinical isolates is shown in Tables S1 and S2.
Finally, BIS2Analyzer was used to analyze amino acid variation within L1 and L2 variants. This is a server-based program tailored to detect residue covariation in highly conserved alignments and thus identify protein regions that might not be essential for function but do contribute to relevant aspects of structure and function (hence they are not strictly conserved but not fully tolerant of substitutions). Among the uniquesequence L1 variants, BIS2Analyzer detected two clusters of covarying residues, which were mapped onto the crystal structure of the monomer (PDB ID 2AIO) and tetramer (PDB ID 2QIN) in Fig. 5. The analysis revealed that strictly conserved residues involved in zinc binding and in tetramerization were surrounded by residues that tolerate substitution, but under covariation constraints. The cluster highlighted in green in Fig. 5 arises from covariation of residues surrounding zinc-binding residues and Met175, a residue key for tetramerization (24). The cluster highlighted in blue contains additional covarying residues that map to the tetramerization interfaces.
On the unique amino acid sequences of L2 variants, Bis2Analyzer identified a single cluster of residues that tolerate substitutions under covariation constraints. Figure 6 shows that residues of this network are scattered throughout most of the structure, enriched around the active site and in the ⍀-loop.

DISCUSSION
In this compilation of S. maltophilia clinical strains, collected over a 10-year period across the United States, we found that the majority (56%) of isolates were recovered from persons with CF or from respiratory samples, 21% of the strains were recovered from blood, and minor percentages were recovered from urine (2.3%) and soft tissue infections (3%). Although the predominance of respiratory samples is consistent with other studies, the small sample size prevents us from drawing any conclusion regarding human infection preference.
Antibiotic agents with in vitro activity against S. maltophilia include TMP-SMX, fluoroquinolones, aminoglycosides, and extended-spectrum penicillins/cephalosporins (2, 25). A comparison of susceptibility data found in the strains associated with infection in this collection to those previously reported for strains collected in the United States in the last 2 decades shows that the resistance toward TMP-SMX, ciprofloxacin (CIP), TIC-CLV, and ceftazidime (CAZ) is increasing (2,27,28,29). For instance, the percentage of strains susceptible to TMP-SMX in 1997 to 1999 was 95%; in 2001, it was 90.5%; in 2003 to 2008, it went up again to 97%; and in our collection, it ranged from 79% to 96%, being lower in the isolates collected from blood, likely reflecting collection after treatment of the patient with antibiotics, including TMP-SMX. Similarly, the susceptibility reported for CIP in 1997 to 1999 was 55%; in 2001, it was 52%; and in this study, it was 25% to 38%, the lowest corresponding to isolates from the bloodstream infections. The situation is more dramatic for the ␤-lactams: the susceptibility to True covarying residues are displayed in darker colors (red and dark blue), whereas residues affected by that covariance are represented in orange and a lighter shade of blue. In panels A, B, and E, clusters are shown as ribbons, whereas in panels C, D, and F, the covarying residues only are represented as dots. Panels B and D show the clusters from a different perspective, after a 180°turn. In all sets, important residues are displayed as sticks, gray for the Zn-binding residues, cyan for Met175, the main residue involved in tetramerization, and magenta for some other secondary interaction within the tetramer. Zn(II) ions are represented as black spheres, and active-site loops are shown in green.  (2,(27)(28)(29). A downward trend of susceptibility to antimicrobial agents and an upswing in the frequency of isolates with multidrug resistance were also recently found in a study conducted in China on 300 S. maltophilia strains collected over a 10-year period (16). In that study, however, the percentage of strains susceptible to minocycline (MIN) was found to be also decreasing, whereas our data show outstanding rates of susceptibility to this tetracycline. Importantly, we also show excellent susceptibility rates for the novel combination ceftazidime-avibactam and aztreonam (CZA-ATM), especially when the aztreonam is used at 8 mg/liter. Remarkably, we demonstrate that the susceptibility to CZA-ATM of the strains recovered from blood and from other infections (most of them respiratory samples) is higher than the susceptibility to TMP-SMX, which is the "first-line" antibiotic to treat S. maltophilia infections. The activity of CZA-ATM toward the strains recovered from patients with CF is comparable to the activity of TMP-SMX. Therefore, these results confirm previous findings regarding the in vivo and in vitro efficacy of CZA-ATM against S. maltophilia (13,14). Further observational and controlled studies are planned to add clinical data on the utility and safety of CZA-ATM therapy.
Previous genotyping studies showed that S. maltophilia is a highly heterogeneous species (18,30). The identification of a number of different STs and previously unrecognized allelic combinations demonstrates the high genetic diversity of the study sample. Consequently, specific associations between the genetic background and either the clinical origin, place, or time of collection were not identified. Nevertheless, ST5, previously found to be the predominant ST in France and reported in Germany, Austria, and Korea, was the most common ST in our collection (20,31,32). Larger studies should be performed to confirm the putative success of this genetic background, since only 6 of 83 strains were ST5 in the French study and 9 of 130 representatives of this ST were identified in the present study.
The initial genomic characterization of S. maltophilia by Hauben et al. (18) using amplified fragment length polymorphism (AFLP) suggested an association between some genomic groups and environmental or human sources (19). In that study, some genomic groups contained exclusively environmental strains (4, 9, and 10), whereas others, such as genomic groups 1, 6, and 7, included mostly clinical isolates (18). These findings were not only confirmed but also expanded by the MLST scheme proposed by Kaiser et al. (20). Since then, several other studies employed the genomic group determination based on the concatenation and alignment of the sequences corresponding to the seven housekeeping genes included in the S. maltophilia MLST scheme as an efficient approach to study the population structure of this pathogen (32)(33)(34). In line with other studies, we also report the predominance of genomic groups 6, 1, and C among our clinical samples, all of which have previously been associated with human sources (20,31). Moreover, our results show a putative association of genomic group 6 with B-like and C-like type L1 enzymes and ABC-like type L2 variants (Fig. 7).
The apparent emergence of strains with unique genetic backgrounds associated with human pathogenicity may indicate an ongoing process by which S. maltophilia adapts to this specific niche. Such adaptation processes might be driven by conditions A closer look into the ␤-lactamase-encoding genes reveals their molecular heterogeneity. Among the 130 strains included, we identified 34 novel allelic variants of bla L1 and 43 of bla L2 . In both cases, variable residues are scattered throughout the protein.
As observed in all cases of MBL allelic variants, the identity of key residues is preserved, such as the metal ligands in L1 (His116, His118, and His196, which coordinate Zn1, and Asp120, His121, and His263, which coordinate Zn2), which are fully conserved. The active site of L1 MBL is completed by the second-shell residues, which maintain the orientation of the Zn-binding residues through an extensive hydrogen-bonding network, in addition to hydrophobic interactions (35). Residues involved in the H-bonding network include Asp68, Asp84, Asp153, and Ser221. Interestingly, our MSA identified Asp68 as one of the most variable residues in L1; in fact, substitutions for Asp68 were found in 7 out of the 73 L1 sequences, being replaced by Asn, Gly, and Ser residues, i.e., the three smallest and most flexible amino acids (see Table S1 in the supplemental material). Studies of the second-shell residues Ser84 and Gly121 in SPM-1 (36) and Ser262 in IMP-1 (37, 38) B1 MBLs have demonstrated that these residues can modulate substrate preference. Considering that substitutions which tune flexibility result in modulation of the substrate spectrum in the related MBL from Bacillus cereus (39), we hypothesize that Asp68 substitutions in L1 could potentially result in subtle changes in the substrate preference.
The metal center of L1 is surrounded by two protruding loops, ␣3-␤7 and ␤12-␣5. The ␣3-␤7 loop comprises 22 residues (Arg148 to Arg172) and extends over the active site. Residues located at this flexible loop are thought to make hydrophobic interactions with large hydrophobic motifs at C-2 or C-6 of the ␤-lactam substrates and consequently determine the enzyme's substrate spectrum (35,40,41,42). On the other hand, the ␤12-␣5 loop includes 20 residues (Arg219 to Leu240) and sits adjacent to the active site. From this loop, two important residues project into the active site and serve as an indirect ligand of Zn2 (Ser221) and as a second-shell residue of His196 (Ser223) (35). In our set of 73 clinical isolates, 3 isolates were found carrying two novel substitutions at the ␣3-␤7 loop: Gly158Ala and (Gly/Glu)161Asp. Gly158 is located at the tip of the loop, next to Phe157, whose role in substrate binding and catalysis has been evidenced by molecular modeling, crystal structures of L1 in complex with ligands, directed mutagenesis, and kinetic studies (35,(40)(41)(42). The Gly158Ala substitution could disturb the positioning of Phe157 by restricting the flexibility of the peptide chain. Likewise, the Gly161Asp substitution could alter the enzyme's catalytic efficiency, as is the case for Gly161Glu, which decreases the overall performance of the L1 E variant toward nitrocefin and imipenem, compared with that of the L1 A variant (21). Notably, substitutions at this residue are found in isolates that cluster together with the L1 E variant (Fig. 3 and Table S1).
Five isolates carried novel substitutions in the ␤12-␣5 loop: Gln230Arg, Gly233Asp, (Lys/Gln)232Arg, and (Pro/Ala)235Val. Although none of these residues has been shown to directly interact with the substrate, catalytically important residues like Ser221 and Ser223 protrude into the active site from this loop. Ser221 coordinates the Zn2-bound apical water, whereas Ser223 directly interacts with the substrate C 4 carboxylate group (42). Consequently, substitutions that lead to changes in the overall conformation of this loop, either by strengthening or weakening the H-bonding network that maintains it in place or by changing its electrostatic characteristics, could ultimately affect substrate catalysis.
L1 is an unusual ␤-lactamase, since it is biologically functional as a homotetramer (24,35). The crystal structure of the L1 enzyme submitted under PDB ID 2QIN shows that the tetramer is formed by three main sets of intramolecular interactions that result in each subunit making contact with each of the other three (35) (Fig. S6). First, there are hydrophobic interactions and hydrogen bonds between residues located in the extended N terminus. Second, Leu26 and Leu29 from subunit B (and D) dock in hydrophobic pockets formed by Ala31 and Tyr32 and by Met87, Pro89, Gln90, Met91, and His94 of subunit A (and C), respectively. Remarkably, none of our clinical strains demonstrate any variation affecting these interactions. In fact, only the L1 E variant possesses a His94Tyr substitution. In the third and most important intramolecular interaction, the side chain of Met175 from subunit C (and B) penetrates a hydrophobic pocket formed by several residues in subunit A (and D), including Pro237, Tyr236, Leu154, and Pro198. The importance of Met175 in the tetramer formation was demonstrated by Simm et al., when their Met175Asp variant produced only monomeric L1 (24). Conservation of these residues among all clinical samples and the five L1 references indicates that tetramerization is fundamental for the function of this enzyme.
A minor set of interactions maintain contact between subunits A and D and between subunits B and C of the homotetramer. In this, Lys135 in subunit A (and B) form hydrogen bonds with Lys129, Thr132, and Ala134 in subunit D (and C). Likewise, Arg107 in subunit A (and B) form hydrogen bonds with Asp78 in subunit D (and C). In contrast to what was observed for the main interactions, sequence variability was found at positions 129, 135, and 132 in the clinical isolates. In fact, the identity of residues at position 135 is quite variable, as it can be a Glu, His, Lys, Arg, Met, or Ser. In the other cases, only one substitution was found (Lys129Met or Thr132Ser). Given that this set of interactions plays a minor role, it is unlikely to affect tetramerization.
In contrast to the L1 ␤-lactamase, sequences of the L2 ␤-lactamase were of two distinct types, sharing approximately 70% homology. Of the 116 L2 sequences obtained, 67 (59%) belonged to type ABC and 47 (41%) belonged to type D. As expected, we did not find mutations in the highly conserved structural motifs of class A ␤-lactamases: the active-site pocket Ser70-X-X-Lys73, the SDN loop Ser130-Asp131-Asn132, and Lys234-Thr(Ser)235-Gly236. Within the SDN loop, Ser130 is part of a critical proton shuttle during both the acylation and deacylation of ␤-lactams (43)(44)(45). Furthermore, the SDN loop is associated with maintaining the structure of the active-site cavity, enzyme stability, and stabilization of the enzyme transition state (43). As a result, the alpha-helixes before and after this motif are expected to be highly conserved. However, we found two L2 variants (three total sequences) containing the mutations Thr126Ala and Thr129Ile, just preceding the SDN loop. The hydroxyl side chain of both threonine residues interacts directly with Asp131 (of SDN) in the L2 crystal structure (PDB ID 1N4O). Loss of this interaction could directly affect the catalytic mechanisms of these enzymes by changing the structure of the active-site pocket, including the positioning of Ser130.
Another important structural element of serine ␤-lactamases is the ⍀-loop, which encompasses residues 164 to 179 and constitutes the "back wall" or "floor" of the active site that shapes the binding cavity (46). As expected, important residues within this loop, like Arg164, Glu166, and Asp179, were conserved among all the L2 variants. However, as shown in Fig. S4B, we found a significant number of substitutions in this region, including Leu165Asn and Ser171Leu. In the L2 enzyme submitted under PDB ID 1NO4, Leu165 forms a small hydrophobic cluster underneath the polar surface, together with Phe139, Pro145, and Leu169. However, the slightly exposed position of residue 165 would perfectly allow for a polar amide side chain of an Asn to come into contact with the surrounding solvent and possibly form an H bond with Asn136. On the other hand, the OH group of Ser171 snorkels out to the surface and is surrounded by three pockets with crystallographic water. Substitution by Leu could be relaxed by displacing water molecules from one of these cavities to locate the longer, hydrophobic side chain. Since the ⍀-loop plays a crucial role in substrate recognition and catalysis, amino acid substitutions in this loop are associated with the expansion of the ␤-lactamase substrate spectrum (47)(48)(49). The study of the effect of these novel substitutions on the catalytic activity and inhibitor binding to L2 is warranted (Fig. S7).
Finally, the Leu103Gln substitution was observed in a single sequence. The change from a hydrophobic to a polar side chain could have an impact, especially since this amino acid resides on the loop located at the entrance of the active site (residues 102-110), alongside the critical His105, which forms direct interactions with ␤-lactam substrates. An amino acid substitution at position 103 could alter the position of this loop, as well as the orientation of residue 105 (50)(51)(52). The remaining nonconservative substitutions occurred almost exclusively in catalytically unimportant regions on the periphery of the enzyme, including the N and C termini and the solvent-exposed regions. The effect of these substitutions is yet to be determined.
Conclusions. In this work, we explored the microbiological characteristics, population structure, and sequence diversity of the ␤-lactamase-encoding genes in clinical isolates of S. maltophilia. Overall, our results raise the concern of an increasing trend in resistance toward traditional antibiotics, such as TMP-SMX, and highlight the excellent in vitro performance of the novel combination CZA-ATM against this pathogen. Our data also confirm the predominance of genomic group 6 among strains responsible for hospital-associated infections. The putative association of this genomic group with specific variants of L1 and L2 points to an adaptation of S. maltophilia to human pathogenicity and warrants further studies aimed at determining the biochemical characteristics of these enzymes. Second-shell mutations in MBLs and mutations in the ⍀-loop of class A enzymes have been shown to affect substrate and inhibitor binding. Therefore, the allelic variations should be taken into account for the design of effective inhibitors and the preservation of current antibiotics. Where breakpoints for S. maltophilia were not available, those for Pseudomonas aeruginosa were used, as has been previously done (13).

MATERIALS AND METHODS
MLST. Total DNA was extracted using the UltraClean microbial DNA isolation kit (MoBio Laboratories, Carlsbad, CA) and used as the template to amplify and sequence seven housekeeping genes of S. maltophilia, namely atpD, gapA, guaA, mutM, nuoD, ppsA, and recA, according to the protocol proposed by Kaiser et al. (20) and the S. maltophilia multilocus sequence typing (MLST) database available at http://pubmlst.org/smaltophilia/. Clonal analysis was performed using eBURST v3 (http://eburst.mlst.net). Additional genetic relatedness was estimated based on alignments of the concatemers of the seven allelic sequences of the seven housekeeping genes (final length, 3,591 nucleotides [nt]) generated using Clustal Omega and obtained using iTOL (https://itol.embl.de), using branch length, and rooted on corresponding Stenotrophomonas spp. (not S. maltophilia) sequences. Genomic groups were assigned based on the initial classification by Hauben et al. and expanded by Kaiser et al. (18,20).
Sequence analysis. DNA and deduced protein sequences were analyzed by using the Lasergene Molecular Biology Suite software (DNASTAR, Inc., Madison, WI). Mature protein sequences were obtained after identification and removal of the signal peptide by using standard parameters on the SignalP 4.1 server (http://www.cbs.dtu.dk/services/SignalP/). The protein sequence logo was generated at the host server of the University of California at Berkeley (http://weblogo.berkeley.edu/logo.cgi). The amino acid sequence of mature proteins was aligned using the Clustal Omega algorithm against L1 and L2 reference variants (21), and unique substitutions were identified. Protein dendrograms were plotted using iTOL (https://itol.embl.de/), based on the guide tree constructed by Clustal Omega using the unweighted pair group method using average linkages (UPGMA method). Discovery Studio (Dassault Systèmes BIOVIA, Discovery Studio Modeling Environment, release 2017, San Diego, CA; Dassault Systèmes, 2016) software was used to map unique substitutions onto the crystal structure of the L1 monomer (PDB ID 2AIO), the L1 tetramer (PDB ID 2QIN), and L2 (PDB ID 1N4O). Inspection of site-specific amino acid variability was guided by analysis using the PsychoProt server (53,54).
Covariance analysis. Amino acid sequences of the detected novel L1 and L2 variants plus the references previously described by Avison et al. (21) were aligned using the Clustal Omega algorithm. Alignment was then submitted to the BIS2analyzer website, which is tailored for data sets of high conservation (http://www.lcqb.upmc.fr/BIS2Analyzer/submit.php#) (55), and using the NJ algorithm, clusters of covarying amino acid residues were identified. Clusters were then mapped onto the crystal structure of the L1 monomer (PDB ID 2AIO), the L1 tetramer (PDB ID 2QIN), and L2 (PDB ID 1N4O).
Accession number(s). The nucleotide sequences of the new bla L1 and bla L2 alleles are available under NCBI reference sequence accession numbers MK871676 to MK871737 and MK888855 to MK888979, respectively.