Introduction

Transforming growth factors β (TGFβs) are members of a superfamily of conserved cytokines, which also includes bone morphogenetic proteins, activins, and inhibins1,2,3. TGFβs were discovered in the early 1980s4, and three close orthologs are found in mammals (TGFβ1, TGFβ2, and TGFβ35). They are ubiquitously expressed and secreted throughout the body2,6, where they are pleiotropically engaged in the physiology of nearly all tissues and cell types after activation and binding to TGFβ receptors of type I, II and III (TGFR-I, -II and -III)3,7,8,9,10. TGFβs mainly act through the SMAD pathway, which features a family of genes similar to Caenorhabditis “small worm phenotype” and Drosophila “mothers against decapentaplegic”. This pathway entails sequential binding of TGFR-II and TGFR-I by TGFβs to form a ternary complex that triggers phosphorylation of the latter by the former11,12. Thus, TGFβs regulate intracellular processes including transcription, translation, microRNA biogenesis, protein synthesis, and post-translational modifications1,13, which cascade into roles in growth, proliferation, differentiation, plasticity, migration, and death of epithelial and endothelial cells, as well as lymphocytes14. TGFβs are essential in biological events such as embryogenesis, wound healing, and immunity2, but also in pathologies such as Marfan syndrome, Parkinson’s disease, AIDS, organ fibrosis, autoimmune diseases, atherosclerosis and hypertension, asthma, diabetes, rheumatoid arthritis, encephalomyelitis, colitis, and most epithelial cancers, including prostate, breast, lung, colorectal, pancreatic, gastric, and skin cancers2,13,15,16,17,18,19,20.

Human TGFβs are produced as latent glycosylated precursors (pro-TGFβs), which consist of an ~250-residue N-terminal pro-domain dubbed latency-associated protein (LAP) and a C-terminal ~110-residue mature growth-factor moiety (GF). Two such precursors are joined through disulfides in the homodimeric small latent complex. These complexes are disulfide-linked to a third protein, one of three latent TGFβ binding proteins, in the large latent complexes21,22,23. After secretion, these complexes are targeted to fibrillin-rich microfibrils as inactive species that are covalently bound by tissue transglutaminase to the extracellular matrix for storage. Finally, the large latent complexes are transformed into biologically active GFs by thrombospondin 1, reactive oxygen species, integrins, and/or peptidases such as furin and other related pro-protein convertases. Proteolytic cleavage by these enzymes severs the linker between GF and LAP, but the two proteins remain noncovalently associated until physically separated for function2,10,13,24,25. In this way, most TGFβ in the body is latent and sequestered within the extracellular matrix, although it is also found on the surface of immune cells or in granules of platelets and mast cells2. Thus, localization and compartmentalization ensure tight spatial and temporal regulation of the GFs22.

Although TGFβ GFs are very similar (70–82% sequence identity in humans) and have partially overlapping functions, they also have distinct roles. This is reflected in their differential expression during embryogenesis and specific roles in renal fibrogenesis and regulation of airway inflammation and remodeling26. In particular, human TGFβ2 is biosynthesized as a latent 394-residue precursor (pro-TGFβ2) arranged as a glycosylated homodimer of 91 kDa containing three glycan chains attached to each LAP. Peptidolytic activation at bond R302-A303 (see UniProt [UP] entry P61812 for residue numbers of human TGFβ2 in superscripts27,28) produces the GF, which spans 112 residues and is arranged as a disulfide-linked 25-kDa homodimer noncovalently associated with the respective LAP moieties. Given the importance of this cytokine and its potential in therapeutic applications29,30, here we developed a new high-yield human expression system for tagged pro-TGFβ2. We report the X-ray crystal structure of its GF, which was obtained in a different crystallographic space group and deviates from current structures.

Results and Discussion

A new recombinant overexpression system for pro-TGFβ2

After its discovery, TGFβ2 GF was initially purified from human and porcine platelets31,32 and from bovine bone33. However, to obtain large amounts, recombinant expression systems are generally the option of choice as the resulting homogeneity and purity is greater than that of proteins purified from tissues or fluids. Mature human and mouse TGFβ2 GFs were obtained in high yields (~8–10 mg/liter of cell culture) from Escherichia coli systems34. However, these proteins lacked glycosylated LAP, which has (glycan-mediated) functions beyond latency maintenance35,36,37,38. In addition, these systems produced insoluble protein in inclusion bodies that had to be refolded34,39. From a functional perspective, mammalian expression systems are preferred for human cytokines as they provide native environments for folding and disulfide formation in the endoplasmic reticulum, glycosylation in the Golgi, and subsequent quality control in the endoplasmic reticulum. These factors ensure that only correctly folded proteins are secreted. However, the development of such systems has proven difficult for TGFβs29,34,40,41. Most initial trials—based on murine CHO and human HEK293 cells—outputted well below ~1 mg of pure protein per liter of cell culture, owing to low expression levels and multiple purification steps42,43,44,45. It was not until 2006 that Zou and Sun reported expression of pro-TGFβ2 from stable CHO-lec cell lines with a significantly higher yield40. However, this system was based on stable cell lines, which generally require long selection processes for establishment, are prone to contamination, require more time for preparation and for protein expression, and have little flexibility with respect to the constructs tested. In addition, the lack of equivalence of recombinant proteins produced in CHO and HEK cells owing to differences in the glycosylation patterns between hamsters and humans has been widely documented46,47,48.

Here, we developed a new homologous production procedure for recombinant full-length human pro-TGFβ2 based on transient transfection of human Expi293F cells, a HEK293 cell-line variant that was adapted to high-density suspension growth and selected for high transfection efficiency and protein expression. Such transient expression systems are flexible, the constructs can be changed and retested fast, they have comparable yields to the stable systems, and they can be much more easily shared among scientists by just sending the plasmid49. We obtained ~2.7 and ~2.3 mg of N-terminally octahistidine-tagged and Strep-tagged forms, respectively, per liter of cell culture in only three days (Fig. 1A–D). The protein was in a disulfide-linked dimeric state, and a major fraction was cleaved at the inter-domain linker due to endogenous proteolytic processing, as previously reported for this and other TGFβs34,45. However, the LAP and the GF remained associated in size-exclusion chromatography and native polyacrylamide gel electrophoresis (PAGE). Our efforts to separate them by size-exclusion chromatography in the presence of high salt contents (1 M sodium chloride), chaotropic agents (1 M/4 M urea), detergents (0.05% sodium dodecylsulfate [SDS]), reducing agents (tris[2-carboxyethyl]phosphine or 1,4-dithiothreitol), and low-pH buffers (glycine pH 3.0) failed. We could not isolate them either by ionic exchange chromatography with 20 mM sodium acetate pH 4.0 as buffer and a sodium chloride gradient. This is consistent with previous reports on purified and recombinant pro-TGFβ145,50 and with the very high affinity of GF and LAP for each other, with dissociation constant values in the low nanomolar range51. In contrast, other publications reported activation of TGFβs in vitro but only under harsh conditions including extreme pH values, high temperature, presence of peptidases and glucosidases, and presence of SDS and urea34,40,52,53.

Figure 1
figure 1

Production, purification and crystallization of human TGFβ2. (A) Reducing SDS-PAGE depicting N-terminally octahistidine-tagged pro-TGFβ2 after Ni-NTA affinity purification. M, molecular mass marker; FT, flow-through; W, wash step; 1E, first elution; and 2E, second elution. Black arrows pinpoint (top to bottom) intact pro-TGFβ2, LAP, and the GF in lane 1E. (B) Reducing SDS-PAGE of fractions (F15-F21) of the size-exclusion chromatography purification step (left panel) and Western-blot analysis of fractions F15-F21 employing an anti-histidine-tag antibody (right panel). (C,D), same as (A,B) for N-terminally Strep-tagged pro-TGFβ2. In (D), an anti-Strep-tag antibody was used. (E) Representative tetragonal crystals of mature TGFβ2 of ~20 microns maximal dimension. (F) Reducing SDS-PAGE of ~90 collected, carefully washed and dissolved diffraction-grade crystals revealing they contain mature TGFβ2 (black arrow). (G) Gelatin zymogram of pooled and purified crystallization drop supernatant showing a band pinpointed by an arrow corresponding to the mass of human α2-macroglobulin associated with gelatinolytic activity.The original gels used for panels A-D, F and G can be found in the SupplementaryInformation.

Separation and crystallization of mature TGFβ2

During studies of the interactions between induced human α2-macroglobulin (hα2M), a ~720-kDa homotetrameric pan-peptidase inhibitor purified from blood, and recombinant human pro-TGFβ2 (Marino-Puertas, del Amo-Maestro, Taulés, Gomis-Rüth & Goulas, manuscript in preparation), a mixture of both proteins was set up for crystallization. Diffraction-grade protein crystals appeared after five days with 20% isopropanol, 0.2 M calcium chloride, and 0.1 M sodium acetate pH 4.6 as reservoir solution (Fig. 1E). Reducing SDS-PAGE (Fig. 1F) and peptide mass fingerprinting of carefully washed and dissolved crystals indicated that the crystallized species was the GF. The crystals belonged to a hitherto undescribed, tightly-packed tetragonal space group, diffracted to 2.0 Å resolution, and contained half a GF disulfide-linked dimer per asymmetric unit. Diffraction data processing statistics are provided in Table 1. These crystals also appeared when pro-TGFβ2 alone was subjected to similar crystallization conditions. In this case, the crystals diffracted to lower resolution, i.e. hα2M apparently played a favorable role as an additive for crystallization. To pursue this further, we collected the supernatant from crystallization drops that had given rise to crystals, purified it by size-exclusion chromatography, and assayed a fraction that migrated according to the mass of hα2M by gelatin zymography. We detected gelatinolytic activity (Fig. 1G), which points to a contaminant present in the purified hα2M sample. Accordingly, we conclude that the low pH of the crystallization assay, together with the crystallization process, achieved the separation of the LAP and GF moieties that we could not obtain by chromatography (see above). This process was probably facilitated by a peptidolytic contaminant present in the purified hα2M sample, as peptidases trapped within the induced tetrameric hα2M cage are known to still possess activity54,55,56.

Table 1 Crystallographic data.

Structure of mature TGFβ2

The crystal structure was solved by maximum likelihood-scored molecular replacement, which gave a unique solution for one GF per asymmetric unit at final Eulerian angles and fractional cell coordinates (α, β, γ, x, y, and z) 348.4, 21.1, 119.9, 0.127, 0.696, and −0.899. The initial values for the rotation/translation function Z-scores were 6.0/14.0 and the final log-likelihood gain was 132. Taken together, these values indicated that P41212 was the correct space group, and the final model after model rebuilding and refinement contained residues A303-S414 and 23 solvent molecules. Segments P351-W354 and N371-S377 were flexible but clearly resolved in the final Fourier map for their main chains. Table 1 provides refinement and model validation statistics.

Human TGFβ2 GF is an elongated α/β-fold molecule consisting of an N-terminal helix α1 disemboguing into a fourfold antiparallel sheet of simple up-and-down connectivity (Fig. 2A). Each strand is subdivided into two, β1 + β2, β3 + β4, β5 + β6, and β7 + β8, and the sheet is slightly curled backwards with respect to helix α1. The most exposed segment of the moiety is the tip of hairpin β6β7, and β-ribbon β2β3 is linked by a second α-helix (α2) as part of a loop, whose conformation is mediated by the cis conformation of residue P338. Finally, a long loop segment connects strands β4 and β5 and contains helix α3, whose axis is roughly perpendicular to the β-sheet. TGFβ2 is internally crosslinked by four disulfides forming a cysteine knot (Fig. 2A) and a further intermolecular disulfide by symmetric C379 residues links two crystallographic symmetry mates to yield the functional dimer. Here, helix α3 of one protomer nestles into the concave face of the β-sheet of the other protomer (Fig. 2B). The respective tips of hairpins β6β7, as well as β-ribbons β2β3 with connecting helices α2, are exposed for functional interactions.

Figure 2
figure 2

TGFβ2 in a new conformation. (A) Ribbon-type plot of human TGFβ2 (left panel) and after vertical rotation (right panel). The eight β-strands β1 (residues C317-R320), β2 (L322-D325), β3 (G340-N342), β4 (F345-A347), β5 (C379-S382), β6 (D384-I394), β7 (T397-S404), and β8 (M406-S414), as well as the three helices α1 (A306-F310), α2 (F326-L330), and α3 (Q359-I370), are labeled, as are the N- and the C-terminus. The four intramolecular disulfides are depicted for their side chains and labeled (C309-C318), (C317-C380), (C346-C411) and (C350-C413). The cysteine engaged in a symmetric intermolecular disulfide (C379) is further labeled as . (B) Human mature TGFβ2 dimer with one protomer in the orientation of A (left panel) in blue and the second in pale yellow (left), which is related to the former through a horizontal crystallographic twofold axis. An orthogonal view is provided in the right panel, the intermolecular disulfide is depicted as red sticks. (C) Superposition of the Cα-traces in ribbon presentation of the dimers of previously reported structures of mature TGFβ2 (PDB 2TGI, pale yellow; PDB 1TFG, red; PDB 4KXZ dimer AB, green; and PDB 4KXZ dimer DE, aquamarine) after optimal superposition of the respective top protomers. Magenta arrows pinpoint the only points of significant deviation, i.e. the tips of respective β-ribbons β6β7. The view is that of B (right panel). (D) Superposition of the protomers of PDB 2TGI in pale yellow onto the current structure (PDB 6I9J) in the view of A (left panel). The region of largest deviation (Y352-C380) is pinpointed by magenta arrows and framed. (E) Close-up in cross-eye stereo of the framed region of (D), with ribbon and carbons in pale yellow for PDB 2TGI and in blue/cyan for PDB 6I9J. Y352 and C380 are pinpointed by a magenta and a green arrow, respectively. (F) Superposition of the dimers of PDB 6I9J (top protomer in blue, bottom protomer in aquamarine) and PDB 2TGI (top protomer in pale yellow, bottom protomer in orange) in the views of (B). Owing to the different chain traces of segment Y352-C380 (top magenta arrow in the left panel, see also [D]), substantial variations are observed at distal regions of the bottom protomers. (G) Close-up in stereo of F (left panel) depicting PDB 6I9J (dimer in blue/aquamarine); PDB 2TGI and 4KXZ dimer AB, both in pale yellow/orange; and human TGFβ3 as found in its complex with the ectodomains of human TGFR-I and -II (PDB 2PJY; protomers in pale yellow and red). (H) Superposition in stereo of the dimers of PDB 6I9J (dimer in blue/aquamarine) and human TGFβ3 (PDB 2PJY dimer in yellow/magenta) in complex with the ectodomains of human TGFR-I (dark grey) and -II (white) in the orientation of B (right panel). TGFβ2 in PDB 6I9J must rearrange to bind the receptors as performed by TGFβ3.

Comparison with previous TGFβ2 structures

The structure of isolated human TGFβ2 GF was solved in 1992 by two groups simultaneously. They obtained the same trigonal crystal form with half a disulfide-linked dimer per asymmetric unit (Protein Data Bank access codes [PDB] 1TFG57 and 2TGI58, see Table 2). In 2014, the complex of the GF with the Fab fragment of a neutralizing antibody was reported in an orthorhombic crystal form (PDB 4KXZ;26). These crystals contained two GF dimers per asymmetric unit, each bound to two Fab moieties. Three more structures of TGFβ2 GF were reported in 201759 (Table 2). These corresponded to engineered monomeric forms from human (PDB 5TX4; in complex with TGFR-II ectodomain) and mouse (PDB 5TX2 and 5TX6) that lacked helix α3 and encompassed several point mutations.

Table 2 Crystal structures of TGFβ2.

Superposition of the GF structures of PDB 2TGI, 1TFG, and 4KXZ (Fig. 2C) revealed very similar conformations of bound and unbound protomers and dimers, despite distinct chemical and crystallographic environments, which can generally contribute to different conformations owing to crystallographic artifacts60. The rmsd values with respect to PDB 2TGI (considered hereafter the reference structure) upon superposition of one protomer were 0.26 Å (1TFG), 1.05 Å (4KXZ dimer AB), and 1.22 Å (4KXZ dimer DE) for 112, 111, and 111 common Cα atoms, respectively. Only minor variations occurred at the tips of hairpins β6β7 and at helix α2 (Fig. 2C). Interestingly, also the engineered monomeric variants kept the overall structure of the native TGFβ2 protomer, with the exception of a loop that replaced helix α359. This close similarity was also found with the unbound human ortholog TGFβ3 (PDB 1TGJ61).

Inversely, the present GF structure (PDB 6I9J; hereafter the current structure) showed deviations from the reference structure, which are reflected by an rmsd of 1.90 Å for 107 common Cα atoms. The central β-sheets nicely coincide and only minor differences are found within the deviating regions of the above GF structures. However, a major rearrangement is found within segment Y352-C380, which encompasses helix α3 plus the flanking linkers to strands β4 and β5 (Fig. 2D,E). Owing to an outward movement of segment W354-D357, α3 is translated by maximally ~5.5 Å (at I370). This, in turn, causes downstream segment E373-C379 to be folded inward, with a maximal displacement of ~6.7 Å (at A374). This displacement cascades into a slight shift (~2 Å) of C379 and, thus, the intermolecular disulfide. These differences within protomers further result in variable arrangement of the dimers (Fig. 2F). The second protomer is rotated by ~10°, so that K396 at the tip of hairpin β6β7 is displaced by ~9 Å. In addition, the central β-sheets overlap but are laterally shifted with respect to each other.

Finally, inspection of the respective crystal packings of the current and reference structures (Fig. 3A,B) reveals that while the latter is loosely packed in a hexagonal honeycomb-like arrangement, with large void channels of > 40 Å diameter and high solvent content (61%), the former is tightly packed (solvent content of 43%). In particular, the tip of hairpin β6β7 is not engaged in significant crystal contacts in either crystal form, which supports that both conformations are authentic and do not result from crystallization artifacts.

Figure 3
figure 3

Crystal packing. (A) Cross-eye stereoplot showing the crystal packing of the current TGFβ2 structure (PDB 6I9J), with the protomer in the asymmetric unit in red and the surrounding symmetry mates in gold, plum and aquamarine. The tip of β-ribbon β6β7 is pinpointed by a blue arrow. (B) Same as (A) showing the crystal environment of the reference TGFβ2 structure (PDB 2TGI). The top right inset shows a view down the crystallographic threefold axis to illustrate the solvent channels. The protomer in the asymmetric unit is in red, the symmetry mates setting up the crystal lattice in blue.

Potential implications of a new dimeric arrangement

Structural information on TGFR binding is available for human TGFβ1 and TGFβ3, whose wild-type forms were crystallized in complexes with the ectodomains of TGFR-I and -II (PDB 3KFD62; PDB 1KTZ63; and PDB 2PJY12). In the binary complex of TGFβ3 with TGFR-II (PDB 1KTZ), the cytokine was observed in a unique dimeric arrangement, termed “open”, in which the respective helices α3 were completely disordered. This led the second protomer to be rotated by ~180° and the tip of hairpin β6β7 to be displaced by ~36 Å when compared with the reference structure63. The functional significance of this structure, which differs from any other mature TGFβ dimer, is not clear.

By contrast, the other two complex structures, which contained both receptors, showed that the TGFβ1 and TGFβ3 isoforms dimerize very similarly to the unbound reference structure when bound to receptors (rmsd of 1.43 Å and 1.61 Å for 222 and 219 common Cα atoms for PDB 3KFD and PDB 2PJY, respectively). In contrast, they differed from the current structure (Fig. 2G), which shows a rmsd of 2.94 Å for 216 common Cα atoms when compared with the reference. In addition, both cytokine isoforms—and by similarity probably also TGFβ2—bound their receptors in an equivalent manner. TGFR-II was contacted by the tip of hairpin β6β7 and the linker between α2 and β3 of one protomer, and TGFR-I was liganded by the interface between protomers created by strands β5-β8 from one protomer and α1 plus the region around α3 of the other protomer. In addition, the TGFRs further contacted each other through the N-terminal extension of TGFR-II (Fig. 2H). Notably, the binding mode of TGFR-II was shared with the engineered monomeric variant of TGFβ2 (PDB 5TX459). In contrast, superposition of the cytokine dimers of the current structure and the triple complexes evinced that among other structural elements hairpin β6β7 is displaced away from interacting segment S52-E55 of TGFR-II (see PDB 2PJY) by ~2.7 Å. This hairpin rearrangement also impairs interaction of the β5β8 ribbon with TGFR-I segment I54-F60. Thus, TGFRs could not bind the current structure in the same way they bind TGFβ1 and TGFβ3 without rearrangement.

Conclusion

We developed a recombinant human overexpression system based on Expi293F cells grown in suspension, which produces high yields of well-folded human N-terminally histidine- or Strep-tagged pro-TGFβ2 with native post-translational modifications.

We further crystallized mature TGFβ2 in a different crystallographic space group, which deviates mainly in the region around helix α3 from current functional TGFβ1, TGFβ2 and TGFβ3 structures. Importantly, this region is the segment that shows the highest variability in sequence among TGFβs26. Although crystal packing artifacts cannot be completely ruled out, the fact that several different crystal forms of the three TGFβ GFs had produced highly similar structures to date suggests that the new conformation may be authentic and have functional implications.

The divergent conformation of the region around helix α3 led to differences in the dimer, which could not bind cognate TGFR-I and -II in the same way as TGFβ1 and TGFβ3 do. Moreover, as revealed by pro-TGFβ1 structures from pig (PDB 5VQF64) and human (PDB 5VQP65 and PDB 6GFF66), this region differed substantially between latent and mature moieties owing to the presence of the respective LAPs. Hence, the GF protomers associated differently within the respective dimers. Large differences were also found between the mature form and the pro-form of the more distantly related TGFβ family member activin A (PDB 2ARV67 and PDB 5HLY68). In contrast, bone morphogenetic factor 9 showed deviations in hairpin β6β7 but kept the dimer structure (PDB 5I0569 and PDB 4YCG70). Overall, we conclude that our mature structure may represent an inactive variant or be one of an ensemble of conformational states, which may still undergo an induced fit or selection fit mechanism to form a functional ternary ligand-receptor complex.

Materials and Methods

Protein production and purification

The coding sequence of human pro-TGFβ2 without its signal peptide, i.e. spanning the LAP (L21-R302) and the GF (A303-S414), was inserted in consecutive PCR steps into the Gateway pCMV-SPORT6 vector (ThermoFisher) in frame with the Kozak sequence. The signal peptide from the V-J2-C region of a mouse Ig κ-chain was used instead of the native leader sequence, as this was reported to be more efficient for expression40. This vector attached an N-terminal octahistidine-tag to the protein of interest. For the five PCR steps, oligonucleotides 5′-TCACCACCACCATCATCTCAGCCTGTCTACCTGCAGCA-3′; 5′-GGTTCCACTGGTGACCACCACCATCACCACCACCATC-3′; 5′-GGGTACTGCTGCTCTGGGTTCCAGGTTCCACTGGTGAC-3′; 5′-GACAGACACACTCCTGCTATGGGTACTGCTGCTC-3′; and 5′-CAATCCCGGGGCCACCATGGAGACAGACACACTCC-3′ were used as forward primers, respectively, and 5′-CAATCTCGAGCTAGCTGCATTTGCAAGACTTTAC-3′ was employed as the reverse primer. The intermediate PCR products were purified with the EZNA Cycle Pure Kit (Omega Bio-tek, USA) prior to the next step. The final PCR product was digested twice with SmaI and XhoI restriction enzymes (1 h at 37 °C and o/n at 37 °C), with an intercalated PCR product purification step. This product was ligated into pre-digested pCMV-SPORT6 by adding 2 μL of T4 DNA Ligase (ThermoFisher) per 20 μL of reaction (o/n at r.t.). The resulting plasmid (pS6-TGFB2-H8) was verified for its sequence (GATC Biotech) and transformed into competent Escherichia coli DH5α cells for vector storage and production.

Prior to transfection into mammalian cells, pS6-TGFB2-H8 was produced in E. coli DH5α, purified with the GeneJET Plasmid Maxiprep Kit (ThermoFisher) according to the manufacturer’s instructions, and stored in Milli-Q water at 1 mg/mL. Expi293F cells (ThermoFisher), which had been kept in suspension in FreeStyle F17 Expression Medium (Gibco) plus 0.2% Pluronic F-68 at 150 rpm in a humidified atmosphere with 8% CO2 in air at 37 °C, were transfected at a density of 1 × 106 cells/mL with a mixture of 1 mg of purified pS6-TGFB2-H8 and 3 mg of linear 25-kDa polyethylenimine (Polysciences) in 20 mL of Opti-MEM Medium per liter. The mixture of the reagents was incubated for 15–20 min at room temperature and then added dropwise to the cells in 1 L-disposable Erlenmeyer flasks (Fisherbrand) after proper mixing. Cells were harvested for 72 h and then centrifuged at 2,800 × g in the JLA9.1000 rotor of an Avanti J-20XP centrifuge (Beckman Coulter) for 20 min, and the supernatant was subjected to single-step Ni-NTA affinity chromatography (washing buffer: 50 mM Tris·HCl pH 8.0, 250 mM sodium chloride, 20 mM imidazole; elution buffer: 50 mM Tris·HCl pH 8.0, 250 mM sodium chloride, 300 mM imidazole). Elutions were pooled, concentrated and subjected to size-exclusion chromatography in a Superdex 75 10/300 column (GE Healthcare) attached to an ÄKTA Purifier system (GE Healthcare) at r.t. with 20 mM Tris·HCl pH 8.0, 150 mM sodium chloride as buffer. Purified protein was routinely concentrated with Vivaspin centrifugal devices (Sartorius) with a 10-kDa cutoff, and concentrations were determined by A280 in a NanoDrop Microvolume spectrophotometer (ThermoFisher). Protein identity was confirmed by peptide mass fingerprinting and purity was assessed by 10% SDS-PAGE using Tris-Glycine buffer and Coomassie Brilliant Blue or silver staining, as well as by Western-blot analysis with a histidine antibody horse radish peroxidase conjugate (His-probe Antibody H-3 HRP, Santa Cruz Biotechnology) at 1:5,000 in PBS buffer further 0.1% in Tween 20.

To produce Strep-tagged pro-TGFβ2, plasmid pS6-TGFB2-H8 was modified by PCR to replace the histidine-tag with a twin Strep-tag with an intermediate G/S spacer (sequence WSHPQFEKGGGSGGGSGGSAWSHPQFEK) by using forward primer 5′-GGTGGAGGTTCTGGAGGTGGAAGTGGAGGTAGCGCATGGAGCCATCCACAATTCGAAAAGCTCAGCTGTCTACCTGC-3′ and reverse primer 5′-CTTTTCGAATTGTGGATGGCTCCAGTCACCAGTGGAACCTGGAACCCAGAGCAG-3′. The resulting PCR product was purified with the EZNA Cycle Pure Kit, phosphorylated with 1 μL of T4 polynucleotide kinase (ThermoFisher) per 20 μL of reaction (o/n at 37 °C), and ligated as above to generate plasmid pS6-TGFB2-STREP. This plasmid was further processed and transfected into Expi293F cells for protein production as above. Cells were harvested and centrifuged as aforementioned, and the supernatant was extensively dialyzed against 100 mM Tris·HCl pH 8.0, 150 mM sodium chloride and purified by single-step affinity chromatography using Strep-Tactin XT Superflow Suspension resin (iba) according to the manufacturer’s instructions (washing buffer: 100 mM Tris·HCl pH 8.0, 150 mM sodium chloride; elution buffer: 100 mM Tris·HCl pH 8.0, 150 mM sodium chloride, 50 mM biotin). Final polishing by size-exclusion chromatography, concentration, and purity assessment followed as above, except that Western-blot analysis was performed with a Streptavidin antibody horse radish peroxidase conjugate (Streptavidin-Peroxidase Antibody from Streptomyces avidinii; Sigma-Aldrich) at 1:1000 in PBS, supplemented with 0.1% Tween 20 and 1% BSA.

Crystallization of mature TGFβ2 and diffraction data collection

Octahistidine-tagged pro-TGFβ2, mostly cleaved in the linker between LAP and GF, at ~5 mg/mL in 10 mM Tris·HCl pH 8.0, 150 sodium chloride was incubated o/n at 4 °C with human induced α2-macroglobulin (hα2M) at ~7 mg/mL in 10 mM Tris·HCl pH 8.0, 150 mM sodium chloride. The hα2M was previously purified from blood and reacted with methylamine as reported71. The hα2M/pro-TGFβ2 reaction mixture was subjected to crystallization assays by the sitting-drop vapor diffusion method at the Automated Crystallography Platform of Barcelona Science Park. Reservoir solutions were mixed by a Tecan robot and crystallization drops of 100 nL were dispensed by a Cartesian Microsys 4000 XL (Genomic Solutions) robot or a Phoenix nanodrop robot (Art Robbins) on 96 × 2-well MRC nanoplates (Innovadyne). Crystallization plates were stored at 20 °C or 4 °C in Bruker steady-temperature crystal farms, and successful conditions were scaled up to the microliter range in 24-well Cryschem crystallization dishes (Hampton Research).

Diffraction-grade crystals of ~20 microns maximal dimension were obtained at 20 °C with protein solution and 20% isopropanol, 0.2 M calcium chloride, 0.1 M sodium acetate pH 4.6 as reservoir solution from 1 or 2 μL: 1 μL drops. Carefully washed and pooled crystals revealed the presence of a species of ~12 kDa in SDS-PAGE, which was identified as TGFβ2 by peptide mass fingerprinting. Crystals were cryoprotected by immersion in reservoir solution plus 15% glycerol and flash cryocooled in liquid nitrogen. Diffraction data were collected at 100 K on a Pilatus 6 M pixel detector (Dectris) at beam line XALOC72 of the ALBA synchrotron in Cerdanyola (Catalonia, Spain). These data were processed with programs XDS73 and XSCALE74, and transformed with XDSCONV to MTZ format for the CCP4 suite of programs75. Crystals belonged to space group P41/3212 based on the systematic extinctions, contained one mature TGFβ2 molecule per asymmetric unit (VM = 2.14; 42.6% solvent contents according to76), and diffracted to 2.0 Å resolution.

Proteolytic assay of crystallization-drop supernatant

Supernatant from crystallization drops that had produced crystals of mature TGFβ2, i.e. which contained methylamine-induced hα2M, was pooled and subjected to size-exclusion chromatography in a Superose6 10/300 column (GE Healthcare) at r.t. with 10 mM Tris·HCl pH 8.0, 150 mM sodium chloride as buffer. Fractions corresponding to hα2M (~720 kDa) were concentrated with Vivaspin centrifugal devices (Sartorius) with a 30-kDa cutoff and employed for zymography studies. To this aim, 10% SDS-PAGE gels were prepared containing 0.2% (w/v) gelatin. Protein samples were subjected to electrophoresis at 4 °C with equal volume of SDS-PAGE sample buffer without β-mercaptoethanol. Gels were washed twice in 20 mM Tris·HCl pH 7.4, 150 mM sodium chloride, 10 mM calcium chloride, 2.5% (v/v) Triton X-100 for 15 min and then incubated in the same buffer without detergent for 16 h at 37 °C under gentle shaking. Gels were stained with Coomassie Brilliant Blue.

Structure solution and refinement

The structure of mature TGFβ2 was solved by likelihood-scoring molecular replacement with the PHASER77 program and the coordinates of a GF protomer crystallized in a different space group and unit cell (PDB 2TGI58). Subsequently, an automatic tracing step was performed with ARP/wARP78, which outputted a model that was completed through successive rounds of manual model building with the COOT program79 and crystallographic refinement with the PHENIX80 and BUSTER/TNT81 programs. The latter included TLS refinement.

Miscellaneous

Structure figures were made with the CHIMERA program82. Structures were superposed with the SSM program83 within COOT. A search for structural similarity against the PDB was performed with DALI84. The final model of human TGFβ2 GF was validated with the wwPDB Validation Server (https://www.wwpdb.org/validation)85 and is available at the PDB at https://www.rcsb.org (access code 6I9J).