Structural insights into the interaction between adenovirus C5 hexon and human lactoferrin

ABSTRACT Adenovirus (AdV) infection of the respiratory epithelium is common but poorly understood. Human AdV species C types, such as HAdV-C5, utilize the Coxsackie-adenovirus receptor (CAR) for attachment and subsequently integrins for entry. CAR and integrins are however located deep within the tight junctions in the mucosa where they would not be easily accessible. Recently, a model for CAR-independent AdV entry was proposed. In this model, human lactoferrin (hLF), an innate immune protein, aids the viral uptake into epithelial cells by mediating interactions between the major capsid protein, hexon, and yet unknown host cellular receptor(s). However, a detailed understanding of the molecular interactions driving this mechanism is lacking. Here, we present a new cryo-EM structure of HAdV-5C hexon at high resolution alongside a hybrid structure of HAdV-5C hexon complexed with human lactoferrin (hLF). These structures reveal the molecular determinants of the interaction between hLF and HAdV-C5 hexon. hLF engages hexon primarily via its N-terminal lactoferricin (Lfcin) region, interacting with hexon’s hypervariable region 1 (HVR-1). Mutational analyses pinpoint critical Lfcin contacts and also identify additional regions within hLF that critically contribute to hexon binding. Our study sheds more light on the intricate mechanism by which HAdV-C5 utilizes soluble hLF/Lfcin for cellular entry. These findings hold promise for advancing gene therapy applications and inform vaccine development. IMPORTANCE Our study delves into the structural aspects of adenovirus (AdV) infections, specifically HAdV-C5 in the respiratory epithelium. It uncovers the molecular details of a novel pathway where human lactoferrin (hLF) interacts with the major capsid protein, hexon, facilitating viral entry, and bypassing traditional receptors such as CAR and integrins. The study’s cryo-EM structures reveal how hLF engages hexon, primarily through its N-terminal lactoferricin (Lfcin) region and hexon’s hypervariable region 1 (HVR-1). Mutational analyses identify critical Lfcin contacts and other regions within hLF vital for hexon binding. This structural insight sheds light on HAdV-C5’s mechanism of utilizing soluble hLF/Lfcin for cellular entry, holding promise for gene therapy and vaccine development advancements in adenovirus research.

resulting in viral shedding for months (6).Historically, adenoviruses have played pivotal roles as molecular biology tools in the discovery of eukaryotic biological mecha nisms such as RNA splicing and protein folding (7,8).In addition, the non-integrating nature of adenoviruses has been harnessed in the field of gene therapy (9) where a major obstacle to overcome for the use of HAdVs in gene therapy is the presence of pre-existing immunity (10) and the relatively poor transduction efficiency in the target tissue (11).Nevertheless, after decades of research, a HAdV-based gene therapy was recently approved by regulators in the United States of America (9).In addition, vaccines based on modified adenoviruses against Ebola (12), HIV (13), ZIKV (14), RSV (15), and SARS-CoV2 (16) are currently in various phases of clinical trials or already being used in the clinic.
Species C adenoviruses infect respiratory epithelial cells by interacting with the cellular receptor CAR using the protruding fiber protein, followed by secondary interactions between the penton base and α v β 3 integrins that trigger entry (1,17,18).This fundamentally simple model is likely used in immortalized cell lines but is highly unlikely in the polarized epithelium where CAR is hidden deep within the tight junctions (19)(20)(21).However, additional models and mechanisms have been suggested to overcome this dilemma.HAdV-C5 infection may be facilitated by airway macrophages that take up invading HAdV particles resulting in the secretion of IL-8 as part of their inflammatory response (22).The secretion of IL-8 results in the apical migration of the CAR isoform CAR Ex7 and α v β3 from the basolateral membrane while enhancing the expression of the CAR Ex8 at the apical membrane (22).In addition, several CAR-independent cell entry pathways have been identified for HAdV-C5, such as the use of heparin on hepatocytes (23), α v β 5 on melanoma and breast cancer cells (24), and Toll-like receptor 4 on dendritic cells (25).Finally, several different types of HAdVs have been shown to use different soluble components present in body fluids to enhance the infection.For example, HAdV-5C has been shown to recruit and hi-jack coagulation factor X (FX) for infection of hepatocytes (23), which not only enhances infection but also controls the tropism by directing the virus to the liver.During infection, FX binds to protruding loops of the HAdV-hexon on the capsid thus allowing FX to bridge an interaction between the cellular receptor and the virus particle.Hexon is a trimeric, barrel-shaped protein that represents the main building block of the virus capsid.In addition to coagulation factors, species C HAdVs have been shown to use hLF and its proteolytic cleavage product lactoferricin (Lfcin), a highly positively charged, 49 amino acid peptide with antimicrobial activity (26), for enhanced infection of epithelial cells (27,28).Even though this mechanism is less well described, both hLF and Lfcin have been suggested to bind the HVR-1 of hexon acting as a bridge between the virus capsid and cell surface receptor(s).Interestingly, similar mechanisms have been described for species A HAdVs that bind factor IX for a more effective infection (29,30) indicating that this mechanism might be an efficient way for HAdVs to bypass initial infection using CAR.
In this study, we present a new cryo-EM structure of HAd-5C hexon at FSC 0.143 of 2.9 Å together with a detailed hybrid structure of hLF in complex with HAdV-5C hexon providing structural insight into the previously reported interaction between hLF and HAdV-C5 hexon (28).We show that hLF engages the HVR-1 of one hexon protomer via its N-terminal Lfcin region with additional stabilizing contacts between the hLF C-terminal lobe and a second hexon protomer.In addition, structure-based mutational studies identified key residues within Lfcin as well as an adjacent helix in the N-lobe.Taken together, our structural and biophysical characterizations shed more light onto the mechanism by which HAdV-C5 utilizes soluble hLF/Lfcin for cellular entry.These detailed insights may eventually aid the development of novel adjuvants for use in HAdV-C5-based gene therapy.

The interaction between HAdV-C5 hexon and hLF is charge driven
Based on a series of viral transduction experiments, it was previously suggested that the interaction between HAdV-C5 hexon and hLF involves the stretch of acidic amino acid residues in the HAdV-C5 hexon HVR-1 and the basic N-terminal region of hLF (28) (Fig. 1A and B).When proteolytically liberated, the N-terminal region of hLF, Lfcin, is equally capable of promoting transduction (26).Here, we provide direct biophysical evidence for a charge-driven interaction.Using size exclusion chromatography (SEC) and native mass spectroscopy (MS), we could show that the stability of the complex is highly dependent on the ionic strength of the buffer used which is typical for charge-driven interactions (Fig. 1C and D).In SEC, the HAdV-C5 hexon:hLF complex stays intact only at a salt concentration of ≤5 mM (Fig. 1C).Any further increase in salt concentration leads to gradual disruption of the complex.At 10 mM NaCl, the complex largely disintegrates with free hLF adsorbing to the matrix of the SEC column.Therefore, at 10 mM NaCl, no additional peak corresponding to free hLF such as in the presence of 150 mM NaCl, can be observed.Excess hLF not bound to hexon could afterward be eluted from the column with a higher salt concentration.To rule out aberrant behavior of the protein due to nonspecific interaction with the stationary phase of the column under low salt conditions, we carried out native MS measurements.Consistent with observations made in SEC, the binding of hLF to hexon could only be observed under low ionic strength conditions (Fig. 1D; Fig. S1).In this case, ammonium acetate (AA) was used as an MS-compatible buffer surrogate.In the presence of 20 mM AA, the hexon:hLF complex was not stable.Compared to free hexon, only a fraction of hexon:hLF at 3:1 (hexon protomers:hLF) stoichiometry was detectable.Upon dilution from 20 mM AA to around 5 mM, predominantly a complex with 3:1 stoichiometry was observed with smaller amounts of a complex with 2 hLF molecules per hexon trimer as well as free hexon.

Cryo-EM structure of HAdV-C5 hexon at 2.9 Å
The structure of the HAdV-C5 hexon was reconstructed to FSC 0.143 of 2.9 Å (Fig. 2A and B; Table S1).To our knowledge, this is the highest resolution structure of HAdV-C5 hexon determined by cryo-EM.Due to the high stability of the trimer, the resolution is relatively uniform across the whole structure and thus it could be modeled unambiguously with a high average cross-correlation of 0.736 (Fig. 2C).Throughout the length of the hexon, the protomers are in close contact forming large interfaces in addition to intertwined regions at the "turrets" and the base.Each protomer contributes approximately 16,000 Å 2 to the interface with the two other protomers.Significantly lower resolution is observed only for the hypervariable loop regions, the "turrets" at the top of the molecule which are largely disordered and could not be built due to a lack of traceable density in the final reconstruction.Specifically, regions 137-164, 188-190, 252-256, 271-278, and 431-436 could not be sufficiently reconstructed.These regions are also lacking in the crystal structure (3TG7) of hexon.The same structure was used as a starting model to build hexon in the cryo-EM reconstruction but no major conformational changes were observed.The RMSD between both structures is 0.9 Å, indicating high overall similarity (Fig. S2).

Integrative modeling of the HAdV-C5 hexon:hLF complex
Despite many attempts, the strong preferential orientation of the hexon:hLF complex on EM grids could not be overcome.Unlike for hexon alone, which also showed a higher number of top and bottom views (Fig. 2B), this phenomenon was exacerbated in the complex and precluded its reconstruction.Standard remedial approaches such as stage tilting and the use of detergents were ineffective.Furthermore, grid preparation was highly irreproducible with only small percentages of intact complex particles being detectable.Likewise, reconstructing hLF bound to whole virus particles was Full-Length Text unsuccessful.Here, the primary challenge arose from the different occupational states of the hexon trimer.Data from native MS and single-particle EM have shown that the hexon:hLF complex exhibits stoichiometries of 1:3 and 2:3, in addition to the presence of unbound hexon.This heterogeneity in coating complicates the use of symmetry operators for virus reconstruction.As a result, achieving high-quality reconstructions would require the collection of very large data sets, given the expected diversity in particle morphology.
Despite the lack of side views of the hexon:hLF complex on EM grids, good 2D class averages were obtained for the top and bottom views of the complex.We therefore set out to use this information in the calculation of a hybrid atomic model of the complex, adapting a protocol by Walti et al (31).Implemented in the program XPLOR-NIH (32), this protocol allows the fitting of protein models to two-dimensional electron density projection images.In the first step of our integrative modeling approach, we performed crosslinking mass spectrometry (XL-MS) with the HAdV-C5 hexon:hLF complex using the homobifunctional crosslinker disuccinimidyl dibutyric urea (DSBU).Two pairs of crosslinks were identified (Fig. 3A; Fig S3).The hexon residues (S174, S446) crosslinked to hLF residues are present in the "turrets, " while one of the corresponding hLF residues is located in the N-lobe (K282) and the second (K676) one at the junction of N-and C-lobe.Since, hexon is a homotrimer, in principle, hLF could be crosslinked to residues belonging to the same protomer or residues on different protomers of hexon.However, only when hLF was assumed to be crosslinked to the same hexon protomer did the Lfcin region face hexon.As prior evidence exists for the involvement of Lfcin in hexon binding (28), it was assumed that hLF is crosslinked to residues on the same protomer of hexon (hexon S446: hLF K282; hexon S174: hLF K676).Next, we carried out a molecular dynamics simulation of hexon alone (Fig. 3A) and selected a conformation from the latter part of the trajectory, post-convergence of RMSD, that closely mirrors the observed hexon conformation in experimental 2D classes of the complex (Fig. 3B; Fig. S4).Complex projections suggest a conformational change in the flexible "turret" region of hexon upon binding of hLF.
These steps created the hexon input model and established the principal binding site and distance constraints for docking of the complex in XPLOR-NIH.The docked and energy-minimized complex (Fig. 3A) was then rigid body refined against the 2D projec tions and models were calculated (Fig. 3B and C).Models without crosslink distance violations were ranked according to their correlation with the experimental projections of the complex (Fig. 3B; Fig S5).The model of HAdV-C5 hexon:hLF complex with the highest C-value was selected as the final model and used for further analysis (Fig. 3C).In the selected model, residues 1-49 of hLF, constituting Lfcin, are positioned facing the hexon HVR-1, while another previously not reported point of close contact is near crosslink 2 (hexon S174-hLF K676) between the C-lobe of hLF and an adjacent protomer within hexon.

Residues on both hLF lobes contribute to HAdV-C5 hexon binding
To validate our model and obtain further molecular insight, the mutational analysis of hLF binding to hexon was carried out.Guided by the 2D class averages and the gener ated hybrid model, surface exposed and solvent accessible hLF residues in the regions of two apparent contact points were mutated, and binding affinities to fluorescently labeled hexon were measured using microscale thermophoresis (MST) (Fig. 4A).Single, charge-reducing mutations in Lfcin (mut1-4; R24S, R27A, K28A, and R30A) all resulted in reduced binding affinity of the respective hLF mutant.The importance of each of these residues was shown by their additive effect in a quadruple mutant (mut5; R24S/R27A/ K28A/R30A) which showed the lowest binding affinity within the set of mutants, retaining less than 20% affinity of the WT protein.Point mutations were also introduced in a region directly adjacent to Lfcin which also harbors crosslink 1 (Hexon S446-hLF K282).Interestingly, this region, which has not been implicated in hexon binding before, also seems to play an important role in complex stability (Fig. 4A and B).A triple mutant (mut6; K277S/K280A/K285S) of hLF was found to be similarly impaired in its binding to hexon.Although not measured, it is plausible that a mutant combining mutations from mut5 and 6 might not interact with hexon anymore.Full-Length Text Double mutants mut7 (N673A/S678A) and mut8 (E687A/K691A) were introduced near the site of crosslink two which appears to be in proximity to hexon in our hybrid model (Fig. 4B and C) as well as 2D class averages (Fig. 3C).Although, to a lesser extent than Full-Length Text mutations in the tip of the N-lobe, both double mutants are also significantly impaired in their binding to hexon.Binding affinities of these mutants are reduced by 40% and 70% for mut7 and mut8, respectively, pointing to a considerable role of the C-lobe in hexon binding (Fig. 4A).

DISCUSSION
The human lung has developed an intricate system for protecting its epithelial lin ing against invading pathogens (33).For species C HAdVs, the main mechanism for attachment and entry in respiratory epithelial cells uses CAR followed by secondary attachment to integrins that trigger uptake.This mechanism is well established and is likely the main port of infection in various immortalized cell lines but the relevance for the respiratory epithelium is questionable, given the hidden location of CAR deep within cellular junctions sealed off by tight junctions (19)(20)(21).Therefore, it is not surprising that alternative mechanisms for initial infections have been identified, such as using soluble components of various body fluids to gain cell entry.One such mechanism is to utilize hLF, and Lfcin, both of which are present in high concentrations within mucosal tissue (34).Naturally, hLF is a protein of the innate protection of the lung and has been shown to interfere with the infection ability of, for example, RSV (35) and HIV (36).In inflamed tissue, neutrophils have been shown to secrete hLF, which apart from its role as an antimicrobial is also regarded as the suppressor of inflammation (37).Therefore, it is very interesting that a human pathogen has evolved to highjack this system for infection.
Unfortunately, very limited detailed information is available for this mechanism.In this study, we provide structural and biochemical data for the interaction between hLF and the HAdV-C5 hexon, which fills an important gap in our understanding of how HAdVs engage hLF for a CAR-independent infection of epithelial cells.
A recent study has shown that the interaction with hLF likely involves the HVR-1 of HAdV-C5 hexon, which is the longest HVR of the protein (28).In addition, all species C adenoviruses have a longer HVR-1 containing multiple negatively charged residues.Substituting HVR-1 of HAdV-C5 with a significantly shorter and less anionic HVR of another species has significantly reduced viral transduction in vitro (28).It has therefore been suggested that the length and charge of HVR-1 are essential for the ability to utilize lactoferrin for infection.We show here that the interaction between hLF and hexon is highly dependent on the ionic strength of the environment as buffers (20 mM HEPES) with NaCl concentrations above 5 mM disrupt the complex making it impossible to purify it by size exclusion chromatography.This was further verified by native mass spectrometry where a stable complex is present at ammonium acetate of 5-6 mM but not at 20 mM.These observations present a contradiction when compared to previous in vitro infection studies with HAdV-C5 (28), which demonstrated successful transduction in cell culture media with higher ionic strength.This discrepancy is not only surprising from a biological standpoint but also raises questions about the biological significance of the interaction observed.
Although the underlying molecular mechanisms facilitating LF-enhanced adenovirus transduction in higher ionic strength conditions (e.g., cell culture) remain speculative, we propose two factors that might contribute to mitigate the pronounced salt sensitiv ity observed with isolated, purified components.First, as binding occurs on the cell surface, it may be further stabilized by secondary interactions with cellular receptors.Second, infection studies in cell cultures employed entire virus particles, in contrast to our experiments using isolated hexon proteins.The collective binding strength (avidity) provided by 240 hexon trimers on each capsid (38) could partially offset the diminished affinity that was observed in higher ionic strength conditions (Fig. S6).However, it is entirely possible that viruses also infect cells under low ionic strength conditions which could be utilized by viruses to increase cellular entry.In both the lungs and the eye, tissues targeted by species C adenoviruses, there are local microenviron ments present in which the ionic strength is regulated (39).The epithelial cell layer of the respiratory tract, including the trachea and bronchi, is covered by a thin layer of fluid, or mucus that ensures the proper function of the beating ciliated cells as well as protecting the epithelial cells from invading pathogens (40,41).In this air-liquid interface, the balance of ions such as sodium, chloride, and bicarbonate is essential to preserve the characteristics of the mucins to ensure a normal mucociliary clearance (42,43).Like in the conducting airways, the ionic strength in the tear film covering the cornea is tightly regulated to ensure proper function (44).Thus, as ions are pumped in and out of the cells, the local ionic strength is altered to a degree that may, at times, be beneficial for the hLF-mediated infection of the epithelium.Even if the concentration of sodium has been determined in the sputum to be about 100 nM (±27 nM), the concentration in the periciliary fluid is unknown and virtually impossible to determine (40).
The trimeric structure of the HAdV-C5 hexon suggests a potential interaction with an equivalent number of hLF molecules in a 3:3 stoichiometric ratio to maximize avidity.Our native MS data, in agreement with 2D class averages of the complex, however, suggest a 3:1 stoichiometry of hexon to lactoferrin within the complex.Occasionally, a second molecule of lactoferrin was found to be bound, both in native MS (Fig. 1D) and in 2D projections (Fig. S7) while a 3:3 stoichiometry was not observed.A possible explanation could be an allosteric effect of the first binding event that introduces conformational changes in hexon resulting in reduced affinities for the second and third molecules of hLF.Alternatively, with two protomers of the hexon trimer already being occupied, the binding of a third hLF might be sterically impaired.
An increasing amount of hybrid protein structures is currently being determined making use of an integrative approach combining experimental data and computational techniques (45).The structure of HAdV-C5 hexon in complex with human lactoferrin was well suited for this approach.Although we were unable to reconstruct a 3D model from our cryo-EM data alone, 2D class averages clearly showed assembly of the complex and identity of its components.Combining crosslinking data and refinement of the docked complex against the experimental 2D projections resulted in a likely model of the interaction.We believe that the workflow described here could also serve as a template to generate hybrid models of other difficult to obtain structures.While our hybrid model realistically depicts the overall assembly of the complex, it insufficiently describes direct contacts between interface residues in both structures.This is due to the inherent low resolution of the 2D class averages which only allow for rigid body refinement in addition to the high flexibility of hexon's hypervariable loops in the hexon.These are typically largely disordered and therefore cannot be modeled or computation ally predicted with accuracy.Furthermore, our hybrid model does not sufficiently take into account conformational changes that occur in both molecules upon binding.These result in small, observable disagreements between experimentally obtained 2D class averages and calculated 2D projections of the complex.
Further molecular insight was therefore obtained through site-directed mutagenesis of selected hLF residues present at its interface with hexon.Hereby, we could confirm that a stretch of basic residues of hLF between amino acid positions 24-30 within the Lfcin region shows the highest contribution to the overall binding affinity.These residues appear to face the HVR-1 of hexon in the proposed model.This finding is in agreement with the earlier observation that an LFcin peptide comprising residues 1-18 did not increase HAdV-C5 transduction (28).Likewise, bovine LF (bLF), which carries substitutions in the mutated human lactoferrin residues, lacked transduction-enhancing activity (28) (Fig. S8).
The described region forms a continuum with an adjacent patch of basic residues outside the Lfcin region which also contribute significantly to the binding affinity, or the lack thereof when mutated to Ala or Ser.It therefore seems that the region in the N-lobe of hLF that interacts with hexon HVR-1 might not be limited to the Lfcin peptide but comprises a larger area.
Interestingly, a possible second attachment point between hLF and hexon was observed in 2D class averages which appeared to involve the region around the junction of N-and C-lobe of hLF.The presence of such a site was corroborated by the identification of a crosslink connecting hLF K676 (present at the junction of N-and C-lobe) with hexon S174 (present in the "turrets").Based on the electrostatic potential distribution in both molecules the interaction between the C-lobe of hLF and a second hexon protomer seems to be less charge dependent (Fig. 4E).Mutational analysis also indicated a significant but lesser contribution to overall affinity within the complex.It is therefore possible that this second contact site is less crucial for overall complex stability but rather for proper orientation of LF toward its cellular receptor.
Taken together, our data provide novel structural insight into the interaction between HAdV-C5 and hLF.Combining computational modeling with cryo-EM, XL-MS, and biophysical characterization, our hybrid model agrees well with mutational data, realistically depicting the interaction between HAdV-5C hexon and hLf.In the future, our results may be used for the generation of Lfcin-based antivirals that block the interaction between HVR-1 and hLF.In addition, novel Lfcin-based adjuvants may be developed that improve transduction efficiency in the lung during gene therapy.

Human lactoferrin (hLF)
A commercially synthesized DNA fragment (Genewiz) encoding hLF (UniProt: Q19KS2) was cloned into the pHLsec vector (46) using the Gibson assembly method (New England Biolabs).Eight hLF mutants were produced by introducing desired point mutations during the synthesis of DNA fragments (GenScript).The plasmids encoding the wild-type (WT) LF, and its mutants were transfected and transiently expressed in Expi293F cells (ThermoFisher Scientific) following the manufacturer's protocol.
LF and its mutants were purified from the culture supernatant obtained after centrifugation.After dialysis against 20 mM HEPES pH 7.2, 30 mM NaCl (buffer A), the supernatant was loaded onto a Mono S cation exchange chromatography column (GE Healthcare) equilibrated with buffer A using an ÄKTA Pure system (GE Healthcare).LF and its mutants were eluted by a linear NaCl gradient using buffer A and 20 mM HEPES pH 7.2, 1 M NaCl (buffer B) and subsequently analyzed by SDS-PAGE.Desired fractions were incubated with 0.5 mM FeCl 3 in the presence of 200 mM NaHCO 3 for 1 h at room temperature and further purified SEC on a S200 16/600 column (GE Healthcare) equilibrated with 20 mM HEPES pH 7.2, 150 mM NaCl (buffer C).The purified LF was concentrated using Amicon Ultra centrifugal filters (Merck Millipore) and stored at −80°C until further use.

Hexon
Hexon was purified from virus particles propagated in A549 cells (ATCC CCL-185).A549 cells were maintained in DMEM (Sigma) containing 5% (vol/vol) fetal bovine serum (FBS; Gibco), and 1% (vol/vol) penicillin-streptomycin mix (Gibco) at 37°C under 5% CO 2 .Cells from a confluent culture were seeded into 5 × 175 cm 2 flasks.Upon reaching confluence, the medium was removed and fresh 5 mL medium containing 2% (vol/vol) FBS was added before infecting with HAdV-5C (200 µL).The flasks were incubated at 37°C for 90 min and gently mixed every 15 min.Later, the medium was replaced with fresh 30 mL medium containing 2% (vol/vol) fetal bovine serum and flasks were left undisturbed for 4 days.Cells were harvested by tapping the flasks on the bench followed by centrifugation of the culture medium at 5,000 g for 10 min.Prior to centrifugation, 1 mL culture from a flask was collected to be used as infectious material for the next round of hexon purification.A cell pellet from 5 × 175 cm 2 was re-suspended in 5 mL 20 mM HEPES, 5 mM NaCl pH 7.2 (buffer D).To disrupt the cells, three freeze-thaw (37°C/−80°C) cycles were done and then 6 mL of Vertrel XF was added (1,1,1,2,3,4,4,5,5,5-decafluoropentane; Sigma-Aldrich).The cell suspension was mixed by gentle shaking of tubes by hands for 3 min followed by centrifugation at 3,000 g for 10 min.The contents of the clear top layer above the Vertrel phase were collected and separated over a CsCl gradient ultracentrifuged using the SW41 rotor (Beckman Coulter) at 25,000 rpm, 4°C for 90 min.The clear top phase obtained above a white band was collected and separated over a 5 mL HiTrap Q FF column (GE Healthcare) equilibrated with buffer A. Hexon was eluted with a linear NaCl gradient using buffers A and B. The elution fractions were analyzed by SDS-PAGE, concentrated, and further purified by SEC on S200 16/600 column (GE Healthcare) equilibrated with buffer C. The purified hexon was stored at −80°C until further use.

Purification of hexon:hLF complex
Both hexon and hLF were dialyzed against buffer D before the preparation of the complex.A twofold molar excess of LF was mixed with 925 pmoles of hexon and incubated on ice for 5 min followed by SEC on Superose 6 increase 10/300 (GE Health care) equilibrated with buffer D. Peak fractions were analyzed by SDS-PAGE to confirm the co-elution of hexon and hLF.The fractions containing the complex were concentra ted to 1 mg/mL using a Vivaspin centrifugal concentrator (Sartorius).

Native MS
Prior to native MS measurements, hexon:hLf complex purified in buffer D was buffer exchanged into 20 mM aqueous ammonium acetate solution pH 7.0 (AA; ≥99.99% trace metals basis, Honeywell Research Chemicals) using two cycles of spin gel-filtration (MicroBioSpin P-6, Bio-Rad).The complex was subsequently diluted by 20 mM AA to 0.5 µM (estimated based on 3 + 3 stoichiometry).Alternatively, for low ionic strength conditions, the dilution was performed with LC-MS grade water instead, resulting in a 5.85 mM AA concentration.Reference spectra of isolated hexon and hLF were obtained from 5 µM (monomer) samples in 150 mM AA pH 7.0.
After short equilibration on ice, the samples were introduced into an Orbitrap Q Exactive UHMR mass spectrometer (Thermo Scientific) via static nanoelectrospray ionization.The samples were electrosprayed from gold-coated borosilicate glass capillaries Kwik-Fil 1B120F-4 (World Precision Instruments) prepared in-house essentially as described before (49,50).The mass spectrometer was tuned for best-signal quality and intensity while maintaining minimal ion activation.Specifically, electrospray voltage was kept at 1.25 kV, source desolvation temperature 120°C, in-source trapping −50 V, ion transfer profile "high m/z, " analyzer profile "low m/z, " analyzer target resolution 1,500 acquiring in a mass range 1,000-20,000 m/z, averaging five microscans.Further desolvation and collisional cooling in an HCD cell were performed with nitrogen at a relative gas pressure setting of 8.0 with gentle collisional activation by 10 V HCD voltage gradient.Raw spectra were averaged over at least 50 scans in Thermo Xcalibur Qual Browser 4.2.47 (Thermo Scientific) and mass deconvoluted in the UniDec 6.0.3 package (51).

Cross-linking MS
Hexon:hLF complex was crosslinked using the homobifunctional crosslinker disuccini midyl dibutyric urea (DSBU; CF Plus Chemicals).The complex was mixed with the excess of DSBU at a 1:500 molar ratio.The reaction mixture was then incubated at room temperature for 1 h and quenched with Tris at 30× higher concentration than DSBU.
Samples were diluted with 50 µL of 50 mM ammonium bicarbonate.Proteins were reduced by 10 mM TCEP and alkylated by 30 mM iodoacetamide.For deglycosylation of hLF, PNGase F (New England Biolabs) was added to the sample in a 1:20 (wt/wt) ratio and incubated for 4 h at 37°C.Proteins were digested by trypsin (Promega) overnight at 37°C.After the digestion, samples were off-line desalted using a Microtrap C18 cartridge (Optimize Technologies) and dried using a SpeedVac concentrator.Samples were re-suspended in 40 µL of 2% (vol/vol) acetonitrile, 0.1% (vol/vol) TFA and analyzed using the Vanquish liquid chromatography system (ThermoScientific), connected to the timsToF SCP mass spectrometer equipped with captive spray (Bruker Daltonics).The mass spectrometer was operated in a positive data-dependent mode.One microliter of the peptide mixture was injected by autosampler on the C18 trap column (PepMap Neo C18 5 µm, 300 µm x 5 mm, Thermo Scientific).After 3 min of trapping, peptides were eluted from the trap column and separated on a C18 column (DNV PepMap Neo 75 µm x 150 mm, 2 µm, Thermo Scientific) by a linear 35 min water-acetonitrile gradient from 5% (vol/vol) to 35% (vol/vol) acetonitrile at a flow rate of 350 nL/min.The trap and analytical columns were both heated to 50°C.The TIMS scan range was set between 0.6 and 1.6 V s/ cm 2 with a ramp time of 100 ms.The number of PASEF MS/MS ramps was 4. Precursor ions in the m/z range between 100 and 1,700 with charge states ≥ 1 + and ≤8 + were selected for fragmentation.The active exclusion was enabled for 0.4 min.
The raw data were processed by Data Analysis 5.3 (Bruker Daltonics).The mgf files were uploaded to MeroX 2.0.1.4software (52).The search parameters were set as follows: enzyme-trypsin (specific), carbamidomethylation as a fixed modification, oxidation of methionine as variable modifications.Precursor precision was set at 10.0 ppm and fragment ion precision was set at 0.1 Da.

Cryo-EM sample preparation and data collection
HAdV-C5 hexon 3.5 µL of Hexon (0.7 mg/mL) was applied to freshly glow discharged Au, 300 mesh, R1.2/1.3TEM grids (Protochips) coated in-house with a graphene monolayer.The grids were blotted at 100% humidity and 4°C (Vitrobot, ThemoFisher Scientific) and vitrified by plunging into liquid ethane.The grids were imaged using a Titan Krios G3i microscope (ThermoFisher Scientific) operated at 300 kV.Automated data collection was performed using EPU software (ThermoFisher Scientific) software.In all, 1,063 movies (40 frames/ movie; 5-s exposure) were collected using a Quantum K2 LS detector (Gatan) at a magnification of 165,000×, resulting in a pixel size of 0.828 Å.The nominal defocus during data collection ranged from 1.2 µm to 2.5 µm and the total dose used was 55 e − /Å 2 .

HAdV-C5 hexon
The recorded movies were entirely processed using cryoSPARC (version 3.2 and later) (53).Motion correction to obtain averaged micrographs from multi-frame movies followed by CTF estimation was performed using Patch Motion Correction, and CTFFIND4 algorithms integrated in cryoSPARC (53,54).Hexon particles were picked using the Gaussian Blob Picker in cryoSPARC.Particles were extracted with a box size of 300 px and subjected to iterative, reference-free 2D classification until no ill-defined class averages were obtained.The final selected subset of 98,640 particles was used to produce three ab initio models.The class containing the maximum number of particles (53,996) closely resembled the hexon and was chosen for homogeneous refinement with C3 symmetry imposed to yield the reconstructed hexon.

HAdV-C5 hexon:hLF
The recorded movies were imported into cryoSPARC (version 3.2 and later) and averaged micrographs from multi-frame movies were obtained via the Patch Motion Correction algorithm.Micrographs were CTF corrected using the Patch CTF Estimation algorithm.A Blob Picker was employed to pick particles which were extracted with a box size of 230 px.The set of extracted particles was cleared of ill-defined particles by itera tive, reference-free 2D classification to obtain the final set of class averages, used for refinement of the HAdV-C5 hexon:hLF hybrid structure.

Model building HAdV-C5 hexon cryo-EM structure
To generate a model for hexon using the cryo-EM reconstruction, the crystal structure of hexon (PDB: 3TG7) was docked into the cryo-EM coloumb potential map using the dock_in_map procedure of Phenix ( 55).This docked model was subjected to automatic real-space refinement by Phenix.The obtained model was iteratively refined using multiple rounds of manual refinement performed in Coot followed by another round of automatic refinement in Phenix.After each round of refinement, the model was validated using MolProbity (56) until satisfactory parameters were achieved.

System preparation
The model of hexon was trimmed down for more efficient calculation by discarding the rigid base, that is, only residues 129-315 and 415-460 of all three protomers were retained in the final model.Hydrogens were added using the tLEaP program of the AMBER20 suite (57).Titratable residues were modeled in their standard state at pH = 7.4.Protein force field ff19SB (58) was used along with a truncated octahedral box of SPC/E water molecules (59) extending 12 Å from the solute.Sodium and chloride counter ions were added.Prior to molecular dynamics (MD) simulation, the systems were relaxed, heated to 300 K, and equilibrated at 300 K and 1 atm pressure using the published protocol (60).

Molecular dynamics
The production run was 600 ns long in a NpT ensemble, that is, keeping the number of particles (N), pressure (p), and temperature (T) constant.The SHAKE algorithm (61) was used to restrain all bond vibrations and hydrogen mass repartitioning to 3 Da (62) allowed us to apply a time step of 4 fs.Residues adjacent to those of the discarded base were fixed with harmonic restraint of 500 kcal mol −1 Å −2 to maintain the conformations compatible with linking to the base.All the analyses were done in the CPPTRAJ program (63).

Docking of HAdV-C5 hexon:hLF complex and refinement of the docked model against 2D class averages
All simulations were performed in XPLOR-NIH v 3.4 (32) starting from the MD structure of Hexon and the structure of hLF (PDB: 1LFG).Protons and missing side chain atoms in both proteins were added in XPLOR-NIH, followed by minimization of the energy function consisting of the standard geometric (bonds, angles, dihedrals, and impropers) and steric (van der Waals) terms.Based on the published analysis ( 64), the XL-MS data were converted into pairwise S446 (Hexon)-K282 (hLF) and S174 (Hexon)-K676 (hLF) intermolecular distance restraints, defined as the upper limit bounds of 24 Å between the Cα atoms and 15 Å bounds between the corresponding Ser OG and Lys NZ side chain atoms.The cryo-EM 2D class averages imported into XPLOR-NIH as two-dimensional coloumb potential maps (31,65) were subsequently used to fit the protein molecules.
Following the computational protocol of Clore and co-workers (31), the refinement of the hexon:hLF complex was carried out in three steps.First, we performed a rigid-body protein-protein docking, driven by the intermolecular distance restraints.To this end, the positions of the hexon atoms were kept fixed, except for the "turret" loops (residues 136-164, 173-217, 268-287, and 429-448 in each subunit of the hexon homo-trimer, which includes the crosslink residues S174 and S446), which were given full torsional degrees of freedom.The incoming hLF molecule was treated as a rigid-body group, except for the crosslink residues K282 and K676, whose side chains were allowed to move.Starting from randomly oriented proteins separated by 100 Å, the docking comprised an initial rigid-body simulated annealing step, followed by the full side-chain energy minimization (32).In addition to the energy terms mentioned above, the total energy function included the intermolecular restraint term and a knowledge-based dihedral angle potential (32).In this first step, 100 structures were calculated, and the 10 lowest-energy solutions-showing no distance restraint violations, imposed by crosslinks-were retained for the subsequent refinement.
In the second step, energy terms for the 2D projections were added sequentially, with the map centering and (initially random) orientation optimized for each image from the cryo-EM 2D class stack.The back-calculated projections were computed as a sum over all heavy atoms, where each atom contributed a 2D quartic density according to its van der Waals radius and weighted by the atom's number of electrons (31).The Gaussian blurring was employed throughout, with Gaussian diameter given by the resolution of the 2D class, and the agreement between the experimental and back-calculated electron density projections was estimated by the cross-correlation C metric (31).Out of 10 slices in the 2D classes image stack, one consistently failed to yield a good fit (C < 0.9) and, thus, was discarded in the subsequent refinement rounds.
In the third and final step, the hexon:hLF complex was refined by allowing full rigid-body and torsional degrees of freedom in the motional setup described above, with simultaneous minimization of all constituent energy terms, including those for the intermolecular restraints and the nine individual electron density projections (31).In this run, 100 structures were calculated and, 10 lowest-energy solutions-showing the best agreement with the experimental cryo-EM data (highest C values) and no distance restraint violations-were retained for the final analysis.Individual back-calculated 2D projection images and their overlays with the experimental 2D class densities were output for visualization.

Microscale thermophoresis
Microscale thermophoresis (MST) analysis was carried out to determine the binding affinity between hexon and hLF using Monolith NT.115 (Nanotemper Technologies).Hexon was labeled with AlexaFluor594 (ThermoFisher Scientific) according to the manufacturer's instructions.Excess dye was removed by SEC on S200 16/600 column (GE Healthcare) equilibrated with buffer C. MST experiments were performed in buffer D containing 0.05% (vol/vol) Tween 20.A range of 16 different concentrations of hLF or hLF mutants was generated by serial dilution (1:1) while the same amount of labeled hexon was added to each dilution.Following 5-min incubation at 25°C, the samples were taken up in capillaries for MST measurements that were performed at 25°C at 50% LED power, and 40% MST power.The MST curves were fitted using MO.Affinity analysis software (Nanotemper Technologies).

FIG 1
FIG 1 Effect of ionic strength on hexon:hLF complex formation.(A) Surface views of lactoferrin (PDB: 1LFG) (left) and hexon, completed with AlphaFold2 (see methods) (right).The Lfcin peptide on the N-terminal lobe of LF is colored in blue.The three protomers of hexon are shown in different colors: orange, yellow, and blue.The HVR-1 loop of each subunit is marked with an arrow.Surface representations colored according to electrostatic potential are depicted below.Notably, the area on lactoferrin corresponding to the Lfcin peptide exhibits a strong positive charge while the HVR-1 loops of hexon are strongly negatively charged.(B) Net charge per residue (NCPR) distribution along the amino acid sequence of hLF (left) and hexon (right).Blue color indicates positive NCPR and red color indicates negative NCPR.Residues corresponding to the Lfcin and HVR-1 have been highlighted.NCPR distribution was calculated via CIDER with the default blob size.(C) Hexon:hLF complex formation at different ionic strengths during size exclusion chromatography (top).Reducing SDS gels (bottom) confirms the co-elution of hexon and hLF.Fractions loaded onto the SDS gel have been highlighted with colored bars in the corresponding chromatogram (D) Native MS of 0.5 µM hexon:hLF complex electro-sprayed from 20 mM (left) and 5.85 mM (right) ammonium acetate (AA) pH 7.0.Hexon is detected exclusively trimeric (black) with up to 2 hLF molecules bound (yellow).The main charge states and corresponding molecular weights are indicated.Small amounts of dimeric hexon trimer are artifacts due to ESI concentration effects.For a more detailed peak assignment, see Fig. S1.

FIG 2
FIG 2Structure of HAdV-C5 hexon determined by single particle cryo-EM at 2.9 Å. (A) Visual representation of the workflow adopted for data processing using cryoSPARC.A particle set identified by iterative, reference-free 2D classification was used to obtain three ab initio models.The model with the most recognizable features was selected (boxed) and homogeneously refined to reconstruct the trimeric hexon.53,396 particles contributed to the final reconstruction.A representative micrograph (top left) is shown alongside a panel of selected 2D class averages.(B) The final, unsharpened map of the hexon reconstruction is shown in gray and colored according to local resolution.Next to the gold-standard FSC curves is the angular distribution plot of the reconstruction.(C) Side and bottom view of the hexon model built from the cryo-EM reconstruction.The three protomers of the hexon are depicted in the color scheme used previously.Insets show close-up views of the model inside the unsharpened map from different regions of the hexon.The plot of the Map vs Model cross-correlation for one protomer (boxed) shows a high average cross-correlation of 0.736 (red line) at FSC 0.143 : 2.9 Å.

FIG 3
FIG 3 Visual representation of computation workflow for a hybrid model of the hexon:hLF complex.(A) Top, Crosslinking MS revealed two crosslinks, locations of identified peptides are highlighted on surface representations of hLF (PDB: 1LFG) and hexon in cyan and green.Peptide sequences are shown in the corresponding color, crosslinked residues are marked with a black line.Bottom left; Hexon MD simulation-the evolution of backbone RMSD, overlay of several snapshots from the MD trajectory.Bottom right; Identified crosslinks were used as distance restrains in the subsequent docking and energy minimization of the complex in XPLOR-NIH, visualized by representative schematic.(B) Refinement of the docked hexon:hLF complex against experimental 2D class averages of the complex using XPLOR-NIH (see methods).The left column shows the experimentally obtained 2D class averages for the hexon:hLF, the central column shows the calculated back projections for the hexon:hLF complex obtained after docking and energy minimization, and the right column shows the result of the superposition of experimental (green) and calculated (magenta) projections.Cross-correlation (C) of experimental and calculated projections is shown for individual projections.(D) Different views of the final hybrid model (best C-value) of hexon:hLF complex.LF is colored pink, the protomers of hexon are individually colored using the color scheme as before.

FIG 4
FIG 4 Validation of the hexon:hLF hybrid model.(A) Microscale thermophoresis (MST) was used to determine the effect of mutations in hLF interface residues on the affinity to hexon.The fraction of hLF bound to fluorescently labeled hexon was determined and plotted as a dose-response curve (n = 2).The binding curve of WT hLF is shown in gray, tested hLF mutants are shown in color.The colors of dose-response curves are identical to the colors used in bar graphs.The left diagram depicts the measured affinity; the right diagram shows the relative binding capacity of lactoferrin mutants to hexon in comparison to the WT.The percentage of lactoferrin binding is shown inside the bar.Residues mutated in hLF are shown next to their respective bar.(B) Top view of the hexon:hLF hybrid model in cartoon representation.Mutants, with the exception of Mut5, have been colored using the same color scheme as in (A).In addition, HVR-1 on hexon has been colored red, Lfcin on the lactoferrin is colored blue.Insets show close-up views of the mutated residues.Amino acids in identified crosslinks are depicted in gray, the crosslinks are symbolically depicted as dotted lines.(C) Surface representation of the computed hexon:hLF complex in the color scheme used throughout and colored according to the electrostatic potential.HVR-1 and LFcin have been highlighted on both representations with a black outline.