ESI-MS and Stavrox 3 . 6 . 0 . 1 Investigations of Crosslinking by an Aryl-Azido-NHS-Heterobifunctional Crosslinker

Chemical-crosslinking-mass spectrometry-bioinformatics has become an important technique for studying large scale protein-protein interactions, especially for capturing transient interactions [1-18]. This has become possible by the availability of a wide variety of crosslinkers, viz. both homo-and hetero-bifunctional crosslinkers. The former has identical displaceable groups on the two ends, e.g., the amine displaceable N-hydroxysuccinimide (NHS) group. One of more popular reagent of this type is BS2G (bis[sulfosuccinimydyl]glutarate), which has been used extensively.

For enhancing intermolecular crosslinking, more efficient crosslinkers are required.This limitation is overcome by using heterobifunctional crosslinkers, where two different groups are present on the two ends.One of these groups being thermally reactive (e.g., the NHS group) and the other one being photoreactive (e.g., the azide group), which also provides greater flexibility in experimental protocols.Small crosslinkers are known to be more effective for mapping interfaces, while the larger crosslinkers are more useful for identifying binding sites.X-ray diffraction and NMR continue to be the gold standards, but both have their own limitations.The former requires a single crystal, while the latter requires solubility in specific solvents and also demands larger amounts of the sample.Thus, both these techniques are not yet suitable for dynamic studies.
Despite the availability of a range of crosslinkers (including cleavable and isotope labeled crosslinkers) many laboratories continue to use conventional crosslinkers like formaldehyde, glutaraldehyde as crosslinkers, even though these lead to undesirable and extensive crosslinking.Even "zero crosslinkers" like DCC are still in use.This arises mainly due to the lack of awareness amongst many biochemists and others of the immense potential of this new technique of 'chemical crosslinking-mass spectrometry-bioinformatics' particularly for studying dynamic interactions in living cells.It has been proposed that structures and functions of large protein complexes at the molecular or atomic level in dynamic situations can be studied by combining cryo-electron microscopy (cryo-EM) with crosslinking-mass spectrometry (CX-MS) [19].
Traditional crosslinking strategies generate an enormous amount of mass spectrometry data, which is extremely difficult to analyze with routine software tools.The situation has often been compared to "finding a needle in a haystack".Tremendous advancement in mass spectrometry (MALDI-MS, MS/MS, ESI-MS) have provided much impetus in improving the quality of crosslinking studies.The use of different algorithms in each laboratory has led to better bioinformatics tools such as GPMAW, MSX3D, ProteinPilot, XQuest, ExPASy, StavroX, xTract.These have contributed greatly to making this technique a valuable tool for the identification of protein-protein interactions in dynamic systems.
The protocol (Figure 1) commonly used for heterobifunctional crosslinkers involves, incubating the first protein with the crosslinker at room temperature and then in the second step the incubated sample is subjected to photolysis.In the second step, which could be done using either 254 nm or 366 nm UV lamp, a second and a different protein Much optimization of the duration of incubation, ratio of protein to the crosslinker ratio; exposure time, etc. are required before experimental success can be ensured.The sample at this stage is then subjected to SDS-PAGE.To establish intermolecular crosslinking, the 'dimeric' band is excised, trypsin digested and subjected to mass spectrometric analysis.The huge amount of data thus generated is then analyzed by suitable bioinformatics tools.As most of the protein (s) is not crosslinked, unlabeled peptide fragments usually dominate.Unlike the latter, labeled peptide fragments carry a positive charge (+1, +2, +3…) and can be thus enriched by strong cation exchange(SCX) ('zip tip') tips to separate them from the un-crosslinked fragments, which usually do not carry a charge.This has been referred to as the "double needle in a haystack" problem.
We employed the above protocol with the twin aims of confirming whether our new small heterobifunctional crosslinker does indeed bring about intermolecular crosslinking successfully and whether the same could be established using modern mass spectrometric methods (MALDI-MS, MS/ MS; ESI-MS) combined with StavroX 3.6.0.1.a bioinformatics software, especially suited for identifying intermolecular crosslinks reliably.
Our work arose out of Hagan Bayley's [20] correct prediction based on the work of Banks et al. [21] on perfluorophenylazides, that reagents based on the latter could serve as efficient photo-affinity labelling agents, as such reagents involve 'long-lived 'transients, which leads, in turn, to enhanced intermolecular reaction rates.This was confirmed by Platz et al. [22] who also showed that in this case, the singlet-triplet nitrene energy gap increases.The involvement of a "slippery potential energy surface" has also been demonstrated [23].The concepts described above have found applications in diverse areas [24].
We have also published a series of papers in this area.Our reactions, however, do not involve any fluorination, which is a demanding step and yet we obtain comparable results.Further, in our cases, the initially formed singlet nitrene does not flip to the triplet state and instead forms the corresponding carbene (Nitrenecarbene conversion; Crow-Wentrup pathway) [25], which has also been substantiated by computational studies, yet our results parallel those based on perflourinated aryl azides.We first observed an unusual reaction involving a concomitant ring expansion and ring extrusion during thermolysis of 'Dimethyl-Azido-m-hemipinate' [26].This was followed by establishing "nitrene insertion into an adjacent orthomethoxy group followed by the addition of the amine intermediate on to the corresponding heterocumulene intermediate" along with the isolation of carbene based products [27].The life span of one such 'long lived' transient, in our case, has been shown to be 700 picoseconds [28].Reaction of the somewhat more rigid azido-m-meconine led to intramolecular cyclisation via reaction with the adjacent o-methoxy group [29,30].The reaction of the para analog, viz.'Dimethyl-azidosuccinylosuccinate' led to similar results [31].Recently we have published on crosslinking studies using a new heterobifunctional crosslinker based on an "introverted" carboxylic acid [32].The present work is thus a part of our continued investigations in this field.

Results and Discussion
The new aryl azido-N-hydroxysuccinimide heterobifunctional crosslinker (I) was synthesized as shown in scheme I (SI-I).Thus, 'dimethyl azido-m-hemipinate' was subjected to selective alkaline hydrolysis, which was followed by reaction with N-hydroxysuccinimide (NHS) and dicyclohexylcarbodiimide (DCC) to yield (1), m.p. 165°C.(I) contains a photo-reactive aryl azido (-N 3 ) group and an amine reactive N-hydroxysuccinimide (NHS) moiety, facilitating the two step protocol, viz.an initial incubation step followed by the photolysis step or vice-versa.The structure of (I) was established by detailed spectroscopic studies including, FT-IR, UV, MALDI-MS and MS/MS studies.NMR studies on (I) will be published separately.
The new heterobifunctional crosslinker (I) was incubated with Lysozyme.This was followed by photolysis at 366 nm (using a 6W TLC visualization UV lamp) and SDS-PAGE (SI-III) with standard protocol (SI-IV,) gave a 'dimeric' band at 28 kDa.This confirmed successful intermolecular cross-linking.The 'dimeric' band was excised, trypsin digested (SI-V) and enrichment by using zip tip C-18 with standard protocol (SI-VI).In early work on this project, we had no access 'in house' mass spectrometric facilities and we received only MALDI-MS data with no MS/MS data.Even the bioinformatics software had limitations.In the current centre we had access to all modern MS  and the red bars represent the number of false positive candidates from a decoy data set, which is obtained from the inverted sequence of the FASTA file (SI-XII).More enriched real data set candidates in the score region indicates towards better crosslinking.The decoy analysis data for fragment at m/z 1404.900 is shown in Figure 3.
Out of the candidates with high scores, the detailed spectrum for the peak value m/z 1404.900, the peptide fragments involved in the process of intermolecular cross-linking are shown in Figure 4.The spectrum panel shows the MS 2 spectrum for the identified peaks.This one example with its annotation is shown here as a representative.In the deviation diagram (printed below the spectrum panel, deviation of the identified signals is plotted against the m/z values) less deviation in the annotation, points towards better results in the crosslinking experiment.Similar detailed spectra of the other intermolecularly crosslinked fragments have also been obtained via StavroX 3. 6. 0.1, from our experimental data, but these have not been exhibited here (Tables 2 and 3).
An intermolecularly crosslinked peptide fragment m/z 1404.900 is shown in Figure 5.It contains Peptide 1 (α-peptide) "KVF" crosslinked to peptide 2 (β-peptide) "LAAAmKR", which shows that intermolecular cross-linking has occurred between K1 (Lysine-1 in the primary structure of Lysozyme) of the α-peptide with L8 (Leucine-8 in the primary structure of Lysozyme) of the β-peptide of another moiety of Lysozyme.This is shown as a representative example from many such intermolecular crosslinks established by StavroX 3.6.0.1.

PyMol software
Using the software PyMol, 3D (SI-XIII) representation of the intermolecular cross-link between the two Lysozyme molecules as obtained by using our new heterobifunctional cross-linker and StavroX 3.6.0.1 analysis is depicted in Figure 6.
Crosslinking studies on Lysozyme using homobifunctional crosslinkers was discussed in a seminal and highly cited paper by A. Sinz's group [3].Cross-linking reactions with sulfo-DST and sulfo-EGS yielded no cross-linking products, while the cross-linking reaction with BS 3 gave two cross-linking products.The details of the nature of this cross-linking (mostly intramolecular cross-linking) are included in Table 4, reproduced from this earlier work.
In addition, our experiments also identified many intermolecular cross-linking in MS and ESI-MS based crosslinked peptide not detected by the earlier workers.In comparison, our experiments have led to enhanced intermolecular cross-linking (Table 5).
Our results thus justify the hypothesis originally put up by Hagan facilities under one roof.Even here, too, we initially carried out only MALDI-MS (SI-VII and SI-VIII) investigations.We had to switch to ESI-MS, MS/ MS as ESI-MS data ((SI-IX) which works better for the bioinformatics software, StavroX 3.6.0.1, (used by us in the current study) works on ESI-MS data alone.The ESI-MS chromatogram for the crosslinked Lysozyme is shown in (SI-X).The ESI-MS data thus obtained was stored in the form of .mgffile and then fed into the StavroX 3.6.0.1 software (SI-XI).

StavroX 3.6.0.1 software
For analysis, the original FASTA sequence of Lysozyme along with the ESI-MS data as .mgffile was uploaded into StavroX 3. 6. 0. 1 software to identify the intermolecularly cross-linked peptides.StavroX 3.6.0.1 is a recent version of this software and it enables quick and efficient identification of the intermolecularly cross-linked peptides.This software calculates the theoretical cross-links and estimates them to the precursors of the ESI-MS data stored in the form of .mgffile [33].This further leads to the identification of the hits and scores which are then tabulated.The software provides options to select the desired crosslinker along with the scope to add new cross-linkers.The software itself calculated the actual mass of our new crosslinker as 236.2803 amu, based on the loss of one N-hydroxysuccinimide (NHS) unit and one molecule of N 2 (loss of nitrogen from the azide), which are lost during the incubation and the photolysis steps, respectively.No changes were made in the amino acid sequence section.An unspecific digest option was selected along with minimum peptide length as 1 and maximum peptide length as 10.The precursor precision was selected to be 250.0ppm, fragment ion precision as 1.0 Da with the lower mass limit as 200.0 Da and upper mass limit as 6000.0Da.The S/N ratio was selected to be 2.Only 'b' and 'y' ions were selected with the score cut-off of -1 and pre-score intensity greater than 10%.
After the data was uploaded into the StavroX 3.6.0.1 software along with the FASTA sequence of Lysozyme, appropriate settings were selected as stated above and the software was run to allow the data to be processed.As a result, 123 spectral peaks were compared to 1024687 theoretical candidates out of which, 3791 possible cross-links were identified within 1 minute and 17 seconds of the run.Out of these, the top nine crosslinked peptide fragments with their m/ z values and scores for intermolecular crosslinking are shown in Table 1.Out of all the possible cross-linked candidates, the highest score obtained was 98 (for the m/z 1383.523fragment).
After the analysis was done, a window opened on top, showing a bar chart, where the number of candidates identified in a certain score range (number of hits) to the score hit is plotted.The Decoy analysis figure helped to estimate the quality of the score in our experiment.The blue bars represent the number of candidates from our data set    which is the case with our new hetero-bi-functional cross-linker.As stated earlier, this observation can have an impact on diverse areas of science.

Conclusions
A new small aryl-azido-NHS-hetero-bifunctional cross-linker has been synthesized and characterized spectroscopically.It has been successfully used to crosslink Lysozyme as a 'proof of concept'.This is done in two steps, i.e., via an initial incubation step, which is then followed by the second step of photolysis (366 nm) using a 6 W TLC visualization UV lamp.It was then subjected to SDS-PAGE, excision of the 'dimeric' band, trypsin digestion, desalting using ziptip and analysis by ESI-MS.The data thus obtained was fed into the StavroX 3.6.0.1, a bioinformatics tool, especially suited for studies on intermolecular crosslinking, which established the crosslinking sites.
The above study confirms that the new crosslinker successfully brings about intermolecular crosslinking and that the use of ESI-MS/ MS/MS along with StavroX 3.6.0.1, the bioinformatics software, greatly facilitates the analysis of intermolecular crosslinking of the two protein molecules.
The above technique has implications in diverse fields, e.g., for studies on protein-protein interactions, for proteomics/lipidomics and in systems and structural biology.It is expected to help in preparing monoclonal antibody-drug conjugates, which specifically target tumor cells representing "the pinnacle of such targeting efforts."Recently it has been shown that combining cryo-electron microscopy (cryo-EM) with chemical crosslinking will pave the way for highly efficient in vivo studies.The technique is also important in many areas of materials science.

Volume 9 •
Issue 2 • 1000402 J Anal Bioanal Tech, an open access journal ISSN: 2155-9872 could be introduced.Shorter wavelengths could potentially damage proteins, while longer wavelength exposures are safer.

Figure 1 :
Figure 1: Schematic representation of the chemical cross-linking-mass spectrometry-bioinformatics methodology.

Figure 3 :
Figure 3: Screen shot of the decoy analysis for m/z 1404.900.

Figure 4 :
Figure 4: Detailed spectrum of the intermolecularly cross-linked candidate, m/z 1404.900fragment, with the score of 71, along with its annotation.Modified fragment ions for m/z 1404.900obtained from the StavroX 3.6.0.1 analysis are shown in Table2.

Table 1 :
Major peaks of the crosslinked peptide fragments.

Table 4 :
Earlier reported Identification of crosslinking of Lysozyme with the homo bifunctional cross-linker, the di-NHS ester, BS3.

Table 5 :
Intermolecular cross-links identified by StavroX 3.6.0.1 with our new heterobifunctional cross-linker.Bayley [loc.cit.]based on perflourophenylazides and extended by us to our aryl azides, which do not require ortho-flanking fluorines, as in the case of perflourophenylazides.Aryl azides that lead to 'long-lived' transients bring about more efficient intermolecular cross-linking,