A Simple Carbamidomethylation-Based Isotope Labeling Method for Quantitative Shotgun Proteomics

In this study, we present a new isotope-coded carbamidomethylation (iCCM)-based quantitative proteomics, as a complementary strategy for conventional isotope labeling strategies, with providing the simplicity, ease of use, and robustness. In iCCM-based quantification, two proteome samples can be separately isotope-labeled by means of covalently reaction of all cysteinyl residues in proteins with iodoacetamide (IAA) and its isotope (IAA-13C2, D2), denoted as CM and iCCM, respectively, leading to a mass shift of all cysteinyl residues to be + 4 Da. To evaluate iCCM-based isotope labeling in proteomic quantification, 6 protein standards (i.e., bovine serum albumin, serotransferrin, lysozyme, beta-lactoglobulin, beta-galactosidase, and alpha-lactalbumin) isotopically labeled with IAA and its isotope, mixed equally, and followed by proteolytic digestion. The resulting CM-/iCCM-labeled peptide mixtures were analyzed using a nLC-ESI-FT orbitrap-MS/MS. From our experimental results, we found that the efficiency of iCCM-based quantification is more superior to that of mTRAQ, as a conventional nonisobaric labeling method, in which both of a number of identified peptides from 6 protein standards and the less quantitative variations in the relative abundance ratios of heavy-/light-labeled corresponding peptide pairs. Finally, we applied the developed iCCM-based quantitative method to lung cancer serum proteome in order to evaluate the potential in biomarker discovery study


Introduction
Along with that multidimensional separation platforms coupled with advanced mass spectrometry (MS) have been continuously developed for enlarging the separation capacity, that provides the enhancement of proteomic datasets in the identification of proteins out of biological specimens (e.g., cells, tissues, sera, and etc.), MS or MS/ MS-based quantitative proteomics is also an inevitably important strategy in determining the quantity of targeted protein(s) and in understanding its diverse roles in a cell, related to the regulation of protein activity, protein interaction, and the immune responses. 1,2Furthermore, it is popularly straightforward that MS-or MS n -based quantitative profiling of targeted protein(s) from biological specimens is pathogenically useful for discovering a disease-specific biomarker and the clinical diagnosis of diverse disease states. 3rom these aims mentioned above, a number of stable isotope labeling methods are introduced and verified for obtaining more reliable quantification of proteome samples with high precision and accuracy, compared to label-free experiments. 4Differed from in vivo metabolic labeling such as a stable isotope labeling by amino acids in cell culture (SILAC), 5 an in vitro chemical labeling methods are more accessible than SILAC in sample treatment for isotope labeling of proteome and can be categorized into two groups according by isobaric and nonisobaric approaches.Among in vitro chemical labeling methods (e.g., iCAT, TMT, 18 O-labeling, reductive dimethylation), [6][7][8] an isobaric tags for relative and absolute quantification (iTRAQ) and a mass differential tags for relative and absolute quantification (mTRAQ) are, up to date, considered as a promising strategies for a MS-based relatively and absolutely quantitative method in the determination of Mass Spectrom.Lett.2014 Vol. 5, No. 3, 63-69 peptide(s) or protein(s) of interest through direct measuring the intensity ratios of unique low m/z reporter ions produced by CID fragmentation and corresponding precursor ion pairs with different masses, respectively. 9In mTRAQ, more than two protein samples are individually digested via an enzymatic treatment, and followed by which each of the resulting digests is mixed with mTRAQ's multiplexed reagents to produce the peptides, covalently labeled of the N-terminus and side chain amines of peptides with tags of varying masses (e.g., ∆0, ∆4, and ∆8).Thereafter, the resulting peptides labeled with the multiplexed tags at all N-terminus residues are pooled equally and then applied to a nanoflow liquid chromatography-electrospray-tandem mass spectrometry (nLC-ESI-MS/MS).Finally, the qualitative/quantitative experiments of the multiplex-labeled peptides are simultaneously achieved through direct comparing the relative abundance ratios of MS or MS/MS intensities of the corresponding peptides pair ions obtained by nLC-ESI-MS/MS run.Even if nonisobaric tag-based labeling considers as one of a promising strategies for quantitative proteomics with the robustness and accuracy, there isn't yet fully accessible to universal proteome samples, due to several considerations, for instance, in which the coexistence of thiols or primary amines (e.g., ammonium bicarbonate, ammonium citrate, tris-buffers, and etc.) with proteome sample induces to decreasing the labeling efficiency.Furthermore, the non-specific isotope labeling of all side chain amines of peptides, including hydroxyl residues such as serine and tyrosine, gives rise to an irreproducible results in quantitative experiments.
In this study, we presented a new isotope-coded carbamidomethylation (iCCM)-based isotope labeling for the quantitative proteomics with inexpensiveness, ease of use, simplicity, and robustness.In iCCM-based isotope labeling, the two protein mixtures originated from two different states, for instance, healthy versus cancer sera, are individually isotopically labeled with iodoacetamide (IAA) and its isotope (IAA-13 C 2 , D 2 ), denote as CM and iCCM, respectively, by means of covalently reacting with the cysteinyl residues in proteins, leading to a mass shift of all cysteinyl residues to be + 4 Da.Finally, two protein samples are mixed equally, proteolytically digested, and then quantified via a comparison of the relative abundance ratios of the corresponding pair ions or their fragment ions obtained by nLC-ESI-MS/MS run.To evaluate the efficiency of iCCM-based labeling with respect to reproducibility and precision in protein quantification, two aliquots (43 mg) of 6 protein standards, composed of bovine serum albumin (BSA, 64.4 kDa), serotransferrin (77.1 kDa), lysozyme C (16.2 kDa), beta-lactoglobulin (19.2 kDa), beta-galctosidase (116.5 kDa), alpha-lactalbumin (16.2 kDa), were individually treated with IAA and its isotope, respectively.Each of CM-and iCCM-labeled protein standards was pooled equally, proteolytically cleavaged, and followed by which the resulting peptides were analyzed using a nLC-ESI-MS/MS run in order to evaluate the efficiency of iCCM-based relative quantification, compared to that of mTRAQ using the same protein standards.Finally, we applied the developed iCCM-based quantitative method to human lung cancer serum proteomics.

Tryptic digestion for iCCM-based quantification
Prior to tryptic digestion, two aliquots (43 mg) of the 6 protein standards or human plasma sample was denatured and reduced with 50 mM ABC contained with 10 mM DTT for 2 hours at 37 o C. In order to alkylate the remaining thiol group, two aliquots were individually treated in an ice bath for 2 hours in the dark with 20 mM IAA and isotope labeled-IAA, respectively.L-cysteine (40-fold access of IAA) was added to both aliquots to remove excess IAA and its isotope at room temperature for 30 min.The resulting peptide digests labeled individually with CM and iCCM were mixed at the ratio of 1:1, and then digested with sequencing graded modified trypsin at a ratio of 50 : 1 (protein : trypsin, w/w) at 37 o C for 18 hours.In order to stop the tryptic digestion, the solution was treated with 1% formic acid (v/v).The digested peptides were stored in -80 o C before use.For human sera samples, the endoglycosidic treatment, in particular, was first carried out using a PNGase F, prior to tryptic proteolysis, to minimize the occurrence of trypsin miscleavage that might be caused by a steric hindrance of N-linked glycans covalently attached to asparagine (N) with the consensus motif as N-X-S/T (X for any amino acid except proline).

mTRAQ labeling
Prior to mTRAQ-based stable isotope labeling, the same 6 protein standards as used in iCCM-based isotope labeling was tryptically digested and desalted by an Oasis HLB cartridge to remove chemical hindrances (i.e., thiols or primary amines) containing in protein solution.Eluted peptides were lyophilized using SpeedVac, and reconstituted with 50 mM triethyl ammonium bicarbonate.Two aliquots (43 µg) of tryptic digests from 6 protein standards were isotopically labeled with each of mTRAQ reagents (∆0 and ∆4) with a 50 uL isopropanol for 1 hour at room temperature.Each of light (∆0)-and heavy (∆4)-labeled digests was re-lyophilized, reconstituted with 0.1% formic acid solution, and then pooled equally for next shotgun proteomic analysis.

NanoLC-ESI-MS/MS
nLC-ESI-MS-MS was carried out by a binary gradient elution and the model 1260 capillary LC system from Agilent Technologies coupled to a Q-Exactive Fourier Transform orbitrap-mass spectrometer (FT orbitrap-MS) from Thermo Scientific (Bremen, German) via electrospray ionization.The pre-column (RP) to trap sample was used with an analytical capillary RPLC column (150 mm × 75 µm-i.d.).A pulled tip analytical capillary column used in this study was prepared by packing with Magic C 18 AQ (3 µm, 100 Å) resins.For the purpose of RPLC separation for peptides mixture, the trap column was prepared in a capillary (100 µm-i.d., 360 µmo.d.) with an end frit (2 mm in length), and the capillary was packed in sequence with Magic C 18 AQ (3 µm, 200 Å) resins for 0.5 cm.The trap column and the analytical column were connected via a PEEK microcross. 10or the RPLC runs, two buffer solutions, (A) 2% ACN in water and (B) 93% ACN with 5% DMSO in water, both with 0.1% formic acid, were used as mobile phase solutions.The flow rate for RPLC run was 200 nL/min and the binary gradient run conditions were as follows: Samples were loaded into the trap column first by 0% B for 10 min, and initial increase to 8% B for 1 min then linearly increased to 15% up to 20 min., to 32% up to 55 min, ramped to 80% for 3 min, was held at 80% for 10 min to clean the RP column, and finally it was decreased to 0% over 2 min, and was held for 20 min to re-equilibrate the RP column.In human lung cancer proteomic, the RPLC runs were used the same as mentioned above, excepting that the gradient conditions was modified to be 25% B up to 115 min.from 32% B up to 55 min.The eluted peptides from the capillary column were directly fed into the FT orbitrap-MS via electrospray ionization in positive ion mode with 270 o C of capillary temperature and a voltage of 2.5 kV was applied through the Pt wire connected to the microcross.The column outflow rate was 200 nL/min by spliting the capillary (25 µm-i.d)attached to the microcross for MS precursor scan (m/z 300-1800).Data-dependent MS/MS was carried out for top ten prominent precursor ion from each precursor scan.Dynamic exclusion duration for MS/MS was 30 s, normalized collision energy was 27 eV for a higher-energy collisional dissociation (HCD) experiment.The resolution for full scan and tandem mass spectra was set to 70,000 and 35,000, respectively.The maximum injection time used 60 ms for full scan and data dependent MS/MS.The tandem MS spectra were analyzed by using Proteome Discoverer software (version 1.3.0.339) with the false positive option based on the nrNCBI proteome database (Version 20120623).The mass tolerance values were 1.2 Da for Precursor ions and 0.8 Da for product ions.The results were screened based on the following requirement; A cross-correlation (Xcorr) value larger than 1.9, 2.7, and 3.7 for singly-, doubly-, and triply-charged ions, respectively.The variable modification was set as oxidation of methionine (+15.99492Da), carbamidomethylation (+57.021464Da) and isotope-coded carbamidomethylation (+61.04073Da) of cysteine, as well as deamidation of asparagine (+0.98402Da) for endoglycosidic cleavage.When used mTRAQ's multiplexed reagents, tryptic peptides tagged separately with two different masses of ∆0 (+140.09496Da) and ∆4 (+144.10206Da) of histidine, lysine, serine, threonine, tyrosine, and N-terminus were included.Two tryptic miss-cleavages were defined.

Results and Discussion
The developed iCCM-based isotope labeling strategy presented in this study was aimed for providing more reliable quantitative datasets in proteomic analysis with high accuracy and precision, compared to conventional nonisobaric isotope labeling approaches.In particular, it is characteristic that an iCCM-based isotope labeling of proteome was easily achieved during general proteolysis without any other additional purification steps, as shown in Figure 1, whereas mTRAQ-based approach might be inevitably accompanied with buffer exchange step to remove several interfering substances (e.g., thiols, primary amines) from protein samples with respect to increasing a labeling efficiency, as well as the limited pH value of 8 and over.To evaluate the efficiency of iCCM-based quantification, each of two aliquots (43 mg) of 6 protein standards was individually reduced with 10 mM DTT, and then isotopically labeled with IAA for CM and its isotope for iCCM, respectively.The resulting CM-and iCCM-labeled proteins were pooled in the ratio of 1:1, tryptically digested, and followed by which the resulting peptide mixture were analyzed using a nLC-ESI-MS/MS run.
Figure 2 shows a base peak chromatogram (BPC) obtained from nLC-ESI-MS/MS experiments of the tryptic peptides (300 ng) derived from iCCM-based isotope labeling (Figure 2a), along with the MS scan spectrum eluted at t r = 47.33 min (Figure 2b).Out of the corresponding peptide pair ions detected in Figure 2b ) derived from both doubly charged peptide pair ions (m/z 677.82 and m/z 679.83) resultant from covalently incorporating each of IAA and its isotope into all cysteinyl residues of proteins during conventional proteolysis.
For the purpose of further evaluation, the same protein standards as used in former experiment were isotopically labeled using mTRAQ to make a comparison between the developed iCCM-based quantification and that of mTRAQ with respect to the efficiencies in a qualitative/quantitative experiment.Figure 3 shows the peptide sequence coverage of 6 protein standards and their relative abundance ratios of identified corresponding peptide pair ions differentially isotope-labeled with each of iCCM and mTRAQ.Based on the our quantitative experiments in triplicate, it is shown that the performance of an iCCM-based isotope labeling  approach is more superior to that of mTRAQ in both of a number of identified peptides and the reproducibility in quantitative variations.Resultingly, it is impressive that each sequence coverage of 6 protein standards was averagely more improved to at least ~10% than that of mTRAQ (Figure 3a).Furthermore, the standard deviation (S.D.) of 0.06 in iCCM reduced nearly two-fold less than that (0.13) of mTRAQ on the basis of the observed ratios of the corresponding peptide pair ions from 6 protein standards (Figure 3b), while it shown that each of the median observed ratios between those iCCM and mTRAQ was similar to be 0.92 for iCCM and 0.96 for mTRAQ, respectively.For the reason of which mTRAQ is a bit as lacking in quantitative reproducibility, it might be caused by two major reasons; i) the interfering substances (e.g., thiols and primary amines) that may hinder the covalently reaction of mTRAQ reagent with peptide, and ii) the nonspecific isotope labeling of all side chain amines of peptides, including hydroxyl residues such as serine and tyrosine.In iCCM, the tryptic peptide LSFNPTQLEEQCHI from beta-lactoglobulin, for instance, can be only detected a single corresponding peptide pair as LSFNPTQLEEQcHI (m/z 858.91, +2) for CM and LSFNPTQLEEQc*HI (m/z 860.91, +2) for iCCM, whereas more than two residues of tryptic peptides were randomly labeled in mTRAQ, as shown in Table 1.
Finally, we applied the developed iCCM-based stable isotope labeling to lung cancer serum proteome in order to assess the usefulness in the quantitative profiling of a large scaled protein samples.We first prepared the CM-/iCCMlabeled protein mixtures by which the liver cancer sera and controls are isotopically labeled separately with CM and iCCM, respectively, mixed with the ratio of 1:1 (lung cancer:control), and then subjected to the proteolytic treatment with a trypsin.Finally, the resulting CM-/iCCMlabeled digests were introduced to nLC-ESI-MS/MS experiments.Furthermore, the same sera samples as used in iCCM labeling also prepared via a non-isobaric labeling with mTRAQ in order to compare with quantitative results from those iCCM.The two BPCs obtained by shotgun proteomic analyses of each of iCCM-labeled peptides and mTRAQ-labeled one, as shown in Figure 4a and 4c, respectively.From shotgun analyses, 158 proteins from 378 peptides, including at least one or more cysteine residues, were identified out of total 243 proteins and simultaneously allowed the quantitative profiling of up-/down-regulated peptides through measuring the observed ratios of CM-/ iCCM-labeled corresponding pair ions.As a result, we found     4d).Moreover, it shown that the other 7 proteins in the levels of up-/downregulation were also similar to their quantitative results in literature for lung cancer sera.Consequently, the developed iCCM-based isotope labeling can be considered as a complementary strategy for quantitative proteomics with high reproducibility and robustness compared to conventional isotope labeling approaches.

Conclusion
In this study, we introduced a new iCCM-based isotope labeling strategy can be considered as one of promising strategies for quantitative proteomics, compared to conventional isotope labeling methods.Several advantages of iCCM-labeling method can be summarized as follows: i) stable isotope labeling in protein level can bypass the chances of unequal digestion in two different protein samples or of unequal retrieval of peptides compared to those stable isotope labeling strategies in peptide level, ii) there is not necessary of any purification step to remove some chemical hindrances in conventional isotope labeling methods, and iii) light-and heavy-IAA reagents required for iCCM are much cheaper than typical isotope labeling reagents that can be easily accessible.First evaluation of iCCM-based stable isotope labeling was carried out by means of which the tryptic peptides mixed equally with both of CM-and iCCM-labeled protein 6 protein standards and then quantitative analyzed using nLC-ESI-FT orbitrap-MS/ MS.From our shotgun proteomic experiments, the peptide mixtures labeled with CM-and iCCM, having a mass difference of + 4 Da between them, can be simultaneously identified and quantified during MS/MS experiments.In addition, even though iCCM-based isotope labeling method presented in this study has the limitation of which only proteolytic peptides containing cysteinyl residues are meaningful in the quantitative examination, iCCM-based isotope labeling in protein level spontaneously allows to the minimization of quantitative variations caused by the irreproducibility of the activity of proteases and the recovery of vial-to-vial, compared to mTRAQ whereby the nonisobaric labeling is carried out in peptide level.Consequently, we confirmed that an iCCM-based isotope labeling can be considered as a complementary method for the quantitative proteomics.

6 +
, both of m/z 677.82 [M+2H + ] 2+ and m/z 679.83 [M+2H + ] 2+ were dominantly observed with the mass difference of 2 Da between both ions (Figure2c), and simultaneously identified via a MS/MS experiments as DYELLcLDGTR for CM-labeled peptide and DYELLc*LDGTR for iCCMlabeled one from serotransferrin, respectively.From the superimposed MS/MS spectra in Figure2d, it can be also confirmed that the mass difference of 4 Da between the five y-series ions (y

Figure 1 .
Figure 1.The schematic workflow of iCCM-based isotope labeling method, leading to a mass shift of all cysteinyl residues to be + 4 Da, for quantitative proteins

Figure 2 .
Figure 2. Base peak chromatogram (BPC) obtained from nLC-ESI-MS/MS run of the tryptic peptides (100 ng) derived from iCCMbased sample treatment (a) along with the MS scan spectrum eluted at t r = 47.33 min (b).MS spectrum of both m/z 677.82 [M+2H + ] 2+ and m/z 679.83 [M+2H + ] 2+ observed with the mass difference of 2.01 Da between both ions (c), and simultaneously identified via a MS/ MS experiments as DYELLcLDGTR for CM-labeled peptide and DYELLc*LDGTR for iCCM-labeled one from serotransferrin, respectively (d).

Figure 3 .
Figure 3.The peptide sequence coverage of six protein standards (a) and their relative abundance ratios (b) of identified corresponding peptide pairs from six protein standards labeled with each of iCCM and mTRAQ, respectively.

Figure 4 .
Figure 4. Two BPCs were obtained by introducing two different isotope-labeling strategies of iCCM (a) and mTRAQ (c) to lung cancer sera sample (light) and controls (heavy).Out of identified peptides labeled with either CM and iCCM, the superimposed extracted ion chromatograms (EICs) whereby the comparison of two ions of m/z 749.67 (+3) and m/z 751.03 (+3), eluted at t r = 52.15min, were identically assigned as a CM-(lung cancer) and iCCM-labeled forms (control) of MSAVEGIcTSESPVIDHQGTK and MSAVEGIc*TSESPVIDHQGTK from Complement C5, show the relative abundant ratio of 2.63±0.07 for lung cancer : control.(b).The same corresponding peptide pair as that of iCCM were quantitatively observed as 2.22 via mTRAQ-based labeling (d).

Table 1 .
Comparison of three tryptic peptides of beat-lactoglobulin labeled with mTRAQ and iCCM and their modified sequences assigned using Proteome Discoverer software (version 1.3.0.339)

Table 2 .
A total 13 peptides (from 10 proteins) showing more than 2-folds of change in their levels of lung cancer patient samples compared to the control obtained by measuring the relative abundance ratios of CM-/iCCM-labeled peptide pair ions