Deletion of Immunoproteasome Subunits Imprints on the Transcriptome and Has a Broad Impact on Peptides Presented by Major Histocompatibility Complex I molecules*

Proteasome-mediated proteolysis plays a crucial role in many basic cellular processes. In addition to constitutive proteasomes (CPs), which are found in all eukaryotes, jawed vertebrates also express immunoproteasomes (IPs). Evidence suggests that the key role of IPs may hinge on their impact on the repertoire of peptides associated to major histocompatibility complex (MHC) I molecules. Using a label-free quantitative proteomics approach, we identified 417 peptides presented by MHC I molecules on primary mouse dendritic cells (DCs). By comparing MHC I-associated peptides (MIPs) eluted from primary DCs and thymocytes, we found that the MIP repertoire concealed a cell type-specific signature correlating with cell function. Notably, mass spectrometry analyses of DCs expressing or not IP subunits MECL1 and LMP7 showed that IPs substantially increase the abundance and diversity of MIPs. Bioinformatic analyses provided evidence that proteasomes harboring LMP7 and MECL1 have specific cleavage preferences and recognize unstructured protein regions. Moreover, while differences in MIP repertoire cannot be attributed to potential effects of IPs on gene transcription, IP subunits deficiency altered mRNA levels of a set of genes controlling DC function. Regulated genes segregated in clusters that were enriched in chromosomes 4 and 8. Our peptidomic studies performed on untransfected primary cells provide a detailed account of the MHC I-associated immune self. This work uncovers the dramatic impact of IP subunits MECL1 and LMP7 on the MIP repertoire and their non-redundant influence on expression of immune-related genes.

It has been assumed that the key role of IPs may hinge on their impact on the repertoire of peptides associated to MHC I molecules. Indeed, cell surface levels of MHC I molecules are reduced in spleen cells from Lmp7 Ϫ/Ϫ and Lmp7 Ϫ/Ϫ Mecl1 Ϫ/Ϫ mice (12). Furthermore, studies of selected epitopes revealed that some MHC I-associated peptides can be generated only by CPs, some only by IPs, and others by both types of proteasomes (7,(13)(14)(15)(16)(17)(18). In vitro proteasome digestion experiments suggest that, compared with CPs, IPs have greater efflux and cleavage rates, and generate more N-extended versions of MHC I epitopes (19,20). In addition, immunosubunits alter proteasome structure and cleavage site preferences (7,21). Nevertheless, the aforementioned studies cannot predict the overall impact of IPs on the MHC I peptide (MIP) repertoire in vivo, mainly for three reasons. First, in vitro proteasomal digestion may not reproduce in vivo conditions, where most (MIPs) derive from rapidly degraded proteins that translocate into the endoplasmic reticulum a few seconds after cleavage by proteasomes (22,23). Second, MIP presentation is orchestrated by several steps downstream of proteasomal digestion so that only a small fraction of peptides generated by proteasomes are presented by MHC I molecules (24 -26). Finally, previous studies did not take into account potential differences in transcription regulation by CPs and IPs. Whereas CPs clearly regulate transcriptional activation (27)(28)(29)(30), the potential impact of IPs on transcription remains unexplored. Conceivably, IPs and CPs might differentially regulate MHC I presentation of a given peptide not only by affecting degradation of the peptide's source protein but also by modulating transcription of the gene encoding that peptide. In this perspective, the goal of our work was to obtain a direct and global evaluation of the impact of IPs on the repertoire of MIPs. To this end, we used a recently described label-free quantitative approach to analyze the MIP repertoire of DCs expressing or not expressing IP subunits MECL1 and LMP7 (31). Also, we analyzed the gene expression profile of those two DC populations using microarrays.

EXPERIMENTAL PROCEDURES
Mice-Mouse cells were prepared on a C57BL/6 background and maintained in a specific pathogen free environment. Wild-type (WT) and ␤2-microglobulin (␤2m) Ϫ/Ϫ 1 mice were obtained from The Jackson Laboratory. Lmp7 Ϫ/Ϫ Mecl1 Ϫ/Ϫ (dKO) mice were generously provided by Dr T.A. Griffin from the Medicine College of the University of Cincinnati.
Peptide Extraction and MS Analyses-Three biological replicates (5 ϫ 10 8 DCs per replicate) were prepared from a total of 28 WT mice, 28 dKO mice and 47 ␤2m Ϫ/Ϫ mice. MIPs were analyzed as previously reported (31) with minor modifications. MIPs obtained after acid elution (34) were separated using an off-line 1100 series binary LC system (Agilent Technologies, Mississauga, ON, Canada) to remove contaminating species. Peptides were loaded on a homemade SCX column (0.3 mm internal diameter x 45 mm length) packed with strong cation exchange (SCX) bulk material (Polysulfoethyl A™, PolyLC). Peptides were fractionated with a gradient of 0 -25% B after 33 minutes, 25-60% B after 35 minutes (Solvent A ϭ 5 mmol/L ammonium formate, 15% acetonitrile, pH3; Solvent B ϭ 2 mol/L ammonium formate, 15% acetonitrile, pH3). MIPs were collected in five consecutive fractions and brought to dryness using a speedvac. MIP fractions were resuspended in 2% aqueous acetonitrile (0.2% formic acid) and analyzed by nanoLC-MS/MS on a LTQ-Orbitrap mass spectrometer (Thermo Fisher Scientific) (31). Full mass spectra were acquired with the Orbitrap analyzer operated at a resolving power of 60,000 (at m/z 400) and collision-activated dissociation tandem mass spectra were acquired in data-dependent mode with the quadrupole linear ion trap analyzer. Mass calibration used either an internal lock mass [protonated (Si(CH 3 ) 2 O)) 6 ; m/z 445.12057] or external calibration using Calmix (caffeine, MRFA and ultramark) and typically provided mass accuracy within 5 ppm for all nanoLC-MS/MS experiments.
MS/MS Sequencing and Peptide Clustering-Data were analyzed using Xcalibur software and peak lists were generated using Mascot distiller (version 2.1.1, www.matrixscience.com). Database searches were performed against an International Protein Index mouse database (version 3.23 containing 51,536 sequences and 24,497,860 residues) using Mascot (version 2.2, www.matrixscience.com) with a mass precursor tolerance of Ϯ 0.05 Da and a fragment tolerance of Ϯ 0.5 Da. Searches were performed without enzyme specificity and a variable modification of oxidized Met. All search results were filtered using an MHC motif filter based on the predicted mouse MHC I allele motifs. Raw data files were converted to peptide maps comprising m/z values, charge state, retention time and intensity for all detected ions above a threshold of 15,000 counts using in-house software (Mass Sense) (31). Peptide maps were aligned and clustered together to profile the abundance of Mascot identified peptides using hierarchical clustering with criteria based on m/z and time tolerance (Ϯ0.01 m/z and Ϯ1.5 min). This resulted in a list of non-redundant peptide clusters for all replicates of all samples to be compared. MIPs were further inspected for mass accuracy and MS/MS spectra were validated manually. The Sidekick resource (http://www.bioinfo.iric.ca/ sidekick/Main) was used to identify MIP source proteins. The InnateDB resource (35) was used to identify significantly enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways associated to peptide source genes from DCs and thymocytes. The list of MIPs reported in the present work has been provided to The Immune Epitope Database and Analysis Resource (http://beta.immuneepitope. org/) (36).
Cytotoxicity Assays-In vitro carboxyfluorescein succinimidyl ester (CFSE)-based cytotoxicity assays were performed as previously described with minor modifications (31). CFSE-based assays are more sensitive than classic 51 Cr-release cytotoxicity assays (37)(38)(39). Briefly, 10 6 WT and dKO DCs were injected intravenously into mice from both genotypes on days 0 and 7. On day 14, splenocytes from the four immunized mice (WT mice injected with WT or dKO DCs, dKO mice injected with WT or dKO DCs) were used as effector cells in cytotoxicity assays (39). Target cells were concanavalin A treated WT and dKO splenocytes. The percentage of specific lysis was measured as follows: [(number of remaining CFSEϩ cells after incubation of target cells alone Ϫ number of remaining CFSEϩ cells after incubation with effector cells)/number of CFSEϩ cells after incubation of target cells alone] ϫ 100.
Bioinformatic Analysis of Cleavage Motifs-All 417 peptides extracted from DCs were used in studies of amino acid usage in MIPs. In analyses of flanking regions, we eliminated peptides that can originate from multiple source proteins with different N-or C-terminal flanking sequences. Peptides used for further analyses of flanking regions (376 for upstream regions, 369 for downstream regions) were ranked according to their WT/dKO fold difference in abundance as determined by MS analyses. We next generated a Euclidean distance matrix to compare amino acid usage at each position. We thereby compared amino acid usage by MIPs located at the left versus the right of each ranked peptide. A bootstrap procedure (100,000 iterations) was performed to evaluate whether the distance measured was significant (p Ͻ 0.05 was considered significant). The analysis was performed for each position of the MHC peptides as well as for 10 residues upstream of the peptide N terminus and downstream of the C terminus. For positions that gave a p value Ͻ 0.001, we used the Kolmogorov-Smirnov statistical test to determine which specific amino acids were over-or under-represented in WT as opposed to dKO DCs. The R software was used to visualize amino acid distributions (http://www.R-project.org), and the program SEG (http://mendel. imp.ac.at/METHODS/seg.server.html) to determine unstructured regions in source proteins (window size ϭ 12, low complexity ϭ 2.5, high complexity ϭ 2.8) (40). The number of MIPs from unstructured regions in presence or in absence of IP was measured using a Chi-squared test (with p value Ͻ 0.05).
Microarrays and Genomic Analyses-Total RNA was extracted from WT and dKO DCs with TRIzol RNA reagent (Invitrogen) as instructed by the manufacturer. Samples were purified using DNase (Qiagen, Mississauga, ON, Canada) and the RNeasy Mini kit (Qiagen), and the overall quality was analyzed with the 2100 Bioanalyzer (Agilent Technologies). Purified RNA (10 g/sample) was hybridized on MM8 385K NimbleGen chips at the Genomics core facility of the Institute for Research for Immunology and Cancer according to the manufacturer's instruction. Arrays were scanned using a GenePix4000B scanner (Axon Instruments, Molecular Devices, Sunnyvale, CA) at 5 m resolution. Data were extracted and normalized using the NimbleScan 2.4 extraction software (NimbleGen Systems, Madison, WI). Further microarray analyses were performed using GeneSpring GX 7.3.1. The complete microarray datasets have been deposited in ArrayExpress (http://www.ebi.ac.uk/ arrayexpress) under accession number E-TABM-750. Two-sided Student t test was used to compare transcript abundance in WT versus dKO DCs. Spearman's rank correlation was used to evaluate the relation between MIP abundance and source mRNA expression. The Gene set organization/visualization module of the Web-Based Gene Set Analysis Toolkit (WebGestalt) (41) was used to represent the chromosomal localization of the differentially expressed genes, and the gene enrichment on specific chromosomes was measured using the Database for Annotation, Visualization, and Integrated Discovery (DAVID) Bioinformatics Resource (42).

RESULTS
Experimental Design of Peptidomic Studies-To evaluate the impact of IPs on the MIP repertoire, we elected to study DCs because they are quintessential antigen-presenting cells and constitutively express IPs. Using a recently described label-free quantitative proteomics method (31), we analyzed MIPs eluted from mature WT and double knockout (dKO) DCs (Fig. 1A). WT DCs express both CPs and IPs (alike mature DCs under physiological in vivo conditions (9)) whereas dKO DCs do not express the IP-subunits LMP7 and MECL1 (Fig.  1D). IP subunits are cooperatively incorporated into proteasomes, thereby curtailing the formation of mixed proteasomes containing IP and CP subunits (43). In line with this result, we noted a 50% decrease in the amount of LMP2 protein in dKO cells, suggesting that LMP2 is unstable in the absence of the two other immunosubunits (Fig. 1D). As a negative control we analyzed DCs derived from ␤2m-deficient mice. Since ␤2m is essential for formation of stable peptide-MHC I complexes, cells lacking ␤2m are MHC I-deficient. DCs generated from WT, dKO, and ␤2m-deficient mice shared a mature (CD11c ϩ CD86 ϩ IA bϩ ) myeloid (CD8␣ Ϫ CD11b ϩ ) phenotype (Fig. 1B,C).
Peptides eluted from DCs were fractionated by off-line LC using a SCX column, then analyzed by nano-LC combined with tandem MS (nanoLC-MS/MS). Peptide maps were generated from each analysis to define the coordinates (m/z, retention time, ion abundance) of identified ions. The corresponding peptide maps were then clustered across conditions and replicate analyses to identify unique peptide ions and profile their changes in abundance. Subtraction of "contaminant peptides" eluted from ␤2m-deficient cells allowed specific identification of genuine MIPs (Fig. 1A) (31). MS/MS spectra were manually verified for all MIPs. The following softwares were used to associate peptide sequences with specific MHC I allelic products: smm, SYFPEITHI (H2D b and H2K b ), and Rankpep (Qa2) (31). This resulted in the identification of 417 unique MIPs (derived from 389 source proteins) in WT DCs (supplemental Table  S1, Fig. S2).
The MIP Repertoire of DCs Conceals a Unique Signature-We reported previously that MIPs eluted from thymocytes derived preferentially from transcripts whose abundance was higher in the thymus than in other tissues (31). This suggested that the MIP repertoire might conceal a cell typespecific signature. To directly test this concept, we compared the MIP repertoire of DCs (reported herein) to that of thymocytes (31) derived from the same strain of mice (WT, C57BL/ 6). We identified more peptides in DCs than in thymocytes (417 versus 189; Fig. 2A). This discrepancy can be attributed, at least in part, to the fact that MHC I molecules are much more abundant on DCs than thymocytes (Fig. 2B). In both cell types we found more peptides associated to MHC Ia (H2K b , H2D b ) than to MHC Ib (Qa1, Qa2) allelic products (Fig. 2C). Of note, the proportion of peptides with a Qa2-binding motif was greater in DCs than in thymocytes (p Ͻ 0.01; chi-square test). However, the key finding was that 72 of 189 peptides eluted from thymocytes were not detected in DC eluates, although we recovered less peptides from thymocytes (189) than from DCs (417) (Fig. 2A). This means that about 60% of MIPs present at the surface of thymocytes are also present on DCs whereas 40% are thymocyte-specific. Of 417 peptides recovered from DCs, 117 were shared with thymocytes, whereas 300 were DC-specific.
To further evaluate whether the MIP repertoire might reflect cell type-specific intracellular signaling events, we used the InnateDB resource (35) to analyze pathways catalogued in the KEGG database. We specifically evaluated whether specific KEGG pathways were overrepresented in the group of genes encoding peptides eluted from DCs and/or thymocytes (Fig.  2D). In each dataset (DCs and thymocytes), about 28% of peptide source genes were linked to specific KEGG pathways. Notably, 45% of pathway-connected genes (12.5% of the whole dataset) were associated with pathways significantly overrepresented in the DC and/or thymocyte gene dataset. Many MIP source genes were connected to pathways overrepresented in both DCs and thymocytes (e.g., p53 signaling and ribosome biogenesis). Of special interest, peptide source genes belonging to specific pathways were enriched uniquely in DCs or thymocytes. Many of these pathways reflected the function and differentiation of DCs and thymocytes. For instance, the MIP repertoire of DCs was enriched in peptides whose source genes are involved in myeloid differentiation, proteasome function and Toll-like receptor signaling. Besides, peptide source genes involved in tight junction, purine metabolism and cell cycle were overrepresented in the thymocyte MIP repertoire. For DC and thymocyte gene datasets, a complete list of overrepresented pathways and their constituent genes can be found in supplemental Table S2. Together, these results show that the MIP repertoire conceals a cell type-specific signature that reflects singular functional properties.
IPs Increase the Abundance and Diversity of MIPs at the Surface of DCs-To evaluate the impact of IPs on the MIP repertoire, we compared MIPs eluted from WT and dKO DCs by MS analysis as depicted in Figure 1A. In accordance with previous studies (31), we found that 95% of peptide ions showed a variation of less than Ϯ 3-fold in abundance across biological replicates (n ϭ 3). Therefore, we considered that peptides were differentially presented by MHC I molecules when the fold difference in abundance between WT and dKO DCs was greater than 3. Of the 417 peptides eluted from WT DCs, 212 were expressed at similar levels (within 3-fold) in dKO DCs (Fig. 3A). Remarkably, 199 peptides were overexpressed in WT relative to dKO DCs. Among those 199 peptides, 60 were detected exclusively in WT DCs. Only six peptides were slightly overexpressed (3-to 5-fold) in dKO relative to WT DCs and none were unique to dKO DCs. Peptides with the largest fold differences in abundance are listed in Table I, and the full list of peptides is available in supplemental Table S1. In accordance with our data on MIP abundance, flow cytometry analyses revealed that, as previously shown using WT versus dKO spleen lymphocytes (12), expression of cell surface H2D b and H2K b was higher by approximately 2.1-fold on WT than dKO DCs (Fig. 3B). However, protein immunoblot analyses on whole cell lysates showed that total cellular amounts of H2D b and H2K b heavy chains were similar in both types of DCs (Fig. 3C), suggesting that the lower level of surface expressed MHC I on dKO DCs was caused by a limited peptide supply and not an altered level of MHC I molecules available.
After immunization with WT DCs, dKO mice generated WTspecific cytotoxic T cells (Fig. 3D,G). However, WT mice did not generate cytotoxic effectors against dKO cells (Fig. 3E,F). That unidirectional immunogenicity supports our peptidomic analyses showing that numerous peptides were uniquely detected on WT DCs, whereas no peptides were found only on dKO DCs (Fig. 3A). Furthermore, it is consistent with the previous demonstration that WT splenocytes were immunogenic for Lmp7 Ϫ/Ϫ mice, but not vice versa (44). However, dKO DCs may not be optimal antigen-presenting cells. Indeed, when dKO and WT DCs were coated with exogenous SIINFEKL and injected in mice bearing K b /SIINFEKL-specific T cells, dKO DCs proved to be immunogenic but less so than WT DCs (supplemental Fig. S1). Collectively, these results show that the presence of IPs has a major impact on the global MIP repertoire, by increasing both the abundance and the diversity of MIPs.
IPs Have Specific Cleavage Preferences-Proteasomal cleavage generates the final C terminus of MIPs whereas their N terminus can be further trimmed by aminopeptidases in the cytosol and the endoplasmic reticulum (22,25). Proteasomal cleavage can be influenced by approximately five to seven residues flanking the cleavage site on either side (44,45). To determine whether the MIP repertoire generated in the presence or absence of IPs might reveal discrete cleavage preferences, we analyzed the amino acid composition of MIPs and their flanking residues. The 417 peptides extracted from DCs were used in studies of amino acid frequencies in MIPs.  MIPs that were not detected on dKO cells were attributed the intensity value 15,000, which represents the threshold for limit of detection. Ratios with the symbol Ն indicate peptides for which the exact fold change could not be measured because they were detected only in WT DCs. See supplemental Table S1 for a complete list of MIPs detected on DCs.

The Role of Immunoproteasomes
In analyses of flanking regions, we eliminated peptides that can originate from multiple source proteins with different N-or C-terminal flanking sequences. Remaining peptides were ranked according to their WT/dKO fold difference in abundance as determined by MS analyses. Peptides with high WT/dKO ratios are IP-dependent. We next generated a Euclidean distance matrix to compare amino acid usage at each position. Each ranked peptide was used consecutively as a reference. We thereby compared amino acid usage by MIPs having higher versus lower WT/dKO ratios than the reference peptide. A bootstrap procedure (100,000 iterations) was performed to evaluate whether the distance measured was significant. The analysis was performed for each position of the MHC peptides as well as for 10 residues upstream of the peptide N terminus and downstream of the C terminus. We detected no bias in amino acid frequencies at various positions of the MIPs per se (data not shown). However, we found highly significant deviations of amino acid frequencies at two peptide flanking positions: N-5 upstream of the N terminus and Cϩ2 downstream of the C terminus of MIPs (Fig. 4A,B; p Ͻ 0.001). We performed a detailed analysis of amino acid frequencies at positions N-5 and Cϩ2 using the Kolmogorov-Smirnov test. At position N-5, peptides overexpressed in WT cells showed decreased frequencies of glycine and asparagine residues (Fig. 4C). It is not clear how cytosolic and endoplasmic reticulum aminopeptidases processing MIP precursors choose their substrates (46,47). Therefore, further work is needed to decipher the significance of the amino acid bias at position N-5. For peptides that have high WT/dKO ratios, deviation at position Cϩ2 was particularly dramatic and was characterized by an increased usage of proline and polar residues [lysine and glutamine], with a decreased frequencies of asparagine and hydrophobic residues [leucine and isoleucine] (Fig. 4D). Asparagine and hydrophobic residues are enriched in ␣-helices and ␤-sheets, whereas proline and polar residues are enriched in unstructured protein regions, which represent 20 -30% of the mammalian proteome (48,49). The program SEG, which computes sequence complexity, has been used successfully to predict unstructured regions (i.e., lacking secondary and tertiary structures) (40). Using SEG, we found that amino acids C-terminal of IP-dependent peptides (high WT/dKO ratio) derived more frequently from unstructured protein regions than amino acids C-terminal of IP-independent peptides (Fig. 4E). The IP bias toward unstructured protein domains was specific for amino acid residues next to the MIP C terminus (e.g., Cϩ2), and was not detected for the MIP themselves nor for residues upstream of their N terminus (data not shown). We conclude that IPs display specific cleavage preferences and propose that the presence of IPs leads to enhanced MHC I presentation of peptide sequences adjoining the unstructured proteome.
IPs Have a Non-Redundant Impact on the Transcriptome of DCs-Aside from their role in protein degradation, CPs also regulate transcription (27)(28)(29)(30). Whether IPs may regulate tran-scription differently than CPs is unknown. Thus, we could not assume a priori that peptide overexpression in IP-expressing DCs was due solely to enhanced degradation of peptide source proteins by IPs. Theoretically, IPs might also mold the MIP repertoire by differential regulation of peptide source genes. To test this assumption, we compared the transcriptome of WT and dKO DCs using NimbleGen MM8 385K microarrays. We found that 226 transcripts, representing 171 genes and corresponding to 0.5% of the transcriptome, were differentially expressed between WT and dKO DCs (Fig. 5A; Supplemental Tables S3 and S4). There was no correlation between transcript and MIP abundance (Fig. 5B). We therefore conclude that differential expression of MIPs in WT versus dKO DCs cannot be ascribed to differential transcription of peptide source genes. Nevertheless, a selected set of transcripts was differentially expressed in the presence or absence of IPs. The loci encoding those transcripts were not randomly distributed in the genome. Somewhat unexpectedly, they were clustered in discrete regions located primarily in chromosomes 4, 8, 9, and 17, and practically absent in chromosomes 3, 10, 13, 14, 16, 19, and X. The gene clusters were particularly enriched in chromosomes 4 and 8 (p ϭ 10 Ϫ12 and 10 Ϫ8 , respectively; Fig. 5C).
To evaluate whether differentially expressed genes might be relevant to DC function, we focused on transcripts for which functional annotation data was available (50% of differentially expressed transcripts; Fig. 5D). Specifically, we excluded transcripts for which no biological data were available, or when the sole available evidence was inferred from electronic annotation that was not assigned by a curator (ND and IEA GO codes) (Fig. 5D). Remarkably, 56% of functionally annotated genes had a demonstrated or putative role in DC function or immune signaling (Fig. 5D). Those 48 genes were aggregated into six functional categories: resistance to infection, antigen presentation, phagocytosis, immune signaling, DC maturation, and DC migration. A complete annotated list of genes belonging to these categories is available in supplemental Table S3. We conclude that IPs have a non-redundant impact on expression of a selected of set of genes that regulate different aspects of DC function.

DISCUSSION
By using a label-free quantitative proteomics approach, we gained valuable insights into the impact of IPs on the molecular composition of the immune self. Our peptidomic studies allowed us to generate a most comprehensive biochemical definition of the MIP repertoire. Since we achieved this using untransfected primary DCs, our data provide a broad and faithful representation of the MHC I-restricted immune self. The present work yielded three major observations. First, the MIP repertoire conceals a cell type-specific signature. Though the MIP repertoire of DCs and thymocytes partially overlap, no less than 40% of their MIPs were cell type-specific. The large proportion of cell type-specific MIPs observed herein argues against the notion that MIPs derive primarily from ubiquitously expressed proteins (50,51). MIPs unique to DCs or thymocytes reflected cell function and differentiation. The DCs that we studied were antigen-processing cells with a myeloid phenotype whose maturation was induced by LPS (a toll-like receptor-4 ligand). Quite remarkably, the MIP repertoire of those DCs was enriched in peptides encoded by genes regulating proteasome function, myeloid differentiation and tolllike receptor signaling. Thymocytes have a high proliferation index and undergo a myriad of sequential interactions with subpopulations of stromal cells during their three-week journey in the thymus. Their MIP source genes were biased to- ward cell cycle regulation, purine metabolism and tight junction formation. The notion that a substantial proportion of MIPs are cell type-specific leads us to infer that, at the organismal level, the composition of the MHC I-restricted immune self is highly complex. The cell types studied herein (DCs and thymocytes) are rather closely related in the cell lineage tree because they both derive from hematopoietic stem cells. In the future, it will be interesting to compare the MIP repertoire of DCs to that of non-hematopoietic cells.
Second, our proteomics analyses of MIPs from WT and dKO DCs show that IPs dramatically increase the abundance and diversity of MIPs. In agreement with that, WT DCs were immunogenic for dKO mice, but not vice versa. Our study being based on comparison of WT and dKO DCs, it was aimed specifically at discovering the non-redundant roles of IPs, and not the non-redundant roles of CPs. If anything, our results might slightly underestimate the impact of IPs on the MIP repertoire because, in addition to CPs, DCs from dKO mice may harbor a few mixed proteasomes containing LMP2 admixed with CP ␤2 and ␤5 catalytic subunits. It must be realized, however, that eliminating all vestiges of IP activity is not a trivial task. Since the Lmp2 and the Lmp7 genes are closely linked, it is practically impossible to generate triple KO mutants by breeding (LMP7 Ϫ/Ϫ Mecl1 Ϫ/Ϫ ) dKO mice with LMP2 Ϫ/Ϫ mice. A plausible alternative would be to treat dKO cells with a pharmacologic LMP2 inhibitor (52). However, this strategy would also be fraught with a caveat: pharmacological inhibitors block the proteolytic activity of IP subunits but not other conformational effects that individual IP subunits have on IP function. Indeed, evidence suggests that incorporation of immunosubunits results in structural changes of the whole 20S IP complexes and thereby influences their biologic properties. For example, a model HBV epitope is not generated in LMP7-deficient cells but can be generated in the presence of a catalytically inactive LMP7 subunit (in which Thr1 is mutated to Ala) (7,53). Thus, the generation of epitopes like that one would not be blocked by pharmacological inhibitors. Our observation that IPs increase MIP abundance fits well with in vitro proteasome digestion experiments suggesting IPs have greater efflux and cleavage rates than CPs (19,20) and with the decreased cell surface levels of MHC I molecules on Lmp7 Ϫ/Ϫ and Lmp7 Ϫ/Ϫ Mecl1 Ϫ/Ϫ splenocytes (12). For reasons presented in the Introduction, it was not possible to extrapolate from previous in vitro proteasome digestion experiments the overall impact of IPs on the diversity of the MIP repertoire generated in vivo. Our work now provides a direct and global evaluation of the impact of IPs on MIP diversity. Of 417 peptides eluted from DCs, 199 were overexpressed in WT relative to dKO DCs and 60 were detected exclusively in WT DCs. Thus, about 14% of MIPs (60 of 417) were totally IP-dependent.
Third, our results also suggest that IPs possess distinct cleavage properties that impinge on the MIP of primary DCs. The most salient difference between IP-dependent and -independent peptides was found at position Cϩ2. For IP-dependent peptides, deviation at Cϩ2 was characterized by an increased usage of proline and polar residues with decreased frequencies of asparagine and hydrophobic residues. The most significant difference was the decreased frequency of leucine residues at Cϩ2 in IP-dependent relative to IP-independent peptides. That observation is remarkably coherent with the seminal work conducted by Toes et al. who digested yeast enolase-1 in vitro with CPs or IPs, analyzed fractionated peptide fragments by MS, and found that leucine at Cϩ2 was disfavored by IPs (21). Furthermore, our bioinformatic analyses predict that IPs have a bias toward unstructured protein regions and lead to enhanced MHC I presentation of MIPs adjoining the unstructured proteome. The lack of secondary and tertiary structure confers several properties such as increased interaction surface area, conformational flexibility and accessible posttranslational modification sites (48). Consequently, largely unstructured proteins are especially prone to make promiscuous molecular interactions and their overexpression is particularly dangerous for a cell as it frequently leads to cell death or neoplastic transformation (48,54). The MIP repertoire allows presentation of only a tiny fraction of the proteome to CD8 T cells (22,55). Therefore, the bias of IPs toward unstructured protein regions could be of considerable relevance, for example in cancer immunosurveillance.
Integration of our peptidomic data with global profiling of the DC transcriptome revealed that differential expression of MIPs in WT versus dKO DCs cannot be ascribed to differential transcription of peptide source genes. However, we found that IPs have a non-redundant impact on expression of a selected set of transcripts. Of note, IPs may affect expression of numerous other genes redundantly with CPs, but our study design was poised to selectively identify genes differentially regulated by IPs and CPs. Recent evidence suggests that the role of IPs is not limited to processing peptides for MHC presentation (56). For instance, MECL1 is a T-cell-intrinsic factor regulating homeostatic expansion, T cells from dKO mice hyperproliferate in response to polyclonal mitogens, and selective inhibition of LMP7 blocks cytokine production by activated monocytes and T cells (12,57,58). The present work suggests that these pleiotropic effects of IPs may be mediated by a non-redundant effect of IPs on gene expression. Differential expression of immune genes could explain why dKO DCs pulsed with optimal levels of exogenous SIINFEKL peptide are less immunogenic than WT DCs (supplemental Fig. S1). Further work is needed to discover how IPs may regulate gene expression. Nevertheless, it is interesting to note that genes on which IPs had a non-redundant effect were clustered in the genome (Fig. 5C). Gene order in eukaryotes is not random. In all well-studied genomes, genes of similar and/or coordinated expression tend to be linked in clusters that can extend up to many megabases (59). Gene clustering often results from the sharing of regulatory elements (60). In line with this, proteasome 20S core particles regulate transcriptional activation by controlling the localization, abundances and activity of transcriptional activators and repressors through proteolytic degradation (27)(28)(29)(30). We therefore propose that the non-redundant effect of IPs on gene expression may result from proteolysis of transcriptional modulators or their regulators. The overarching conclusion of our work is that IP subunits MECL1 and LMP7 have more than one non-redundant role. They have a dramatic impact on the MIP repertoire and a heretofore unrecognized impact on expression of immune-related genes. Both of these effects are probably of great importance in adaptive immune responses and may be instrumental in the remarkable conservation of IPs in gnathostomes.
Acknowledgments-We thank Dr T. A. Griffin for providing the LMP7 Ϫ/Ϫ Mecl1 Ϫ/Ϫ mice, Jean-Philippe Laverdure for help with bioinformatic analyses and Martin Giroux for thoughtful suggestions. We are grateful to the staff of the following core facilities at the Institute for Research in Immunology and Cancer (IRIC) for their outstanding support: Animal facility, Bioinformatics, Flow cytometry, Genomics, and Proteomics.