M6A Methylation Modification–Mediated Mucosal Immune Microenvironment in Crohn's Disease

doi:10.21203/rs.3.rs-2565800/v1

Download PDF

Research Article

M6A Methylation Modification–Mediated Mucosal Immune Microenvironment in Crohn's Disease

https://doi.org/10.21203/rs.3.rs-2565800/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Objective

To explore the pathogenesis of Crohn's disease by revealing the relationship between m6A methylation and Crohn's disease

Methods

The GEO (GENE EXPRESSION OMNIBUS) database was used to download the dataset GSE186582 on Crohn's disease, including standard tissue samples and Crohn's disease tissue samples, and the Expression of M6A-related genes in the calibrated dataset was obtained. Through the observation and comparison of the random forest tree method and machine learning method, it was determined that the random forest tree model could be used to screen the characteristic genes of diseases. Samples were divided into subtypes by the expression of m6A-related genes, and the relationship between different types and immune cells was analyzed and verified by principal component analysis. The expression of M6A-related genes and the relationship between the genotyped samples and immune cells were analyzed. We classified Crohn's disease according to the expression of differential genes, finally established the corresponding relationship between different types by Sankey diagram and analyzed the expression of Crohn's disease-related disease genes in two different types.

Results

By comparing the model construction methods, we found that the residual value of the random forest tree model method was low, and the area under the ROC curve was 1. Therefore, we chose the random forest tree method to construct the model and screen characteristic genes and found 11 methylation-related genes related to m6A in Crohn's disease, such as RBM15, YTHDF3 and RBM15B. According to the expression of 11 M6A-related genes, the samples were divided into two subtypes: activated B cells, immune B cells and MDSC (myeloid-derived inhibitory cells) expressed more than the B subtype (P value is less than 0). There was a significant positive correlation between the METTL3 gene, M6A recognition enzyme HNRNPA2B and activated CD4 + T cells. The expressions of activated B cells, MDSC and immune B cells in genotype B were significantly higher than those in genotype A (P < 0.05).

Conclusion

m6A modulators play an essential role in Crohn's disease, and the study of their patterns can guide future immunotherapy strategies for Crohn's disease

Crohn's Disease

M6A

GEO

characteristic gene

Crohn's disease (CD) is an inflammatory bowel disease of unknown etiology that occurs mainly in the terminal ileum and right colon[1]. This disease and chronic nonspecific ulcerative colitis are collectively known as inflammatory bowel disease (IBD)[2]. The clinical manifestations of Crohn's disease are abdominal pain, diarrhea, intestinal obstruction, accompanied by fever, nutritional disorders and other parenteral manifestations[3]. The course of the disease is more recurrent and not easy to heal[4]. Crohn's disease complications can cause inflammation when it appears, even fibrosis, which will cause the ileum intestinal lumen stenosis, then the need for surgical intervention[1]. There is no gold standard for diagnosing Crohn's disease, so early identification of those at high risk of developing Crohn's disease is critical. Given the extensive development of Crohn's disease research, Crohn's disease is now considered to be a disease that interacts with genetic changes and the environment[5].

N6-methyladenosine, also called m6A, is a widespread base modification behavior on mRNA[6]. The most common internal modifications of mRNA include N6-adenylate methylation (m6A), N1-adenylate methylation (m1A), and cytosine hydroxylation (m5C). N6-methyladenine (m6A) accounts for the most significant proportion of internally modified bases in mRNA, mainly distributed in G (m6A) C (70%) or A (m6A) C (30%) conserved sequences[7, 8]. The function of mRNA internal modification is mainly used to maintain the stability of mRNA.

M6A is an essential epigenetic modification regulated by methyltransferase, demethylase and methylated reading protein. Methylated transferases, including METTL3/14, WTAP and KIAA1429, mainly catalyze the m6A modification of adenylate on mRNA. However, demethylases, including FTO and ALKHB5, act to demethylate the bases modified by m6A[9]. The primary function of reading proteins is to recognize the bases modified by m6A to activate downstream regulatory pathways such as RNA degradation and miRNA processing[10]. Methyltransferase, also known as Writers, modifies the bases on mRNA with m6A methylation. METTL3, METTL14, WTAP and KIAA1492 are all core proteins of m6A methylated transferase. These proteins form complexes that act together as catalysts[11].

Recently, many research results have shown that the m6A modified through the influence of the expression of tumor-associated genes plays an essential role in tumor development. However, the role of m6A modulators in Crohn's disease remains unknown.

In this study, the role of m6A methylation-related genes in the diagnosis and subtype classification of Crohn's disease was comprehensively evaluated based on the GSE186582 dataset from the Integrated Gene Expression Database (GEO) database. A gene model based on five candidate m6A regulators (RBM15, YTHDTC2, YTHDF3, LRPPRC, WTAP) was developed to predict susceptibility to Crohn's disease. In addition, two distinct m6A patterns were revealed, which are highly consistent with T helper cell type 1 (Th1) dominant and Th2 dominant immunity, suggesting that the m6A pattern can be used to distinguish Crohn's disease from non-Crohn’s disease and guide subsequent treatment.

1. DA (Data Acquisition)

The GSE186582 dataset (including 343 Crohn's disease patients with inflammatory ileum and 25 healthy ileum controls) was downloaded from the GEO database and analyzed by LIMMA, DESeq2, edgeR R package, and M6A-related genes were extracted from the results.

2. Selection of models

We constructed a random forest tree model through the random forest R package and the SVM model through the kernlab R package as a training model to predict the occurrence of Crohn's disease. The model was also drawn out to evaluate the model using the reverse cumulative distribution of residual residues, the residual boxplot, and subject operating characteristics (ROC) curves.

3. The nomogram models

We selected candidate m6A regulators using the training model and constructed a nomogram model on them. We used the rms R package to predict the prevalence of Crohn's disease, and we used the calibration curve to assess whether the predicted values are consistent with reality. We also performed decision curve analysis (DCA) and plotted clinical impact curves to assess whether model-based decision-making benefited patients.

4. Construction of the two molecular subtypes of CD

Consensus Cluster Plus R package was used to identify different m6A patterns based on essential m6A methylation-related genes. The samples were classified according to the expression of M6A-related genes and the expression of differential genes.

5. Identification and functional analysis of differentially expressed genes among different m6A molecular subtypes

The limma R package was used to screen for differentially expressed genes (DEG) between different m6A patterns. Post-corrected p-value < 0.05 was selected as the screening criteria. A GO functional enrichment analysis was performed using the cluster profile R package to understand the possible mechanisms of DEGs involved in Crohn's disease and to visualize the results.

6. Scores of m6A

The principal component analysis (PCA) algorithm was used to calculate the m6A score of each sample. First, PCA was performed to distinguish the m6A pattern. The m6A score was then calculated using the following formula: m6A score = PC1i, where PC1 represents principal component 1, and i represents the expression of differential genes. The m6A scores of different types of samples were analyzed.

7. Immune infiltration analysis

Single sample Gene set enrichment analysis (ssGSEA) was performed using GSEA Base, GSVA, and LIMMA R packages to assess the abundance of immune cells in Crohn's disease samples. First, the expression levels of the genes in the samples were sequenced using ssGSEA to obtain their rank scores. These genes were then searched in the input data set, and their expression levels were summed to obtain the abundance of immune cells in each sample.

Expression of the 23 m6A-related genes in the differential genes was extracted.

We used the limma R package to select a total of 23 m6A-related genes in the differentially expressed gene dataset(Figure.1a), and we screened out 11 m6A-related genes with expression differences(Figure.1b), including R BM15B, WTAP, YTHDF2, METTL14, RBM15, YTHDF3, IGF2BP1, YTHDC2, LRPPRC, METTL3 and HNRNPA2B1. We found that WTAP, M ETTL14, R BM15, R BM15B, YTHDF2, and YTHDF3 were highly expressed in the experimental group (CD), and the rest were highly expressed in the healthy ileal control group.

2. Construction Of The Rf Model And The Svm Model

RF and SVM models were established to select candidate M6A-related genes from 23 M6A-related genes to predict the occurrence of Crohn's disease. We found the random forest tree models with smaller residuals by the "reverse cumulative distribution of residuals" and the "residue boxplot"(Figure.1c-d). Therefore, we considered the RF model the best model to predict the occurrence of Crohn's disease. After ranking these genes according to importance, we visualized the 11 m6A-related genes.

As shown in Figure.1f, the abscissa for the number of trees, the ordinate is the cross-validation error. Further, the red line represents the error of the experimental group (Crohn's disease group), the green line represents the error of the control group, and the black line represents the error of all samples(Figure.1h). We found a slight error of 106 for the optimal number of trees in the curve and then reconstructed the forest tree model. Genes are of more importance and significance than two were selected as signature genes(Figure.1g). We found that the number of m6A regulators in the top five positions (FMR1, KIAA1429, WTAP, YTHDC2, and ZC3H13) were selected as candidate genes. We found that the number of m6A regulators in the top five (FMR1, KIAA1429, WTAP, YTHDC2 and ZC3H13) were selected as candidate genes. Finally, the ROC curve was drawn to evaluate the model(Figure.1e), and its AUC value also shows that the RF model has higher accuracy than the SVM model.

3. Construction Of Nomogram Model

The "RMS" R package used constructed nomogram models based on five candidate M6A-related genes to predict the prevalence of Crohn's disease(Figure.1k). The calibration curve shows that the nomogram model is accurate in its prediction(Figure.1i). In the DCA curve(Figure.1j), we found that the red lines mainly remained above the gray and black lines on a scale from 0 to 1, indicating that decision-making based on nomogram models may benefit patients with Crohn's disease. On the clinical impact curve, we found that the nomogram model's predictive power was very significant.

4. Differences In Connections Between Different Molecular Subtypes

We divided the samples into cluster A and B subtypes based on the 11 most critical m6A-related genes (Figure.2a). Among them, subtype A included 129 cases, and subtype B included 214 cases. We subsequently showed the difference in the expression situation of the essential m6A-related genes between the two isoforms by heatmaps and histograms, and we found that the expression was higher in isoform A than in isoform B. Conversely, RBM15 and YTDHF3 were not significantly different in isoforms A and B. We found in PCA that nine essential m6A-related genes could completely distinguish between the two m6A patterns(Figure.2b). A total of 315 m6A were selected between the two m6A patterns, with associated differentially expressed genes. We applied to GO functional enrichment analysis to obtain possible mechanisms for these differential genes in Crohn's disease and to visualize the results with histograms(Figure.2f-i). We found that GO: 0015711, GO: 0015849, and GO: 0046942 correlated with ion transport.

When ssGSEA calculated immune cell abundance in Crohn's disease samples and evaluated the correlation between 11 important m6A modulators and immune cells(Figure.2c-d), we found that HNRNPA2B1 was positively associated with many immune cells. We explored the differential immune cell infiltration between patients with high and low HNRNPA2B1 expression(Figure.2e). The results showed that patients with Crohn's disease with high HNRNPA2B1 expression were immune immunity.

Finally, we analyzed the differential immune cell infiltration between the two m6A patterns. We found that cluster A was associated with MDSC immunity, while cluster B was associated with monocyte immunity.

5.identification Of Two Different M6a Gene Patterns And The Generation Of M6a Signature Genes

Based on 315 M6A-related differentially expressed genes(Figure.3a), Crohn's disease patients were divided into different genomic subtypes using consensus clustering. Consistent with grouping M6A-related gene subtypes, we found two distinct genotypes of m6A (gene cluster A and gene cluster B). Figure.3b shows the expression levels of 11 differentially expressed genes related to m6A in gene clusters A and B.As shown in Figure.3c, the differential expression levels of 11 crucial m6A regulators and immune cell infiltration between gene cluster A and gene cluster B are also contrary to the results of M6A-related gene grouping subtypes, which again validates the accuracy of our grouping by consensus clustering method. To quantify the m6A pattern, we used the PCA algorithm to calculate the m6A score for each sample. Then the m6A scores of m6A pattern typing and M6A-related differential gene pattern typing were compared. It was shown that a higher m6A score was found in subtype B than in subtype A in m6A pattern typing, while in M6A-related genotyping, a higher m6A score was found in subtype A than in subtype B. The relationship between m6A methylation-related gene pattern typing, m6A gene pattern and m6A score was visualized in the Sankey plot. It was found that there was a negative regulatory relationship between different genotypes(Figure.3d-e).

6. The Role Of The M6a Molecular Subtype In Differentiating Crohn'S Disease

To further reveal the relationship between the m6A grouping subtypes and Crohn, we investigated the correlation between the m6A patterns and gene cluster(Figure.4a). We found that PHOX2B and ATG16L were higher than genotyping B, while NCF4, I L-33, and NOD2 were higher than genotyping B than genotyping A. In m6A typing A, NCF4, I L-33, and NOD2 were higher than B, while PHOX2B and ATG16L were higher than genotyping A. PHOX2B, NCF4, and NOD-2 have been closely linked to the development of Crohn's disease(Figure.4b-c).

Crohn's disease (CD) is a chronic inflammatory bowel disease of unknown etiology that tends to occur in young and middle-aged people[12]. At present, m6A is involved in regulating microglial inflammatory effects[13]; however, the role of m6A in Crohn's disease remains unknown. Our study aimed to explore the role of M6A-related regulators in Crohn's disease.

Eleven essential m6A methylation-related genes were identified among 23 m6A regulators by differential expression analysis between non Crohn's and Crohn's disease patients. To establish the RF model, five candidate m6A methylation-related genes (FMR1, KIAA1429, WTAP, YTHDC2 and ZC3H13) were selected from 21 m6A methylation-related genes to predict the occurrence of Crohn's disease. However, we could not validate our model in independent datasets due to the lack of the m6A dataset for Crohn's disease patients in public databases. Nomogram models based on five candidate m6A regulators were constructed, and DCA curves indicated that nomogram model-based decision-making might benefit patients with Crohn's disease.

LRPPRC has been shown to have a specific relationship with pancreatic cancer prognosis. Its expression is significantly higher than that in adjacent tissues, and LRPPRC negatively correlates with overall survival[14]. Knockdown of LRPPRC inhibited the malignant biological behavior of PANC-1 cells, including proliferation and migration[15]. We also show that WTAP plays an essential role in many physiological processes in the cell, including binding to the 3 ′ untranslated region of the mRNA to improve mRNA stability[16]. WTAP can act as an essential component of MTC and promote the formation of m6A[17]. The oncogene yes-related protein (YAP) targets YTHDC2 in GC cells[18]. It has been confirmed that knocking down YTHDC2 significantly reduces the size of gastric cancer tumors in vivo, and high YTHDC2 is strongly and positively correlated with high YAP in clinical GC tissues[19].

While there is evidence that YTHDF3 positively regulates cell migration, invasion, and EMT in breast cancer cells, ZEB1 was identified as a critical downstream target of YTHDF3, which can enhance ZEB1 mRNA stability in an m6 A-dependent manner[20, 21]. Inhibition of YTHDF3 reduced tumor cell migration, invasion, and invasion, all of this biological behavior was reversed after ZEB1 overexpression[21].

Currently, most researchers believe that the occurrence of Crohn's disease is often associated with the immune microenvironment. Loss of NOD function in Crohn's disease with NOD2 may cause an imbalance of macrophage-fibroblast homeostasis. CD14 + PBMCS in NOD risk gene carriers showed high expression of collagen fibers. Intestinal stenosis was also observed in the NOD2 knockout zebrafish model of intestinal inflammation[22–24]. This study mainly revealed a clear association between the NOD2 gene and Crohn's disease fibrosis.

At the same time, in comparing immune T cells in inflammatory and non-inflammatory tissues and control tissues of patients with Crohn's disease. It was found that the activated T cell subsets of intestinal intraepithelial lymphocytes in the terminal ileum of inflammatory tissues increased Th17 cells but decreased CD8 + T, γδT, Tfh and Treg cells[25, 26]. However, in the lamina propria of inflammatory tissue, CD8 + T cells were increased, while CD4 + T cells were decreased. This indicates that the occurrence and progression of Crohn's disease are closely related to the immune cell microenvironment.

In conclusion, five candidate m6A modulators were selected, and a nomogram model was developed that accurately predicted Crohn's disease in this study. Based on 11 essential m6A methylation-related genes, we further identified two m6A subtypes, of which typing A may be associated with Crohn's disease.

Consent for publication: Not applicable

Ethical Approval and Consent to participate：

This study did not use human, human body tissues or animals (such as mice) as the source of research materials.

Human Ethics：

This study does not involve relevant human ethics.

Availability of data and materials: All the data we use comes from the GEO database（http://www.ncbi.nlm.nih.gov/geo）, which is open to use and has been described in the article. We used the following GEO data to download the data set GSE186582 about Crohn's disease.

Competing interests: We declare that we have no conflict of interest.

Funding：

No Funding.

Authors' contributions：

Lan Shui-Qing is responsible for the writing of articles and the construction of article ideas.

Huang Gui-Liu is responsible for the construction and layout of pictures.

Huang ZanSong is responsible for writing the overall idea of the article and formulating the method of the article.

Acknowledgements:We thank all the authors who participated in the design and writing of the article.

Roda G, Chien Ng S, Kotze PG, Argollo M, Panaccione R, Spinelli A, Kaser A, Peyrin-Biroulet L, Danese S: Crohn's disease. Nat Rev Dis Primers 2020, 6(1):22.
Pizarro TT, Stappenbeck TS, Rieder F, Rosen MJ, Colombel JF, Donowitz M, Towne J, Mazmanian SK, Faith JJ, Hodin RA et al: Challenges in IBD Research: Preclinical Human IBD Mechanisms. Inflamm Bowel Dis 2019, 25(Suppl 2):S5-S12.
Hagen JW, Swoger JM, Grandinetti LM: Cutaneous Manifestations of Crohn Disease. Dermatol Clin 2015, 33(3):417-431.
Santacroce G, Lenti MV, Di Sabatino A: Therapeutic Targeting of Intestinal Fibrosis in Crohn’s Disease. Cells 2022, 11(3).
Rao N, Kumar S, Taylor S, Plumb A: Diagnostic pathways in Crohn's disease. Clin Radiol 2019, 74(8):578-591.
Chen J, Wei X, Yi X, Jiang DS: RNA Modification by m(6)A Methylation in Cardiovascular Disease. Oxid Med Cell Longev 2021, 2021:8813909.
Li D, Zhu X, Li Y, Zeng X: Novel insights into the roles of RNA N(6)-methyladenosine modification in regulating gene expression during environmental exposures. Chemosphere 2020, 261:127757.
Shi H, Wei J, He C: Where, When, and How: Context-Dependent Functions of RNA Methylation Writers, Readers, and Erasers. Mol Cell 2019, 74(4):640-650.
Jang KH, Heras CR, Lee G: m(6)A in the Signal Transduction Network. Mol Cells 2022, 45(7):435-443.
Li X, Ma S, Deng Y, Yi P, Yu J: Targeting the RNA m(6)A modification for cancer immunotherapy. Mol Cancer 2022, 21(1):76.
Rana AK, Ankri S: Reviving the RNA World: An Insight into the Appearance of RNA Methyltransferases. Front Genet 2016, 7:99.
Rajbhandari R, Blakemore S, Gupta N, Mannan S, Nikolli K, Yih A, Drown L, Bukhman G: Crohn's Disease Among the Poorest Billion: Burden of Crohn's Disease in Low- and Lower-Middle-Income Countries. Dig Dis Sci 2022.
Zhou Y, Yang J, Tian Z, Zeng J, Shen W: Research progress concerning m(6)A methylation and cancer. Oncol Lett 2021, 22(5):775.
Cui J, Wang L, Ren X, Zhang Y, Zhang H: LRPPRC: A Multifunctional Protein Involved in Energy Metabolism and Human Disease. Front Physiol 2019, 10:595.
Wang L, Luo J, Li Y, Lu Y, Zhang Y, Tian B, Zhao Z, Hu QY: Mitochondrial-Associated Protein LRPPRC is Related With Poor Prognosis Potentially and Exerts as an Oncogene Via Maintaining Mitochondrial Function in Pancreatic Cancer. Front Genet 2021, 12:817672.
Fan Y, Li X, Sun H, Gao Z, Zhu Z, Yuan K: Role of WTAP in Cancer: From Mechanisms to the Therapeutic Potential. Biomolecules 2022, 12(9).
Garcias Morales D, Reyes JL: A birds'-eye view of the activity and specificity of the mRNA m(6) A methyltransferase complex. Wiley Interdiscip Rev RNA 2021, 12(1):e1618.
Yuan W, Chen S, Li B, Han X, Meng B, Zou Y, Chang S: The N6-methyladenosine reader protein YTHDC2 promotes gastric cancer progression via enhancing YAP mRNA translation. Transl Oncol 2022, 16:101308.
Liu T, Tang W, Chen Y, Liu Y, Xu D, Jiang Y, Zhou S, Qin X, Ren L, Chang W et al: The m6A RNA Modification Quantity and the Prognostic Effect of Reader YTHDC2 in Colorectal Cancer. Clin Med Insights Oncol 2022, 16:11795549221104441.
Chang G, Shi L, Ye Y, Shi H, Zeng L, Tiwary S, Huse JT, Huo L, Ma L, Ma Y et al: YTHDF3 Induces the Translation of m(6)A-Enriched Gene Transcripts to Promote Breast Cancer Brain Metastasis. Cancer Cell 2020, 38(6):857-871 e857.
Lin Y, Jin X, Nie Q, Chen M, Guo W, Chen L, Li Y, Chen X, Zhang W, Chen H et al: YTHDF3 facilitates triple-negative breast cancer progression and metastasis by stabilizing ZEB1 mRNA in an m(6)A-dependent manner. Ann Transl Med 2022, 10(2):83.
Frade-Proud'Hon-Clerc S, Smol T, Frenois F, Sand O, Vaillant E, Dhennin V, Bonnefond A, Froguel P, Fumery M, Guillon-Dellac N et al: A Novel Rare Missense Variation of the NOD2 Gene: Evidencesof Implication in Crohn's Disease. Int J Mol Sci 2019, 20(4).
Nelson A, Stewart CJ, Kennedy NA, Lodge JK, Tremelling M, Consortium UIG, Probert CS, Parkes M, Mansfield JC, Smith DL et al: The Impact of NOD2 Genetic Variants on the Gut Mycobiota in Crohn's Disease Patients in Remission and in Individuals Without Gastrointestinal Inflammation. J Crohns Colitis 2021, 15(5):800-812.
Sidiq T, Yoshihama S, Downs I, Kobayashi KS: Nod2: A Critical Regulator of Ileal Microbiota and Crohn's Disease. Front Immunol 2016, 7:367.
Clough JN, Omer OS, Tasker S, Lord GM, Irving PM: Regulatory T-cell therapy in Crohn's disease: challenges and advances. Gut 2020, 69(5):942-952.
Rosati E, Rios Martini G, Pogorelyy MV, Minervina AA, Degenhardt F, Wendorff M, Sari S, Mayr G, Fazio A, Dowds CM et al: A novel unconventional T cell population enriched in Crohn's disease. Gut 2022.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

M6A Methylation Modification–Mediated Mucosal Immune Microenvironment in Crohn's Disease

Status:

Version 1

Abstract

Objective

Methods

Results

Conclusion

Figures

Introduction

Method

1. DA (Data Acquisition)

2. Selection of models

3. The nomogram models

4. Construction of the two molecular subtypes of CD

5. Identification and functional analysis of differentially expressed genes among different m6A molecular subtypes

6. Scores of m6A

7. Immune infiltration analysis

Results

2. Construction Of The Rf Model And The Svm Model

3. Construction Of Nomogram Model

4. Differences In Connections Between Different Molecular Subtypes

5.identification Of Two Different M6a Gene Patterns And The Generation Of M6a Signature Genes

6. The Role Of The M6a Molecular Subtype In Differentiating Crohn'S Disease

Discussion

Declarations

References

Additional Declarations

Status:

Version 1