Network analysis reveals a stress-affected common gene module among seven stress-related diseases/systems which provides potential targets for mechanism research

Guo, Liyuan; Du, Yang; Wang, Jing

doi:10.1038/srep12939

Download PDF

Article
Open access
Published: 06 August 2015

Network analysis reveals a stress-affected common gene module among seven stress-related diseases/systems which provides potential targets for mechanism research

Liyuan Guo¹,
Yang Du^1,2 &
Jing Wang¹

Scientific Reports volume 5, Article number: 12939 (2015) Cite this article

1934 Accesses
11 Citations
Metrics details

Subjects

Abstract

Chronic stress (CS) was reported to associate with many complex diseases and stress-related diseases show strong comorbidity; however, molecular analyses have not been performed to date to evaluate common stress-induced biological processes across these diseases. We utilized networks constructed by genes from seven genetic databases of stress-related diseases or systems to explore the common mechanisms. Genes were connected based on the interaction information of proteins they encode. A common sub-network constructed by 561 overlapping genes and 8863 overlapping edges among seven networks was identified and it provides a common gene module among seven stress-related diseases/systems. This module is significantly overlapped with network that constructed by genes from the CS gene database. 36 genes with high connectivity (hub genes) were identified from seven networks as potential key genes in those diseases/systems, 33 of hub genes were included in the common module. Genes in the common module were enriched in 190 interactive gene ontology (GO) functional clusters which provide potential disease mechanism. In conclusion, by analyzing gene networks we revealed a stress-affected common gene module among seven stress-related diseases/systems which provides insight into the process of stress induction of disease and suggests potential gene and pathway candidates for further research.

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

Genome-wide association studies

Article 26 August 2021

The JAK/STAT signaling pathway: from bench to clinic

Article Open access 26 November 2021

Introduction

Chronic stress (CS) influences multiple systems and affects the generation and development of numerous complex disorders^1,2, such as infectious and autoimmune disorders^3,4,5, cardiovascular events^6,7, cancers^8,9, mental disorders^10,11,12 and obesity¹³. Results from epidemiological literature show strongly that there is comorbidity among stress-related diseases^14,15 and studies of molecular mechanisms also imply a tight relevance across these diseases¹⁶. Additionally, recent clinical tests suggest that psychological interventions can affect patients with other stress-related diseases^17,18. Although increasing evidence has hinted at a strong association among different stress-related diseases, it remains unclear whether there is a common stress-induced biological process across these diseases.

In recent years, genetic and expressional studies have identified a significant number of disease-related genes and the information has been organized into specific data resources^19,20,21. Moreover, many gene-based bioinformatic approaches, such as gene network analysis that based on the interaction of proteins encoded by genes, so called protein-protein interaction network analysis²² and gene co-expression module analysis²³, have been employed to explore the biological processes that underlie stress-related diseases. Several key genes, biological pathways and functional modules have been identified in bioinformatics studies.

In this study, we used genes from seven stress-related disease/system databases and one CS database to construct networks based on the interaction information of proteins they encode. These networks were analyzed as follows: 1) identify nodes with high connectivity (hub genes) of these diseases/systems to obtain key genes in the interactive system; 2) reveal a common gene module among different stress-related diseases/systems to provide molecules potentially related to disease comorbidity; and 3) determine the relationship between CS and disease/system common module. Based on the results of the network analysis, a pathway enrichment analysis was performed to determine potential biological mechanisms through which CS induces disease.

Materials and Methods

Gene sets of stress-related disease/system

Genes from genetic and expressional databases of neurodegeneration disease, mental disorders and other stress related diseases or systems were obtained and utilized for network analyses. Genes from Alzgene²⁴, BDgene²¹, MK4MDD¹⁹, CADgene²⁰ and NCG4.0²⁵ were selected to form gene sets of five stress-related diseases: Alzheimer’s disease(Alz), bipolar disorder(BD), major depressive disorder(MDD), coronary artery disease(CAD)and cancer. The BDgene and MK4MDD databases include genetic factors that are linked to BD and MDD and have positive and negative results; only genes with at least one positive result were selected for the corresponding gene sets. Genes from the Obesity Gene Atlas²⁶ and Immunome²⁷ were also selected for inclusion in gene sets for fatty metabolism and immune responses. Human CS genes in the database CS-DEGs²⁸ were selected to form a gene set that is affected by CS environments. Overlaps were compared among the stress-related diseases and system gene sets.

Protein-protein interaction networks

The database STRING 9.1²⁹ provides a comprehensive protein interactome that includes known and predicted protein-protein interactions scored according to their confidence. Information in this database was utilized to construct the disease/system and CS gene networks. Genes in the stress-related disease/system or CS datasets were considered seed nodes and used to obtain protein-protein interactions with the highest confidence (score >0.9). As shown in Fig. 1a, an extended network that included seed nodes, first neighbor nodes and the highest confident interactions between these nodes was constructed for each gene set. All of the networks were visualized and analyzed with the visualization software Cytoscape 3.0.2³⁰. The node properties, such as the betweenness centrality (BC) and degree, were calculated using the plug-in “Network Analyzer”³¹ in the Cytoscape software. Hub genes were identified according to the following thresholds: BC > 0.05 and degree >50²². The statistical significant difference between properties of nodes in disease/system networks and the entire interactome was examined by T-test.

Common gene module of stress-related diseases/systems

To examine whether a common gene module exists among different stress-related diseases/systems, nodes and edges were compared among the seven stress-related disease and system networks and a common sub-network was constructed using the overlapping nodes and edges. These interactive nodes in the sub-network constructed the common gene module. Three properties of the module were analyzed: 1) network topological parameters; 2) overlapping genes shared between the module and hub genes; and 3) overlapping genes shared between the module and the CS gene set and network. The statistical significance of the overlap between common module and the CS gene set and network was determined by the Fisher’s exact test.

Gene ontology (GO) pathway cluster enrichment analysis

To identify common biological processes underlying stress-related diseases, a GO pathway cluster enrichment analysis was performed on nodes in the disease/system common module using the online analysis tool DAVID³². As recommended in DAVID, the cutoff for pathway cluster enrichment was set at a score >1.3. The representative biological terms associated with significant clusters were manually selected. Because these clusters reflect interactive functional systems, the GO term network of genes in common module was also deciphered using the Cytoscape plug-in ClueGO³³ to provide a system-wide view.

Results

Summary of genes and networks

The seven stress-related disease/system gene sets included 4637 genes (summarized in Supplementary Table S1). The seven gene sets overlapped; however, there were no genes that occurred in all seven sets. The genes that occupied by more than four gene sets are shown in Supplementary Table S2. The CS gene set included 2606 genes (see in Supplementary Table S1). A total of 3941 disease/system or CS genes were found in STRING v9.1 and 8429 nodes were included in the disease/system or CS networks (see in Supplementary Table S1).

Hub genes

Node properties in each network were analyzed. As shown in Supplementary Table S3, the average degrees of the disease/system nodes were all significantly higher than the average degree of the entire STRING network. With a threshold degree >50 and BC > 0.05, as shown in Table 1, 36 genes were identified as hub genes for seven diseases/systems and the genes ESR1, TP53, FOS, AKT1 and FRN were hub genes for more than one disease/system.

Table 1 Hub genes in seven stress-related disease/system networks.

Full size table

Common gene module among stress-related diseases/systems

To explore the common biological modules underlying stress-related diseases/systems, the nodes and edges of the seven disease/system networks were compared. A common sub-network including 561 genes and 8863 edges was observed in the network of all seven diseases/systems (Fig. 1b). The 561 interactive common genes (as shown in Table 2) constructed the common gene module among stress-related diseases/systems, they include 180 members of the CS gene set and all genes in this module can be found in the CS network. Nodes of the CS network significantly overlapped with the common gene module (Fisher’s Exact Test, p < 2.2E-16). The average degrees of genes in common module were significantly higher than other genes in disease/system networks (as shown in Supplementary Table S4). 33 hub genes were included in the common module; hub genes in Table 1 that were not included in the common module were ACTN2, CDC42 and OR6A2.

Table 2 Genes in diseases/systems common module.

Full size table

Functional pathways enriched by genes in common module

Using the recommended threshold enrichment score (>1.3), 190 GO functional pathway clusters were enriched and categorized into three types: Cellular Component (see in Supplementary Table S5), Biological Process (see in Supplementary Table S6) and Molecular Function (see in Supplementary Table S7). Table 3 shows the top 10 enriched clusters and the detailed information of all enriched clusters was shown in Supplementary Table S8. Fifty-four interactive pathway groups were identified with network connectivity (Kappa score) 0.5. The largest group is shown in Fig. 1c and all groups are shown in Supplementary Figure S1.

Table 3 Top 10 pathway clusters enriched by genes of disease/system common module.

Full size table

Discussion

In this study we constructed seven stress-related disease/system gene networks based on the interaction information of gene-encoded proteins. The average degrees of the disease/system genes are significantly higher than the average degrees of the entire human interactome, suggesting that disease/system genes and their first neighbors are more highly connected in the human interactome than random genes, so they may play roles in a tighter and more complex manner. The result also supports the hypothesis that disease genes tend to have higher degrees^34,35. A total of 36 disease/system genes were identified as hub genes that occupy central positions in disease/system networks and may possess important biological functions.

Although common genes were not identified among the stress-related diseases/systems compared in this study, a common sub-network was identified among the seven disease/systems. Genes in this sub-network were most enriched in GO pathways that related to chemical homeostasis (as shown in Table 3). This result may imply that there is a common interactive gene module that maintains homeostasis which is related to all stress-related diseases/systems, so this common module provides potential molecular fundaments of the comorbidity. Because most hub genes are included in the common module, the dysfunction of this module may play an important role in disease generation and development. The imbalance of homeostasis induced by aberrant expression of genes in the common module may trigger a pre-disease state³⁶ with the potential to develop into different pathological processes because of additional disease/system genes that were not found in the common module. In each disease/system network, the average degrees of nodes in common module were significantly higher than other nodes. Considering the reports that that disease genes tend to have higher degrees^34,35, genes in common module may be more strongly associated to pathological processes than other genes in disease/system networks.

The CS gene set includes human genes whose rodent homologs were differentially expressed in CS rodent models. The significant overlap between the common gene module of seven human diseases/systems and CS network suggests that the stress environment may induce disease by influencing a common homeostasis system. Consequently, a pre-disease state progresses to different disease states as genetic factors and/or other environmental factors are stimulated. This potential mechanism may explain the concomitant strong association between stress and disease and high heterogeneity of pathological processes associated with stress-related diseases. Genes in the common module (as shown in Table 2) could be useful candidates for subsequent experimental study.

The GO pathway clusters enriched by genes in the common module indicate the biological systems that are influenced by stress environments and abnormal in diseases, so they may imply the biological mechanisms by which stress environments induce disease. Beyond that, these pathway clusters also provide specific potential targets for relevant research. As shown in Supplementary Table S5, most of the common nodes are located in the extracellular space and plasma membrane-related cellular components, which suggests candidate targets for disease intervention. The biological processes associated with common module (see in Supplementary Table S6) provide a series of candidates for mechanism research, such as processes related to response, metabolism, cell differentiation and migration, transport and signaling transduction. The enriched pathway clusters of molecular function, such as peptide receptor activity and phospholipase activity, suggest potential drug targets (Supplementary Table S7). These functional pathways are interactive systems and could be enriched in several groups (as shown in Supplementary Figure S1). Figure 1c shows the largest enriched interactive group that constructed by pathways of response, regulation, cell migration, transport and signaling transduction. Besides of the system views, certain enriched biological process clusters provide detailed biological hypotheses for specific diseases. For example, the dysfunction of 37 genes in the function cluster “response to bacterium”(Supplementary Table S6 and Table S8) may directly mediate the process by which stress stimulates infectious disease. This function cluster also provides a potential explanation for the comorbidity among infectious diseases and other stress-related diseases.

In conclusion, we utilized stress-related disease/system genes to construct interactive networks. By analyzing these networks, we identified hub genes which may play roles in the pathological processes of stress-related diseases. We also identified a common sub-network among diseases/systems and the sub-network is significantly overlapped with the CS network. The common sub-network implies that different stress-related diseases/systems share a common gene module that may be influenced by stress environments. By analyzing this common gene module, the potential mechanism underlying the process by which stress induces diseases could be partially revealed.

In spite of above results, this study also has some limitations. First, we constructed network based on existing annotations database which are limited by our current knowledge of biology. Second, limited by the lack of data resource, only seven stress-related diseases or systems were selected to analyze. Third, the CS genes were obtained via homologous analysis on differentially expressed genes from CS rodent models, so result based on these genes need to be further validated in human study.

Additional Information

How to cite this article: Guo, L. et al. Network analysis reveals a stress-affected common gene module among seven stress-related diseases/systems which provides potential targets for mechanism research. Sci. Rep. 5, 12939; doi: 10.1038/srep12939 (2015).

References

McEwen, B. S. & Stellar, E. Stress and the individual. Mechanisms leading to disease. Arch Intern Med 153, 2093–2101 (1993).
Article CAS Google Scholar
Schmidt, M. V., Sterlemann, V. & Muller, M. B. Chronic stress and individual vulnerability. Ann NY Acad Sci 1148, 174–183 (2008).
Article ADS Google Scholar
Dhabhar, F. S. Effects of stress on immune function: the good, the bad and the beautiful. Immunol Res 58, 193–210 (2014).
Article CAS Google Scholar
Stojanovich, L. Stress and autoimmunity. Autoimmun Rev 9, A271–276 (2010).
Article CAS Google Scholar
Stojanovich, L. & Marisavljevich, D. Stress as a trigger of autoimmune disease. Autoimmun Rev 7, 209–213 (2008).
Article Google Scholar
Brotman, D. J., Golden, S. H. & Wittstein, I. S. The cardiovascular toll of stress. Lancet 370, 1089–1100 (2007).
Article Google Scholar
Hanna, R. N. & Hedrick, C. C. Stressing out stem cells: linking stress and hematopoiesis in cardiovascular disease. Nat Med 20, 707–708 (2014).
Article CAS Google Scholar
Thaker, P. H. et al. Chronic stress promotes tumor growth and angiogenesis in a mouse model of ovarian carcinoma. Nat Med 12, 939–944 (2006).
Article CAS Google Scholar
Todd, B. L., Moskowitz, M. C., Ottati, A. & Feuerstein, M. Stressors, stress response and cancer recurrence: a systematic review. Cancer Nurs 37, 114–125 (2014).
Article Google Scholar
Miklowitz, D. J. & Johnson, S. L. Social and Familial Factors in the Course of Bipolar Disorder: Basic Processes and Relevant Interventions. Clin Psychol-Sci Pr 16, 281–296 (2009).
Article Google Scholar
Wang, J. Work stress as a risk factor for major depressive episode(s). Psychol Med 35, 865–871 (2005).
Article Google Scholar
Wilson, R. S. et al. Proneness to psychological distress is associated with risk of Alzheimer’s disease. Neurology 61, 1479–1485 (2003).
Article CAS Google Scholar
Shively, C. A., Register, T. C. & Clarkson, T. B. Social stress, visceral obesity and coronary artery atherosclerosis in female primates. Obesity 17, 1513–1520 (2009).
Article Google Scholar
Jenny-Avital, E. R. Obesity and the risk of heart failure. New Engl J Med 347, 1887–1889 (2002).
Article Google Scholar
Pearce, B. D., Kruszon-Moran, D. & Jones, J. L. The relationship between Toxoplasma gondii infection and mood disorders in the third National Health and Nutrition Survey. Biol Psychiat 72, 290–295 (2012).
Article Google Scholar
Martin, C., Tansey, K. E., Schalkwyk, L. C. & Powell, T. R. The inflammatory cytokines: molecular biomarkers for major depressive disorder? Biomark Med 9, 169–180 (2014).
Article Google Scholar
Andersen, B. L. et al. Psychologic intervention improves survival for breast cancer patients: a randomized clinical trial. Cancer 113, 3450–3458 (2008).
Article Google Scholar
Tong, G. et al. Effects of psycho-behavioral interventions on immune functioning in cancer patients: a systematic review. J Cancer Res Clin 140, 15–33 (2014).
Article CAS Google Scholar
Guo, L. et al. MK4MDD: a multi-level knowledge base and analysis platform for major depressive disorder. PloS One 7, e46335 (2012).
Article ADS CAS Google Scholar
Liu, H. et al. CADgene: a comprehensive database for coronary artery disease genes. Nucleic Acids Res 39, D991–996 (2011).
Article CAS Google Scholar
Su-Hua Chang, L. G., Li, Z., Zhang, W.-N., Du, Y. & Wang, J. BDgene: a genetic database for bipolar disorder and its overlap with schizophrenia and major depressive disorder. Biol Psychiat 74, 727–733 (2013).
Article Google Scholar
Nair, J., Ghatge, M., Kakkar, V. V. & Shanker, J. Network analysis of inflammatory genes and their transcriptional regulators in coronary artery disease. PloS One 9, e94328 (2014).
Article ADS Google Scholar
Chen, C. et al. Two gene co-expression modules differentiate psychotics and controls. Mol psychiatr 18, 1308–1314 (2013).
Article CAS Google Scholar
Bertram, L., McQueen, M. B., Mullin, K., Blacker, D. & Tanzi, R. E. Systematic meta-analyses of Alzheimer disease genetic association studies: the AlzGene database. Nat Genet 39, 17–23 (2007).
Article CAS Google Scholar
An, O. et al. NCG 4.0: the network of cancer genes in the era of massive mutational screenings of cancer genomes. Database-Oxford 2014, bau015 (2014).
PubMed PubMed Central Google Scholar
Kunej T, J. S. D., Zorz, M., Ogrinc, A., Michal, J. J., Kovac, M. & Jiang, Z. Obesity Gene Atlas in Mammals. J Genomics 1, 11 (2012).
Google Scholar
Ortutay, C. & Vihinen, M. Immunome: a reference set of genes and proteins for systems biology of the human immune system. Cell Immunol 244, 87–89 (2006).
Article CAS Google Scholar
Guo, L., Du, Y., Chang, S., Zhang, W. & Wang, J. Applying differentially expressed genes from rodent models of chronic stress to research of stress-related disease: an online database. Psychosom Med 76, 644–649 (2014).
Article CAS Google Scholar
Franceschini, A. et al. STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res 41, D808–815 (2013).
Article CAS Google Scholar
Kohl, M., Wiese, S. & Warscheid, B. Cytoscape: software for visualization and analysis of biological networks. Methods Mol Biol 696, 291–303 (2011).
Article CAS Google Scholar
Assenov, Y., Ramirez, F., Schelhorn, S. E., Lengauer, T. & Albrecht, M. Computing topological parameters of biological networks. Bioinformatics 24, 282–284 (2008).
Article CAS Google Scholar
Huang da, W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature protoc 4, 44–57 (2009).
Article Google Scholar
Bindea, G. et al. ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics 25, 1091–1093 (2009).
Article CAS Google Scholar
Jonsson, P. F. & Bates, P. A. Global topological features of cancer proteins in the human interactome. Bioinformatics 22, 2291–2297 (2006).
Article CAS Google Scholar
Sun, J. et al. Schizophrenia gene networks and pathways and their applications for novel candidate gene selection. PloS One 5, e11351 (2010).
Article ADS Google Scholar
Liu, R., Wang, X., Aihara, K. & Chen, L. Early diagnosis of complex diseases by molecular biomarkers, network biomarkers and dynamical network biomarkers. Med Res Rev 34, 455–478 (2014).
Article Google Scholar

Download references

Acknowledgements

This work was supported by: the Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences; the CAS/SAFEA International Partnership Program for Creative Research Teams (Y2CX131003); the Knowledge Innovation Program of the Chinese Academy of Sciences (KSCX2-EW-J-8); and the National Natural Science Foundation of China (81201046).

Author information

Authors and Affiliations

Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences, Beijing, China
Liyuan Guo, Yang Du & Jing Wang
University of Chinese Academy of Sciences, Beijing, China
Yang Du

Authors

Liyuan Guo
View author publications
You can also search for this author in PubMed Google Scholar
Yang Du
View author publications
You can also search for this author in PubMed Google Scholar
Jing Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.G. designed the study, L.G. and Y.D. preformed analyses in this study, L.G. and J.W. wrote the main manuscript text. All authors reviewed the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Guo, L., Du, Y. & Wang, J. Network analysis reveals a stress-affected common gene module among seven stress-related diseases/systems which provides potential targets for mechanism research. Sci Rep 5, 12939 (2015). https://doi.org/10.1038/srep12939

Download citation

Received: 20 April 2015
Accepted: 30 June 2015
Published: 06 August 2015
DOI: https://doi.org/10.1038/srep12939

This article is cited by

An interaction network driven approach for identifying biomarkers for progressing cervical intraepithelial neoplasia
- Shikha Suman
- Ashutosh Mishra
Scientific Reports (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.