Abstract
Chronic stress (CS) was reported to associate with many complex diseases and stress-related diseases show strong comorbidity; however, molecular analyses have not been performed to date to evaluate common stress-induced biological processes across these diseases. We utilized networks constructed by genes from seven genetic databases of stress-related diseases or systems to explore the common mechanisms. Genes were connected based on the interaction information of proteins they encode. A common sub-network constructed by 561 overlapping genes and 8863 overlapping edges among seven networks was identified and it provides a common gene module among seven stress-related diseases/systems. This module is significantly overlapped with network that constructed by genes from the CS gene database. 36 genes with high connectivity (hub genes) were identified from seven networks as potential key genes in those diseases/systems, 33 of hub genes were included in the common module. Genes in the common module were enriched in 190 interactive gene ontology (GO) functional clusters which provide potential disease mechanism. In conclusion, by analyzing gene networks we revealed a stress-affected common gene module among seven stress-related diseases/systems which provides insight into the process of stress induction of disease and suggests potential gene and pathway candidates for further research.
Similar content being viewed by others
Introduction
Chronic stress (CS) influences multiple systems and affects the generation and development of numerous complex disorders1,2, such as infectious and autoimmune disorders3,4,5, cardiovascular events6,7, cancers8,9, mental disorders10,11,12 and obesity13. Results from epidemiological literature show strongly that there is comorbidity among stress-related diseases14,15 and studies of molecular mechanisms also imply a tight relevance across these diseases16. Additionally, recent clinical tests suggest that psychological interventions can affect patients with other stress-related diseases17,18. Although increasing evidence has hinted at a strong association among different stress-related diseases, it remains unclear whether there is a common stress-induced biological process across these diseases.
In recent years, genetic and expressional studies have identified a significant number of disease-related genes and the information has been organized into specific data resources19,20,21. Moreover, many gene-based bioinformatic approaches, such as gene network analysis that based on the interaction of proteins encoded by genes, so called protein-protein interaction network analysis22 and gene co-expression module analysis23, have been employed to explore the biological processes that underlie stress-related diseases. Several key genes, biological pathways and functional modules have been identified in bioinformatics studies.
In this study, we used genes from seven stress-related disease/system databases and one CS database to construct networks based on the interaction information of proteins they encode. These networks were analyzed as follows: 1) identify nodes with high connectivity (hub genes) of these diseases/systems to obtain key genes in the interactive system; 2) reveal a common gene module among different stress-related diseases/systems to provide molecules potentially related to disease comorbidity; and 3) determine the relationship between CS and disease/system common module. Based on the results of the network analysis, a pathway enrichment analysis was performed to determine potential biological mechanisms through which CS induces disease.
Materials and Methods
Gene sets of stress-related disease/system
Genes from genetic and expressional databases of neurodegeneration disease, mental disorders and other stress related diseases or systems were obtained and utilized for network analyses. Genes from Alzgene24, BDgene21, MK4MDD19, CADgene20 and NCG4.025 were selected to form gene sets of five stress-related diseases: Alzheimer’s disease(Alz), bipolar disorder(BD), major depressive disorder(MDD), coronary artery disease(CAD)and cancer. The BDgene and MK4MDD databases include genetic factors that are linked to BD and MDD and have positive and negative results; only genes with at least one positive result were selected for the corresponding gene sets. Genes from the Obesity Gene Atlas26 and Immunome27 were also selected for inclusion in gene sets for fatty metabolism and immune responses. Human CS genes in the database CS-DEGs28 were selected to form a gene set that is affected by CS environments. Overlaps were compared among the stress-related diseases and system gene sets.
Protein-protein interaction networks
The database STRING 9.129 provides a comprehensive protein interactome that includes known and predicted protein-protein interactions scored according to their confidence. Information in this database was utilized to construct the disease/system and CS gene networks. Genes in the stress-related disease/system or CS datasets were considered seed nodes and used to obtain protein-protein interactions with the highest confidence (score >0.9). As shown in Fig. 1a, an extended network that included seed nodes, first neighbor nodes and the highest confident interactions between these nodes was constructed for each gene set. All of the networks were visualized and analyzed with the visualization software Cytoscape 3.0.230. The node properties, such as the betweenness centrality (BC) and degree, were calculated using the plug-in “Network Analyzer”31 in the Cytoscape software. Hub genes were identified according to the following thresholds: BC > 0.05 and degree >5022. The statistical significant difference between properties of nodes in disease/system networks and the entire interactome was examined by T-test.
Common gene module of stress-related diseases/systems
To examine whether a common gene module exists among different stress-related diseases/systems, nodes and edges were compared among the seven stress-related disease and system networks and a common sub-network was constructed using the overlapping nodes and edges. These interactive nodes in the sub-network constructed the common gene module. Three properties of the module were analyzed: 1) network topological parameters; 2) overlapping genes shared between the module and hub genes; and 3) overlapping genes shared between the module and the CS gene set and network. The statistical significance of the overlap between common module and the CS gene set and network was determined by the Fisher’s exact test.
Gene ontology (GO) pathway cluster enrichment analysis
To identify common biological processes underlying stress-related diseases, a GO pathway cluster enrichment analysis was performed on nodes in the disease/system common module using the online analysis tool DAVID32. As recommended in DAVID, the cutoff for pathway cluster enrichment was set at a score >1.3. The representative biological terms associated with significant clusters were manually selected. Because these clusters reflect interactive functional systems, the GO term network of genes in common module was also deciphered using the Cytoscape plug-in ClueGO33 to provide a system-wide view.
Results
Summary of genes and networks
The seven stress-related disease/system gene sets included 4637 genes (summarized in Supplementary Table S1). The seven gene sets overlapped; however, there were no genes that occurred in all seven sets. The genes that occupied by more than four gene sets are shown in Supplementary Table S2. The CS gene set included 2606 genes (see in Supplementary Table S1). A total of 3941 disease/system or CS genes were found in STRING v9.1 and 8429 nodes were included in the disease/system or CS networks (see in Supplementary Table S1).
Hub genes
Node properties in each network were analyzed. As shown in Supplementary Table S3, the average degrees of the disease/system nodes were all significantly higher than the average degree of the entire STRING network. With a threshold degree >50 and BC > 0.05, as shown in Table 1, 36 genes were identified as hub genes for seven diseases/systems and the genes ESR1, TP53, FOS, AKT1 and FRN were hub genes for more than one disease/system.
Common gene module among stress-related diseases/systems
To explore the common biological modules underlying stress-related diseases/systems, the nodes and edges of the seven disease/system networks were compared. A common sub-network including 561 genes and 8863 edges was observed in the network of all seven diseases/systems (Fig. 1b). The 561 interactive common genes (as shown in Table 2) constructed the common gene module among stress-related diseases/systems, they include 180 members of the CS gene set and all genes in this module can be found in the CS network. Nodes of the CS network significantly overlapped with the common gene module (Fisher’s Exact Test, p < 2.2E-16). The average degrees of genes in common module were significantly higher than other genes in disease/system networks (as shown in Supplementary Table S4). 33 hub genes were included in the common module; hub genes in Table 1 that were not included in the common module were ACTN2, CDC42 and OR6A2.
Functional pathways enriched by genes in common module
Using the recommended threshold enrichment score (>1.3), 190 GO functional pathway clusters were enriched and categorized into three types: Cellular Component (see in Supplementary Table S5), Biological Process (see in Supplementary Table S6) and Molecular Function (see in Supplementary Table S7). Table 3 shows the top 10 enriched clusters and the detailed information of all enriched clusters was shown in Supplementary Table S8. Fifty-four interactive pathway groups were identified with network connectivity (Kappa score) 0.5. The largest group is shown in Fig. 1c and all groups are shown in Supplementary Figure S1.
Discussion
In this study we constructed seven stress-related disease/system gene networks based on the interaction information of gene-encoded proteins. The average degrees of the disease/system genes are significantly higher than the average degrees of the entire human interactome, suggesting that disease/system genes and their first neighbors are more highly connected in the human interactome than random genes, so they may play roles in a tighter and more complex manner. The result also supports the hypothesis that disease genes tend to have higher degrees34,35. A total of 36 disease/system genes were identified as hub genes that occupy central positions in disease/system networks and may possess important biological functions.
Although common genes were not identified among the stress-related diseases/systems compared in this study, a common sub-network was identified among the seven disease/systems. Genes in this sub-network were most enriched in GO pathways that related to chemical homeostasis (as shown in Table 3). This result may imply that there is a common interactive gene module that maintains homeostasis which is related to all stress-related diseases/systems, so this common module provides potential molecular fundaments of the comorbidity. Because most hub genes are included in the common module, the dysfunction of this module may play an important role in disease generation and development. The imbalance of homeostasis induced by aberrant expression of genes in the common module may trigger a pre-disease state36 with the potential to develop into different pathological processes because of additional disease/system genes that were not found in the common module. In each disease/system network, the average degrees of nodes in common module were significantly higher than other nodes. Considering the reports that that disease genes tend to have higher degrees34,35, genes in common module may be more strongly associated to pathological processes than other genes in disease/system networks.
The CS gene set includes human genes whose rodent homologs were differentially expressed in CS rodent models. The significant overlap between the common gene module of seven human diseases/systems and CS network suggests that the stress environment may induce disease by influencing a common homeostasis system. Consequently, a pre-disease state progresses to different disease states as genetic factors and/or other environmental factors are stimulated. This potential mechanism may explain the concomitant strong association between stress and disease and high heterogeneity of pathological processes associated with stress-related diseases. Genes in the common module (as shown in Table 2) could be useful candidates for subsequent experimental study.
The GO pathway clusters enriched by genes in the common module indicate the biological systems that are influenced by stress environments and abnormal in diseases, so they may imply the biological mechanisms by which stress environments induce disease. Beyond that, these pathway clusters also provide specific potential targets for relevant research. As shown in Supplementary Table S5, most of the common nodes are located in the extracellular space and plasma membrane-related cellular components, which suggests candidate targets for disease intervention. The biological processes associated with common module (see in Supplementary Table S6) provide a series of candidates for mechanism research, such as processes related to response, metabolism, cell differentiation and migration, transport and signaling transduction. The enriched pathway clusters of molecular function, such as peptide receptor activity and phospholipase activity, suggest potential drug targets (Supplementary Table S7). These functional pathways are interactive systems and could be enriched in several groups (as shown in Supplementary Figure S1). Figure 1c shows the largest enriched interactive group that constructed by pathways of response, regulation, cell migration, transport and signaling transduction. Besides of the system views, certain enriched biological process clusters provide detailed biological hypotheses for specific diseases. For example, the dysfunction of 37 genes in the function cluster “response to bacterium”(Supplementary Table S6 and Table S8) may directly mediate the process by which stress stimulates infectious disease. This function cluster also provides a potential explanation for the comorbidity among infectious diseases and other stress-related diseases.
In conclusion, we utilized stress-related disease/system genes to construct interactive networks. By analyzing these networks, we identified hub genes which may play roles in the pathological processes of stress-related diseases. We also identified a common sub-network among diseases/systems and the sub-network is significantly overlapped with the CS network. The common sub-network implies that different stress-related diseases/systems share a common gene module that may be influenced by stress environments. By analyzing this common gene module, the potential mechanism underlying the process by which stress induces diseases could be partially revealed.
In spite of above results, this study also has some limitations. First, we constructed network based on existing annotations database which are limited by our current knowledge of biology. Second, limited by the lack of data resource, only seven stress-related diseases or systems were selected to analyze. Third, the CS genes were obtained via homologous analysis on differentially expressed genes from CS rodent models, so result based on these genes need to be further validated in human study.
Additional Information
How to cite this article: Guo, L. et al. Network analysis reveals a stress-affected common gene module among seven stress-related diseases/systems which provides potential targets for mechanism research. Sci. Rep. 5, 12939; doi: 10.1038/srep12939 (2015).
References
McEwen, B. S. & Stellar, E. Stress and the individual. Mechanisms leading to disease. Arch Intern Med 153, 2093–2101 (1993).
Schmidt, M. V., Sterlemann, V. & Muller, M. B. Chronic stress and individual vulnerability. Ann NY Acad Sci 1148, 174–183 (2008).
Dhabhar, F. S. Effects of stress on immune function: the good, the bad and the beautiful. Immunol Res 58, 193–210 (2014).
Stojanovich, L. Stress and autoimmunity. Autoimmun Rev 9, A271–276 (2010).
Stojanovich, L. & Marisavljevich, D. Stress as a trigger of autoimmune disease. Autoimmun Rev 7, 209–213 (2008).
Brotman, D. J., Golden, S. H. & Wittstein, I. S. The cardiovascular toll of stress. Lancet 370, 1089–1100 (2007).
Hanna, R. N. & Hedrick, C. C. Stressing out stem cells: linking stress and hematopoiesis in cardiovascular disease. Nat Med 20, 707–708 (2014).
Thaker, P. H. et al. Chronic stress promotes tumor growth and angiogenesis in a mouse model of ovarian carcinoma. Nat Med 12, 939–944 (2006).
Todd, B. L., Moskowitz, M. C., Ottati, A. & Feuerstein, M. Stressors, stress response and cancer recurrence: a systematic review. Cancer Nurs 37, 114–125 (2014).
Miklowitz, D. J. & Johnson, S. L. Social and Familial Factors in the Course of Bipolar Disorder: Basic Processes and Relevant Interventions. Clin Psychol-Sci Pr 16, 281–296 (2009).
Wang, J. Work stress as a risk factor for major depressive episode(s). Psychol Med 35, 865–871 (2005).
Wilson, R. S. et al. Proneness to psychological distress is associated with risk of Alzheimer’s disease. Neurology 61, 1479–1485 (2003).
Shively, C. A., Register, T. C. & Clarkson, T. B. Social stress, visceral obesity and coronary artery atherosclerosis in female primates. Obesity 17, 1513–1520 (2009).
Jenny-Avital, E. R. Obesity and the risk of heart failure. New Engl J Med 347, 1887–1889 (2002).
Pearce, B. D., Kruszon-Moran, D. & Jones, J. L. The relationship between Toxoplasma gondii infection and mood disorders in the third National Health and Nutrition Survey. Biol Psychiat 72, 290–295 (2012).
Martin, C., Tansey, K. E., Schalkwyk, L. C. & Powell, T. R. The inflammatory cytokines: molecular biomarkers for major depressive disorder? Biomark Med 9, 169–180 (2014).
Andersen, B. L. et al. Psychologic intervention improves survival for breast cancer patients: a randomized clinical trial. Cancer 113, 3450–3458 (2008).
Tong, G. et al. Effects of psycho-behavioral interventions on immune functioning in cancer patients: a systematic review. J Cancer Res Clin 140, 15–33 (2014).
Guo, L. et al. MK4MDD: a multi-level knowledge base and analysis platform for major depressive disorder. PloS One 7, e46335 (2012).
Liu, H. et al. CADgene: a comprehensive database for coronary artery disease genes. Nucleic Acids Res 39, D991–996 (2011).
Su-Hua Chang, L. G., Li, Z., Zhang, W.-N., Du, Y. & Wang, J. BDgene: a genetic database for bipolar disorder and its overlap with schizophrenia and major depressive disorder. Biol Psychiat 74, 727–733 (2013).
Nair, J., Ghatge, M., Kakkar, V. V. & Shanker, J. Network analysis of inflammatory genes and their transcriptional regulators in coronary artery disease. PloS One 9, e94328 (2014).
Chen, C. et al. Two gene co-expression modules differentiate psychotics and controls. Mol psychiatr 18, 1308–1314 (2013).
Bertram, L., McQueen, M. B., Mullin, K., Blacker, D. & Tanzi, R. E. Systematic meta-analyses of Alzheimer disease genetic association studies: the AlzGene database. Nat Genet 39, 17–23 (2007).
An, O. et al. NCG 4.0: the network of cancer genes in the era of massive mutational screenings of cancer genomes. Database-Oxford 2014, bau015 (2014).
Kunej T, J. S. D., Zorz, M., Ogrinc, A., Michal, J. J., Kovac, M. & Jiang, Z. Obesity Gene Atlas in Mammals. J Genomics 1, 11 (2012).
Ortutay, C. & Vihinen, M. Immunome: a reference set of genes and proteins for systems biology of the human immune system. Cell Immunol 244, 87–89 (2006).
Guo, L., Du, Y., Chang, S., Zhang, W. & Wang, J. Applying differentially expressed genes from rodent models of chronic stress to research of stress-related disease: an online database. Psychosom Med 76, 644–649 (2014).
Franceschini, A. et al. STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res 41, D808–815 (2013).
Kohl, M., Wiese, S. & Warscheid, B. Cytoscape: software for visualization and analysis of biological networks. Methods Mol Biol 696, 291–303 (2011).
Assenov, Y., Ramirez, F., Schelhorn, S. E., Lengauer, T. & Albrecht, M. Computing topological parameters of biological networks. Bioinformatics 24, 282–284 (2008).
Huang da, W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature protoc 4, 44–57 (2009).
Bindea, G. et al. ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics 25, 1091–1093 (2009).
Jonsson, P. F. & Bates, P. A. Global topological features of cancer proteins in the human interactome. Bioinformatics 22, 2291–2297 (2006).
Sun, J. et al. Schizophrenia gene networks and pathways and their applications for novel candidate gene selection. PloS One 5, e11351 (2010).
Liu, R., Wang, X., Aihara, K. & Chen, L. Early diagnosis of complex diseases by molecular biomarkers, network biomarkers and dynamical network biomarkers. Med Res Rev 34, 455–478 (2014).
Acknowledgements
This work was supported by: the Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences; the CAS/SAFEA International Partnership Program for Creative Research Teams (Y2CX131003); the Knowledge Innovation Program of the Chinese Academy of Sciences (KSCX2-EW-J-8); and the National Natural Science Foundation of China (81201046).
Author information
Authors and Affiliations
Contributions
L.G. designed the study, L.G. and Y.D. preformed analyses in this study, L.G. and J.W. wrote the main manuscript text. All authors reviewed the manuscript.
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Electronic supplementary material
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Guo, L., Du, Y. & Wang, J. Network analysis reveals a stress-affected common gene module among seven stress-related diseases/systems which provides potential targets for mechanism research. Sci Rep 5, 12939 (2015). https://doi.org/10.1038/srep12939
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep12939
This article is cited by
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.