Editorial overview
Autoimmunity: insights from human genomics

https://doi.org/10.1016/j.coi.2012.09.006Get rights and content

Introduction

Early efforts to localize and identify genes that contribute to the occurrence of human disease often used a limited subset of variants in candidate genes, either in case–control association analyses or family-based linkage studies. These studies typically focused on candidate genes selected based upon a perceived biological pathway, and variants that were selected on the basis of putative functional effects. At that time, however, our knowledge of human genomic variation and the nature of variants within specific genes were still lacking. This resulted in the publication of many findings with low statistical power and, often, with little potential for replication. As genotyping strategies and technologies evolved, there was a shift from candidate gene studies to genome-wide linkage scans in families. While linkage studies provided more coverage of the genome, they were often limited by the availability of large collections of families. Also, the primary focus of linkage scans was on multiple affected individuals in a family, thereby implicitly testing for a model where a handful of alleles had large effects on disease risk.

Recently, genome-wide association scans (GWAS) have profited from both increased genomic coverage and cost reduction in genotyping. As a result it has become possible to genotype large numbers of common variants in collections of large sample size. These GWAS studies have succeeded in identifying and replicating novel common genetic variation associated with the spectrum of common autoimmune diseases, including type 1 diabetes [1], ulcerative colitis [2], Crohn's disease [3], systemic lupus erythematosus (SLE) [4], alopecia areata [5], rheumatoid arthritis [6], and multiple sclerosis [7]. Unlike the family-based linkage and candidate gene studies, the individual allelic effect sizes detected by GWAS studies (excluding the MHC) are relatively small, consistent with the common variant hypothesis, with each allele accounting for only a small portion of genetic variance. Described by Voight and Cotsapas, in this issue, there has also been confirmation that many loci are shared across multiple autoimmune diseases [8]. Even though many loci have been identified by GWAS, the goal remains not only the discovery of individual loci or genes, but also the discovery of new biology. For the vast majority of GWAS findings, however, the functional variants remain yet to be determined; however, there has been progress recently in Crohn's disease (e.g. [9, 10]).

The findings in the GWAS era have recently been criticized, on the basis of the difficulty in pinpointing specific genes and causal variants, the limited amount of genetic risk burden that has been explained (missing heritability) and the incomplete genomic coverage. However, the use of GWAS technologies has provided the locations for further characterization of the genes and variants in a genomic region that statistically will contain genes and variants contributing to disease risk. Gene and causal variant discovery will continue as technical advances allow for denser genotyping and sequencing [11] as well as access to integrated health care systems that will permit rapid recruitment of patients from electronic medical records [12].

With these advances in technology, however, there are new challenges to understanding the genetic basis of autoimmunity and disease. To date, experimental strategies to understand the function of disease susceptibility alleles have been tailored toward those with highly penetrant effects that typically cause familial diseases. In contrast, the vast majority of discoveries in autoimmune diseases have resulted from GWAS studies of common variants [13]. Since these alleles are common they may play a role in influencing disease susceptibility in the general population  but with only a small effect on an individual. Common variant alleles represent mutational events that have survived the pressures of evolution, escaped purifying negative selection, and have therefore been able to propagate within a population [14]. Thus, within an individual, a common risk variant may have only modest and very specific influence on immune function. As Buckner and Nepom describe in their article, different variants for the same disease can influence disease progression at distinct steps in immune activation, expansion, and regulation. Their functional influences are likely to be cell-specific and subtle. Indeed, common risk variants may alter function differently in different cell-types (e.g. the same common variant can have disparate effects on gene expression in lymphoid and myeloid cell-types [15]). Traditionally, the evaluation of rare penetrant mutations has generated a standard classification that includes ‘loss-of-function’ or ‘gain-of-function’; however, with common variants, a more appropriate paradigm might be ‘altered function’.

Once an association is identified, determining which of many alleles may be functional and may account for the observed association with disease can be challenging. In many instances, due to linkage disequilibrium, an entire set of variants may demonstrate equivalent levels of association. Genetic evidence alone may not be able to identify which specific variant is contributing biologically to the association [13]. One rare instance where an observed SNP association was fine-mapped successfully to causal variants was the IRF5 locus in SLE. The initial association of SLE to a common SNP ultimately was explained as an exonic splice site variant, a 30-bp in-frame insertion/deletion, and a variant in a conserved polyA+ signal sequence that alters the length of the 3′ UTR and stability of IRF5 mRNAs [16]. For other loci, the path from GWAS to fine-mapping to detection of causal variation has yet to occur.

In cases where common variants have been shown to alter function, it is possible to associate disease with specific biological mechanisms. In this issue, Gardet and Xavier describe how common variants in Crohn's disease alter autophagy, normally thought to be a homeostatic response that ensures the recycling of cellular materials to promote cell survival. This unanticipated finding has established a new research direction for autophagy mucosal homeostasis. Even in the absence of molecular evidence of disease mechanism, it is possible to link common variants to specific disease pathways. In this issue, Bronson et al. present compelling evidence linking interferon abnormalities to SLE using genetics and functional genomics. These results are both influencing drug discovery and the mechanisms involved in autoimmune disease.

In order to make progress in understanding the genetic basis of autoimmunity, several approaches need to be pursued in parallel. The first is continued gene and causal variant discovery. As additional novel alleles are discovered they will continue to contribute to help identify novel pathways and potential therapeutic targets. A second approach would implement targeted DNA sequencing in regions of susceptibility. Sequencing will identify rare and highly penetrant alleles, some of which may be particularly useful in designing biomarkers and interventions. These rare variants, especially if they reside in coding regions of genes, will not only identify the gene as causal but may have more obvious function. This approach has already identified likely pathogenic genes in Crohn's disease and type 1 diabetes [17, 18, 19].

A third and potentially informative approach for understanding the role of common variants in autoimmunity is to broadly examine functional quantitative traits, for example expression quantitative traits, reviewed by Stranger and de Jaeger, and immune quantitative traits, described by Gregersen et al. Utilizing large populations with robust experimental and analytical approaches to examine expression and immune phenotypes could identify those variants that strongly influence immune responses. One obvious challenge is to identify those assays that are most valuable and relevant to look at from the perspective of autoimmunity and human disease. As massive parallel profiling becomes feasible, this approach may emerge as a powerful parallel method to augment genomic studies [20].

For human immunologists, the discoveries from genetics should be viewed as a great success, creating a platform for countless avenues of investigation into the immune system and how host genomic variation modifies this response. This is a particularly exciting time with the convergence of human genetic studies, human immunologic studies and an ever-increasing platform of reagents and model systems for probing immune biology. As Hu and Daly note, integrative approaches examining many alleles and their functions in concert might yield insight into disease mechanisms. Ermann and Glimcher write how, in certain key instances, traditional mouse models may present an opportunity to investigate specific allelic functions, and to connect to disease phenotypes. The integration of epigenetic data with genetic data, as reviewed by Leung et al. offers another promising avenue for progress, particularly in light of recent observation within data produced by the ENCODE consortium of the overlap between disease alleles and regulatory chromatin marks [21, 22]. The breadth and quantity of human risk alleles has given investigators many leads to follow on in order to understand the molecular underpinnings of autoimmune diseases.

Section snippets

Soumya Raychaudhuri is an Assistant Professor of Medicine at Harvard Medical School within the Divisions of Rheumatology and Genetics at Brigham and Women's Hospital. In addition, he is a Professor in Genetics at the University of Manchester and an affiliate of the Broad Institute. His research leverages computational and bioinformatics approaches to interpret genetic, genomic, epigenetic, and functional genomics data to better understand autoimmune diseases. He has worked collaboratively with

References (22)

  • S. Raychaudhuri

    Mapping rare and common causal alleles for complex human diseases

    Cell

    (2011)
  • J.C. Barrett et al.

    Genome-wide association study and meta-analysis find that over 40 loci affect risk of type 1 diabetes

    Nat Genet

    (2009)
  • C.A. Anderson et al.

    Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47

    Nat Genet

    (2011)
  • A. Franke et al.

    Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci

    Nat Genet

    (2010)
  • V. Gateva et al.

    A large-scale replication study identifies tnip1, prdm1, jazf1, uhrf1bp1 and il10 as risk loci for systemic lupus erythematosus

    Nat Genet

    (2009)
  • L. Petukhova et al.

    Genome-wide association study in alopecia areata implicates both innate and adaptive immunity

    Nature

    (2010)
  • E.A. Stahl et al.

    Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci

    Nat Genet

    (2010)
  • N.A. Patsopoulos et al.

    Genome-wide meta-analysis identifies novel multiple sclerosis susceptibility loci

    Ann Neurol

    (2011)
  • E.J. Rossin et al.

    Proteins encoded in genomic regions associated to immune-mediated disease physically interact and suggest underlying biology

    PLoS Genet

    (2011)
  • T. Haritunians et al.

    Variants in znf365 isoform d are associated with Crohn's disease

    Gut

    (2011)
  • K.S. Michelsen et al.

    Ibd-associated tl1a gene (tnfsf15) haplotypes determine increased expression of tl1a protein

    PLoS One

    (2009)
  • Cited by (1)

    • Expression level of risk genes of MHC class II is a susceptibility factor for autoimmunity: New insights

      2018, Journal of Autoimmunity
      Citation Excerpt :

      These approaches contribute to obtain a finest mapping of risk-associated MHC genetic variants showing a larger involvement of MHC locus in autoimmunity. SNP2HLA is an imputation tool that, using GWAS datasets from European populations, allowed fine–mapping associations not only with classical alleles but also contributed to highlight the role of specific amino acid variants that structurally, or functionally, affects the interaction of HLA molecules with the autoantigen [25–27]. A recent paper, based on this approach, confirmed strong associations of CD with HLA-DQA1 and HLA-DQB1 genes.

    Soumya Raychaudhuri is an Assistant Professor of Medicine at Harvard Medical School within the Divisions of Rheumatology and Genetics at Brigham and Women's Hospital. In addition, he is a Professor in Genetics at the University of Manchester and an affiliate of the Broad Institute. His research leverages computational and bioinformatics approaches to interpret genetic, genomic, epigenetic, and functional genomics data to better understand autoimmune diseases. He has worked collaboratively with the international consortia of rheumatoid arthritis investigators to identify novel risk alleles and fine-map the MHC association. He has devised computational text mining methods to interpret genomic data, methods to analyze large-scale gene expression data, and statistical methods to interpret common variant alleles in the context of epigenetic data. He is also a practicing rheumatologist at Brigham and Women's Hospital.

    Stephen Rich is the Harrison Professor and Vice-Chair in the Department of Public Health Sciences at the University of Virginia. He has conducted research in the genetic basis of type 1 diabetes and its complications, serving as the Principal Investigator of the Type 1 Diabetes Genetics Consortium. His research in type 1 diabetes has included studies of candidate genes, genome-wide linkage as well as genome-wide association scans and, more recently, evaluation of structural variants, rare variants, and targeted functional studies. He has been one of the Principal Investigators of the NHLBI Exome Sequencing Project, leading a consortium of large, deeply phenotyped cohorts for examination of the role of rare coding variants in heart disease. He is now the Director of the Center for Public Health Genomics at the University of Virginia.

    View full text