The origin, global distribution, and functional impact of the human 8p23 inversion polymorphism

  1. Carol C. Shoulders1,9
  1. 1Centre for Endocrinology, Barts & the London School of Medicine & Dentistry, Queen Mary University of London, London EC1M 6BQ, United Kingdom;
  2. 2Bioinformatics & Biostatistics Group, Cancer Research UK London Research Institute, London WC2A 3LY, United Kingdom;
  3. 3Department of Systems Biology, Sage Bionetworks, Seattle, Washington 98109, USA;
  4. 4Department of Epidemiology, Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts 02115, USA;
  5. 5National Heart & Lung Institute, Imperial College London, London SW3 6LY, United Kingdom;
  6. 6Guys & St. Thomas Hospital, King's College London, London SE1 2PR, United Kingdom;
  7. 7Institute of Clinical Science, Imperial College London, London W12 0NN, United Kingdom
    • 8 Present address: Bioinformatics & Biostatistics group, Cancer Research UK London Research Institute, London WC2A 3LY, United Kingdom.

    Abstract

    Genomic inversions are an increasingly recognized source of genetic variation. However, a lack of reliable high-throughput genotyping assays for these structures has precluded a full understanding of an inversion's phylogenetic, phenotypic, and population genetic properties. We characterize these properties for one of the largest polymorphic inversions in man (the ∼4.5-Mb 8p23.1 inversion), a structure that encompasses numerous signals of natural selection and disease association. We developed and validated a flexible bioinformatics tool that utilizes SNP data to enable accurate, high-throughput genotyping of the 8p23.1 inversion. This tool was applied retrospectively to diverse genome-wide data sets, revealing significant population stratification that largely follows a clinal “serial founder effect” distribution model. Phylogenetic analyses establish the inversion's ancestral origin within the Homo lineage, indicating that 8p23.1 inversion has occurred independently in the Pan lineage. The human inversion breakpoint was localized to an inverted pair of human endogenous retrovirus elements within the large, flanking low-copy repeats; experimental validation of this breakpoint confirmed these elements as the likely intermediary substrates that sponsored inversion formation. In five data sets, mRNA levels of disease-associated genes were robustly associated with inversion genotype. Moreover, a haplotype associated with systemic lupus erythematosus was restricted to the derived inversion state. We conclude that the 8p23.1 inversion is an evolutionarily dynamic structure that can now be accommodated into the understanding of human genetic and phenotypic diversity.

    Footnotes

    • 9 Corresponding authors.

      E-mail max.salm05{at}imperial.ac.uk.

      E-mail c.shoulders{at}qmul.ac.uk.

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.126037.111.

    • Received May 9, 2011.
    • Accepted March 6, 2012.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported License), as described at http://creativecommons.org/licenses/by-nc/3.0/.

    | Table of Contents

    Preprint Server