Genome-wide analysis of retrogene polymorphisms in Drosophila melanogaster

  1. Matthew W. Hahn1,2
  1. 1Department of Biology, Indiana University, Bloomington, Indiana 47405, USA;
  2. 2School of Informatics and Computing, Indiana University, Bloomington, Indiana 47405, USA;
  3. 3Department of Evolution and Ecology, University of California, Davis, California 95616, USA;
  4. 4Center for Population Biology, University of California, Davis, California 95616, USA

    Abstract

    Gene duplication via retrotransposition has been shown to be an important mechanism in evolution, affecting gene dosage and allowing for the acquisition of new gene functions. Although fixed retrotransposed genes have been found in a variety of species, very little effort has been made to identify retrogene polymorphisms. Here, we examine 37 Illumina-sequenced North American Drosophila melanogaster inbred lines and present the first ever data set and analysis of polymorphic retrogenes in Drosophila. We show that this type of polymorphism is quite common, with any two gametes in the North American population differing in the presence or absence of six retrogenes, accounting for ∼13% of gene copy-number heterozygosity. These retrogenes were identified by a straightforward method that can be applied using any type of DNA sequencing data. We also use a variant of this method to conduct a genome-wide scan for intron presence/absence polymorphisms, and show that any two chromosomes in the population likely differ in the presence of multiple introns. We show that these polymorphisms are all in fact deletions rather than intron gain events present in the reference genome. Finally, by leveraging the known location of the parental genes that give rise to the retrogene polymorphisms, we provide direct evidence that natural selection is responsible for the excess of fixations of retrogenes moving off of the X chromosome in Drosophila. Further efforts to identify retrogene and intron presence/absence polymorphisms will undoubtedly improve our understanding of the evolution of gene copy number and gene structure.

    Footnotes

    • Received October 9, 2010.
    • Accepted March 10, 2011.
    | Table of Contents

    Preprint Server