Comparison of Human Chromosome 21 Conserved Nongenic Sequences (CNGs) With the Mouse and Dog Genomes Shows That Their Selective Constraint Is Independent of Their Genic Environment

  1. Emmanouil T. Dermitzakis1,5,
  2. Ewen Kirkness2,
  3. Scott Schwarz3,
  4. Ewan Birney4,
  5. Alexandre Reymond1, and
  6. Stylianos E. Antonarakis1,5
  1. 1 Division of Medical Genetics, University of Geneva Medical School, CH-1211 Geneva, Switzerland
  2. 2 The Institute for Genomic Research (TIGR), Rockville, Maryland 20850, USA
  3. 3 Department of Computer Sciences and Engineering, Pennsylvania State University, Pennsylvania 16802, USA
  4. 4 European Bioinformatics Institute, Hinxton CB10 1SD, UK

Abstract

The analysis of conservation between the human and mouse genomes resulted in the identification of a large number of conserved nongenic sequences (CNGs). The functional significance of this nongenic conservation remains unknown, however. The availability of the sequence of a third mammalian genome, the dog, allows for a large-scale analysis of evolutionary attributes of CNGs in mammals. We have aligned 1638 previously identified CNGs and 976 conserved exons (CODs) from human chromosome 21 (Hsa21) with their orthologous sequences in mouse and dog. Attributes of selective constraint, such as sequence conservation, clustering, and direction of substitutions were compared between CNGs and CODs, showing a clear distinction between the two classes. We subsequently performed a chromosome-wide analysis of CNGs by correlating selective constraint metrics with their position on the chromosome and relative to their distance from genes. We found that CNGs appear to be randomly arranged in intergenic regions, with no bias to be closer or farther from genes. Moreover, conservation and clustering of substitutions of CNGs appear to be completely independent of their distance from genes. These results suggest that the majority of CNGs are not typical of previously described regulatory elements in terms of their location. We propose models for a global role of CNGs in genome function and regulation, through long-distance cis or trans chromosomal interactions.

Footnotes

  • [Supplemental material is available online at www.genome.org.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1934904. Article published online before print in April 2004.

  • 5 Corresponding authors. E-MAIL Emmanouil.Dermitzakis{at}medecine.unige.ch; FAX 0041-22-379-5706.

    5 E-MAIL Stylianos.Antonarakis{at}medecine.unige.ch; FAX 0041-22-379-5706.

    • Accepted December 28, 2003.
    • Received September 3, 2003.
| Table of Contents

Preprint Server