EAnnot: A genome annotation tool using experimental evidence

  1. Li Ding1,
  2. Aniko Sabo,
  3. Nicolas Berkowicz,
  4. Rekha R. Meyer,
  5. Yoram Shotland,
  6. Mark R. Johnson,
  7. Kymberlie H. Pepin,
  8. Richard K. Wilson, and
  9. John Spieth
  1. Genome Sequencing Center, Washington University School of Medicine, St. Louis, Missouri 63110, USA

Abstract

The sequence of any genome becomes most useful for biological experimentation when a complete and accurate gene set is available. Gene prediction programs offer an efficient way to generate an automated gene set. Manual annotation, when performed by experienced annotators, is more accurate and complete than automated annotation. However, it is a laborious and expensive process, and by its nature, introduces a degree of variability not found with automated annotation. EAnnot (Electronic Annotation) is a program originally developed for manually annotating the human genome. It combines the latest bioinformatics tools to extract and analyze a wide range of publicly available data in order to achieve fast and reliable automatic gene prediction and annotation. EAnnot builds gene models based on mRNA, EST, and protein alignments to genomic sequence, attaches supporting evidence to the corresponding genes, identifies pseudogenes, and locates poly(A) sites and signals. Here, we compare manual annotation of human chromosome 6 with annotation performed by EAnnot in order to assess the latter's accuracy. EAnnot can readily be applied to manual annotation of other eukaryotic genomes and can be used to rapidly obtain an automated gene set.

Footnotes

  • [Supplemental material available online at www.genome.org and http://genome.wustl.edu/analysis/EAnnot.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.3152604.

  • 1 Corresponding author. E-mail lding{at}watson.wustl.edu; fax (314) 286-1810.

    • Accepted October 4, 2004.
    • Received August 13, 2004.
| Table of Contents

Preprint Server