Rapid whole-genome mutational profiling using next-generation sequencing technologies

  1. Douglas R. Smith1,7,9,
  2. Aaron R. Quinlan2,7,
  3. Heather E. Peckham3,7,
  4. Kathryn Makowsky1,
  5. Wei Tao1,
  6. Betty Woolf1,
  7. Lei Shen1,
  8. William F. Donahue1,
  9. Nadeem Tusneem1,
  10. Michael P. Stromberg2,
  11. Donald A. Stewart2,
  12. Lu Zhang2,
  13. Swati S. Ranade3,
  14. Jason B. Warner3,
  15. Clarence C. Lee3,
  16. Brittney E. Coleman3,
  17. Zheng Zhang3,4,
  18. Stephen F. McLaughlin3,
  19. Joel A. Malek3,
  20. Jon M. Sorenson3,4,
  21. Alan P. Blanchard3,
  22. Jarrod Chapman5,
  23. David Hillman5,
  24. Feng Chen5,
  25. Daniel S. Rokhsar5,
  26. Kevin J. McKernan3,
  27. Thomas W. Jeffries6,
  28. Gabor T. Marth2,9, and
  29. Paul M. Richardson5,8,9
  1. 1 Agencourt Bioscience Corporation, Beverly, Massachusetts 01915, USA;
  2. 2 Boston College Biology Department, Higgins Hall, Chestnut Hill, Massachusetts 02467, USA;
  3. 3 Applied Biosystems, Beverly, Massachusetts 01915, USA;
  4. 4 Appplied Biosystems, Foster City, California 94404, USA;
  5. 5 US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA;
  6. 6 Institute for Microbial and Biochemical Technology, US Forest Products Laboratory, Madison, Wisconsin 53726, USA
  1. 7 These authors contributed equally to this work.

Abstract

Forward genetic mutational studies, adaptive evolution, and phenotypic screening are powerful tools for creating new variant organisms with desirable traits. However, mutations generated in the process cannot be easily identified with traditional genetic tools. We show that new high-throughput, massively parallel sequencing technologies can completely and accurately characterize a mutant genome relative to a previously sequenced parental (reference) strain. We studied a mutant strain of Pichia stipitis, a yeast capable of converting xylose to ethanol. This unusually efficient mutant strain was developed through repeated rounds of chemical mutagenesis, strain selection, transformation, and genetic manipulation over a period of seven years. We resequenced this strain on three different sequencing platforms. Surprisingly, we found fewer than a dozen mutations in open reading frames. All three sequencing technologies were able to identify each single nucleotide mutation given at least 10–15-fold nominal sequence coverage. Our results show that detecting mutations in evolved and engineered organisms is rapid and cost-effective at the whole-genome level using new sequencing technologies. Identification of specific mutations in strains with altered phenotypes will add insight into specific gene functions and guide further metabolic engineering efforts.

Footnotes

  • 8 Present address: Progentech Limited, 5885 Hollis St., Suite 155, Emeryville, CA 94608, USA.

  • 9 Corresponding authors.

    9 E-mail PaulRichardson{at}Progentech.com; fax (510) 655-5840.

    9 E-mail douglas.smith{at}agencourt.com; fax (978) 867-2601.

    9 E-mail marth{at}bc.edu; fax (617) 552-2011.

  • [Supplemental material is available online at www.genome.org. Complete data sets are available at the NCBI Short Read Archive under accession no. SRA 001158 (ftp://ftp.ncbi.nih.gov/pub/TraceDB/ShortRead).]

  • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.077776.108.

    • Received February 22, 2008.
    • Accepted July 10, 2008.
  • Freely available online through the Genome Research Open Access option.

| Table of Contents
OPEN ACCESS ARTICLE

Preprint Server