Abstract
Rice is an important crop and a model system for monocot genomics, and is a target for whole genome sequencing by the International Rice Genome Sequencing Project (IRGSP). The IRGSP is using a clone by clone approach to sequence rice based on minimum tiles of BAC or PAC clones. For chromosomes 10 and 3 we are using an integrated physical map based on two fingerprinted and end-sequenced BAC libraries to identifying a minimum tiling path of clones. In this study we constructed and tested two rice genomic libraries with an average insert size of 10 kb (10-kb library) to support the gap closure and finishing phases of the rice genome sequencing project. The HaeIII library contains 166,752 clones covering approximately 4.6× rice genome equivalents with an average insert size of 10.5 kb. The Sau3AI library contains 138,960 clones covering 4.2× genome equivalents with an average insert size of 11.6 kb. Both libraries were gridded in duplicate onto 11 high-density filters in a 5 × 5 pattern to facilitate screening by hybridization. The libraries contain an unbiased coverage of the rice genome with less than 5% contamination by clones containing organelle DNA or no insert. An efficient method was developed, consisting of pooled overgo hybridization, the selection of 10-kb gap spanning clones using end sequences, transposon sequencing and utilization of in silico draft sequence, to close relatively small gaps between sequenced BAC clones. Using this method we were able to close a majority of the gaps (up to approximately 50 kb) identified during the finishing phase of chromosome-10 sequencing. This method represents a useful way to close clone gaps and thus to complete the entire rice genome.
Similar content being viewed by others
References
Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408:796–815
Atrazhev AM, Elliott JF (1996) Simplified de-salting of ligation reactions immediately prior to electroporation into E. coli. Biotechniques 21:1024
Barry G (2001) The use of the Monsanto draft rice genome sequence in research. Plant Physiol 125:1164–1165
Batzoglou S, Berger B, Mesirov J, Lander ES (1999) Sequencing a genome by walking with clone-end sequences: a mathematical analysis. Genome Res 9:1163–1174
Budiman MA, Mao L, Wood T, Wing RA (2000) A deep-coverage tomato BAC library and prospects toward development of an STC framework for genome sequencing. Genome Res 10:129–136
Chen M, Presting G, Barbazuk WB, Goicoechea JL, Blackmon B, Fang G, Kim H, Frisch D, Yu Y, Sun S, et al. (2002) An integrated physical and genetic map of the rice genome. Plant Cell 14:1–10
Devine SE, Chissoe SL, Eby Y, Wilson RK, Boeke JD (1997) A transposon-based strategy for sequencing repetitive DNA in eukaryotic genomes. Genome Res 7:551–563
Ewing B, Green P (1998) Base-calling of automated sequencer traces using PhredII. Error probabilities. Genome Res 8:186–194
Ewing B, Hillier L, Wendl MC, Green P (1998) Base-calling of automated sequencer traces using PHRED. I. Accuracy assessment. Genome Res 8:175–185
Feng Q, Zhang Y, Hao P, Wang S, Fu G, Huang Y, Li Y, Zhu J, Liu Y, Hu X, et al. (2002) Sequence and analysis of rice chromosome 4. Nature 420:316–320
Frohme M, Camargo AA, Czink C, Matsukuma AY, Simpson AJG, Hoheisel JD, Verjovski-Almeida S (2001) Directed gap closure in large-scale sequencing projects. Genome Res 11:901–903
Goff SG, Ricke D, Lan TH, Presting G, Wang R, Dunn M, Glazebrook J, Sessions A, Oeller P, Varma H, et al. (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296:92–100
Gordon D, Abajian C, Green P (1998) Consed: a graphical tool for sequence finishing. Genome Res 8:195–202
Green P (1997) Against a whole-genome shotgun. Genome Res 7:410–417
International Human Genome Sequencing Consortium (2001) Initial sequencing and analysis of the human genome. Nature 409:860–941
Luo M, Wang YH, Frisch D, Joobeur T, Wing RA, Ralph AD (2001) Melon BAC library construction using improved methods and identification of clones linked to the locus conferring resistance to melon Fusarium Wilt (Fom-2). Genome 44:154–162
McMurray AA, Sulston, JE, Quail MA (1998) Short-insert libraries as a method of problem solving in genome sequencing. Genome Res 8:562–566
Ohmido N, Kijima K, Akiyama Y, de Jong JH, Fukui K (2000) Quantification of total genomic DNA and selected repetitive sequences reveals concurrent changes in different DNA families in indica and jopanica rice. Mol Gen Genet 263:388–394
Roach JC, Thorsson V, Siegel AF (2000) Parking strategies for genome sequencing. Genome Res 10:1020–1030
Sasaki T, Burr B (2000) International rice genome sequencing project: the effort to completely sequence the rice genome. Curr Opin Plant Biol 3:138–141
Sasaki T, Matsumoto T, Yamamoto K, Sakata K, Baba T, Katayose Y, Wu J, Niimura Y, Cheng Z, Nagamura Y, et al. (2002) The genome sequence and structure of rice chromosome 1. Nature 420:312–316
Siegel AF, Trask B, Roach JD, Mahairas GG, Hood L, van den Engh G (1999) Analysis of sequence-tagged-connector strategies for DNA sequencing. Genome Res 9:297–307
Soderlund C, Humphray S, Dunham A, French L (2000) Contigs built with fingerprints, markers and FPC V4.7. Genome Res 10:1772–1787
Tomkins JP, Peterson DG, Yang TJ, Main D, Wilkins TA, Paterson AH, Wing RA (2001) Development of genomic resources for cotton (Gosypium hirsutuum): BAC library construction, preliminary STC analysis, and identification of clones associated with fiber development. Mol Breed 8:255–261
Venter JD, Smith HO, Hood L (1996) A new strategy for genome sequencing. Nature 381:364–366
Venter JD, Adams MD, Myers EW, Li PW, Mural RJ, Sutton SS, Smith HO, Yandell M, Evans CA, Holt RA, et al. (2001) The sequence of the human genome. Science 291:1304–1351
Wendl MC, Marra MA, Hillier LW, Chinwalla AT, Wilson RK, Waterston RH (2001) Theories and applications for sequencing randomly selected clones. Genome Res 11:274–280
Yang TJ, Yu Y, Frisch D, Wing RA (2002) Two series of plasmid vectors for the shotgun library. Plant and Animal Genome Conference X, p 24
Yu J, Hu S, Wang J, Wong GK, Li S, Liu B, Deng Y, Dai L, Zhou Y, Zhang X, et al. (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science 296:79–91
Yu Y, Tomkins JP, Waugh R, Frisch D, Kudrna D, Kleinhofs A, Brueggeman RS, Muehlbauer GJ, Wise RP, Wing RA (2000) A bacterial artificial chromosome library for barley (Hordeum vulgare L.) and the identification of clones containing putative resistance genes. Theor Appl Genet 101:1093–1099
Yu Y, Rambo T, Currie J, Sasaki C, Kim HR, Collura K, Thompson S, Simmons J, Yang TJ, Park GN, Patel AJ, Thurmond S, Henry D, Oates R, Palmer M, Pries G, Gibson J, Anderson H, Paradkar M, Crane L, Dale J, Carver MB, Wood T, Frisch D, Engler F, Soderlund C, Palmer LE, Tetylman L, Nascimento L, Bastide M de la, Spiegel L, Ware D, O'Shaughnessy A, Dike S, Dedhia N, Preston R, Huang E, Ferraro K, Kuit K, Miller B, Zutavern T, Katzenberger F, Muller S, Balija V, Martienssen RA, Stein L, Minx P, Johnson D, Cordum H, Mardis E, Cheng Z, Jiang J, Wilson R, McCombie WR, Wing RA, Yuan Q, Ouyang S, Liu J, Jones KM, Gansberger K, Moffat K, Hill J, Tsitrin T, Overton L, Bera J, Kim M, Jin S, Tallon L, Ciecko A, Pai G, Aken SV, Utterback T, Reidmuller S, Bormann J, Feldblyum T, Hsiao J, Zismann V, Blunt S, Vazeilles A de, Shaffer T, Koo H, Suh B, Yang Q, Haas B, Peterson J, Pertea M, Volfovsky N, Worman J, White O, Salzberg SL, Fraser CV, Buell CR, Messing J, Song R, Fuks G, Llaca V, Kovchak S, Young S, Bowers JE, Paterson AH, Johns MA, Mao L, Pan H, Dean RA (2003) In-depth view of structure, activity, and evolution of rice chromosome 10. Science (in press)
Zhang HB, Zhao X, Ding X, Paterson AH, Wing RA (1995) Preparation of megabase-size DNA from plant nuclei. Plant J 7:175–184
Acknowledgements
This work was supported by grants from the U.S. Department of Agriculture CREES, the National Science Foundation, and the Department of Energy and postdoctoral fellowship program from Korea Science and Engineering Foundation (KOSEF). Any opinions, findings, and conclusions or recommendations expressed herein are those of the authors and do not necessarily reflect the views of the U.S. Department of Agriculture, the National Science Foundation or the Department of Energy. We thank Monsanto for providing access to their draft sequence data.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Q. Zhang
Rights and permissions
About this article
Cite this article
Yang, TJ., Yu, Y., Nah, G. et al. Construction and utility of 10-kb libraries for efficient clone-gap closure for rice genome sequencing. Theor Appl Genet 107, 652–660 (2003). https://doi.org/10.1007/s00122-003-1302-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00122-003-1302-4