Ultralow-input single-tube linked-read library method enables short-read second-generation sequencing systems to routinely generate highly accurate and economical long-range sequencing information

  1. Ming Lei7
  1. 1Universal Sequencing Technology Corporation, Carlsbad, California 92011, USA;
  2. 2Bioturing Incorporated, San Diego, California 92121, USA;
  3. 3Faculty of Information Technology, University of Science, Vietnam National University, Ho Chi Minh City, 700 000 Vietnam;
  4. 4Department of Pediatrics, University of California San Diego, La Jolla, California 92161, USA;
  5. 5Center for Microbiome Innovation and Departments of Pediatrics, Bioengineering, and Computer Science and Engineering, University of California San Diego, La Jolla, California 92093, USA;
  6. 6Department of Computer Science and Engineering, University of California San Diego, La Jolla, California 92093, USA;
  7. 7Universal Sequencing Technology Corporation, Canton, Massachusetts 02021, USA
  • Corresponding author: tchen{at}universalsequencing.com
  • Abstract

    Long-range sequencing information is required for haplotype phasing, de novo assembly, and structural variation detection. Current long-read sequencing technologies can provide valuable long-range information but at a high cost with low accuracy and high DNA input requirements. We have developed a single-tube Transposase Enzyme Linked Long-read Sequencing (TELL-seq) technology, which enables a low-cost, high-accuracy, and high-throughput short-read second-generation sequencer to generate over 100 kb of long-range sequencing information with as little as 0.1 ng input material. In a PCR tube, millions of clonally barcoded beads are used to uniquely barcode long DNA molecules in an open bulk reaction without dilution and compartmentation. The barcoded linked-reads are used to successfully assemble genomes ranging from microbes to human. These linked-reads also generate megabase-long phased blocks and provide a cost-effective tool for detecting structural variants in a genome, which are important to identify compound heterozygosity in recessive Mendelian diseases and discover genetic drivers and diagnostic biomarkers in cancers.

    Footnotes

    • Received December 19, 2019.
    • Accepted June 10, 2020.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    | Table of Contents

    Preprint Server