Abstract
For high-throughput sequencing and quantification of immunoglobulin repertoires, most methodologies use RNA. However, output varies enormously between recombined genes due to different promoter strengths and differential activation of lymphocyte subsets, precluding quantitation of recombinants on a per-cell basis. To date, DNA-based approaches have used V gene primer cocktails, with substantial inherent biases. Here, we describe VDJ sequencing (VDJ-seq), which accurately quantitates immunoglobulin diversity at the DNA level in an unbiased manner. This is accomplished with a single primer-extension step using biotinylated J gene primers. By addition of unique molecular identifiers (UMIs) before primer extension, we reliably remove duplicate sequences and correct for sequencing and PCR errors. Furthermore, VDJ-seq captures productive and nonproductive VDJ and DJ recombination events on a per-cell basis. Library preparation takes 3 d, with 2 d of sequencing and 1 d of data processing and analysis.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
Accession codes
References
Bolland, D.J. et al. Two mutually exclusive local chromatin states drive efficient V(D)J recombination. Cell Rep. 15, 2475–2487 (2016).
Robinson, W.H. Sequencing the functional antibody repertoire—diagnostic and therapeutic discovery. Nat. Rev. Rheumatol. 11, 171–182 (2015).
Galson, J.D., Pollard, A.J., Trück, J. & Kelly, D.F. Studying the antibody repertoire after vaccination: practical applications. Trends Immunol. 35, 319–331 (2014).
Hou, D., Chen, C., Seely, E.J., Chen, S. & Song, Y. High-throughput sequencing-based immune repertoire study during infectious disease. Front. Immunol. 7, 336 (2016).
Matheson, L.S. et al. Local chromatin features including PU.1 and IKAROS binding and H3K4 methylation shape the repertoire of immunoglobulin kappa genes chosen for V(D)J recombination. Front. Immunol. 8, 1550 (2017).
Wardemann, H. & Busse, C.E. Novel approaches to analyze immunoglobulin repertoires. Trends Immunol. 38, 471–482 (2017).
Georgiou, G. et al. The promise and challenge of high-throughput sequencing of the antibody repertoire. Nat. Biotechnol. 32, 158–168 (2014).
Stubbington, M.J. et al. T cell fate and clonality inference from single-cell transcriptomes. Nat. Methods 13, 329–332 (2016).
DeKosky, B.J. et al. High-throughput sequencing of the paired human immunoglobulin heavy and light chain repertoire. Nat. Biotechnol. 31, 166–169 (2013).
McDaniel, J.R., DeKosky, B.J., Tanno, H., Ellington, A.D. & Georgiou, G. Ultra-high-throughput sequencing of the immune receptor repertoire from millions of lymphocytes. Nat. Protoc. 11, 429–442 (2016).
Turchaninova, M. et al. High-quality full-length immunoglobulin profiling with unique molecular barcoding. Nat. Protoc. 11, 1599–1616 (2016).
Khan, T.A. et al. Accurate and predictive antibody repertoire profiling by molecular amplification fingerprinting. Sci. Adv. 2, e1501371 (2016).
Greiff, V. et al. Systems analysis reveals high genetic and antigen-driven predetermination of antibody repertoires throughout B cell development. Cell Rep. 19, 1467–1478 (2017).
Love, V.A., Lugo, G., Merz, D. & Feeney, A.J. Individual VH promoters vary in strength, but the frequency of rearrangement of those VH genes does not correlate with promoter strength nor enhancer-independence. Mol. Immunol. 37, 29–39 (2000).
Choi, N.M. et al. Deep sequencing of the murine IgH repertoire reveals complex regulation of nonrandom V gene rearrangement frequencies. J. Immunol. Baltim. 191, 2393–2402 (2013).
Kaplinsky, J. et al. Antibody repertoire deep sequencing reveals antigen-independent selection in maturing B cells. Proc. Natl. Acad. Sci. USA 111, E2622–E2629 (2014).
Linnemann, C. et al. High-throughput identification of antigen-specific TCRs by TCR gene capture. Nat. Med. 19, 1534–1541 (2013).
Shugay, M. et al. Towards error-free profiling of immune repertoires. Nat. Methods 11, 653–655 (2014).
Hu, J. et al. Detecting DNA double-stranded breaks in mammalian genomes by linear amplification-mediated high-throughput genome-wide translocation sequencing. Nat. Protoc. 11, 853–871 (2016).
Lin, S.G. et al. Highly sensitive and unbiased approach for elucidating antibody repertoires. Proc. Natl. Acad. Sci. USA 113, 7846–7851 (2016).
Patel, R. et al. Formation of chimeric DNA primer extension products by template switching onto an annealed downstream oligonucleotide. Proc. Natl. Acad. Sci. USA 93, 2969–74 (1996).
Kivioja, T. et al. Counting absolute numbers of molecules using unique molecular identifiers. Nat. Methods 9, 72–74 (2012).
Smith, T.S., Heger, A. & Sudbery, I. UMI-tools: modelling sequencing errors in unique molecular identifiers to improve quantification accuracy. Genome Res. 27, 491–499 (2017).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal 17, 10–12 (2011).
Langmead, B. & Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Zhang, J., Kobert, K., Flouri, T. & Stamatakis, A. PEAR: a fast and accurate Illumina paired-end reAd mergeR. Bioinformatics 30, 614–620 (2014).
Lassmann, T., Frings, O. & Sonnhammer, E.L.L. Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features. Nucleic Acids Res. 37, 858–865 (2009).
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Ye, J., Ma, N., Madden, T.L. & Ostell, J.M. IgBLAST: an immunoglobulin variable domain sequence analysis tool. Nucleic Acids Res. 41, W34–W40 (2013).
Heine, G., Sims, G.P., Worm, M., Lipsky, P.E. & Radbruch, A. Isolation of Human B Cell Populations. in Current Protocols in Immunology. Wiley, 2001.
Rumfelt, L.L., Zhou, Y., Rowley, B.M., Shinton, S.A. & Hardy, R.R. Lineage specification and plasticity in CD19 early B cell precursors. J. Exp. Med. 203, 675–687 (2006).
Bolotin, D.A. et al. MiXCR: software for comprehensive adaptive immunity profiling. Nat. Methods 12, 380–381 (2015).
Gaëta, B.A. et al. iHMMune-align: hidden Markov model-based alignment and identification of germline genes in rearranged immunoglobulin gene sequences. Bioinformatics 23, 1580–1587 (2007).
Ralph, D.K. & Matsen, F.A. Consistency of VDJ rearrangement and substitution parameters enables accurate B cell receptor sequence annotation. PLOS Comput. Biol. 12, e1004409 (2016).
Munshaw, S. & Kepler, T.B. SoDA2: a hidden Markov model approach for identification of immunoglobulin rearrangements. Bioinformatics 26, 867–872 (2010).
Alamyar, E., Giudicelli, V., Duroux, P. & Lefranc, M.-P. Antibody V and C domain sequence, structure, and interaction analysis with special reference to IMGT. Methods Mol. Biol. 1131, 337–381 (2014).
Lefranc, M.-P. et al. IMGT, the international ImMunoGeneTics information system. Nucleic Acids Res. 37, D1006–D1012 (2009).
Gupta, N.T. et al. Change-O: a toolkit for analyzing large-scale B cell immunoglobulin repertoire sequencing data. Bioinformatics 31, 3356–3358 (2015).
Deng, C., Daley, T. & Smith, A. Applications of species accumulation curves in large-scale biological data analysis. Quant. Biol. 3, 135–144 (2015).
Yaari, G. & Kleinstein, S.H. Practical guidelines for B-cell receptor repertoire sequencing analysis. Genome Med. 7, 121 (2015).
Greiff, V., Miho, E., Menzel, U. & Reddy, S.T. Bioinformatic and statistical analysis of adaptive immune repertoires. Trends Immunol. 36, 738–749 (2015).
Acknowledgements
We are grateful for invaluable assistance provided by K. Tabbada, Babraham Sequencing Facility; A. Davis, Babraham Flow Facility; and staff at the Babraham Biological Support Unit. This work was supported by the Biotechnology and Biological Sciences Research Council (BBS/E/B/000C0404), including a BBSRC Industrial CASE PhD studentship award to P.C. (1520397).
Author information
Authors and Affiliations
Contributions
The original version of the VDJ-seq assay was devised for mouse Igh by A.L.W. and D.J.B., and was adapted for human Igh and mouse Igδ by L.S.M. The improvements reported here were developed by P.C. and D.J.B. The original Babraham LinkON pipeline was developed by F.K. and S.A., and improvements and add-ons reported here were devised by P.C. Assay QC and supporting analyses were performed by P.C. P.C., D.J.B. and A.E.C. wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
A.L.W., D.J.B., L.S.M. and A.E.C. are named inventors on a patent, “Covering the VDJ-seq technique: Method of identifying VDJ recombination products” (UK Patent Application No. GB1203720.6, filed March 2, 2012; PCT Patent Applic. No. PCT/GB2013/05056, published September 6, 2013). National applications have been filed in Europe, the United States and Japan. US Publication number: 20150031042, publication date: January 29, 2015. The other authors declare no competing interests.
Integrated supplementary information
Supplementary Figure 1 VDJ-seq V gene usage across the mouse IgH locus compared to two other DNA-based methods.
Double width bars highlight V-genes that have been captured by VDJ-seq, but were not present using the other methods. (a) V-J primer cocktail method (Kaplinsky et al.1) compared to VDJ-seq. (b) HTGTS-Rep-seq compared to VDJ-seq. IGHV1-62-2 is missing in the HTGTS-Rep-seq data, but we believe this is because of IGHV1-62-2 and IGHV1-71 having identical sequence and filtering of multi-V gene calls would have removed these. All mice used for data generation were maintained in accordance with Babraham Institute Animal Welfare and Ethical Review Body and Home office rules under Project Licence 80/2529. Appropriate ARRIVE guidelines have been followed.
Supplementary Figure 2 Further comparison of VDJ-seq with the V-J primer cocktail method and HTGTS-Rep-seq.
(a) HTGTS-Rep-seq has a high correlation (r2=0.89-0.93) of V gene usage to our VDJ-seq method when comparing similar starting material. (b) Next, we chose to compare the number of clonotypes captured. We chose to compare clonotypes due to the high duplicate rate found in HTGTS-Rep-seq data, which we interpreted as PCR duplicates, as naïve spleen B-cells should not have high clonality. We found that VDJ-seq captured 2-5 times more clonotypes than HTGTS-Rep-seq. (c) Finally, we compared our method to using the classical cocktail of V-J primers from Kaplinsky et al.1. The correlation between the V gene usage from our method and using the V-J primer cocktail method is low (r2=0.58). All mice used for data generation were maintained in accordance with Babraham Institute Animal Welfare and Ethical Review Body and Home office rules under Project Licence 80/2529. Appropriate ARRIVE reporting guidelines have been followed.
Supplementary Figure 3 Impact of chew back on J gene mispriming correction.
The seqlogos illustrate the extent of chew back experienced by the J genes from NHEJ. Bases highlighted in bold capitals are the 5bp used for mispriming correction. It is possible to use these 5bp for mispriming correction as they are either after the primer sequence in the case of J2 and J3 or the end of the original primer sequence in the case of J1 and J4 remains after other J primers misprime. The chew back is most prominent at the site of cleavage (at the heptamer sequence (blue) of the recombination signal sequence (RSS)) and decreases in effect as a function of distance. This is illustrated by the gradual transition of J1 and J4. J2 and J3 have a much sharper transition due to the proximity of the 5bp to the cleavage site (5bp for J2 and J3 compared to 13bp for J1 and 11bp for J4). This means the 5bp used for mispriming correction will be conserved less often for J2 and J3 then they will be for J1 and J4. The J gene sequence underneath the seqlogo's illustrates the position of the J primers (red) within the J gene along with the 5bp used for mispriming correction. All mice used for data generation were maintained in accordance with Babraham Institute Animal Welfare and Ethical Review Body and Home office rules under Project Licence 80/2529. Appropriate ARRIVE reporting guidelines have been followed.
1. Kaplinsky, J. et al. Antibody repertoire deep sequencing reveals antigen-independent selection in maturing B cells. Proc. Natl. Acad. Sci. 111, E2622–E2629 (2014).
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–3 and Supplementary Table 1. (PDF 952 kb)
Rights and permissions
About this article
Cite this article
Chovanec, P., Bolland, D., Matheson, L. et al. Unbiased quantification of immunoglobulin diversity at the DNA level with VDJ-seq. Nat Protoc 13, 1232–1252 (2018). https://doi.org/10.1038/nprot.2018.021
Published:
Issue Date:
DOI: https://doi.org/10.1038/nprot.2018.021
This article is cited by
-
Adaptive immune receptor repertoire analysis
Nature Reviews Methods Primers (2024)
-
Igh and Igk loci use different folding principles for V gene recombination due to distinct chromosomal architectures of pro-B and pre-B cells
Nature Communications (2023)
-
Single-cell gene and isoform expression analysis reveals signatures of ageing in haematopoietic stem and progenitor cells
Communications Biology (2023)
-
Single-cell RNA sequencing to identify cellular heterogeneity and targets in cardiovascular diseases: from bench to bedside
Basic Research in Cardiology (2023)
-
Integrated single-cell analysis revealed immune dynamics during Ad5-nCoV immunization
Cell Discovery (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.