Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Protocol
  • Published:

Unbiased quantification of immunoglobulin diversity at the DNA level with VDJ-seq

Abstract

For high-throughput sequencing and quantification of immunoglobulin repertoires, most methodologies use RNA. However, output varies enormously between recombined genes due to different promoter strengths and differential activation of lymphocyte subsets, precluding quantitation of recombinants on a per-cell basis. To date, DNA-based approaches have used V gene primer cocktails, with substantial inherent biases. Here, we describe VDJ sequencing (VDJ-seq), which accurately quantitates immunoglobulin diversity at the DNA level in an unbiased manner. This is accomplished with a single primer-extension step using biotinylated J gene primers. By addition of unique molecular identifiers (UMIs) before primer extension, we reliably remove duplicate sequences and correct for sequencing and PCR errors. Furthermore, VDJ-seq captures productive and nonproductive VDJ and DJ recombination events on a per-cell basis. Library preparation takes 3 d, with 2 d of sequencing and 1 d of data processing and analysis.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1: A systematic overview of VDJ-seq.
Figure 2: Overview of the BabrahamLinkON pipeline.
Figure 3: Reproducibility of the VDJ-seq technique.
Figure 4: Experimental design considerations.
Figure 5: Example fragment size distributions of a VDJ-seq library.
Figure 6: Anticipated results from a successful VDJ-seq library.

Similar content being viewed by others

Accession codes

Primary accessions

Gene Expression Omnibus

References

  1. Bolland, D.J. et al. Two mutually exclusive local chromatin states drive efficient V(D)J recombination. Cell Rep. 15, 2475–2487 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Robinson, W.H. Sequencing the functional antibody repertoire—diagnostic and therapeutic discovery. Nat. Rev. Rheumatol. 11, 171–182 (2015).

    Article  CAS  PubMed  Google Scholar 

  3. Galson, J.D., Pollard, A.J., Trück, J. & Kelly, D.F. Studying the antibody repertoire after vaccination: practical applications. Trends Immunol. 35, 319–331 (2014).

    Article  CAS  PubMed  Google Scholar 

  4. Hou, D., Chen, C., Seely, E.J., Chen, S. & Song, Y. High-throughput sequencing-based immune repertoire study during infectious disease. Front. Immunol. 7, 336 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  5. Matheson, L.S. et al. Local chromatin features including PU.1 and IKAROS binding and H3K4 methylation shape the repertoire of immunoglobulin kappa genes chosen for V(D)J recombination. Front. Immunol. 8, 1550 (2017).

    Article  PubMed  PubMed Central  Google Scholar 

  6. Wardemann, H. & Busse, C.E. Novel approaches to analyze immunoglobulin repertoires. Trends Immunol. 38, 471–482 (2017).

    Article  CAS  PubMed  Google Scholar 

  7. Georgiou, G. et al. The promise and challenge of high-throughput sequencing of the antibody repertoire. Nat. Biotechnol. 32, 158–168 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Stubbington, M.J. et al. T cell fate and clonality inference from single-cell transcriptomes. Nat. Methods 13, 329–332 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  9. DeKosky, B.J. et al. High-throughput sequencing of the paired human immunoglobulin heavy and light chain repertoire. Nat. Biotechnol. 31, 166–169 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. McDaniel, J.R., DeKosky, B.J., Tanno, H., Ellington, A.D. & Georgiou, G. Ultra-high-throughput sequencing of the immune receptor repertoire from millions of lymphocytes. Nat. Protoc. 11, 429–442 (2016).

    Article  CAS  PubMed  Google Scholar 

  11. Turchaninova, M. et al. High-quality full-length immunoglobulin profiling with unique molecular barcoding. Nat. Protoc. 11, 1599–1616 (2016).

    Article  CAS  PubMed  Google Scholar 

  12. Khan, T.A. et al. Accurate and predictive antibody repertoire profiling by molecular amplification fingerprinting. Sci. Adv. 2, e1501371 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  13. Greiff, V. et al. Systems analysis reveals high genetic and antigen-driven predetermination of antibody repertoires throughout B cell development. Cell Rep. 19, 1467–1478 (2017).

    Article  CAS  PubMed  Google Scholar 

  14. Love, V.A., Lugo, G., Merz, D. & Feeney, A.J. Individual VH promoters vary in strength, but the frequency of rearrangement of those VH genes does not correlate with promoter strength nor enhancer-independence. Mol. Immunol. 37, 29–39 (2000).

    Article  CAS  PubMed  Google Scholar 

  15. Choi, N.M. et al. Deep sequencing of the murine IgH repertoire reveals complex regulation of nonrandom V gene rearrangement frequencies. J. Immunol. Baltim. 191, 2393–2402 (2013).

    Article  CAS  Google Scholar 

  16. Kaplinsky, J. et al. Antibody repertoire deep sequencing reveals antigen-independent selection in maturing B cells. Proc. Natl. Acad. Sci. USA 111, E2622–E2629 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Linnemann, C. et al. High-throughput identification of antigen-specific TCRs by TCR gene capture. Nat. Med. 19, 1534–1541 (2013).

    Article  CAS  PubMed  Google Scholar 

  18. Shugay, M. et al. Towards error-free profiling of immune repertoires. Nat. Methods 11, 653–655 (2014).

    Article  CAS  PubMed  Google Scholar 

  19. Hu, J. et al. Detecting DNA double-stranded breaks in mammalian genomes by linear amplification-mediated high-throughput genome-wide translocation sequencing. Nat. Protoc. 11, 853–871 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Lin, S.G. et al. Highly sensitive and unbiased approach for elucidating antibody repertoires. Proc. Natl. Acad. Sci. USA 113, 7846–7851 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Patel, R. et al. Formation of chimeric DNA primer extension products by template switching onto an annealed downstream oligonucleotide. Proc. Natl. Acad. Sci. USA 93, 2969–74 (1996).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Kivioja, T. et al. Counting absolute numbers of molecules using unique molecular identifiers. Nat. Methods 9, 72–74 (2012).

    Article  CAS  Google Scholar 

  23. Smith, T.S., Heger, A. & Sudbery, I. UMI-tools: modelling sequencing errors in unique molecular identifiers to improve quantification accuracy. Genome Res. 27, 491–499 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal 17, 10–12 (2011).

    Article  Google Scholar 

  25. Langmead, B. & Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Zhang, J., Kobert, K., Flouri, T. & Stamatakis, A. PEAR: a fast and accurate Illumina paired-end reAd mergeR. Bioinformatics 30, 614–620 (2014).

    Article  CAS  PubMed  Google Scholar 

  27. Lassmann, T., Frings, O. & Sonnhammer, E.L.L. Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features. Nucleic Acids Res. 37, 858–865 (2009).

    Article  CAS  PubMed  Google Scholar 

  28. Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).

    Article  PubMed  PubMed Central  Google Scholar 

  29. Ye, J., Ma, N., Madden, T.L. & Ostell, J.M. IgBLAST: an immunoglobulin variable domain sequence analysis tool. Nucleic Acids Res. 41, W34–W40 (2013).

    Article  PubMed  PubMed Central  Google Scholar 

  30. Heine, G., Sims, G.P., Worm, M., Lipsky, P.E. & Radbruch, A. Isolation of Human B Cell Populations. in Current Protocols in Immunology. Wiley, 2001.

  31. Rumfelt, L.L., Zhou, Y., Rowley, B.M., Shinton, S.A. & Hardy, R.R. Lineage specification and plasticity in CD19 early B cell precursors. J. Exp. Med. 203, 675–687 (2006).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Bolotin, D.A. et al. MiXCR: software for comprehensive adaptive immunity profiling. Nat. Methods 12, 380–381 (2015).

    Article  CAS  PubMed  Google Scholar 

  33. Gaëta, B.A. et al. iHMMune-align: hidden Markov model-based alignment and identification of germline genes in rearranged immunoglobulin gene sequences. Bioinformatics 23, 1580–1587 (2007).

    Article  PubMed  Google Scholar 

  34. Ralph, D.K. & Matsen, F.A. Consistency of VDJ rearrangement and substitution parameters enables accurate B cell receptor sequence annotation. PLOS Comput. Biol. 12, e1004409 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  35. Munshaw, S. & Kepler, T.B. SoDA2: a hidden Markov model approach for identification of immunoglobulin rearrangements. Bioinformatics 26, 867–872 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Alamyar, E., Giudicelli, V., Duroux, P. & Lefranc, M.-P. Antibody V and C domain sequence, structure, and interaction analysis with special reference to IMGT. Methods Mol. Biol. 1131, 337–381 (2014).

    Article  CAS  PubMed  Google Scholar 

  37. Lefranc, M.-P. et al. IMGT, the international ImMunoGeneTics information system. Nucleic Acids Res. 37, D1006–D1012 (2009).

    Article  CAS  PubMed  Google Scholar 

  38. Gupta, N.T. et al. Change-O: a toolkit for analyzing large-scale B cell immunoglobulin repertoire sequencing data. Bioinformatics 31, 3356–3358 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Deng, C., Daley, T. & Smith, A. Applications of species accumulation curves in large-scale biological data analysis. Quant. Biol. 3, 135–144 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Yaari, G. & Kleinstein, S.H. Practical guidelines for B-cell receptor repertoire sequencing analysis. Genome Med. 7, 121 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  41. Greiff, V., Miho, E., Menzel, U. & Reddy, S.T. Bioinformatic and statistical analysis of adaptive immune repertoires. Trends Immunol. 36, 738–749 (2015).

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We are grateful for invaluable assistance provided by K. Tabbada, Babraham Sequencing Facility; A. Davis, Babraham Flow Facility; and staff at the Babraham Biological Support Unit. This work was supported by the Biotechnology and Biological Sciences Research Council (BBS/E/B/000C0404), including a BBSRC Industrial CASE PhD studentship award to P.C. (1520397).

Author information

Authors and Affiliations

Authors

Contributions

The original version of the VDJ-seq assay was devised for mouse Igh by A.L.W. and D.J.B., and was adapted for human Igh and mouse Igδ by L.S.M. The improvements reported here were developed by P.C. and D.J.B. The original Babraham LinkON pipeline was developed by F.K. and S.A., and improvements and add-ons reported here were devised by P.C. Assay QC and supporting analyses were performed by P.C. P.C., D.J.B. and A.E.C. wrote the manuscript.

Corresponding author

Correspondence to Anne E Corcoran.

Ethics declarations

Competing interests

A.L.W., D.J.B., L.S.M. and A.E.C. are named inventors on a patent, “Covering the VDJ-seq technique: Method of identifying VDJ recombination products” (UK Patent Application No. GB1203720.6, filed March 2, 2012; PCT Patent Applic. No. PCT/GB2013/05056, published September 6, 2013). National applications have been filed in Europe, the United States and Japan. US Publication number: 20150031042, publication date: January 29, 2015. The other authors declare no competing interests.

Integrated supplementary information

Supplementary Figure 1 VDJ-seq V gene usage across the mouse IgH locus compared to two other DNA-based methods.

Double width bars highlight V-genes that have been captured by VDJ-seq, but were not present using the other methods. (a) V-J primer cocktail method (Kaplinsky et al.1) compared to VDJ-seq. (b) HTGTS-Rep-seq compared to VDJ-seq. IGHV1-62-2 is missing in the HTGTS-Rep-seq data, but we believe this is because of IGHV1-62-2 and IGHV1-71 having identical sequence and filtering of multi-V gene calls would have removed these. All mice used for data generation were maintained in accordance with Babraham Institute Animal Welfare and Ethical Review Body and Home office rules under Project Licence 80/2529. Appropriate ARRIVE guidelines have been followed.

Supplementary Figure 2 Further comparison of VDJ-seq with the V-J primer cocktail method and HTGTS-Rep-seq.

(a) HTGTS-Rep-seq has a high correlation (r2=0.89-0.93) of V gene usage to our VDJ-seq method when comparing similar starting material. (b) Next, we chose to compare the number of clonotypes captured. We chose to compare clonotypes due to the high duplicate rate found in HTGTS-Rep-seq data, which we interpreted as PCR duplicates, as naïve spleen B-cells should not have high clonality. We found that VDJ-seq captured 2-5 times more clonotypes than HTGTS-Rep-seq. (c) Finally, we compared our method to using the classical cocktail of V-J primers from Kaplinsky et al.1. The correlation between the V gene usage from our method and using the V-J primer cocktail method is low (r2=0.58). All mice used for data generation were maintained in accordance with Babraham Institute Animal Welfare and Ethical Review Body and Home office rules under Project Licence 80/2529. Appropriate ARRIVE reporting guidelines have been followed.

Supplementary Figure 3 Impact of chew back on J gene mispriming correction.

The seqlogos illustrate the extent of chew back experienced by the J genes from NHEJ. Bases highlighted in bold capitals are the 5bp used for mispriming correction. It is possible to use these 5bp for mispriming correction as they are either after the primer sequence in the case of J2 and J3 or the end of the original primer sequence in the case of J1 and J4 remains after other J primers misprime. The chew back is most prominent at the site of cleavage (at the heptamer sequence (blue) of the recombination signal sequence (RSS)) and decreases in effect as a function of distance. This is illustrated by the gradual transition of J1 and J4. J2 and J3 have a much sharper transition due to the proximity of the 5bp to the cleavage site (5bp for J2 and J3 compared to 13bp for J1 and 11bp for J4). This means the 5bp used for mispriming correction will be conserved less often for J2 and J3 then they will be for J1 and J4. The J gene sequence underneath the seqlogo's illustrates the position of the J primers (red) within the J gene along with the 5bp used for mispriming correction. All mice used for data generation were maintained in accordance with Babraham Institute Animal Welfare and Ethical Review Body and Home office rules under Project Licence 80/2529. Appropriate ARRIVE reporting guidelines have been followed.

1. Kaplinsky, J. et al. Antibody repertoire deep sequencing reveals antigen-independent selection in maturing B cells. Proc. Natl. Acad. Sci. 111, E2622–E2629 (2014).

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–3 and Supplementary Table 1. (PDF 952 kb)

Life Science Reporting Summary (PDF 132 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chovanec, P., Bolland, D., Matheson, L. et al. Unbiased quantification of immunoglobulin diversity at the DNA level with VDJ-seq. Nat Protoc 13, 1232–1252 (2018). https://doi.org/10.1038/nprot.2018.021

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/nprot.2018.021

This article is cited by

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing