Variant antigen repertoires in Trypanosoma congolense populations and experimental infections can be profiled from deep sequence data using universal protein motifs

  1. Andrew P. Jackson1
  1. 1Department of Infection Biology, Institute of Infection and Global Health, University of Liverpool, Liverpool L3 5RF, United Kingdom;
  2. 2Department of Parasitology, Liverpool School of Tropical Medicine, Liverpool L3 5QA, United Kingdom;
  3. 3Department of Vector Biology, Liverpool School of Tropical Medicine, Liverpool L3 5QA, United Kingdom;
  4. 4International Livestock Research Institute, Nairobi 00100, Kenya;
  5. 5Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom;
  6. 6Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
  • Corresponding author: sara.silva-pereira{at}liverpool.ac.uk
  • Abstract

    African trypanosomes are vector-borne hemoparasites of humans and animals. In the mammal, parasites evade the immune response through antigenic variation. Periodic switching of the variant surface glycoprotein (VSG) coat covering their cell surface allows sequential expansion of serologically distinct parasite clones. Trypanosome genomes contain many hundreds of VSG genes, subject to rapid changes in nucleotide sequence, copy number, and chromosomal position. Thus, analyzing, or even quantifying, VSG diversity over space and time presents an enormous challenge to conventional techniques. Indeed, previous population genomic studies have overlooked this vital aspect of pathogen biology for lack of analytical tools. Here we present a method for analyzing population-scale VSG diversity in Trypanosoma congolense from deep sequencing data. Previously, we suggested that T. congolense VSGs segregate into defined “phylotypes” that do not recombine. In our data set comprising 41 T. congolense genome sequences from across Africa, these phylotypes are universal and exhaustive. Screening sequence contigs with diagnostic protein motifs accurately quantifies relative phylotype frequencies, providing a metric of VSG diversity, called the “variant antigen profile.” We applied our metric to VSG expression in the tsetse fly, showing that certain, rare VSG phylotypes may be preferentially expressed in infective, metacyclic-stage parasites. Hence, variant antigen profiling accurately and rapidly determines the T. congolense VSG gene and transcript repertoire from sequence data, without need for manual curation or highly contiguous sequences. It offers a tractable approach to measuring VSG diversity across strains and during infections, which is imperative to understanding the host–parasite interaction at population and individual scales.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.234146.118.

    • Freely available online through the Genome Research Open Access option.

    • Received January 3, 2018.
    • Accepted July 12, 2018.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server