Portable and cost-effective genetic detection and characterization of Plasmodium falciparum hrp2 using the MinION sequencer

doi:10.21203/rs.3.rs-1836842/v1

Download PDF

Article

Portable and cost-effective genetic detection and characterization of Plasmodium falciparum hrp2 using the MinION sequencer

https://doi.org/10.21203/rs.3.rs-1836842/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 18 Feb, 2023

Read the published version in Scientific Reports →

You are reading this latest preprint version

The prevalence of Plasmodium falciparum hrp2 (pfhrp2)-deleted parasites threatens the efficacy of the most used and sensitive malaria rapid diagnostic tests and highlights the need for continued surveillance for this gene deletion. While PCR methods are generally adequate for determining pfhrp2 presence or absence, they offer a limited view of its genetic diversity. Here, we present a portable sequencing method using the MinION. Pfhrp2 amplicons were generated from individual samples, barcoded, and pooled for sequencing. To overcome potential crosstalk between barcodes, we implemented a coverage-based threshold for pfhrp2 deletion confirmation. Amino acid repeat types were then counted and visualized with custom Python scripts following de novo assembly. We evaluated this assay using well-characterized reference strains and 152 field isolates with and without pfhrp2 deletions, of which 38 were also sequenced on the PacBio platform to provide a standard for comparison. Of 152 field samples, 93 surpassed the positivity threshold, and of those samples, 62/93 had a dominant pfhrp2 repeat type. PacBio-sequenced samples with a dominant repeat-type profile from the MinION sequencing data matched the PacBio profile. This field-deployable assay may supplement the World Health Organization’s existing protocol for surveilling pfhrp2 deletions and facilitate timely implementation of diagnostic policy change when needed.

Malaria rapid diagnostic tests (RDTs) offer a relatively cheap, fast, and simple alternative to traditional microscopy. The use of RDTs led to an increase in testing of suspected malaria cases from 36–84% in Sub-Saharan Africa from 2013 to 2018 (Aidoo & Incardona, 2022). RDTs detecting the histidine rich protein 2 (HRP2) are more sensitive and generally more heat-stable than alternative tests for Plasmodium falciparum (World Health Organization, 2020). P. falciparum accounts for over 90% of malaria cases in sub-Saharan Africa, the region with the highest malaria burden. HRP2-detecting RDTs thus constitute the majority of RDTs sold each year (World Health Organization, 2021). However, in 2010, parasites lacking pfhrp2 (the encoding gene for HRP2) were identified in Peru (Gamboa, et al., 2010), indicating a threat to the efficacy of HRP2-based RDTs. Since then, numerous regional surveys have been conducted assessing the prevalence of pfhrp2 and pfhrp3 deletions, particularly in regions that heavily depend on HRP2 RDTs for malaria diagnosis. HRP3 is a paralog to HRP2, and can cross-react with HRP2-based RDTs, at high parasitemia levels of > 1000 parasites/mL (Kong, et al., 2021). The World Health Organization (WHO) now includes pfhrp2/3 deletions in its list of biological threats to malaria control and tracks regional deletion prevalence surveys in its global threat map (Global Malaria Programme, 2022). In addition, the WHO provided the “Master protocol for surveillance of pfhrp2/3 deletions and biobanking to support future research” as a formal mechanism by which municipalities can assess the local prevalence of pfhrp2/3-deleted P. falciparum (2020). If this prevalence surpasses 5% in the population of clinical cases studied, replacing the HRP2 RDT is recommended. Simulation and empirical studies have both suggested that while the use of HRP2 RDTs cannot be solely responsible for the spread of pfhrp2 deletions, their continued use does exert some selective pressure in favor of deletions (as they mask malaria diagnosis and therefore limit access to or speed of treatment) and would ultimately lead to the deletion’s fixation in the population (Feleke, et al., 2021; Chaudhry, et al., 2022). An immediate global transition away from HRP2-based RDTs, however, is neither logistically feasible nor necessary in regions where pfhrp2 deletions have not yet emerged. Public health decisions regarding the continuation or cessation of HRP2-based RDT use must be made at the regional level in response to regional prevalence.

Methods used to assess pfhrp2 deletion and diversity have included antigen detection (Rogier, et al., 2022), PCR (Abdallah, et al., 2015; Akinyi, et al., 2013), amplicon sequencing (Nderu, et al., 2019), high-throughput whole genome sequencing, and high-throughput targeted deep sequencing (Feleke, et al., 2021). In many studies two or more of these methods are used in conjunction. While PCR methods may allow for indirect forms of genotyping, sequencing methods provide direct access to nucleotide content. However, there are numerous limitations to overcome when applying sequencing methods to pfhrp2 deletion and diversity surveillance.

Sanger sequencing, while not as costly as its next-generation counterparts, is a low-throughput technology that can only yield the dominant sequence in what may be a poly-clonal infection. Next-generation sequencing on Illumina platforms addresses this issue but is subject to biases for sequences with GC-content outside of the 45–65% range (Browne, et al., 2020) and produce short reads (75 to 300 bp). Exon 1 and the amino acid (AA) repeat region of exon 2 sit at the cusp of this range at 45% GC-content, and the pfhrp2 gene has long stretches of AA repeat regions making the assembly of the approximately 1,000 base pair gene complicated using short-read sequencing methods. Targeted molecular inversion probe (MIP) deep sequencing supported by selective whole genome sequencing on the Illumina platform was used to reconstruct genomic windows surrounding pfhrp2 and pfhrp3 to identify deletion breakpoints, though this study did not provide repeat-type profiling (the dominant measure of pfhrp2 variation currently), and the genotyping resolution is unclear (Feleke, et al., 2021). An intuitive solution to the pitfalls of short-read sequencing for reconstructing genotypes at the nucleotide level is long-read sequencing as provided by Oxford Nanopore and PacBio sequencing platforms. In addition, Sanger, Illumina, and PacBio sequencers require generous lab space, large initial investment, and on-site expertise for device maintenance and usage. The availability of these sequencing platforms in malaria-endemic countries can thus be sparse and access to necessary sequencing reagents difficult to procure.

The WHO pfhrp2/3 deletion surveillance protocol currently requires dried blood spots from false-negative RDTs to be transported to a secondary facility after an appropriate sample size has been collected (World Health Organization, 2020), and the extent of the analysis depends on the resources of that facility. This step adds a potentially costly delay to the discovery of the regional pfhrp2/3 deletion prevalence and limits the timeliness and utility of the resulting data if the resources of the secondary facility are limited.

Here, we present a portable long-read sequencing assay for assessing pfhrp2 deletion and diversity on the MinION sequencing device. Long-read sequencing of amplicon reads largely mitigates issues surrounding de novo assembly that arise from short-read sequencing, and the MinION platform produces high data volumes which are roughly scalable by the researcher in their choice of flow cell and sequencing run duration. The capital investment is minimal in comparison to other sequencing platforms. Starter packs for the MinION Mk1B and Mk1C (a self-contained sequencing and computing device that eliminates the Mk1B’s need for a laptop) are 1,000 USD and 4,500 USD respectively. Although the assay involves MinION-specific library preparation and flow cell loading protocols, it also eliminates the need for an expert to be present at each site, as the operator can travel with the sequencing device, as both the Mk1B and Mk1C can be easily transported in a backpack. This assay builds on a previously reported one-step PCR assay for pfhrp2 (Jones, et al., 2020) and can be easily incorporated into the WHO surveillance protocol, with the potential of streamlining pfhrp2 deletion surveillance efforts in the future.

2.1 Samples

The pfhrp2 MinION assay was developed using well-characterized reference isolates cultured at the US Centers for Disease Control and Prevention (CDC). 7G8 (Brazil), 3D7 (suspected origin Africa), HB3 (Honduras), D6 (Sierra Leone), and FC27 (Papua New Guinea), which express pfhrp2, were used as proxies for positive samples, and Dd2 (Indo-China), which lacks pfhrp2, was used as a negative control.

The assay was evaluated using three additional sample sets. The first set was a collection of cultured control isolates from the Malaria Research and Reference Reagent Resource Center (MR4) (Wu, et al., 2001) including 7G8, 3D7, HB3, FC27, and Dd2. The MR4 set was utilized for final quality control experiments prior to sequencing field samples. Remaining clinical isolates from Sub-Saharan Africa therapeutic efficacy studies (TES) that were used for opportunistic pfhrp2 surveillance previously (McCaffery, et al., 2021; Rogier, et al., 2022) were also used. Additionally, numerous domestic and international clinical isolates were utilized. Domestic samples represent suspected cases of imported malaria from travelers seeking care at a U.S. medical facility. Whole blood samples were obtained from U.S. domestic, imported malaria cases from the CDC domestic malaria surveillance network and tested in accordance with protocol 2017 − 309 approved by the Office of the Associate Director of Science, Center for Global Health, Centers for Disease Control and Prevention as a surveillance activity. Informed consent was obtained from all participants. These samples were previously used to evaluate the one-step pfhrp2 PCR protocol (Jones, et al., 2020). A subset of the clinical samples was selected to perform PacBio sequencing against which to compare the resulting pfhrp2 types of this assay. This subset of field samples will be referred to as the PacBio set from this point forward. See Supplementary Table 1 for field sample metadata.

2.2 One-step PCR for pfhrp2

Here, we utilized the one-step PCR protocol for pfhrp2 (Jones, et al., 2020). A 50 µL reaction was prepared consisting of 0.126 µM each of the one-step pfhrp2 forward and reverse primers (Supplementary Table 2), 1 x Q5 reaction buffer (New England Biolabs, MA, USA), 0.2 mM dNTPs (New England Biolabs, MA, USA), 0.02 U/µL Q5 high-fidelity polymerase (New England Biolabs, MA, USA), and 5 µL of DNA template. Each reaction batch included a positive control (7G8 or 3D7), negative control (Dd2), and nuclease-free water negative control. PCR conditions were an initial denaturation at 98°C for 3 minutes (mins), a cycle of (98°C for 30 seconds (s), 60°C for 90 s, and 68° for 2 mins) repeated 30 times, and a final elongation at 68°C for 5 mins. The PCR product was visualized on a 1.0% TBE gel.

2.3 Barcoding primer design

We designed barcoding primers based on the PCR Barcoding (96) Amplicons (PBC096) kit and protocol from Oxford Nanopore Technologies (ONT) (Oxford, UK), customized for pfhrp2 (Supplementary Table 2). Primers for barcode sequences 1–60 were assessed in silico for self- and cross-dimers. Twenty-eight barcoding primers were suitable (Supplementary Table 2). We utilized this target-specific barcoding primer method instead of ONT’s formal PBC096 protocol due to pfhrp2-specific issues we encountered with the kit (see Supplementary Information, Supplementary Figs. 1 and 2).

2.4 Barcoding PCR

Barcodes were assigned to samples such that no single sequencing pool would share two samples with the same barcode. The barcoding reaction was like the one-step pfhrp2 PCR reaction, except each sample received 0.126 µM each of the forward and reverse barcoding primer assigned to it and 10 µL of pfhrp2 amplicon from the one-step reaction were added. PCR conditions were an initial denaturation at 98°C for 3 mins, a cycle of (98°C for 30 s, 69°C for 90 s, 68°C for 2 mins) repeated 30 times, and a final elongation at 68°C for 5 mins. Each reaction batch included the positive control (7G8 or 3D7), negative control (Dd2), and nuclease-free water negative control from the previous reaction. The PCR product was visualized on a 1.0% TBE gel.

2.5 Sample pooling and library preparation

Samples were quantified on a Qubit 4 Fluorometer with the dsDNA High Sensitivity assay (Thermo Fisher Scientific, MA, USA) following the barcoding PCR and prior to pooling, using 1 µL of product. These measurements were used for experiments with normalized pools (Supplementary Information).

Pools were then purified using AMPure XP beads (Beckman Coulter, IN, USA). The beads were added to each pool at a ratio of 0.5. Library preparation for MinION sequencing was performed according to the PCR Barcoding amplicons (96) ONT protocol section “DNA repair and end-prep” followed by “Adapter ligation and sequencing cleanup” with reagents from the ONT ligation sequencing kit (SQK-LSK109) (Oxford, UK), NEBNext FFPE DNA Repair Mix (New England Biolabs, MA, USA), NEBNext Ultra II End repair / dA-tailing Module (New England Biolabs, MA, USA), and NEBNext Quick Ligation Module (New England Biolabs, MA, USA). One microliter from the final eluate of each of these two final steps was taken for quantitation on the Qubit (Thermo Fisher Scientific, MA, USA).

2.6 MinION Sequencing

Sequencing was performed on MinION Mk1B and Mk1C devices on either standard or Flongle R.9 flow cells (Oxford Nanopore Technologies, Oxford, UK). For standard flow cells the Flow Cell Priming Kit was used, and for Flongle flow cells the Flongle Sequencing Expansion Kit was used. Flow cell priming and loading was performed according to manufacturer instructions. In ONT’s MinKNOW software, barcoding was turned on, the kit was set to EXP-PBC096, and the minimum barcoding quality was set to 60. The minimum read quality score was set to 7, and no minimum read length filter was set. Information on sequencing run lengths can be found in the Supplementary Information (Supplementary Table 3).

2.7 PacBio sequencing

Amplicons from the one-step pfhrp2 reaction (section 2.2) were purified with AMPure XP beads (Beckman Coulter, IN, USA). Libraries were prepared using the 5kb PacBio protocol with the DNA preparation kit (Pacific Biosciences, CA, USA). Samples were pooled together with 100 ng per sample and 8 samples per pool. Finished libraries were bound to P6 polymerase and sequenced on the PacBio RSII with C4 chemistry with one pool per SMRTcell.

2.8 Analytical pipeline

Quality filtered reads were aligned to exon 1 and the repeat region of exon 2 of the pfhrp2 reference sequence (PF3D7_0831800) with minimap2 (v. 2.21-r1071) using its “map-ont” option (Li, 2018; Li, 2021). The SAM file was sorted and converted to a BAM file with samtools sort (v. 1.13), and the BAM file was converted to a FASTQ file using samtools fastq (Li, et al., 2009). Mapping statistics were calculated using Qualimap (v. 2.2.2) (Okonechnikov, et al., 2016) and compiled into reports with MultiQC (v. 1.10.1) (Ewels, et al., 2016). The FASTQ containing successfully aligned sequencing reads was then used as input for de novo assembly with canu (v. 2.1.1) (Koren, et al., 2017) using its “-nanopore” option and “genomeSize” 1.3 kb. We used aligned reads for assembly to eliminate the inclusion of off target reads. De novo assembly was also performed in Geneious 2022.0.2. The canu and Geneious contigs were all converted to consensus sequences, visually explored, and manually edited to fit in the correct translation frame if necessary, based on AA marker sequences (Table 1, Supplementary Information). Up to ten contig consensus sequences per sample were curated based on their having > 10 reads and few low-quality bases. The nucleotide sequences were then translated to AA sequences and exported to FASTA format. Repeat type counts were calculated by a Python (v. 3.8.5) script (Data and Code Availability). Visualizations of repeat-type counts were created using the Python library seaborn (v. 0.11.2) (Waskom, 2021). For PacBio-sequenced data, an analogous analytical pipeline was used, with minimap2’s “map-pb” option and otherwise identical treatment.

Table 1

**Amino acid sequence markers for** pfhrp2. These markers were used as guide for correctly framing the *pfhrp2* sequences for translation. Not all repeat types are present in all positive samples. The region between amino acid sequences VDD and CLRH was considered the repeat region of exon 2 and inspected for repeat types.
Marker	Amino acid sequence
Exon 1	MVSFSKNKVLSAAVFASVLLDN
Exon2 start	NNSAFNNNLCSKNA
Beginning of repeat region of exon 2	VDD
End of repeat region of exon 2	CLRH
Repeat types:
Type 1	AHHAHHVAD
Type 2	AHHAHHAAD
Type 3	AHHAHHAAY
Type 4	AHH
Type 5	AHHAHHASD
Type 6	AHHATD
Type 7	AHHAAD
Type 8	AHHAAY
Type 9	AAY
Type 10	AHHAAAHHATD
Type 11	AHN
Type 12	AHHAAAHHEAATH
Type 13	AHHASD
Type 14	AHHAHHATD
Type 30	AHHAVD
Type 31	SHHAAY

2.9 Kelch 13 control experiment for barcoding method

To determine if uneven performance between barcodes with normalized DNA input was an issue specific to pfhrp2, we designed a version of our assay for k13 amplicons. Five barcoding primer pairs were designed for k13 and checked for self- and cross-dimers in silico prior to being ordered (Supplementary Table 4). k13 was amplified from a panel of reference strains including 7G8, 3D7, HB3, D6, and Dd2 according to the Malaria Resistance Surveillance Project (MaRS) protocol (Talundzic, et al., 2018). The amplicons were then barcoded with k13-specific barcoding primers under the same PCR conditions described in section 2.4. Two normalized sequencing pools were made, the first containing the reference panel and the second containing 7G8 replicates. Bead purification, library preparation, and sequencing were performed as described in section 2.6.

2.10 Barcode primer in vitro experiment

Given uneven performance between barcodes, we assessed the performance of each on normalized sequencing pools of 7G8 replicates to determine which performed better than others and should therefore be more heavily utilized. The 28 barcodes were spread across 6 sequencing pools with 3–5 samples each and sequenced on standard flow cells for 3 hours. The following ten barcodes were selected for the evaluation experiments: bc01, bc02, bc12, bc13, bc17, bc18, bc49, bc30, bc52, and bc60 (Supplementary Information, Supplementary Table 5).

2.11 Evaluation

The MR4 and PacBio sets were normalized, and the remaining field samples were not, given the expectation in field-based use of this assay that there will be pfhrp2 positive and negative samples. Otherwise, samples used for evaluation were treated identically according to the methods described in sections 2.2 to 2.6. To confirm pfhrp2 negativity and to assess the viability of a sample for genotyping, we implemented a coverage-based positivity threshold (Supplementary Information, Supplementary Table 6). In a field application, the results of this threshold test are the first of two possible outcomes from this assay (Fig. 1). Putatively positive samples from this step were processed for repeat-typing as described in section 2.8.

3.1 Pfhrp2 inhibits equal performance of individual PCR barcodes from Oxford Nanopore

To further explore the uneven performance of barcodes across normalized control DNA input, we prepared and sequenced two pools of barcoded k13 amplicon to compare against the pfhrp2 results. One pool contained five different reference strains (see Methods) and one pool contained replicates of 7G8. Barcodes across both the k13 pools performed better than their counterparts in comparable pfhrp2 control pools when sequenced for the same amount of time with pools containing the same number of different samples (Fig. 2). Read counts across barcodes were more even for k13 than for pfhrp2 (Supplementary Fig. 5). In the k13 experiments, the correctly classified reads performed consistently better against the quantity of “unclassified” reads (Fig. 2c). The proportion of correctly classified reads to total classified reads would be expected to hover around 0.20 for a 5-sample sequencing pool, and for k13, this was the case. Barcodes for the pfhrp2 experiments performed stochastically, spanning from near 0 to 0.8 (Fig. 2d). These differences in barcode performance across comparable sequencing pools and identical workflows suggests that pfhrp2 created difficulty either for the sequencer or demultiplexing program when used with the PBC096 barcoding kit.

3.2 Assessing pfhrp2 negativity with MinION sequencing

Crosstalk between barcodes creates the issue of pfhrp2 negative samples possibly having non-zero coverage when aligned to the reference sequence. We witnessed this phenomenon with our Dd2 negative control, which was sequenced with each field sample pool and found that median coverage on pfhrp2 in the negative controls scaled roughly with the total passing reads yielded by the sequencing run (Supplementary Fig. 3, Supplementary Tables 3 and 6). Barcode crosstalk in nanopore sequencing has been previously reported elsewhere (Xu, et al., 2018). However, the positivity threshold we present here (see Methods) was intended to mitigate false identification of pfhrp2 positive samples due to barcode crosstalk. This method successfully confirmed all pfhrp2 negative samples that were included in the TES sample set, for which pfhrp2 positivity and negativity had been previously established by antigen test and PCR (Fig. 4). Importantly, many pfhrp2 positive samples in this set did not surpass the threshold (Fig. 4). As the TES pools were not normalized, this could be due to the quantity of DNA added, though the stochastic performance of barcodes in this assay hinders our ability to present an exact cause. This issue of potential false negatives (i.e., positive samples with low coverage) highlights the importance of using this assay as an additive and confirmatory companion to inferring pfhrp2 negativity through other methods. However, it also eliminates low coverage and potentially crosstalk-impacted samples from the genotyping stage, minimizing the likelihood of calling pfhrp2 repeat types based on crosstalk. A total of 33/38, 13/32, and 47/82 samples were called as positive for the PacBio, TES, and remaining field sample set, respectively (93/152 total field samples) (Fig. 3, Supplementary Fig. 6).

3.3 MinION assay supports pfhrp2 repeat typing

We were able to successfully type four well-characterized reference strains across four sequencing runs with this assay (Fig. 4, Supplementary Table 7). For the PacBio set, we compared contigs generated with PacBio sequencing to those generated with MinION sequencing. Of 32 putatively positive samples identified using the positivity threshold, 31 had a minimum of one contig in agreement with the PacBio sequence. Eighteen of the samples from the PacBio set had a clear dominant type (≥ 50% of typed contigs), and all dominant repeat types but one matched that of its PacBio-sequenced counterpart (Fig. 5, Supplementary Table 7). This sample, 8d, had a majority repeat type with three out of five curated contigs in agreement, but had one contig that did match the PacBio contig. Two samples had no MinION repeat types that matched the PacBio contig. The TES sample set had 13 pfhrp2 positive samples according to the octile threshold, with 11 of these having a clear majority repeat type. Six of those samples had 100% agreement across contigs. One sample had agreement between 2 out of 10 contigs produced, and one sample, 259K, had no agreement between 4 contigs. For the remaining 47 positive field samples, 33 had a clearly dominant repeat type and an additional 5 had agreement between 2 or more contigs that did not reach 50%. Considering all putatively positive field samples, 62/93 had a dominant repeat type. Additionally, we successfully reconstructed the positive control repeat types (either 3D7 or 7G8) in each sequencing run.

4.1 Field utility of MinION assay for identifying pfhrp2 deletion

With the recent increase in pfhrp2 deletions in malaria endemic countries, having a portable sequencing assay for confirmation of pfhrp2 is becoming more critical for routine molecular surveillance. Pfhrp2 deletion threatens the efficacy of the most widely used RDT in Sub-Saharan Africa. Outright replacement of HRP2 RDTs is impractical, but regional substitution of HRP2 RDTs in favor of alternatives is necessary to achieve malaria control in regions with high prevalence of pfhrp2 deleted parasites.

The WHO’s protocol defines its primary output measures by samples which have shown discordance between HRP2 RDTs and non-HRP2 diagnostic tools for malaria (i.e., only samples with suspected pfhrp2/3 deletion) (2020). This is designed to reduce the number of samples that must be transported to be assessed for pfhrp2 deletion by PCR. However, as acknowledged in the protocol, this places a limitation on the utility of the data by not characterizing pfhrp2 diversity in cases where the gene is present in a trade-off for timeliness gained by limiting the number of samples selected for further analysis to only those with suspected deletions. The MinION pfhrp2 assay presented here would allow scientists and healthcare professionals to characterize pfhrp2 deletion status and pfhrp2 diversity among non-deleted samples at or near point of care by eliminating the need for bulkier, more expensive sequencers and potentially sample transport. The feasibility of eliminating the sample transport step as outlined in the WHO protocol depends on the region being surveyed, the number of survey sites, and other aspects of survey design. A primary benefit of the MinION platform for this application is the flexibility it provides to be used at primary sample collection sites or centralized laboratories, neither of which would require established genomics laboratories. The use of this assay in conjunction with a portable PCR device such as miniPCR (miniPCR bio, MA, USA) would resolve the logistical burden of sample transportation for a lower cost than outfitting regional facilities in malaria-endemic areas with in-house PCR and sequencing systems and expertise. This assay can be performed at any laboratory facility with cold-chain storage and the equipment required for DNA extraction from whole blood or dried blood spots. The sequencer can be transported by backpack. While this portability may not eliminate the need for sample transport entirely (e.g., in surveys including dozens of collection sites), it maximizes the speed of data generation, allows flexibility in experimental design and logistics and can reduce regional dependence on international partners to run pfhrp2 surveillance programs. This in turn allows a more rapid compilation of the WHO protocol’s primary output measures where pfhrp2 is concerned and the crucial secondary output measure of genetic diversity among pfhrp2 positive samples.

In addition to this assay’s utility in relation to the WHO protocol for pfhrp2/3 surveillance, it may be used for general research on pfhrp2/3 deletions and diversity outside the protocol. The portability of the assay increases geographic and temporal flexibility for research investigating the evolution of this gene.

Here, we generated sequencing data on two R.9.4.1 flow cells available from ONT: the standard flow cell (up to 900 USD per flow cell) and the Flongle (~ 90 USD per flow cell). The optimal flow cell depends on the volume of samples being pooled for a single sequencing run and the desired speed of data recovery. The advantage of the standard flow cell is its capacity and longevity compared to the Flongle. Standard flow cells are considered in good condition with a minimum of 800 active sequencing pores and are under warranty for 3 months after purchase, whereas Flongles perform with approximately 10% of the pores and are under warranty for 1 month. When processing 8 or more samples in parallel with an allowance of 12 hours for sequencing and basecalling/demultiplexing or when processing 5 or more samples with an allowance of 6 hours for sequencing and basecalling/demultiplexing, the standard flow cell would offer superior data. However, if there is a greater time allowance, with up to 24 hours for sequencing and demultiplexing for pools of 6–8 samples, the Flongle would offer a lower cost per sample.

4.2 Genotyping and broad pfhrp2 surveillance

When used within or outside of the WHO protocol for pfhrp2 deletion surveillance, data generated using this assay and shared could be used additively for broad surveillance of pfhrp2 diversity. As of this writing, the public health community understands little about why HRP2 is non-essential, its function, or whether and/or to what degree the use of HRP2 RDTs has influenced evolution in the pfhrp2 gene. Researchers have roughly calculated the fitness costs of pfhrp2/3 deletion in vitro to be on par with drug resistance mutations (Nair, et al., 2022). (Yang, et al., 2020).

4.3 Limitations of current study

The foremost limitation of this assay lies in its quantitative ambiguity. We found there to be pfhrp2-specific issues that affected the consistency and degree of barcode performance regardless of whether the pooled DNA was normalized or not.

Another limitation lies in the quality of sequencing reads and the impact of low quality on the de novo assemblies. The relatively low quality of reads from R.9 flow cells and the repetitiveness of pfhrp2 may cause difficulty in resolving assemblies. The unpredictable yield of each barcode exacerbates this issue, as it is difficult to distinguish whether assemblies from single samples with multiple singleton repeat type patterns are a result of sample quality and quantity of input DNA or of poor barcode performance. Though we do not anticipate singleton repeat-type patterns to reflect true intra-sample diversity (i.e., evidence of a polyclonal infection), we have not assessed the performance of this assay with known poly-clonal samples. The read quality limitation may be short-lived due to the release of enhanced chemistry from ONT with R.10 flow cells. We do not anticipate changes to the workflow considering this development, but we do anticipate better resolution in the resulting de novo assemblies.

Though not a limitation of the method described here, our evaluation did not include any exploration of the pfhrp2 diversity found in the field samples. We did not develop a sample pool for the sake of this project with power to determine longitudinal or spatial nucleotide diversity or repeat-type profile diversity.

Lastly, this assay inherits the limitations of the one-step pfhrp2 PCR method presented by Jones et al. (2020). In brief, the size of pfhrp2 deletions that may surpass exons 1 and 2 cannot be characterized here due to the size of the amplicon, variables such as sample type and PCR efficiency obfuscate the limit of detection of the PCR method and therefore the sequencing method, and this assay does not address pfhrp3 (Jones, et al., 2020).

4.4 Future work

As this assay has been evaluated in a laboratory environment, an important next step in applying it will be validation in the field. The assay would make an ideal addition to any regional iteration of the WHO protocol for pfhrp2 surveillance.

Additionally, it is important to note that we present the pfhrp2 MinION assay here as a proof of concept, with multiple potential avenues for improvement and optimization. This might include the evaluation of some of the many barcoding kits offered by ONT, which may mitigate the barcoding (and therefore quantitative) issues we encountered in the development of this assay using the PCR barcoding (96) kit. A recent report found nanopore sequencing to be less error prone for AT-rich sequences in PCR-free approaches (Delahaye & Nicholas, 2021). Utilizing a barcoding kit that would require no further amplification of the pfhrp2 product as the assay does in its current state may increase the reliability and reduce the error rate in the sequencing data. Another parallel avenue of optimization could be the implementation of ONT’s less error prone R.10 chemistry, which may result in fewer errors and more reliable sequence reconstruction. Experimentation with combinations of different filtering parameters such as fragment length and minimum base-calling score is another avenue, unexplored here, that may optimize the assay’s output. Given the value of this method for exploring the population of pfhrp2 copies that may exist within a sample, a controlled in vitro exploration of how poly-clonal samples would present analytically using R.10 chemistry is an important next step in optimizing this assay for broad genotypic surveillance of pfhrp2 diversity.

Disclaimer

The findings and conclusions in this presentation are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention or the Association of Public Health Laboratories.

Data availability

Bash and Python code used for data analysis can be found on Github at https://github.com/sjsabin/minion_hrp2_project. Sequencing summaries have been deposited on Zenodo (DOI: 10.5281/zenodo.6780399). We used the following as our pfhrp2 reference sequence: NCBI Gene database PF3D7_0831800. The gene sequences produced and analyzed here have been deposited on the NCBI GenBank database, under accession numbers OP186488-OP186871. Accession numbers are listed in completion in Supplementary Table 7.

Author Contributions

S.S., S.J, M.A., and E.T. conceived of the project. S.S., S.J, G.S., and J.K. performed laboratory work. D.P. and S.J. conceived of and developed the foundation of the analytical pipeline. S.S. developed additions to the analytical pipeline and visualizations of the data. The manuscript was composed by S.S. with input from E.T. and M.A.

Acknowledgements

The following reagents were obtained through BEI Resources, NIAID, NIH: Genomic DNA from Plasmodium falciparum, Strain 7G8, MRA-152G, contributed by David Walliker; Genomic DNA from Plasmodium falciparum, Strain Dd2, MRA-150G, contributed by David Walliker; Genomic DNA from Plasmodium falciparum, Strain HB3, MRA-155G, contributed by Thomas E. Wellems; Genomic DNA from Plasmodium falciparum, Strain FC27/PNG, MRA-914G, contributed by Tobili Y. Sam-Yellowe; and Genomic DNA from Plasmodium falciparum, Strain 3D7, MRA-102G, contributed by Daniel J. Carucci. We would like to thank Jessica McCaffery and Eric Rogier for providing us with DNA from their survey of TES samples for pfhrp2 deletions. We would also like to thank Lucy Impoinvil and Malania Wilson for feedback and support regarding MinION sequencing; Subin Park and Swarnali Louha for assistance with trouble shooting the analytical pipeline; and the CDC Genome Sequencing Lab for providing PacBio sequencing services and technological support for MinION experiments.

Abdallah, J. F. et al., 2015. Prevalence of pfhrp2 and pfhrp3 gene deletions in Puerto Lempira, Honduras. Malaria journal, 14(19).
Aidoo, M. & Incardona, S., 2022. Ten Years of Universal Testing: How the Rapid Diagnostic Test Became a Game Changer for Malaria Case Management and Improved Disease Reporting. American Journal of Tropical Medicine and Hygeine, 106(1), pp. 29–32.
Akinyi, S. et al., 2013. Multiple genetic origins of histidine-rich protein 2 gene deletion in Plasmodium falciparum parasites from Peru. Scientific Reports, 3(2797).
Atroosh, W. M. et al., 2022. Plasmodium falciparum histitdine rich protein 2 (pfhrp2): an additional genetic marker suitable for anti-malarial drug efficacy trials. Malaria Journal, 21(2).
Browne, P. D. et al., 2020. GC bias affects genomic and metagenomic reconstructions, underrepresenting GC-poor organisms. GigaScience, Volume 9, pp. 1–14.
Chaudhry, A., Cunningham, J., Cheng, Q. & Gatton, M. L., 2022. Modelling the epidemiology of malaria and spread of HRP2-negative Plasmodium falciparum following the replacement of HRP2-detecting rapid diagnostic tests. PLoS Global Public Health, 2(1), p. e0000106.
Delahaye, C. & Nicholas, J., 2021. Sequencing DNA with nanopores: Troubles and biases. PLoS ONE, 16(10), p. e0257521.
Ewels, P., Magnusson, M., Lundin, S. & Käller, M., 2016. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics, 32(19), pp. 3047–3048.
Feleke, S. M. et al., 2021. Plasmodium falciparum is evolving to escape malaria rapid diagnostic tests in Ethiopia. Nature Microbiology, Volume 6, pp. 1289–1299.
Gamboa, D. et al., 2010. A Large Proportion of P. falciparum Isolates in the Amazon Region of Peru Lack pfhrp2 and pfhrp3: Implicationos for Malaria Rapid Diagnostic Tests. PLoS ONE, 5(1), p. e8091.
Jones, S. et al., 2020. One-step PCR: A novel protocol for determination of pfhrp2 deletion status in Plasmodium falciparum. PLOS ONE, 15(7), p. e0236369.
Kluyver, T. et al., 2016. Jupyter Notebooks - a publishing format for reproducible computational workflows. In: F. Loizides & B. Schmidt, eds. Positioning and Power in Academic Publishing: Players, Agents and Agendas. s.l.:IOS Press, pp. 87–90.
Kong, A. et al., 2021. HRP2 and HRP3 cross-reactivity and implications for HRP2-based RDT use in regions with Plasmodium falciparum hrp2 gene deletions. Malaria Journal, Volume 20.
Koren, S. et al., 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Research, Volume 27, pp. 722–736.
Li, H., 2018. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics, Volume 34, pp. 3094–3100.
Li, H., 2021. New strategies to improve minimap2 alignment accuracy. Bioinformatics, Volume 37, pp. 4572–4574.
Li, H. et al., 2009. The Sequence Alignment/Map format and SAMtools. Bioinformatics, 25(16), pp. 2078–2079.
McCaffery, J.N. et al., 2021. Plasmodium falciparum pfhrp2 and pfhrp3 gene deletions among patients in the DRC enrolled from 2017 to 2018. Scientific Reports, 11, p. 22979.
Nair, S., Li, X., Nkhoma, S. C. & Anderson, T., 2022. Fitness costs of pfhrp2 and pfhrp3 deletions underlying diagnostic evasion in malaria parasites. bioRxiv.
Nderu, D. et al., 2019. Plasmodium falciparum histidine-rich protein (PfHRP2 and 3) diversity in Western and Coastal Kenya. Scientific Reports, 9(1709).
Okonechnikov, K., Conesa, A. & García-Alcalde, F., 2016. Qualimap 2: advanced multi-sample quality control for high-throughput sequencing data. Bioinformatics, 32(2), pp. 292–294.
Rogier, E. et al., 2022. Plasmodium falciparum pfhrp2 and pfhrp3 Gene Deletions from Persons with Symptomatic Malaria Infection in Ethiopia, Kenya, Madagascar, and Rwanda. Emerging Infectious Diseases, 28(3), pp. 608–616.
Talundzic, E. et al., 2018. Next-Generation Sequencing and Bioinformatics Protocol for Malaria Drug Resistance Marker Surveillance. Antimicrobial Agents and Chemotherapy, 62(4), pp. e02474-17.
Waskom, M. L., 2021. seaborn: statistical data visualization. Journal of Open Source Software, 6(60), p. 3021.
Wick, R. R., Judd, L. M., Gorrie, C. L. & Holt, K. E., 2017. Completing bacterial genome assemblies with multiplex MinION sequencing. Microbial Genomics, Volume 3.
World Health Organization, 2020. Master protocol for surveillance of pfhrp2/3 deletions and biobanking to support future research, Geneva: World Health Organization.
World Health Organization, 2021. World malaria report 2021, Geneva: World Health Organization.
Wu, Y. et al., 2001. The Malaria Research and Reference Reagent Resource (MR4) Center -- creating African opportunities. Afr J Med Sci, Issue 30, pp. 52–54.
Xu, Y. et al., 2018. Detection of Viral Pathogens With Multiplex Nanopore MinION Sequencing: Be Careful With Cross-Talk. Frontiers in Microbiology, Volume 9.
Yang, Y. et al., 2020. Disruption of Plasmodium falciparum histidine-rich protein 2 may affect haem metabolism in the blood stage. Parasite Vecotrs, 13(1), p. 611.

No competing interests reported.

Download PDF

Journal Publication

published 18 Feb, 2023

Read the published version in Scientific Reports →

Editorial decision: Major revision
27 Sep, 2022
Reviews received at journal
12 Sep, 2022
Reviewers agreed at journal
29 Aug, 2022
Reviewers agreed at journal
26 Aug, 2022
Reviewers invited by journal
26 Aug, 2022
Editor assigned by journal
26 Aug, 2022
Editor invited by journal
26 Aug, 2022
Submission checks completed at journal
26 Aug, 2022
First submitted to journal
07 Jul, 2022

You are reading this latest preprint version

Portable and cost-effective genetic detection and characterization of Plasmodium falciparum hrp2 using the MinION sequencer

Status:

Journal Publication

Version 1

Abstract

Figures

1. Introduction

2. Methods

2.1 Samples

2.2 One-step PCR for pfhrp2

2.3 Barcoding primer design

2.4 Barcoding PCR

2.5 Sample pooling and library preparation

2.6 MinION Sequencing

2.7 PacBio sequencing

2.8 Analytical pipeline

2.9 Kelch 13 control experiment for barcoding method

2.10 Barcode primer in vitro experiment

2.11 Evaluation

3. Results

3.1 Pfhrp2 inhibits equal performance of individual PCR barcodes from Oxford Nanopore

3.2 Assessing pfhrp2 negativity with MinION sequencing

3.3 MinION assay supports pfhrp2 repeat typing

4. Discussion

4.1 Field utility of MinION assay for identifying pfhrp2 deletion

4.2 Genotyping and broad pfhrp2 surveillance

4.3 Limitations of current study

4.4 Future work

Declarations

Disclaimer

Data availability

Author Contributions

Acknowledgements

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1