A cross-sectional study to characterize local HIV-1 dynamics in Washington, DC using next-generation sequencing

Gibson, Keylie M.; Jair, Kamwing; Castel, Amanda D.; Bendall, Matthew L.; Wilbourn, Brittany; Jordan, Jeanne A.; Crandall, Keith A.; Pérez-Losada, Marcos

doi:10.1038/s41598-020-58410-y

Download PDF

Article
Open access
Published: 06 February 2020

A cross-sectional study to characterize local HIV-1 dynamics in Washington, DC using next-generation sequencing

Scientific Reports volume 10, Article number: 1989 (2020) Cite this article

2489 Accesses
9 Citations
26 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 22 April 2020

This article has been updated

Abstract

Washington, DC continues to experience a generalized HIV-1 epidemic. We characterized the local phylodynamics of HIV-1 in DC using next-generation sequencing (NGS) data. Viral samples from 68 participants from 2016 through 2017 were sequenced and paired with epidemiological data. Phylogenetic and network inferences, drug resistant mutations (DRMs), subtypes and HIV-1 diversity estimations were completed. Haplotypes were reconstructed to infer transmission clusters. Phylodynamic inferences based on the HIV-1 polymerase (pol) and envelope genes (env) were compared. Higher HIV-1 diversity (n.s.) was seen in men who have sex with men, heterosexual, and male participants in DC. 54.0% of the participants contained at least one DRM. The 40–49 year-olds showed the highest prevalence of DRMs (22.9%). Phylogenetic analysis of pol and env sequences grouped 31.9–33.8% of the participants into clusters. HIV-TRACE grouped 2.9–12.8% of participants when using consensus sequences and 9.0–64.2% when using haplotypes. NGS allowed us to characterize the local phylodynamics of HIV-1 in DC more broadly and accurately, given a better representation of its diversity and dynamics. Reconstructed haplotypes provided novel and deeper phylodynamic insights, which led to networks linking a higher number of participants. Our understanding of the HIV-1 epidemic was expanded with the powerful coupling of HIV-1 NGS data with epidemiological data.

HIV-1 molecular diversity in Brazil unveiled by 10 years of sampling by the national genotyping network

Article Open access 04 August 2021

Molecular epidemiology and HIV-1 variant evolution in Poland between 2015 and 2019

Article Open access 16 August 2021

Characterisation of HIV-1 Molecular Epidemiology in Nigeria: Origin, Diversity, Demography and Geographic Spread

Article Open access 26 February 2020

Introduction

Despite recent reductions in HIV-1 prevalence in Washington, DC from 2.5% in 2013¹ to 1.8% in 2018, the United States (US) capital is still experiencing a generalized HIV-1 epidemic – as defined by the World Health Organization^2,3,4. There were 340 newly diagnosed cases in DC in 2018, and the DC rate is five times higher than the national rate³. Blacks, men, men who have sex with men (MSM), and heterosexuals (HRH) account for the majority of people living with HIV-1 (PLWH) in DC^2,3. However, ~20% of the newly diagnosed persons had an unknown risk for transmission in both 2016 and 2017^2,3. Furthermore, the leading group (33.3%) of newly diagnosed cases was between the ages of 20–29 years old³. This same age group had the highest percentage (27.5%) of drug resistance mutations (DRM) at diagnosis, suggesting broader spread of HIV-1 drug resistant variants and potential concern for future therapeutic options, especially if these mutations are against first line antiretroviral (ART) drugs for newly infected individuals. With blacks and young adults being the most impacted groups of individuals for HIV-1 in DC, understanding the current HIV-1 phylodynamics can provide informative data to guide programs that prevent and reduce the incidence of HIV-1. Moreover, identifying potential transmission clusters amongst individuals in DC and their associated epidemiological features may help infer otherwise ‘unknown’ transmission modes and provide insight for more targeted prevention and intervention strategies.

In 2011, the DC Cohort, a longitudinal observational NIH-funded cohort study of PLWH who are receiving care at clinical sites in DC, began enrollment. As of 2018, the Cohort has enrolled approximately 10,000 PLWH². By capitalizing on the longitudinal study of the DC Cohort, phylodynamics (i.e., the study of how epidemiological, immunological, and evolutionary processes act and potentially interact to shape viral phylogenies⁵) can provide insights into HIV-1 infection in DC. Analyzing sequence data can detect new variants within the population, identify population structuring and associations with risk factors, and, in combination with demographic information, predict areas of interest to direct public health efforts.

The great power and resolution of Next-Generation Sequencing (NGS) technologies are changing phylodynamics research. NGS is used for active infectious disease surveillance⁶, detection of circulating drug resistant variants^7,8, and inference of HIV-1 transmission clusters. High variation among viral strains of RNA viruses such as HIV-1 are a result of high mutation rates, large population sizes, and short generation times⁹. NGS can detect mutations present in less abundant strains (<1%)⁸. Such rare mutations are particularly relevant in the context of the evolution of drug resistance, since they may facilitate viral adaptation leading to treatment failure^10,11. Moreover, sequence variants (or haplotypes) can be reconstructed from NGS sequencing reads. Viral populations may contain a pool of different variants that are resistant to different antiretroviral drugs^12,13 and also help the virus to evade the immune system¹⁴. Reconstructing the haplotypes present in a viral sample and assessing their phylodynamics may show additional or different transmission clusters present between individuals or identify a few HIV-1 strains that are dominating the HIV-1 viral population^15,16,17. The use of powerful NGS technologies to study the HIV-1 epidemic at local levels (e.g., Washington, DC) may generate deeper insights into the ongoing HIV-1 dynamics. Near full length sequences and amplicon sequences that span entire HIV-1 genes are becoming more prevalent with this advanced technology^18,19. Some studies have indicated that pol is less informative than env for phylogenetic resolution²⁰. As env evolves at a faster rate than pol^19,21, env has shown to be useful in determining the recency of HIV-1 acquisition²² and could provide more resolution to infer active or recent transmission clusters than pol.

This study applies NGS to a subset of newly and previously diagnosed participants in the DC Cohort to characterize the recent (2016 and 2017) local phylodynamics of HIV-1 in Washington, DC. Towards this general aim, we 1) estimate the diversity of HIV-1 in Washington, DC, 2) determine the circulating drug resistant mutations, 3) identify and evaluate potential transmission clusters with consensus sequences and their association with epidemiologic and clinical factors, and 4) predict HIV-1 haplotypes for each sample and assess their potential for detecting transmission clusters. The number and size of transmission clusters may vary across HIV-1 gene regions^{12,15,23,24,25}, hence in this study we also compared phylodynamic inferences based on the polymerase and envelope HIV-1 genes.

Results

Sample and phenotypic characterization

Our sampling included PCR products from 68 participants in the DC Cohort. Most of the study participants resided in Washington, DC (Table 1). The majority were non-Hispanic black (82.4%) and male (69.1%), with 52.9% of participants being non-Hispanic black males. The majority of participants were infected through heterosexual sexual contact (39.7%) followed closely by MSM sexual contact (35.3%). A total of 76.4% of the patients were on an ART drug regimen at the time of blood sample collection. The demographics of our subsample of DC Cohort participants reflects a similar composition of PLWH in DC^3,4 but includes slightly more participants infected through heterosexual contact and non-Hispanic Blacks, and a lower proportion of participants on ART than the overall Cohort sample.

Table 1 Demographic and clinical characteristics for DC Cohort participants whose samples were sequenced and passed filtering criteria.

Full size table

HIV-1 diversity in DC

We performed in-depth phylodynamic profiling of HIV sequences from 171 PCR gene products that passed quality thresholds, including 62 PR/RT, 62 int, and 47 env amplicons. The subtyping analyses showed that all of the participants belonged to subtype B; therefore, subsequent analyses included data from all participants. Our participants were dispersed amongst and showed a star-like pattern with other DC HIV PR/RT sequences (see Supplementary Fig. S1 online). The env(c) data showed higher nucleotide diversity (π) and Watterson genetic diversity (θ) than pol(c) (Table 2). Males had a higher diversity, though not significant, than females (haplotype diversity: PR/RT: p = 0.2039, int: p = 0.9571, env: p = 0.3404). Participants whose risk factor was IDU (n = 6) had 50% less diversity than those with MSM and HRH risk, though again not significant (haplotype diversity: PR/RT: p = 0.7323, int: p = 0.7861, env: p = 0.6560). Non-Hispanic black participants had a higher genetic diversity for the pol(c) gene than for the env(c) gene (Table 2). The average haplotype diversity when calculated with the number of reconstructed haplotypes by PredictHaplo showed that env had more haplotype diversity compared to PRRT and int (Table 3). The average number of haplotypes per participant was the same for PR/RT and int (2 haplotypes) and slightly higher for env (4). Four of the six participants that had a higher number of haplotypes reconstructed (7–12 haplotypes in one or more gene regions) also had a higher average haplotype diversity estimate of 0.634 (range: 0.392–0.777), while the other two had a very low average haplotype diversity estimate (0.032, range: 0.027–0.038). Participants with HRH and MSM risk factors were found to have an average of 3 reconstructed haplotypes, with haplotype diversities of 0.338 and 0.335, respectively.

Table 2 Nucleotide diversity between the pol and env concatenated consensus sequences.

Full size table

Table 3 Haplotype diversity estimates from PredictHaplo results.

Full size table

Drug resistant mutations

The consensus concatenated gene regions (pol(c) and env(c); see Methods for definition of “(c)”, which in short stands for consensus) for each participant were used to evaluate the presence of Drug Resistant Mutations (DRMs) (Table 4). The PR gene from participants had the fewest DRMs (1), compared to RT (25) and int (11) genes. The majority of DRMs were NRTI, NNRTI, and RT surveillance DRMS (SDRMs). Together, they included 12 to 24 resistant participants and 15 to 44 total DRMs (9 to 19 unique DRMs). Finally, 34 participants (50.0%) showed at least one mutation and 24 (35.3%) showed at least two different DRMs. Of the ARV treatment naïve participants, none were found to have a DRM present in the PR and RT genes, and only a single participant was found to have an IN Accessory DRM at amino acid 157. This treatment naïve individual with a DRM was a part of a transmission cluster for all genes except the haplotype V1V2 gene region. An overall DRM prevalence of 1.4%, 35.7%, and 15.7% was estimated for PR, RT, and int, respectively. The 40–49 year-olds in our study had the highest prevalence of DRMs (22.9%), while the 20–29 year-olds in our study did not show any DRMs. Overall, DRMs caused amino acid changes in only one codon position in PR, while 22 and 9 different codons positions changed in RT and int, respectively, when analyzing the consensus sequences. However, more codons were affected by an amino acid change in PR (5 codons) and RT (27 codons) when analyzing the haplotype sequences. On average, more DRMs and more unique DRMs were identified in the haplotype sequences (Table 4). Also, one young adult (20–29 years old) contained DRMs in RT, and one treatment naïve participant contained DRMs in RT. Slightly more participants had a haplotype sequence that showed at least one DRM (37 participants; 54.0%) and at least two different DRMs (30; 44.1%).

Table 4 Drug Resistant Mutations.

Full size table

FUBAR analysis, which identifies nucleotide positions under positive selection, identified inferred two, three, and four codons under positive selection in the PR gene, RT gene, and int gene, respectively, when analyzing the consensus sequences (Table 4). Positively selected sites 37 and 57 were inferred by FUBAR for PR and sites 35, 83, and 162 for RT. Amino acid positions 201, 216, 265, and 283 were found to be under positive selection for int. No codons overlapped between the FUBAR analysis and DRM analysis for any gene. None of the sites predicted by FUBAR are known resistance sites²⁶, suggesting DRMs are fixed in the population. Additionally, FUBAR also found four codons under positive selection for the V1V2 and V3 genes (Table 4). These codon sites were 8, 15, 34, 53 for V1V2 and 82, 90, 93, 106 for V3. Every IDU participant had a mutation in at least one of the sites predicted by FUBAR in the V1V2 and V3 genes. HRH (55.6%) and MSM (72.0%) participants had a high prevalence of sites under selection as well. Additionally, a total of 51.4% and 17.1% of the male and female participants, respectively, contained a mutation at one of these predicted sites. Furthermore, FUBAR analysis inferred four codons under positive selection in the PR and RT genes and nine codons for int from the haplotype sequences. These sites were 64, 72, 77, and 93 for PR; 35, 85, 102, and 200 for RT; and 201, 206, 211, 218, 230, 256, 265, 283, and 285 for int. The only codon position predicted by FUBAR in the haplotypes was 230 for int, and it is associated with reduced susceptibility to integrase inhibitors (INSTI)²⁷. Seven (4, 34, 41, 43, 44, 45, 64) and eight (22, 40, 84, 92, 93, 95, 105, 111) codons were inferred to be under positive selection for V1V2 and V3 haplotype sequences, respectively. Interestingly, only a handful of inferred positively selected codons overlapped between the haplotype and consensus sequences, those were at position 35 for RT; positions 201, 265, and 283 for int; 34 for V1V2; and 93 for V3.

Transmission clusters

Transmission clusters were assessed by phylogenetic methods and HIV-TRACE, a genetic-distance based clustering method. Phylogenetic methods found support (>70% bootstrap or >0.95 posterior probability) for 33.8% of the sequences associated with six transmission clusters in pol(c) and for 31.9% of the sequences associated with seven clusters in env(c) (highlighted in Fig. 1). All of the clusters were comprised of two to three sequences, except one cluster in pol(c) which had twelve sequences. Ten of the twelve sequences in the large pol(c) cluster contained a DRM within RT. Furthermore, 11 of the 12 sequences in this large cluster were included on the same sequencing run; therefore, we were unable to undoubtfully discriminate between laboratory artifacts or batch effects and HIV infection to interpret this transmission cluster. However, there were other samples from that same sequencing run that did not cluster with these twelve sequences and formed a dyad cluster. The most common DRM was T215C, which does not reduce NRTI susceptibility, and was found in eight of the twelve sequences.

HIV-TRACE grouped only a few sequences into two clusters for pol(c) (12.8%) and into a single cluster for env(c) (2.9%) (Fig. 2I). More transmission clusters were estimated with the haplotypes reconstructed from PredictHaplo (Fig. 2II, Table 5). For PR, RT, and int (genes also used in past DC HIV studies^28,29), 82.1% of our participants were incorporated into transmission clusters. Unique to our dataset was the use of envelope to predict transmission clusters; 35.8% of our participants were included in V1V2 and V3 transmission clusters. Haplotypes were not concatenated for this analysis, but little overlap of cluster composition was found between gene regions. Often one participant’s haplotype would cluster with another participant in one gene region, but it would cluster with a different participant in a different gene region. Thus different gene regions displayed different transmission clusters that would not have been detected if one only analyzes a single gene. Clusters predicted in all gene regions were mainly composed of two participants (42 of the 55 total clusters). Likewise, the majority of the transmission clusters predicted by past DC HIV studies^28,29 were comprised of two or three participants, regardless of the gene region, clustering estimation method, or haplotype/consensus construction approach.

Table 5 Characteristics of transmission clusters with HIV-TRACE and comparison between different genes.

Full size table

Discussion

Collectively, our study aimed to characterize the local and recent phylodynamics of a subset of DC Cohort participants from Washington, DC metro area living with HIV. By combining clinical and behavioral data with NGS data, we were able to identify transmission clusters across groups with different demographics and risk behaviors. Additionally, this study reconstructed sequence variants present within a participant (i.e., intra-host) and investigated associations between the reported participant characteristics and transmission clusters.

Our population genetic estimators indicate that HIV-1 env(c) is more genetically diverse than pol(c). Similar diversity estimates were found in other studies^19,29,30,31. Our estimate of genetic diversity for pol(c) in DC (θ = 0.079) was lower than those reported for subtype B in Pérez-Losada et al.²⁹ (θ = 0.084 and 0.090), but higher than those currently reported for the US subtype B sequences in Los Alamos HIV-1 database (θ = 0.075 for int; θ = 0.067 for PR/RT). This same trend was seen in env(c), where our DC HIV-1 genetic diversity (θ = 0.202) was greater than that reported for V1-V3 from Los Alamos HIV-1 database (θ = 0.168) across the US. Haplotype diversity estimated from the PredictHaplo results also found env genes more genetically diverse than PR/RT and int.

Notably, there were interesting differences in diversity estimates among risk groups. Participants who were infected through injection drug use (IDU) were found to have about half of the diversity of MSM and HRH participants. This low diversity, potentially associated with low multiplicity of HIV-1 infection, is not unusual within IDU individuals^32,33. In both pol(c) and env(c), males also showed higher genetic diversity, which could be attributed to half (51%) of the males in this study being infected through sex with other men (16% of the males had an unknown risk factor). Our measurements of HIV-1 diversity in HRH were also high and similar to those of MSM; however, we did not observe differences in HIV genetic diversity by race or ethnicity.

The HIV-1 subtype B epidemic in DC is highly diverse, and our results here agree with previous conclusions suggesting a mature epidemic^29,34. High genetic diversity could result from risk groups intermingling and viral strains being exchanged and the transient nature of the DC metro area population. DC is an international stop for some, a temporary residence for others, and home for many. This constant influx of incomers could have a boosting effect on the DC HIV-1 population by consistently introducing new viral strains into the pool. Treatment and vaccine development can be compromised by high genetic diversity. As HIV-1 continues to evolve and, as seen here, high genetic diversity levels are kept constant over time³⁴, resistance to vaccines and ART drugs may increase, which could ultimately lead to treatment and prevention failures³⁵.

A Drug Resistant Mutation (DRM) prevalence of 48.6–54.0% was detected in this subset of the DC Cohort, depending on use of consensus sequence or haplotypes. Lower DRM prevalence rates were previously reported for the DC area^28,29,34,36 (17.3–37.9% between 1994 and 2016). A much higher rate (66%) was reported in a smaller study of ART treatment-naïve and experienced pediatric patients in Rhode Island³⁷. DRM rates over 50% are also seen in large sequence databases in the UK and Switzerland²⁹. Our study found fewer codons affected by a DRM and fewer DRMs in our participants than previous studies^28,29 of the DC epidemic. We only found 32 and 38 codons to be affected when analyzing consensus sequences and haplotypes, respectively, whereas Pérez-Losada et al.²⁹ found 83 codons affected for subtype B, however that study contained 20 times more sequences than ours. Moreover, our study found three novel DRM sites that were not identified in either previous study: P145PAST (IN Major) and A128APST, Q146QH (IN Accessory).

More recently, a study by Kuhnert and colleagues³⁸ reported on the fitness of fourteen HIV-1 resistance mutations, of which seven were detected in RT in our study. Three of the seven DRMs (codons: 41 L, 67 N, 184 V) were NRTI-related and the other four DRMs (codons: 103 N, 108I, 138 A, 181 C) were NNRTI-related. Six DC Cohort participants contained the 184 V DRM, which was found by Kuhnert et al.³⁸ to have the highest transmission cost – i.e., the success (low transmission cost) or lack thereof (high transmission cost) of transmission of hosts infected by drug resistant strains. Because of this high cost to the virus, the mutation resulted in very short transmission chains despite evolving frequently under treatment failure³⁸. Of the six DC Cohort participants, all were included in a cluster when using the haplotypes, but none of the participants clustered with each other. Additionally, this mutation was found to be persistent in the DC HIV-1 viral population since 2005³⁴. Previous studies showed that the most important NNRTI mutation currently is 103 N because of its connection to first-line treatment failure^39,40,41; a quarter of our DC Cohort participants with one or more DRMs contained this mutation, which was also found to be at a low frequency in the DC HIV-1 viral population since 2005³⁴.

In treatment-naïve participants, both when using consensus sequences and haplotypes, we estimated a low prevalence rate of DRMs (14.3%). Similarly, low prevalence rates have been seen in the past in the US (⁴²; 15% between 1999 and 2011) and even lower in treatment-naïve individuals in Europe (^41,43,44,45; 10% between 2001 and 2013). Moreover, we also found fewer DRMs in treatment-naïve participants compared to a recent study of PLWH in DC²⁸ (22.5% between 1994 and 2013). Kassaye et al.²⁸ also observed a downward trend of overall prevalence of DRMs over time in treatment-naïve individuals. Our results suggest a further decrease in overall prevalence of DRMs in the current DC HIV-1 epidemic. A lower prevalence of DRMs in surveillance versus targeted treatment-naïve studies could result from sampling design. A surveillance study of DC would likely provide the more accurate picture of DRM trends in the population.

Novel sites that continue to evolve in the DC epidemic and have not become fixed in the population are of serious concern for future drug therapy and conferring resistance to these drugs. More and different codons were predicted to be under positive selection in the haplotype sequences than the consensus sequences. FUBAR predicted sites in V1V2 are likely being impacted by the immune system, which the virus is actively trying to evade (diversifying selection). V3 is associated with co-receptor binding⁴⁶; codons predicted here could be rising advantageous mutations by HIV-1 to adapt to the host cells’ response against the virus. Five codons (PR: 37, RT: 35, and int: 201, 265, 283) were identified by both our study, in both haplotypes and consensus sequences, and Pérez-Losada et al.²⁹ as sites under selection for subtype B. Since none of these sites corresponded to any known Stanford DRMs, these may be newly evolving resistance mutations in the DC HIV-1 epidemic. Given that all of our participants were on dual- or multiple-drug regimens, these sites may also be indicative of potential escape mechanisms by the virus in response to multiple-drugs treatments. Thus, these amino acid replacements are candidates for fitness testing with and without associated drugs to infer their ability to confer drug resistance, their relative fitness status in different environments, and their transmissibility across individuals.

Additionally, identifying transmission clusters is critical to recognizing groups who may be at risk of contracting HIV-1 or who may already be infected but are not yet aware of their diagnosis. Phylogenetic studies suggest that transmission clusters greatly contribute to the spread of HIV-1 within the population⁴⁷; therefore, identifying high-risk groups, whether that is based on risk behavior or geographical location⁴⁸, can help public health officials to better target prevention efforts and treatment options. The spread of infection is often associated with early HIV-1 infection⁴⁷, consequently, molecular surveillance of the DC epidemic should continue in order to identify potential areas or clusters of transmission and, thus, help lower the HIV-1 incidence.

Towards this goal, we combined NGS sequence data with clinical characteristics to obtain a dynamic picture of the evolution of HIV-1 in the DC metro area. We detected high levels of clustering using haplotypes (32.8–64.2%, not including the V1V2 region), as also seen in other HIV-1 cohorts (24–65%)^{29,35,49,50,51,52,53,54,55,56,57,58,59,60,61}. Other studies, including one completed with Sanger sequencing in DC^{28,31,36,52,62}, however, have found lower levels of clustering (7–17%), in agreement with those reported here for the consensus sequence phylogenetic clusters (pol(c) and env(c)) and the V1V2 region for haplotypes (9.0%). Therefore, a more comprehensive understanding of HIV-1 transmission events in DC has been achieved when evaluating multiple genes together, rather than primarily focusing on polymerase genes that are typically screened for DRMs in clinical settings or used in investigations at the local health department level. By excluding envelope genes, informative transmission events can be missed, which could hinder community health prevention and intervention efforts. In an ideal setting, using all the genetic information available would be most favorable when investigating local HIV-1 phylodynamics.

In agreement with a recent study of HIV-1 transmission clusters in Chicago⁵⁹, we also found association of risk factors within clusters. More HRH participants fell in our haplotype transmission networks compared to MSM, IDU, and participants with unknown risk (HRH = 23, MSM = 18, UNK = 11 each & IDU = 6). A total of 58.3% of the clusters that included an HRH participant also had an MSM participant. Likewise, a US study that included 12 major US cities⁶³ found transmission clusters that contained overlap between participants who were MSM and HRH. Mixing of risk types in HIV-1 subtype B transmission clusters has also been observed in Switzerland, Iceland, and Nordic European countries^60,64,65. Contrarily, Kouyos et al.⁶⁵ found segregation based on location among individuals who were included in a transmission cluster despite having overlapping risk factors. Risk groups may be mixing due to underreporting of risk behaviors or bisexual behavior^65,66,67. This heterogeneity of risk groups in transmission clusters suggests that focusing on individuals within city areas (e.g., wards in Washington, DC) to concentrate resources and information may help in addressing the HIV-1 epidemic.

Otherwise, we were unable to determine the mode of transmission for the “unknown modes of transmission” group (16.2% of our sample). Nonetheless, our results suggest that the mode of transmission may not be as important for prevention and intervention efforts as the location where transmission events are occurring. Likewise, Morgan et al.⁵⁹ suggested not targeting efforts towards risk groups, but rather age groups, particularly younger people, in Chicago. The average age of the DC participants included in a haplotype transmission cluster was 46.6 years of age. If DC’s younger population is being the most affected, as suggested by the new cases identified by the DC DOH in 2016 and 2017^1,3, taking a spatial dynamic approach to intervention with continued surveillance may help. Through surveillance studies, further adapted location-based prevention efforts can be employed.

Notably, our analysis has some limitations. We included only 68 participants of the approximately 10,000 people enrolled in the DC Cohort³, whereas past studies conducted in DC included 700 (Kassaye et al.) and 1,500 participants (Pérez-Losada et al.). However, the demographics in our sample size are similar to PLWH in DC^3,4. As a prospective study conducted as part of an ongoing HIV-1 surveillance program associated with the DC Cohort, we capitalized on all the current cases that met our inclusion criteria (see Matierals and Methods: DC cohort). These past studies of HIV-1 diversity in Washington, DC were historical in nature and, therefore, had larger sample sizes available. Our study also applied a powerful next-generation sequencing approach instead of Sanger sequencing (previous DC studies), to characterize the current HIV-1 epidemic. With the implementation of NGS, mapping very diverse short reads to a reference genome poses alignment issues¹⁴, which can add difficulty to downstream analyses. We circumnavigated this alignment issue by using HAPHPIPE, where the reads were mapped against a tailored reference genome, thus resulting in higher alignment rates and fewer errors. Nonetheless, aligning very diverse reads still remains an issue. Although new sites under selection were identified using NGS, their clinical relevance as potential DRMs requires further validation. We also recognize that we used conservative genetic distance cutoff values for determining transmission clusters, which could result in lower numbers of transmission clusters⁶⁸; however, this conservative estimate reduced the number of false positive transmission clusters. Finally, due to the nature of predicting transmission clusters and the potential for missing individuals, we are not able to determine the direction of infection within the transmission clusters. We were also unable to rule out batch effects or laboratory artifacts accounting for any transmission cluster which participants were included in the same sequencing run.

Conclusions

This study showed that NGS and epidemiological data can be used to characterize the current phylodynamics of a subset of people living with HIV, enhancing our understanding of the diversity and local dynamics of the HIV-1 epidemic in the DC area. HIV-1 diversity in DC is high and seems to remain stable over time. Furthermore, NGS of the envelope gene provided sufficient coverage to compare transmission cluster inference across HIV-1 gene regions^14,20. Additional transmission clusters were identified when using HIV-1 intra-host haplotypes instead of consensus sequences, which led to networks linking a higher number of participants. Moreover, transmission clusters varied across genes, with each gene suggesting a different transmission story. Hence, using multiple HIV-1 genes or whole genomes is recommended to infer more reliable transmission clusters. Inferred clusters should then be linked to locations in DC to target transmission intervention efforts. Additionally, HIV-1 drug resistance was only found when using haplotypes in a single young adult in our cross-sectional sample of the cohort. Future studies should also focus on age groups and geographic regions rather than only risk factors. As the DC area maintains significant rates of HIV-1 infection, integrating present and past molecular data from previous studies conducted in DC in 2017²⁹ and 2013²⁸ will help to paint a comprehensive picture of the HIV-1 transmission and evolution of drug resistance in this high prevalence urban U.S. city. Future HIV-1 phylodynamic studies should also include more participants, particularly young adults, and newly diagnosed persons to provide a comprehensive view of DRM prevalence in treatment-naïve individuals in the DC area. Studies revealing the severity of transmitted drug resistance in the DC population may provide physicians and public health workers with additional information to design more effective treatment plans for newly diagnosed individuals and intervention strategies for targeted key populations.

Materials and Methods

Ethics

Institutional Review Board (IRB071029) approval was obtained from The George Washington University IRB (which serves as the IRB of Record for eight of the participating sites), the DC DOH IRB, and the remaining site IRBs. Informed consent was obtained and documented prior to conducting study procedures. Sample collections from participants were performed in accordance with relevant guidelines and regulations.

DC cohort

Participants from the DC Cohort were recruited for this molecular epidemiology sub-study from January 2016 through May 2017. Eligibility criteria included current DC Cohort enrollment, ≥18 years of age, HIV-1 diagnosis within prior 12 months of enrollment or detectable HIV-1 viral load of ≥1,500 copies/mL, ability to provide written informed consent, and completion of a behavioral survey; a total of 104 participants met the eligibility criteria. Blood samples were collected at the clinical sites and transported to George Washington University for processing, targeted amplification, library preparation and NGS. Sample sequences were paired with clinical and demographic data retrieved from the database from the DC Cohort (Table 1). Clinical and demographic characteristics collected included age, race/ethnicity, sex at birth, gender, country of birth, state of residence, zip code, HIV-1 risk factor, presence of co-infections (e.g., chlamydia, gonorrhea, syphilis, trichomoniasis, and Hepatitis B and C), duration of infection, CD4 count, viral load, ART exposure, ART regimen type, date of sample, and date of HIV-1 diagnosis. The paired data were de-identified and analyzed using the approaches described below.

Next-Generation sequencing

Total RNA was extracted from each patient’s plasma sample, and cDNA synthesis followed. The QIAamp Viral RNA Mini Kit (Cat. #52904, Qiagen, Gaithersburg, MD) and the SuperScript^™ IV First-Strand Synthesis System (Cat. # 18091050, Invitrogen, Carlsbad, CA) were used respectively and according to manufacturers’ instructions. Multiple sets of HIV-1 specific primer pairs were used to target and amplify using polymerase-chain-reaction (PCR) the protease (PR), reverse transcriptase (RT), integrase (int), and envelope (env) HIV-1 genes (~43% of genome)³⁶. Library preparation was completed with Nextera XT Library Prep (Cat. # 15032350, Illumina, Dan Diego, CA). Samples were then sequenced on eight runs on an Illumina MiSeq platform using the MiSeq v2 (300 cycles) chemistry (Cat. # MS-102–2002, Illumina). Both library prep and sequencing were completed according to the manufacturer’s instructions. All DNA sequence files are available from the GenBank database under SRA accession: PRJNA517147.

Sequence analyses

The raw sequence data for each patient were processed through HAPHPIPE (https://github.com/gwcbi/haphpipe), a HAplotype reconstruction and PHylodynamics PIPEline for genome-wide assembly of viral consensus sequences and haplotypes⁶⁹. Briefly, HAPHPIPE includes modules for quality trimming, error correction, assembly, and haplotype reconstruction. We put the raw sequencing FASTQ files through quality control and quality trimming with Trimmomatic⁷⁰. Error correction of the reads was completed with an earlier version of HAPHPIPE that used BLESS⁷¹, and the cleaned reads were mapped against the current HIV-1 subtype B reference sequence HXB2 (Genbank accession: K03455)⁷². Through iterative refinement, the cleaned reads were then mapped back to the reference sequence generated in the mapping step with Bowtie2⁷³. This iterative refinement step was completed twice, first using only a random subsampling of the reads (25% subsampling) and the fast-local mapping option to speed up the computational time, and second using all of the sequence reads and the very sensitive mapping option to further refine the individually crafted reference sequence. A consensus sequence was generated from the refined reference sequence, and a final refinement step was concluded with BLAST⁷⁴ against this refined consensus sequence. The resulting sequences were filtered to include participants that contained a passing amplicon, defined as having > 95% of the amplicon covered by 10x or greater read coverage. Amplicons that did not pass this filter were removed, and this subset was then used for subsequent phylodynamic analyses (Table 1).

Sequence data for each PCR amplicon (PR/RT, int, and env) were aligned individually using MAFFT with the L-INS-i algorithm⁷⁵ in Geneious (ver. 9.1.6)⁷⁶. Protease (PR) and reverse transcriptase (RT) were extracted from the PR/RT amplicon, and env was further divided into the variable regions: gp120 V1V2 (HXB2 coordinates: 6615–6812) and gp120 V3 (HXB2: 6984–7349). PR, RT, and int were concatenated into the pol(c) gene region, and V1V2 and V3 were concatenated into the env(c) gene region. Concatenated gene regions will be distinguished from whole gene regions by adding “(c)” to the end of the gene name. Each gene (PR, RT, int, V1V2, and V3) was extracted from the amplicon data to fulfill different purposes: (1) remove any nucleotides belonging to other genes, for example the env PCR amplicon contained primer sequences, and therefore nucleotides belonging to the vpu gene region; (2) simulate amplicon sizes that could be covered end to end by paired-end reads to be consistent and comparable to future NGS studies with HIV-1 when using PrimerID or other local haplotype phasing techniques^8,77; and (3) account for differences in PCR performance between and within samples by extracting a common, high-coverage region. Therefore, missing data in this dataset were low, and often only due to amplification failure of an entire amplicon. Concatenating the genes into their respective gene regions (pol and env) retained variants in genes that are often studied, such as protease and reverse transcriptase for drug resistant mutations. It also allowed comparisons to past studies based on Sanger sequencing that used either parts of genes or whole genes. Our overall goal was to keep the integrity of the individual genes while using as much of the NGS data as possible.

Identification of subtypes and drug resistant mutations

HIV-1 subtype identification was completed for each concatenated gene region (pol(c) and env(c)) using the REGA subtyping tool (version 3)^78,79. A total of 170 subtype reference sequences from the Los Alamos HIV-1 database (LANL; http://www.hiv.lanl.gov/) were included to assign the patient sequences to a particular subtype clade and validate the findings from REGA using phylogenetic methods described below. Drug resistant mutations were identified aligning the consensus concatenated gene nucleotide sequences with reference strains in the Stanford HIV Drug Resistance Database (https://hivdb.stanford.edu) using the HIVdb program⁸⁰. Nucleotide positions under positive selection were identified using Fast Unconstrained Bayesian AppRoximation (FUBAR)⁸¹ in HyPhy⁸². Recombination in our HIV-1 data was accounted for with GARD^83,84.

Phylogenetic analyses

Phylogenetic estimations were completed for each concatenated gene region. The best-fit model of molecular evolution⁸⁵ was estimated for each pol(c) and env(c) from the data using jModelTest2⁸⁶ in CIPRES Science Gateway⁸⁷. Amino acid positions corresponding to identified DRMs described above were removed prior to phylogenetic estimations to avoid potential bias due to selection. A maximum likelihood phylogenetic estimate using RAxML⁸⁸ was made for each region with the 3 codon-position partitions⁸⁹. The branch support for the RAxML phylogenetic trees was estimated with a bootstrap approach with 1,000 replicates⁹⁰. Bayesian trees were inferred using MrBayes⁹¹. Four Markov chains (one cold and three heated) were run for 8 × 10⁸ generations sampling every 2,000 steps for each gene region, and each run was repeated twice. The output was analyzed in Tracer⁹² to assess convergence and mixing of the chains. Subtypes references for subtype D (GenBank accessions: K03454, AY371157, AY253311, U88824) and circulating recombinant forms CRF28,42-BF (GenBank accessions: FJ213781, FJ358521, FJ670529) and CRF10-CD (GenBank accessions: AF289548, AF289549, AF289550) were pulled from LANL and used as proper outgroups for the phylogenetic analyses⁹³. Additional RT sequences from DC²⁹ were included to observe how our data related to other DC sequences. We visualized the epidemiological data on the resulting trees with the Interactive Tree of Life (iTOL).

Haplotype reconstruction

For the identification of transmission clusters and testing for associations of clinical variables to transmission clusters, it is ideal to characterize within patient viral variation as individual sequence variants (haplotypes) instead of combining all of the individual reads into a single consensus sequence^94,95. Therefore, haplotypes for each patient were predicted from the sequence data using HAPHPIPE’s haplotype stages. Haplotype reconstruction was performed on each PR/RT, int, and env targeted PCR amplicons using PredictHaplo⁹⁶. Each gene region (PR, RT, V1V2, and V3) was then extracted from the corresponding targeted amplicon and, using the methods described below, transmission clusters were estimated using the predicted haplotypes. No concatenation of the individual genes to form the regions pol(c) and env(c) was done with the haplotypes.

Identification of transmission clusters

Transmission clusters were assessed for each pol(c) and env(c), as well as for each of the gene regions with the haplotypes, using phylogenetic methods^89,91 and the genetic-distance based clustering method HIV-TRACE⁹⁷. Phylogenetic transmission networks were defined as clades with bootstrap proportions ≥70 or posterior probabilities ≥95%. Genetic distance thresholds of 0.01^29,62 and 0.02⁹⁸ substitutions/site were used for pol and env in HIV-TRACE, respectively, to identify potential transmission events. Ambiguities were handled with the HIV-TRACE option “average” to avoid biases and false positives and minimum overlap was 1/genetic distance threshold and adjusted for size of amplicon, as recommended. Default settings were used for the remaining parameters. Transmission clusters were compared between gene regions.

Diversity estimation

Haplotype diversity (h), the number of segregating sites (S), nucleotide diversity (π), and Watterson’s genetic diversity (θ) (see⁹⁹) were estimated for both the consensus pol(c) and env(c) regions per patient using DnaSP (ver. 6.11.01)¹⁰⁰. Haplotype diversity (h), which takes into account the number of haplotypes and their relative frequencies, was also estimated from PredictHaplo results according to Nei and Tajima¹⁰¹. Both diversity estimates were used as the number of haplotypes estimated from DnaSP is representative of inter-patient diversity, whereas the number of haplotypes and haplotype diversity estimates from the PredictHaplo results are representative of intra-patient diversity. Significance for haplotype diversity between clinical variables was measured with the Wilcoxon rank sum test or Kruskal-Wallis test in R v 3.6.0¹⁰² using RStudio v 1.2.1335¹⁰³.

Ethical approval

Written informed consent was obtained from all participants prior to enrollment in the DC Cohort and the molecular epidemiology sub-study. The DC Cohort and molecular epidemiology studies were approved by the Institutional Review Board at The George Washington University, which serves as the IRB of record for Whitman-Walker Health, La Clinica del Pueblo, Family and Medical Counseling Service, Unity Health Care, The GW Medical Faculty Associates, MetroHealth, and Children’s National Health System (pediatric and adolescent clinics). The study was independently approved by the IRBs of record at Howard University Hospital (adult and pediatric clinics), MedStar Washington Hospital Center, Georgetown University, and the Veterans Affairs Medical Center.

Change history

22 April 2020
An amendment to this paper has been published and can be accessed via a link at the top of the paper.

References

District of Columbia Department of Health HIV/AIDS, H., STD and TB Administration (HAHSTA),. Annual Epidemiology & Surveillance Report: Surveillance Data Through December 2015. Washinton, DC Department of Health, (2016).
District of Columbia Department of Health HIV/AIDS, H., STD and TB Administration (HAHSTA),. Annual Epidemiology & Surveillance Report: Surveillance Data Through December 2016. Washinton, DC Department of Health, (2017).
District of Columbia Department of Health HIV/AIDS, H., STD and TB Administration (HAHSTA),. Annual Epidemiology & Surveillance Report: Surveillance Data Through December 2017. Washinton, DC Department of Health, (2018).
District of Columbia Department of Health HIV/AIDS, H., STD and TB Administration (HAHSTA),. Annual Epidemiology & Surveillance Report: Surveillance Data Through December 2018. (2019).
Volz, E. M., Koelle, K. & Bedford, T. Viral phylodynamics. PLoS Comput. Biol. 9, e1002947, https://doi.org/10.1371/journal.pcbi.1002947 (2013).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Pérez-Losada, M., Arenas, M. & Castro-Nallar, E. Microbial sequence typing in the genomic era. Infection, Genet. Evolution. https://doi.org/10.1016/j.meegid.2017.09.022 (2017).
Article Google Scholar
Alidjinou, E. K. et al. RNA and DNA Sanger sequencing versus next-generation sequencing for HIV-1 drug resistance testing in treatment-naive patients. J. Antimicrob. Chemother. 72, 2823–2830, https://doi.org/10.1093/jac/dkx232 (2017).
Article CAS PubMed Google Scholar
Posada-Cespedes, S., Seifert, D. & Beerenwinkel, N. Recent advances in inferring viral diversity from high-throughput sequencing data. Virus Res. 239, 17–32, https://doi.org/10.1016/j.virusres.2016.09.016 (2017).
Article CAS PubMed Google Scholar
Duffy, S., Shackelton, L. A. & Holmes, E. C. Rates of evolutionary change in viruses: patterns and determinants. Nat. Rev. Genet. 9, 267–276, https://doi.org/10.1038/nrg2323 (2008).
Article CAS PubMed Google Scholar
Kyeyune, F. et al. Low-Frequency Drug Resistance in HIV-Infected Ugandans on Antiretroviral Treatment Is Associated with Regimen Failure. Antimicrob. Agents Chemother. 60, 3380–3397, https://doi.org/10.1128/AAC.00038-16 (2016).
Article CAS PubMed PubMed Central Google Scholar
Simen, B. B. et al. Low-abundance drug-resistant viral variants in chronically HIV-infected, antiretroviral treatment-naive patients significantly impact treatment outcomes. J. Infect. Dis. 199, 693–701, https://doi.org/10.1086/596736 (2009).
Article PubMed Google Scholar
Vandenhende, M.-A. et al. Prevalence and Evolution of Low Frequency HIV Drug Resistance Mutations Detected by Ultra Deep Sequencing in Patients Experiencing First Line Antiretroviral Therapy Failure. PLoS One 9, e86771, https://doi.org/10.1371/journal.pone.0086771.t001 (2014).
Article ADS PubMed PubMed Central Google Scholar
Lapointe, H. R. et al. HIV drug resistance testing by high-multiplex “wide” sequencing on the MiSeq instrument. Antimicrob. Agents Chemother. 59, 6824–6833, https://doi.org/10.1128/AAC.01490-15 (2015).
Article CAS PubMed PubMed Central Google Scholar
Maldarelli, F. et al. HIV populations are large and accumulate high genetic diversity in a nonlinear fashion. J. Virol. 87, 10313–10323, https://doi.org/10.1128/JVI.01225-12 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hassan, A. S., Pybus, O. G., Sanders, E. J., Albert, J. & Esbjornsson, J. Defining HIV-1 transmission clusters based on sequence data. AIDS 31, 1211–1222, https://doi.org/10.1097/QAD.0000000000001470 (2017).
Article PubMed Google Scholar
Lemey, P., Rambaut, A. & Pybus, O. HIV evolutionary dynamics within and among hosts. AIDS Rev. 8, 125–140 (2006).
PubMed Google Scholar
Grabowski, M. & Redd, A. Molecular tools for studying HIV transmission in sexual networks. Curr. Opin. HIV. AIDS 9, 126–133, https://doi.org/10.1097/COH.0000000000000040 (2014).
Article PubMed PubMed Central Google Scholar
Boltz, V. F. et al. Ultrasensitive single-genome sequencing: accurate, targeted, next generation sequencing of HIV-1 RNA. Retrovirology 13, 87, https://doi.org/10.1186/s12977-016-0321-6 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zanini, F. et al. Population genomics of intrapatient HIV-1 evolution. Elife 4, https://doi.org/10.7554/eLife.11282 (2015).
Lemey, P. et al. Molecular footprint of drug-selective pressure in a human immunodeficiency virus transmission chain. J. Virol. 79, 11981–11989, https://doi.org/10.1128/JVI.79.18.11981-11989.2005 (2005).
Article CAS PubMed PubMed Central Google Scholar
Haim, H., Salas, I. & Sodroski, J. Proteolytic processing of the human immunodeficiency virus envelope glycoprotein precursor decreases conformational flexibility. J. Virol. 87, 1884–1889, https://doi.org/10.1128/JVI.02765-12 (2013).
Article CAS PubMed PubMed Central Google Scholar
Mammano, F. et al. HIV-1 envelope sequence-based diversity measures for identifying recent infections. Plos One 12, https://doi.org/10.1371/journal.pone.0189999 (2017).
Eshleman, S. H. et al. Analysis of genetic linkage of HIV from couples enrolled in the HIV Prevention Trials Network 052 trial. J. Infect. Dis. 204, 1918–1926, https://doi.org/10.1093/infdis/jir651 (2011).
Article CAS PubMed PubMed Central Google Scholar
Vrancken, B. et al. The genealogical population dynamics of HIV-1 in a large transmission chain: bridging within and among host evolutionary rates. PLoS Comput. Biol. 10, e1003505, https://doi.org/10.1371/journal.pcbi.1003505 (2014).
Article CAS PubMed PubMed Central Google Scholar
Novitsky, V., Moyo, S. & Essex, M. Phylogenetic Inference of HIV Transmission Clusters. Infect. Dis. Transl. Med. 3, 51–59, https://doi.org/10.11979/idtm.201702007 (2017).
Article Google Scholar
Wensing, A. M. et al. 2017 Update of the Drug Resistance Mutations in HIV-1. Topics in Antiviral Medicine 24 (2017).
Pham, H. T. et al. The S230R Integrase Substitution Associated With Virus Load Rebound During Dolutegravir Monotherapy Confers Low-Level Resistance to Integrase Strand-Transfer Inhibitors. J. Infect. Dis. 218, 698–706, https://doi.org/10.1093/infdis/jiy175 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kassaye, S. G. et al. Transmitted HIV Drug Resistance Is High and Longstanding in Metropolitan Washington, DC. Clin. Infect. Dis. 63, 836–843, https://doi.org/10.1093/cid/ciw382 (2016).
Article PubMed PubMed Central Google Scholar
Pérez-Losada, M. et al. Characterization of HIV diversity, phylodynamics and drug resistance in Washington, DC. PLoS One 12, e0185644, https://doi.org/10.1371/journal.pone.0185644 (2017).
Article CAS PubMed PubMed Central Google Scholar
Santoro, M. M. & Perno, C. F. HIV-1 Genetic Variability and Clinical Implications. ISRN Microbiol. 2013, 481314, https://doi.org/10.1155/2013/481314 (2013).
Article PubMed PubMed Central Google Scholar
Pérez-Losada, M. et al. Phylodynamics of HIV-1 from a phase-III AIDS vaccine trial in North America. Mol. Biol. Evol. 27, 417–425, https://doi.org/10.1093/molbev/msp254 (2010).
Article CAS PubMed Google Scholar
Sterrett, S. et al. Low Multiplicity of HIV-1 Infection and No Vaccine Enhancement in VAX003 Injection Drug Users. Open. Forum Infect. Dis. 1, ofu056, https://doi.org/10.1093/ofid/ofu056 (2014).
Article PubMed PubMed Central Google Scholar
Masharsky, A. E. et al. A substantial transmission bottleneck among newly and recently HIV-1-infected injection drug users in St Petersburg, Russia. J. Infect. Dis. 201, 1697–1702, https://doi.org/10.1086/652702 (2010).
Article PubMed Google Scholar
Gibson, K. M. et al. A 28-Year History of HIV-1 Drug Resistance and Transmission in Washington, DC. Front. Microbiology 10, 369 (2019).
Article Google Scholar
Pérez-Losada, M. et al. Phylodynamics of HIV-1 from a phase III AIDS vaccine trial in Bangkok, Thailand. PLoS One 6, e16902, https://doi.org/10.1371/journal.pone.0016902 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Jair, K. et al. Validation of publicly-available software used in analyzing NGS data for HIV-1 drug resistance mutations and transmission networks in a Washington, DC, Cohort. PLOS ONE 14, e0214820, https://doi.org/10.1371/journal.pone.0214820 (2019).
Article CAS PubMed PubMed Central Google Scholar
Rogo, T., DeLong, A. K., Chan, P. & Kantor, R. Antiretroviral treatment failure, drug resistance, and subtype diversity in the only pediatric HIV clinic in Rhode Island. Clin. Infect. Dis. 60, 1426–1435, https://doi.org/10.1093/cid/civ058 (2015).
Article PubMed PubMed Central Google Scholar
Kuhnert, D. et al. Quantifying the fitness cost of HIV-1 drug resistance mutations through phylodynamics. PLoS Pathog. 14, e1006895, https://doi.org/10.1371/journal.ppat.1006895 (2018).
Article CAS PubMed PubMed Central Google Scholar
Gupta, R. K. et al. HIV-1 drug resistance before initiation or re-initiation of first-line antiretroviral therapy in low-income and middle-income countries: a systematic review and meta-regression analysis. Lancet Infect. Dis. 18, 346–355, https://doi.org/10.1016/s1473-3099(17)30702-8 (2018).
Article PubMed PubMed Central Google Scholar
Wittkop, L. et al. Effect of transmitted drug resistance on virological and immunological response to initial combination antiretroviral therapy for HIV (EuroCoord-CHAIN joint project): a European multicohort study. Lancet Infect. Dis. 11, 363–371, https://doi.org/10.1016/S1473-3099(11)70032-9 (2011).
Article PubMed Google Scholar
Tostevin, A. et al. Recent trends and patterns in HIV-1 transmitted drug resistance in the United Kingdom. HIV. Med. 18, 204–213, https://doi.org/10.1111/hiv.12414 (2017).
Article CAS PubMed Google Scholar
Buchacz, K. et al. Trends in use of genotypic resistance testing and frequency of major drug resistance among antiretroviral-naive persons in the HIV Outpatient Study, 1999–2011. J. Antimicrob. Chemother. 70, 2337–2346, https://doi.org/10.1093/jac/dkv120 (2015).
Article CAS PubMed Google Scholar
Schmidt, D. et al. Estimating trends in the proportion of transmitted and acquired HIV drug resistance in a long term observational cohort in Germany. PLoS One 9, e104474, https://doi.org/10.1371/journal.pone.0104474 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Frange, P. et al. HIV-1 subtype B-infected MSM may have driven the spread of transmitted resistant strains in France in 2007–12: impact on susceptibility to first-line strategies. J. Antimicrob. Chemother. 70, 2084–2089, https://doi.org/10.1093/jac/dkv049 (2015).
Article CAS PubMed Google Scholar
Hofstra, L. M. et al. Transmission of HIV Drug Resistance and the Predicted Effect on Current First-line Regimens in Europe. Clin. Infect. Dis. 62, 655–663, https://doi.org/10.1093/cid/civ963 (2016).
Article CAS PubMed Google Scholar
Wilen, C. B., Tilton, J. C. & Doms, R. W. HIV: cell binding and entry. Cold Spring Harb Perspect Med 2, https://doi.org/10.1101/cshperspect.a006866 (2012).
Yerly, S. et al. The impact of transmission clusters on primary drug resistance in newly diagnosed HIV-1 infection. AIDS 23, 1415–1423, https://doi.org/10.1097/QAD.0b013e32832d40ad (2009).
Article PubMed Google Scholar
Zulu, L. C., Kalipeni, E. & Johannes, E. Analyzing spatial clustering and the spatiotemporal nature and trends of HIV/AIDS prevalence using GIS: the case of Malawi, 1994–2010. BMC Infectious Diseases 14 (2014).
Brenner, B. G. et al. Transmission networks of drug resistance acquired in primary/early stage HIV infection. AIDS 22, 2509–2515, https://doi.org/10.1097/QAD.0b013e3283121c90 (2008).
Article PubMed Google Scholar
Brenner, B. G. et al. High rates of forward transmission events after acute/early HIV-1 infection. J. Infect. Dis. 195, 951–959, https://doi.org/10.1086/512088 (2007).
Article CAS PubMed Google Scholar
Bezemer, D. et al. Transmission networks of HIV-1 among men having sex with men in the Netherlands. AIDS 24, 271–282, https://doi.org/10.1097/QAD.0b013e328333ddee (2010).
Article PubMed Google Scholar
Nguyen, L. et al. Genetic Analysis of Incident HIV-1 Strains Among Injection Drug Users in Bangkok: Evidence for Multiple Transmission Clusters During a Period of High Incidence. J. Acquir. Immune Defic. Syndr. 30, 248–256 (2002).
Article PubMed Google Scholar
Thomson, M. et al. Molecular epidemiology of HIV-1 in St Petersburg, Russia: predominance of subtype A, former Soviet Union variant, and identification of intrasubtype subclusters. J. Acquir. Immune Defic. Syndr. 51, 332–339, https://doi.org/10.1097/QAI.0b013e31819c1757 (2009).
Article CAS PubMed Google Scholar
Chalmet, K. et al. Epidemiological study of phylogenetic transmission clusters in a local HIV-1 epidemic reveals distinct differences between subtype B and non-B infections. BMC Infectious Diseases 10, http://www.biomedcentral.com/1471-2334/10/262 (2010).
Ambrosioni, J. et al. Impact of highly active antiretroviral therapy on the molecular epidemiology of newly diagnosed HIV infections. AIDS 26, 2079–2086, https://doi.org/10.1097/QAD.0b013e32835805b6 (2012).
Article CAS PubMed Google Scholar
Cuevas, M. T. et al. HIV-1 transmission cluster with T215D revertant mutation among newly diagnosed patients from the Basque Country, Spain. J. Acquir. Immune Defic. Syndr. 51, 99–103, https://doi.org/10.1097/QAI.0b013e318199063e (2009).
Article PubMed Google Scholar
Pao, D. et al. Transmission of HIV-1 during primary infection: relationship to sexual risk and sexually transmitted infections. AIDS 19, 85–90, 00002030-200501030-00010 (2005).
Ahumada-Ruiz, S., Flores-Figueroa, D., Toala-Gonzalez, I. & Thomson, M. M. Analysis of HIV-1 pol sequences from Panama: identification of phylogenetic clusters within subtype B and detection of antiretroviral drug resistance mutations. Infect. Genet. Evol. 9, 933–940, https://doi.org/10.1016/j.meegid.2009.06.013 (2009).
Article CAS PubMed Google Scholar
Morgan, E. et al. HIV-1 Infection and Transmission Networks of Younger People in Chicago, Illinois, 2005–2011. Public. Health Rep. 132, 48–55, https://doi.org/10.1177/0033354916679988 (2017).
Article PubMed Google Scholar
Sallam, M. et al. Molecular epidemiology of HIV-1 in Iceland: Early introductions, transmission dynamics and recent outbreaks among injection drug users. Infect. Genet. Evol. 49, 157–163, https://doi.org/10.1016/j.meegid.2017.01.004 (2017).
Article PubMed Google Scholar
Hakre, S. et al. Characteristics of HIV-infected USArmy soldiers linked in molecular transmission clusters, 2001–2012. PLoS One 12, e0182376, https://doi.org/10.1371/journal.pone.0182376 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wertheim, J. O. et al. Social and Genetic Networks of HIV-1 Transmission in New York City. PLoS Pathog. 13, e1006000, https://doi.org/10.1371/journal.ppat.1006000 (2017).
Article CAS PubMed PubMed Central Google Scholar
Raymond, H. F. et al. HIV Among MSM and Heterosexual Women in the United States: An Ecologic Analysis. J. Acquir. Immune Defic. Syndr. 75, S276–S280 (2017).
Article PubMed Google Scholar
Esbjornsson, J. et al. HIV-1 transmission between MSM and heterosexuals, and increasing proportions of circulating recombinant forms in the Nordic Countries. Virus Evol. 2, vew010, https://doi.org/10.1093/ve/vew010 (2016).
Article PubMed PubMed Central Google Scholar
Kouyos, R. D. et al. Molecular epidemiology reveals long-term changes in HIV type 1 subtype B transmission in Switzerland. J. Infect. Dis. 201, 1488–1497, https://doi.org/10.1086/651951 (2010).
Article PubMed Google Scholar
German, D., Grabowski, M. K. & Beyrer, C. Enhanced use of phylogenetic data to inform public health approaches to HIV among men who have sex with men. Sex. Health 14, 89–96, https://doi.org/10.1071/SH16056 (2017).
Article PubMed PubMed Central Google Scholar
Dennis, A. M. et al. Phylogenetic insights into regional HIV transmission. AIDS 26, 1813–1822, https://doi.org/10.1097/QAD.0b013e3283573244 (2012).
Article PubMed Google Scholar
Hightower, G. K. et al. HIV-1 clade B pol evolution following primary infection. PLoS One 8, e68188, https://doi.org/10.1371/journal.pone.0068188 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Bendall, M. L., Gibson, K. M., Steiner, M. C., Pérez-Losada, M. & Crandall, K. A. HAPHPIPE: Haplotype reconstruction and real-time phylodynamics for deep sequencing of intra-host viral populations. Submitted to Molecular Biology and Evolution (2019).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinforma. 30, 2114–2120, https://doi.org/10.1093/bioinformatics/btu170 (2014).
Article CAS Google Scholar
Heo, Y., Ramachandran, A., Hwu, W.-M., Ma, J. & Chen, D. BLESS 2: accurate, memory-efficient and fast error correction method. Bioinformatics 32 (2016).
Korber, B. T., Foley, B. T., Kuiken, C. L., Pillai, S. K. & Sodroski, J. G. Numbering Positions in HIV Relative to HXB2CG, https://www.hiv.lanl.gov/content/sequence/HIV/REVIEWS/HXB2.html (2014).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359, https://doi.org/10.1038/nmeth.1923 (2012).
Article CAS PubMed PubMed Central Google Scholar
Boratyn, G. M. et al. BLAST: a more efficient report with usability improvements. Nucleic Acids Res. 41, W29–33, https://doi.org/10.1093/nar/gkt282 (2013).
Article PubMed PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780, https://doi.org/10.1093/molbev/mst010 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kearse, M. et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinforma. 28, 1647–1649, https://doi.org/10.1093/bioinformatics/bts199 (2012).
Article Google Scholar
Zhou, S., Jones, C., Mieczkowski, P., Swanstrom, R. & Primer, I. D. Validates Template Sampling Depth and Greatly Reduces the Error Rate of Next-Generation Sequencing of HIV-1 Genomic RNA Populations. J. Virol. 89, 8540–8555, https://doi.org/10.1128/JVI.00522-15 (2015).
Article CAS PubMed PubMed Central Google Scholar
de Oliveira, T. et al. An automated genotyping system for analysis of HIV-1 and other microbial sequences. Bioinforma. 21, 3797–3800, https://doi.org/10.1093/bioinformatics/bti607 (2005).
Article CAS Google Scholar
Alcantara, L. C. et al. A standardized framework for accurate, high-throughput genotyping of recombinant and non-recombinant viral sequences. Nucleic Acids Res. 37, W634–642, https://doi.org/10.1093/nar/gkp455 (2009).
Article CAS PubMed PubMed Central Google Scholar
Rhee, S. Y. Human immunodeficiency virus reverse transcriptase and protease sequence database. Nucleic Acids Res. 31, 298–303, https://doi.org/10.1093/nar/gkg100 (2003).
Article CAS PubMed PubMed Central Google Scholar
Murrell, B. et al. FUBAR: a fast, unconstrained bayesian approximation for inferring selection. Mol. Biol. Evol. 30, 1196–1205, https://doi.org/10.1093/molbev/mst030 (2013).
Article CAS PubMed PubMed Central Google Scholar
Pond, S. L., Frost, S. D. & Muse, S. V. HyPhy: hypothesis testing using phylogenies. Bioinforma. 21, 676–679, https://doi.org/10.1093/bioinformatics/bti079 (2005).
Article CAS Google Scholar
Kosakovsky Pond, S. L., Posada, D., Gravenor, M. B., Woelk, C. H. & Frost, S. D. GARD: a genetic algorithm for recombination detection. Bioinforma. 22, 3096–3098, https://doi.org/10.1093/bioinformatics/btl474 (2006).
Article CAS Google Scholar
Kosakovsky Pond, S. L., Posada, D., Gravenor, M. B., Woelk, C. H. & Frost, S. D. Automated phylogenetic detection of recombination using a genetic algorithm. Mol. Biol. Evol. 23, 1891–1901, https://doi.org/10.1093/molbev/msl051 (2006).
Article PubMed Google Scholar
Posada, D. & Crandall, K. A. Selecting models of nucleotide substitution: An application to Human Immuno- deficiency Virus 1 (HIV-1). Mol. Biol. evolution 18, 897–906 (2001).
Article CAS Google Scholar
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. jModelTest 2: more models, new heuristics and parallel computing. Nat. Methods 9, 772, https://doi.org/10.1038/nmeth.2109 (2012).
Article CAS PubMed PubMed Central Google Scholar
Miller, M., Pfeiffer, W. & Schwartz, T. In Proceedings of the Gateway Computing Environments Workshop (GCE). 1–8.
Felsenstein, J. Evolutionary Trees from DNA Sequences: A Maximum Likelihood Approach. J. Mol. Evolution 17, 368–376 (1981).
Article ADS CAS Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinforma. 30, 1312–1313, https://doi.org/10.1093/bioinformatics/btu033 (2014).
Article CAS Google Scholar
Felsenstein, J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39, 783–791, https://doi.org/10.1111/j.1558-5646.1985.tb00420.x (1985).
Article PubMed Google Scholar
Huelsenbeck, J. P. & Ronquist, F. MRBAYES: Bayesian inference of phylogenetic trees. Bioinforma. 17, 754–755 (2001).
Article CAS Google Scholar
Rambaut, A., Drummond, A., D, Z., G, B. & MA, S. Tracer v1.7, http://tree.bio.ed.ac.uk/software/tracer/ (2018).
Castro-Nallar, E., Perez-Losada, M., Burton, G. F. & Crandall, K. A. The evolution of HIV: inferences using phylogenetics. Mol. Phylogenet Evol. 62, 777–792, https://doi.org/10.1016/j.ympev.2011.11.019 (2012).
Article CAS PubMed Google Scholar
Berg, M. G. et al. A Pan-HIV Strategy for Complete Genome Sequencing. J. Clin. Microbiol. 54, 868–882, https://doi.org/10.1128/JCM.02479-15 (2016).
Article CAS PubMed PubMed Central Google Scholar
Aralaguppe, S. G. et al. Multiplexed next-generation sequencing and de novo assembly to obtain near full-length HIV-1 genome from plasma virus. J. Virol. Methods 236, 98–104, https://doi.org/10.1016/j.jviromet.2016.07.010 (2016).
Article CAS PubMed Google Scholar
Prabhakaran, S., Rey, M., Zagordi, O., Beerenwinkel, N. & Roth, V. HIV Haplotype Inference Using a Propagating Dirichlet Process Mixture Model. IEEE/ACM Trans. Comput. Biol. Bioinform 11, 182–191, https://doi.org/10.1109/TCBB.2013.145 (2014).
Article PubMed Google Scholar
Pond, S. L. K., Weaver, S., Brown, A. J. L. & Wertheim, J. O. HIV-TRACE (Transmission Cluster Engine): a tool for large scale molecular epidemiology of HIV-1 and other rapidly evolving pathogens. Molecular Biology and Evolution, msy016, https://doi.org/10.1093/molbev/msy016 (2018).
Rose, R. et al. Identifying Transmission Clusters with Cluster Picker and HIV-TRACE. AIDS Res. Hum. Retroviruses 33, 211–218, https://doi.org/10.1089/AID.2016.0205 (2017).
Article CAS PubMed PubMed Central Google Scholar
Castro-Nallar, E., Crandall, K. A. & Pérez-Losada, M. Genetic diversity and molecular epidemiology of HIV transmission. Future Virology 7, https://doi.org/10.2217/fvl.12.4 (2012).
Rozas, J. et al. DnaSP 6: DNA Sequence Polymorphism Analysis of Large Data Sets. Mol. Biol. evolution 34, 3299–3302, https://doi.org/10.1093/molbev/msx248 (2017).
Article CAS Google Scholar
Nei, M. & Tajima, F. DNA Polymorphism Detectable By Restriction Endonucleases. Genet. 97, 145–163 (1981).
CAS Google Scholar
R Core Team. R: A language and environment for statistical computing., (R Foundation for Statistical Computing, 2014).
RStudio Team. RStudio: Integrated Development for R. (RStudio, Inc., 2018).

Download references

Acknowledgements

First and foremost, we thank the participants who are involved with the DC Cohort. DC Cohort data in this manuscript were collected by the DC Cohort Study Group with investigators and research staff located at: Cerner Corporation (Thilakavathy Subramanian, Jeffery Binkley, Rob Taylor, Nabil Rayeed, Cheryl Akridge, Stacey Purinton, Jeff Naughton); Children’s National Medical Center Adolescent (Lawrence D’Angelo) and Pediatric (Natella Rakhmanina) clinics; The Senior Deputy Director of the DC Department of Health HAHSTA (Michael Kharfen); Family and Medical Counseling Service (Angela Wood, Michael Serlin); Georgetown University (Princy Kumar); George Washington University Medical Faculty Associates (David Parenti); George Washington University Department of Epidemiology and Biostatistics (Alan Greenberg, Anne Monroe, Lindsey Powers Happ, Maria Jaurretche, Brittany Lewis, James Peterson); Howard University Adult Infectious Disease Clinic (Ronald Wilcox), and Pediatric Clinic (Sohail Rana); Kaiser Permanente Mid-Atlantic States (Michael Horberg); La Clinica Del Pueblo, (Ricardo Fernandez); MetroHealth (Annick Hebou); National Institutes of Health (Carl Dieffenbach, Henry Masur); Providence Hospital (Jose Bordon); Unity Health Care (Gebeyehu Teferi); Veterans Affairs Medical Center (Debra Benator); Washington Hospital Center (Maria Elena Ruiz); and Whitman-Walker Health (Deborah Goldstein, David Hardy). We would also like to acknowledge the DC Cohort Community Advisory Board, the DC Cohort Executive Committee Members, and staff of the DC Department of Health HIV/AIDS, Hepatitis, STD, TB Administration Strategic Information Division. We would also like to thank the site PIs, RAs, the DC Department of Health, the DC CFAR, and the National Institutes of Health for their numerous contributions to the DC Cohort. Finally, we thank reviewers for their time and constructive comments to help refine this manuscript. This study was supported by the DC Cohort Study (U01 AI69503–03S2), a supplement from the Women’s Interagency Study for HIV-1 (410722_GR410708), a DC D-CFAR pilot award, and a 2015 HIV-1 Phylodynamics Supplement award from the District of Columbia for AIDS Research, an NIH funded program (AI117970), which is supported by the following NIH Co-Funding and Participating Institutes and Centers: NIAID, NCI, NICHD, NHLBI, NIDA, NIMH, NIA, FIC, NIGMS, NIDDK and OAR. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

Author information

Authors and Affiliations

Computational Biology Institute, The Milken Institute School of Public Health, The George Washington University, Washington, DC, 20052, USA
Keylie M. Gibson, Matthew L. Bendall, Keith A. Crandall & Marcos Pérez-Losada
Department of Epidemiology, The Milken Institute School of Public Health, The George Washington University, Washington, DC, 20052, USA
Kamwing Jair, Amanda D. Castel, Brittany Wilbourn & Jeanne A. Jordan
Department of Biostatistics and Bioinformatics, The Milken Institute School of Public Health, The George Washington University, Washington, DC, 20052, USA
Keith A. Crandall, Marcos Pérez-Losada, Alan Greenberg, Anne Monroe, Lindsey Powers Happ, Maria Jaurretche & James Peterson
CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Campus Agrário de Vairão, Vairão, Portugal
Marcos Pérez-Losada
Cerner Corporation, Arlington, VA, 22209, USA
Thilakavathy Subramanian, Jeffery Binkley, Rob Taylor, Nabil Rayeed, Cheryl Akridge, Stacey Purinton & Jeff Naughton
Children’s National Medical Center Pediatric Clinic, Washington, DC, 20010, USA
Natella Rakhmanina
Children’s National Medical Center Adolescent Clinic, Washington, DC, 20010, USA
Larry D’Angelo
DC Department of Health HAHSTA, Washington, DC, 20002, USA
Michael Kharfen
Family and Medical Counseling Service, Washington, DC, 20020, USA
Angela Wood & Michael Serlin
Georgetown University, Washington, DC, 20007, USA
Princy Kumar
George Washington Medical Faculty Associates, Washington, DC, 20037, USA
David Parenti
Howard University Hospital Adult Clinic, Washington, DC, 20059, USA
Ronald D Wilcox
Howard University Hospital Pediatric Clinic, Washington, DC, 20060, USA
Sohail Rana
Kaiser Permanente, Rockville, MD, 20852, USA
Michael A Horberg
La Clinica Del Pueblo, Washington, DC, 20009, USA
Ricardo Fernández
MetroHealth, Washington, DC, 20005, USA
Annick Hebou
National Institutes of Health, Bethesda, Maryland, 20892, USA
Carl Dieffenbach & Henry Masur
Washington Health Institute, Washington, DC, 20017, USA
Jose Bordon
Unity Health Care, Washington, DC, 20019, USA
Gebeyehu Teferi
Veterans Affairs Medical Center, Washington, DC, 20422, USA
Debra Benator
Washington Hospital Center, Washington, DC, 20010, USA
Maria Elena Ruiz
Whitman-Walker Health, Washington, DC, 20036, USA
Deborah Goldstein & David Hardy

Author notes

A comprehensive list of consortium members appears at the end of the paper

Authors

Keylie M. Gibson
View author publications
You can also search for this author in PubMed Google Scholar
Kamwing Jair
View author publications
You can also search for this author in PubMed Google Scholar
Amanda D. Castel
View author publications
You can also search for this author in PubMed Google Scholar
Matthew L. Bendall
View author publications
You can also search for this author in PubMed Google Scholar
Brittany Wilbourn
View author publications
You can also search for this author in PubMed Google Scholar
Jeanne A. Jordan
View author publications
You can also search for this author in PubMed Google Scholar
Keith A. Crandall
View author publications
You can also search for this author in PubMed Google Scholar
Marcos Pérez-Losada
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

the DC Cohort Executive Committee

Thilakavathy Subramanian
, Jeffery Binkley
, Rob Taylor
, Nabil Rayeed
, Cheryl Akridge
, Stacey Purinton
, Jeff Naughton
, Natella Rakhmanina
, Larry D’Angelo
, Michael Kharfen
, Angela Wood
, Michael Serlin
, Princy Kumar
, David Parenti
, Alan Greenberg
, Anne Monroe
, Lindsey Powers Happ
, Maria Jaurretche
, James Peterson
, Ronald D Wilcox
, Sohail Rana
, Michael A Horberg
, Ricardo Fernández
, Annick Hebou
, Carl Dieffenbach
, Henry Masur
, Jose Bordon
, Gebeyehu Teferi
, Debra Benator
, Maria Elena Ruiz
, Deborah Goldstein
& David Hardy

Contributions

K.A.C., M.P.-L. and A.D.C. conceptualized this study and acquired funding, while the project was administered by A.D.C., B.W. and M.P.-L. Sample collection was organized by B.W. and A.D.C. and RNA extraction and DNA sequencing, along with all laboratory analyses, from samples were completed by K.J. and J.A.J. Bioinformatic analysis was completed by K.M.G. and M.L.B. with validation by M.P.-L. K.M.G., M.P.-L. and K.A.C. wrote the original manuscript draft and all authors contributed to revisions and approved the final version.

Corresponding author

Correspondence to Keylie M. Gibson.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Figure 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gibson, K.M., Jair, K., Castel, A.D. et al. A cross-sectional study to characterize local HIV-1 dynamics in Washington, DC using next-generation sequencing. Sci Rep 10, 1989 (2020). https://doi.org/10.1038/s41598-020-58410-y

Download citation

Received: 23 August 2019
Accepted: 31 December 2019
Published: 06 February 2020
DOI: https://doi.org/10.1038/s41598-020-58410-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

HIV-1 molecular diversity in Brazil unveiled by 10 years of sampling by the national genotyping network

Molecular epidemiology and HIV-1 variant evolution in Poland between 2015 and 2019

Characterisation of HIV-1 Molecular Epidemiology in Nigeria: Origin, Diversity, Demography and Geographic Spread

Introduction

Results

Sample and phenotypic characterization

HIV-1 diversity in DC

Drug resistant mutations

Transmission clusters

Discussion

Conclusions

Materials and Methods

Ethics

DC cohort

Next-Generation sequencing

Sequence analyses

Identification of subtypes and drug resistant mutations

Phylogenetic analyses

Haplotype reconstruction

Identification of transmission clusters

Diversity estimation

Ethical approval

Change history

22 April 2020

References

Acknowledgements

Author information

Authors and Affiliations

Author notes

A comprehensive list of consortium members appears at the end of the paper

Consortia

the DC Cohort Executive Committee

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplemental Figure 1.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links