Humans Are Selectively Exposed to Pneumocystis jirovecii.

Environmental exposure has a significant impact on human health. While some airborne fungi can cause life-threatening infections, the impact of environment on fungal spore dispersal and transmission is poorly understood. The democratization of shotgun metagenomics allows us to explore important questions about fungal propagation. We focus on Pneumocystis, a genus of host-specific fungi that infect mammals via airborne particles. In humans, Pneumocystis jirovecii causes lethal infections in immunocompromised patients if untreated, although its environmental reservoir and transmission route remain unclear.

in the environment (see reference 2), it remains unclear if and for how long a Pneumocystis cell can remain viable outside its host.
The epidemiology of Pneumocystis infection is poorly understood and has been almost exclusively studied in humans in the context of drug resistance or outbreaks. Most of the studies utilize a multilocus sequencing approach targeting a few mitochondrial and nuclear genes. The structure of P. jirovecii populations is ambiguous. Some studies have reported a strong local population structure mediated by clusters of infected patients (10); however, this is not supported by other studies (11). These conflicting results could be explained by the presence of viable ascii in the environment over extended periods of time. If Pneumocystis ascii are able to survive in the environment, spores could disperse over large geographical distances. In such a scenario, individuals would be constantly infected with new strains, therefore creating highly heterogeneous populations, which would be consistent with the observed high prevalence of coinfections with multiple P. jirovecii strains (11,12). Pneumocystis ascii share similarities with other long-distance fungal dispersers, for example, the presence of glucans in the cell wall, which likely promote survival in a harsh environment. The scenario of a long-distance dispersal of Pneumocystis ascii has not been explored to date. If Pneumocystis ascii are ubiquitous, then humans should simultaneously be exposed to different Pneumocystis species, but only P. jirovecii cells would be able to replicate. Under this model, we should expect a widespread coexistence of ascii from different Pneumocystis species in the environment.
Although patients with PCP have high organism burdens which likely facilitate transmission, healthy carriers also potentially play a role. The transmission of Pneumocystis through immunocompetent individuals has been demonstrated in animal models (13). In this situation, Pneumocystis organisms are able to actively infect new hosts and complete their life cycle with the ability to release newly formed ascii before the host immune system clears the infection. Assuming that recurrent infections are frequent, healthy individuals would be the primary source of ascii. In that case, we should expect a high density of P. jirovecii ascii in the vicinity of humans, irrespective of the presence of immunocompromised individuals.
A recent study published in Cell (14) exploring the dynamics of the human environmental exposome provided an opportunity to investigate our hypotheses. The authors collected and analyzed metagenomic sequences from air filter devices attached to 15 self-declared healthy adults. These individuals were followed for up to 890 days over 66 different geographical locations. The data set is available as 594 SRA files in the NCBI SRA database (ϳ5 TB of raw data). We also utilized data from two additional metagenomics studies from surface swabs across New York City (NYC) subway systems, the Gowanus Canal, and public parks (1,573 SRA files) (15) and the collection of 63 uncultured hospital air samples collected over a 6-month period (6,888 SRA files) (16). Here, we show clear evidence of P. jirovecii, but no other Pneumocystis species, in air filters from 4 out of 15 individuals in the human exposome study (14), whereas no evidence of Pneumocystis was identified in the other two studies, possibly because the sampling methods were not suitable for Pneumocystis detection. The significance of this finding is discussed in the context of epidemiology and host species specificity. biotics extraction. Participants were instructed to either carry the monitor on their arm or place the monitor near them within a 2-m radius at all times during the sampling period. At the end of the sampling and monitoring period, filters and cartridges were removed from the monitor and stored at -80°C until analysis. None of the individuals were health care workers or involved in animal models of PCP.
Mapped reads were assembled using metaSPades (22). Small-size contigs (Ͻ500 nucleotides) and/or those with a high GC content (Ͼ50%; because Pneumocystis genomes have a GC content of ϳ30%) were separated. Analysis of small-size and/or high-GC content contigs indicated that they were heavily enriched in ambiguous ribosomal DNA (rDNA) sequences and low complexity repeats, the vast majority of which could not be reliably categorized. The remaining contigs were aligned to the reference genome sequences using BLAT (23) with a minimum nucleotide alignment score of 200 as the threshold. Ribosomal DNA and low-complexity sequences were excluded because of their high rates of false positivity. A total of 1,711 contigs were retained for further investigation. Contigs were annotated with AUGUSTUS (24) using Pneumocystis built-in models (25). Orthology was inferred using the reciprocal best BLASTn hit (26) with an E value of 10 Ϫ10 as the cutoff. For each orthologous group, sequence orientations were assessed using Revseq from the EMBOSS package (27), aligned using MUSCLE (28), and visually inspected for alignment inconsistencies using AliView (29). Maximum likelihood phylogenies were inferred using RAxML-NG (30) with 100 replicates as support values. Pairwise nucleotide identities were calculated using Water from EMBOSS.
Pneumocystis organisms are present in the human exposome. We identified Pneumocystis reads in 24 of the 594 SRA files provided by the human exposome study (14). These 24 SRA files represent air filters from 4 of 15 individuals (Table 1). We found no conclusive evidence of Pneumocystis presence in the metagenomic data sets from the surface swabs of the NYC subway system (15) or air samples from hospitals (16). The latter study used filters to capture particles between 0.1 and 10 m in size, which should capture Pneumocystis ascii if they are present. However, the depth of sequencing appears lower than for those of other data sets. Only data from the human exposome study (14) were used for subsequent analyses.
Humans seem to be selectively exposed to Pneumocystis jirovecii. The de novo assembly of Pneumocystis reads retrieved from 24 SRA files yielded 45 contigs larger than 500 nucleotides (Table 1). Maximum likelihood phylogeny and pairwise alignment identity scores for 37 of 45 contigs with their one-to-one ortholog unambiguously identified all contigs as being from P. jirovecii and clearly distinct from all other Pneumocystis species analyzed ( Fig. 1 A and B). The remaining eight contigs, which lack homologs that are present in other Pneumocystis species, were clearly identified as P. jirovecii based on pairwise alignments. This was a somewhat surprising finding given that multiple Pneumocystis species are distributed in the ambient air surrounding mammals. While Pneumocystis species show a strict host species specificity for productive infection, air sampling does not depend on replication of the organism but should randomly sample the proximal ambient air. Given that Pneumocystis species appear unable to replicate outside the mammalian host, including in air filters, the only source of aerosolized organisms appears to be an infected host of the same species. While Pneumocystis can cause pneumonia in immunosuppressed hosts, it can also lead to subclinical infection or colonization in immunocompetent hosts, and sampling of, e.g., wild rodents demonstrates the presence of Pneumocystis DNA in ϳ30 to 90% of animals (40,41). None of the individuals involved in the study were reported to have an immunodeficiency, although we cannot rule out undisclosed medical conditions or contact with an infected immunocompromised patient. We have information that at least some individuals included in the study had close contact with animals such as pets, laboratory animals, or zoo animals (see https://www.technologynetworks.com/ proteomics/news/exposing-the-exposome-309789).
Dog and cats cannot be infected by P. jirovecii, but they can be infected by genetically distinct Pneumocystis species. Pneumocystis pneumonia is rare in dogs (42), although severe cases with development of spontaneous pneumomediastinum have been reported (43,44). The agent Pneumocystis sp. canis is phylogenetically related but  (Continued on next page) Opinion/Hypothesis ® clearly distinct from P. jirovecii and other Pneumocystis species (45). The Pneumocystis species that infects cats is referred as Pneumocystis sp. catus and constitutes another distinct species (46). While spontaneous PCP has not been observed in cats, PCP is frequently found in feline immunodeficiency virus-infected (47) or corticoid-treated cats (48). The low occurrence of PCP in healthy dogs and cats suggests that their immune system clears or maintains Pneumocystis organism loads at a low level. Therefore, healthy animals are expected to be colonized or infected subclinically, similar to healthy humans. Given that the phylogenetic placement of both Pneumocystis sp. canis and Pneumocystis sp. catus is known and that the nucleotide divergence of P. jirovecii with these two species exceeds 16% (46,49), our unbiased approach would have recovered sequences from these species if present, even in the absence of complete genome sequences. If present, Pneumocystis sp. canis and Pneumocystis sp. catus sequences would be easy to identify in a phylogenetic tree because they will not cluster with any of the species used here and, instead, have their own branches located between primate Pneumocystis species (P. jirovecii and Pneumocystis sp. macacae) and rodent Pneumocystis species (P. carinii and P. murina). Unlike other fungi that can replicate in the environment (e.g., in soil) and for which even a single organism has the potential to replicate wherever it lands, a Pneumocystis cyst, the infective form of the organism, must be inhaled and presumably deposited into the alveoli to initiate productive infection. We found high levels of mammal DNA, including that of humans and animals, in the air metagenomes where P. jirovecii DNA was detected (Table 1), which suggest that other Pneumocystis species would have been detected if present. The absence of non-P. jirovecii sequences in the human exposome suggests that Pneumocystis cysts can travel for only a limited range from the infected host, or if they travel long distances, the likelihood of being inhaled (or sampled in the current study) diminishes very rapidly. Thus, efficient transmission likely requires close proximity to an infected host (50). The fact that none of the filters contained other species of Pneumocystis strengthens the likelihood of a selective exposure of humans to P. jirovecii, though the limited number of species included in our database represents a potential caveat.
Participants were instructed to either carry the monitor on their arm or place the monitor near them within a 2-m radius at all times during the sampling period. Assuming that the filter was worn on the upper arm at all times, it is unlikely that this would create a significant barrier to the deposition of Pneumocystis from four-legged mammals, particularly from smaller animals. The reason is that experimental data from animal models suggest that Pneumocystis ascii remain in suspension in the ambient air. Noninfected rats acquired P. carinii and developed PCP when exposed to the exhaled air from infected rats (51,52). In addition, Pneumocystis ascii from different species are of similar size, 3.5 to 8 m (reviewed in reference 53), which suggests similar ascii dispersion capabilities, although interspecies differences cannot be ruled out. At the time of this writing, we are not aware of any study that has compared ascii diffusion properties in multiple Pneumocystis species. Another confounding factor is that P. jirovecii populations would be expected to outnumber those of other Pneumocystis populations in human-dense areas. This would allow a higher recovery yield for P. jirovecii versus other Pneumocystis species but should not influence the recovery of other species.
To investigate whether the air filters were seeded with a single strain or multiple strains of P. jirovecii, we aligned raw reads to P. jirovecii reference genome assembly Opinion/Hypothesis ® strain RU7 (19) and identified single nucleotide polymorphisms (SNP). We filtered out singletons and analyzed 1,852 SNP positions. SNP profiles from different air filters appear to be heterogeneous, which indicates a wide dispersion of samples from individuals (Fig. 1C) and could reflect the differences in sampling times or exposure to different infected hosts. In addition, 115 of 1,852 positions overlap genomic regions encoding highly polymorphic major surface glycoproteins (Msg), which would presumably further enhance the genetic heterogenicity among isolates. In fact, the hypervariability of msg genes has been extensively used to study strain variation of P. jirovecii (54,55). Overall, the source of the Pneumocystis that was sampled cannot be determined, but it presumably was either the volunteer or someone else in close proximity, which would be consistent with the dynamics of Pneumocystis air shedding (8,56). Whatever the source, the frequent detection of Pneumocystis DNA suggests frequent shedding of organisms, which is consistent with serologic data suggesting that most humans have been infected by Pneumocystis at an early age (57,58).
Concluding remarks. Our observations highlight the potential impact of metagenomics to improve our understanding of fungal pathogen transmission and epidemiology. Our results support a short-range transmission model and suggest that closerange exposure to infected hosts may play a significant role in transmission of Pneumocystis. Analyses of publicly available metagenomic data sets provide new opportunities for modeling and exploring human exposure to airborne infectious pathogens.