Next Article in Journal
Comparative Performance in the Detection of Four Coronavirus Genera from Human, Animal, and Environmental Specimens
Next Article in Special Issue
Community Structure, Drivers, and Potential Functions of Different Lifestyle Viruses in Chaohu Lake
Previous Article in Journal
Prevalence of Respiratory Viral Infections in Deceased Persons during the COVID-19 Pandemic Season 2021–2022: A Population-Based Observational Study
Previous Article in Special Issue
A Novel Podophage StenR_269 Suggests a New Family in the Class Caudoviricetes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Metagenomic Investigation of Potential Health Risks and Element Cycling Functions of Bacteria and Viruses in Wastewater Treatment Plants

1
Yunnan Institute of Microbiology, School of Life Sciences, Yunnan University, Kunming 650091, China
2
College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
*
Authors to whom correspondence should be addressed.
Viruses 2024, 16(4), 535; https://doi.org/10.3390/v16040535
Submission received: 22 January 2024 / Revised: 27 March 2024 / Accepted: 28 March 2024 / Published: 29 March 2024
(This article belongs to the Special Issue Diversity and Evolution of Viruses in Ecosystem)

Abstract

:
The concentration of viruses in sewage sludge is significantly higher (10–1000-fold) than that found in natural environments, posing a potential risk for human and animal health. However, the composition of these viruses and their role in the transfer of pathogenic factors, as well as their role in the carbon, nitrogen, and phosphorus cycles remain poorly understood. In this study, we employed a shotgun metagenomic approach to investigate the pathogenic bacteria and viral composition and function in two wastewater treatment plants located on a campus. Our analysis revealed the presence of 1334 amplicon sequence variants (ASVs) across six sludge samples, with 242 ASVs (41.22% of total reads) identified as pathogenic bacteria. Arcobacter was found to be the most dominant pathogen accounting for 6.79% of total reads. The virome analysis identified 613 viral genera with Aorunvirus being the most abundant genus at 41.85%. Approximately 0.66% of these viruses were associated with human and animal diseases. More than 60% of the virome consisted of lytic phages. Host prediction analysis revealed that the phages primarily infected Lactobacillus (37.11%), Streptococcus (21.11%), and Staphylococcus (7.11%). Furthermore, our investigation revealed an abundance of auxiliary metabolic genes (AMGs) involved in carbon, nitrogen, and phosphorus cycling within the virome. We also detected a total of 113 antibiotic resistance genes (ARGs), covering major classes of antibiotics across all samples analyzed. Additionally, our findings indicated the presence of virulence factors including the clpP gene accounting for approximately 4.78%, along with toxin genes such as the RecT gene representing approximately 73.48% of all detected virulence factors and toxin genes among all samples analyzed. This study expands our understanding regarding both pathogenic bacteria and viruses present within sewage sludge while providing valuable insights into their ecological functions.

1. Introduction

Viruses are the most abundant entities on earth, and the number of viruses is estimated to be as high as 1031 [1,2]. The rapid development of metaviromic technology in the past decade has promoted the study of viruses in a variety of environments, including marine, freshwater, and intestinal tract environments, revealing many new viral species and a large amount of functional information. Among these viruses, studies on the marine virome have revealed that viruses play an indispensable role in the regulation of host communities, biogeochemical cycles, global climate change, and horizontal gene transfer [3]. Sewage sludge harbours enormous quantities of viruses, with 10–1000-fold higher concentrations than in natural environments, and some of these viruses can cause severe human and animal diseases. Furthermore, a limited number of studies to date have investigated sewage treatment plants. Despite the increasing application of high-throughput sequencing techniques for viral metagenomics, their use in identifying viruses in sewage has not been well explored [4,5,6]. However, the few existing studies have demonstrated that viruses are key factors in nutrient removal and that sewage sludge, as a reservoir of viruses, may pose threats to human health. The sewage virosphere has been monitored worldwide using molecular techniques, such as PCR and quantitative PCR [5,7]. These methods can only provide information about the presence and abundance of known and characterized viruses because there is no universal marker for viruses, such as the 16S rRNA gene for bacteria [4,5].
Antibiotic resistance is a growing global public health challenge and is expected to cause a difference in the burden on low- and middle-income countries in the next few decades [8]. Wastewater and wastewater treatment systems are important reservoirs of antibiotic-resistant bacterial populations, which contribute to regular outbreaks of drug-resistant infections [9]. Metaviromic technology can not only reveal new viruses but also provides the possibility for understanding the ecological functions of viruses. Metaviromic methods can be used to effectively detect and monitor the transfer of harmful genes, such as ARGs, virulence factors, and toxin genes. Previous studies have focused more on the function of bacteria in wastewater treatment systems and have ignored the regulatory role of viruses in the function of bacteria. In fact, the auxiliary metabolic genes contained in viruses are important regulators of host metabolism and regulate the metabolic function of the host during interaction with the host, thus affecting biogeochemical cycle processes [10].
Considering the potential importance of viruses in natural environments, understanding what role viruses play in the C, N, and P cycles and the impact of pathogenic genes in the genome of viruses on public safety is important in campus wastewater treatment plants. This study used metagenomic technology to analyse in depth the human bacterial and viral pathogens that may be present in sludge, along with the ARGs, virulence factors, and toxins located on viruses, and analysed the potential functions of bacteria and viruses in the element cycle. Our study demonstrates that a large number of pathogenic bacteria and unknown viruses occur in campus wastewater treatment plants (WWTPs), and the ARGs, virulence factors, and toxins in the genome of viruses pose potential health risks. The auxiliary metabolic genes involved in the C, N, and P cycles located on viruses in the sludge are beneficial to the cycling of elements in the sludge. In contrast to previous studies that primarily concentrated on bacteria and archaea, this research endeavors to redirect attention towards elucidating the potential ecological functions, and discerning the concomitant public health risks posed by viruses and bacteria within wastewater treatment plants.

2. Materials and Methods

2.1. Sample Collection

The activated sludge samples were collected on 22 October 2021 from two wastewater treatment plants located 360 m apart at a university in China. These plants primarily receive domestic wastewater from dormitories and canteens. A total of six 10 L activated sludge samples (WTP1-1, WTP1-2, and WTP1-3 from Plant 1; WTP2-1, WTP2-2, and WTP2-3 from Plant 2) were obtained using acid-washed polycarbonate bottles. Subsequently, the samples were promptly transported to the laboratory within a time frame of four hours at room temperature.

2.2. DNA Extraction

The sludge sample was initially filtered through an 800-mesh nylon filter followed by a 0.22 μm filter membrane (Millipore, Burlington, MA, USA) to separate viruses from prokaryotes. Subsequently, the membranes containing prokaryotic cells were harvested, frozen with liquid nitrogen, pulverized, and subjected to extraction of total prokaryotic DNA using the DNeasy Power Soil Pro Kit (QIAGEN, Hilden, Germany). The virus-containing filtrate was mixed with PEG 8000 and NaCl to achieve final concentrations of 10% (w/v) and 0.5 M respectively before being stored at 4 °C overnight. After centrifugation at 10,000× g for 30 min, the supernatant was discarded and the pellet was resuspended using SM buffer solution. DNase I (final concentration of 10 U/mL, TaKaRa, Beijing, China) and RNase A (final concentration of 200 μg/mL, TaKaRa, China) were added, and incubated at 37 °C for 2 h. After treatment at 80 °C for 5 min, the mixture was supplemented with proteinase K, EDTA, and SDS, and then incubated at 55 °C for 3 h. Viral DNA was extracted using a phenol-chloroform-isoamyl alcohol method as described by Che in 2022 [11].

2.3. 16S rRNA Gene Amplicon and Virome Sequencing

The prokaryotic 16S rRNA gene was amplified using 341F/806R primers and sequenced on the Illumina Miseq PE300 platform at Shanghai Majorbio Bio-pharm Technology Co., Ltd. The dada2 plugin in Qiime2 was utilized for denoising paired-end sequencing data of 16S rRNA gene sequences, resulting in the generation of ASV (Amplicon Sequence Variant) tables and feature sequences. The q2-feature-classifier plugin and Silva 16S rRNA database (v138) [12] were employed to train a Naive Bayes classifier specifically for V3–V4 sequences, enabling the classification of prokaryotic species. Chloroplast and mitochondrial sequences were eliminated, along with ASVs having a total count < 10. Multiple bacterial pathogen detection pipeline (MBPD) [13] was applied to detect zoonotic pathogens in animals, plants, and humans. It employs the mothur software (v1.45.3) for sequence alignment, with an identity threshold established at 0.8. Viral DNA underwent sequencing on the Illumina PE150 platform. Briefly, viral DNA was randomly fragmented using an ultrasonic homogenizer and used for constructing the sequencing library; subsequent sequencing occurred after passing quality control measures. Trimmomatic (v0.32) [14] was utilized to eliminate paired-end reads containing adapters. Corresponding paired reads were also discarded if the number of low-quality bases (SQ ≤ 20) exceeded 20% of the total read bases in single-ended sequencing data. Duplicate reads generated by PCR amplification as well as polyX sequences were removed.

2.4. Identification of Viral Contigs

The clean reads were assembled into contigs using the metaSPAdes software (v3.15.4) [15]. Subsequently, contigs with a length greater than 2500 bp were selected for further analysis. The contigs underwent initial filtering and screening using VirSorter2 [16] with a threshold of 0.5 to select ssDNA and dsDNA viral sequences. False-positive viral sequences were subsequently eliminated using CheckV [17]. The potential viral sequences were then de-replicated utilizing CD-HIT-est [18] with a threshold of 90% coverage and 95% confidence, resulting in non-redundant sequence sets for each sample.

2.5. Annotation of Virome and Host Prediction

The clean reads underwent annotation for viral species employing the Kraken2 software and the corresponding viral database [19] based on the k-mer algorithm. The annotation results were filtered through Bracken [20] and visualized via Pavian [21]. Human viruses were identified by screening against the Virus–Host DB [22]. Phage lifestyles were determined using PhaTYP [23]. Contigs were classified and annotated utilizing PhaGCN [24], while host prediction for viral sequences was performed using CHERRY [25].

2.6. Viral Gene Functional Prediction

The Prokka software [26] was utilized to predict genes in the obtained non-redundant viral sequence set based on viral models. Subsequently, CG-HIT was employed with 90% coverage and 95% identity as thresholds to eliminate redundant predicted protein sequences, resulting in a non-redundant protein sequence set. Carbohydrate active enzymes were annotated using the run_dbcan script of dbCAN2 [27]. To annotate nitrogen metabolism genes and phosphorus metabolism genes, the protein sequence sets were compared to the NCycDB [28] database and PCycDB [29] database, respectively, employing Diamond [30] with default parameters. PathoFact [31] was applied for predicting pathogenic factors such as ARGs, toxin genes, and virulence factors. Furthermore, virulence factors were annotated by comparing the dataset to VFDB’s core dataset using Diamond. [32].

3. Results

3.1. Pathogenic Bacteria within WWTP

A total of 654,766 original 16S rRNA gene reads were obtained from six samples collected from two WWTPs; these reads included 25 phyla, 49 classes, 113 orders, 188 families, 308 genera, and 1334 ASVs (Figure 1).
MBPD was used to analyse the pathogenic bacteria in the samples (Figure 1). A total of 242 ASVs were identified as pathogenic bacteria, with animal pathogens accounting for 41.22% of the total number of sequences. All the pathogenic bacteria were distributed in 13 phyla, of which Proteobacteria, Campylobacterota, Bacteroidota, Firmicutes, Patescibacteria, and Actinobacteriota were the dominant phyla, accounting for 18.61%, 6.79%, 5.41%, 3.64%, 3.21%, and 3.20% of the total number of sequences, respectively.
The pathogenic Proteobacteria included Candidatus symbiobacter (4.36%), Candidatus accumulibacter (3.11%), Paracoccus (2.50%), Laribacter (2.07%), Sphingobium (2.00%), and Aeromonas (1.46%). The pathogenic bacteria in the phylum Campylobacterota were all Arcobacter and had the highest relative abundance (6.79%). The phylum Bacteroidota included the genera Bacteroides (1.51%), Sphingobacterium (1.50%), and Parabacteroides (1.20%). The phylum Firmicutes included genera such as Granulicatella (1.05%). Pathogenic cases of Granulicatella species are relatively rare, but they can cause diseases such as bacteremia, sepsis, and infective endocarditis [33]. Mycobacterium (2.08%) was the dominant pathogen in the phylum Actinobacteria.
In addition to bacteria, a small number of archaea were also identified in the samples, accounting for 0.18% of the total number of sequences. They were all classified into three genera, Methanosaeta, Methanobrevibacter, and Methanomethylovorans, under Methanosarcinaceae.

3.2. Viral Community Composition within WWTP

After metavirome sequencing and quality control, 270,227,598 reads were obtained. After Kraken annotation, a total of 455,846 reads were annotated as viral sequences; these reads belonged to 8 phyla, 12 classes, 20 orders, 63 families, and 613 genera.
Aorunvirus (41.85%), Muminvirus (6.20%), Plateaulakevirus (6.00%), Burrovirus (3.63%), Lillamyvirus (3.61%), Fromanvirus (1.91%), Kehishuvirus (1.63%), and Nipunavirus (1.14%) were the dominant viral genera in the samples (Figure 2A), and all belonged to the class Caudoviricetes. There were considerable differences in the compositions of the viral communities among the two WWTPs, as shown by the fact that Aorunvirus accounted for only 0.19%, 0.23%, and 0.07% of the number of sequences in the WTP1-1, WTP1-2, and WTP1-3 samples but 49.65%, 73.63%, and 39.65% of the total number of sequences in the WTP2-1, WTP2-2, and WTP2-3 samples, respectively. However, the distribution pattern of Plateaulakevirus was opposite that of Aorunvirus; that is, it accounted for a relatively high proportion of the total number of sequences in the WTP1-1, WTP1-2, and WTP1-3 samples, i.e., 28.99%, 7.78%, and 17.13%, respectively, but a relatively small proportion in the other three samples, accounting for 0.14% of the total number of their respective sample sequences. All the other viral genera were found with similar content and without significant differences among the six samples.
The annotation results were screened using the Virus-Host Database (V-H DB). A total of ten viral families that can infect humans, including Orthoherpesviridae, Poxviridae, and Adenoviridae, were identified, accounting for 0.66% of the total number of viruses. Among them, Orthoherpesviridae accounted for 0.25% of the total number of species (Figure 2B).

3.3. Viral Lifestyle and Host Prediction

Phage lifestyle prediction was performed using PhaTYP. There were 25,770 contigs with scores higher than 0.9, in which virulent phages accounted for 60.94% of the total number of contigs and temperate phages accounted for 39.06%, indicating that virulent phages are dominant in sludge.
Using PhaGCN and CHERRY to predict viral hosts, we found that in the samples, the phage hosts were mainly Lactobacillus, Streptococcus and Staphylococcus, accounting for 37.11%, 21.11%, and 7.11%, respectively (Figure 3).
Lactobacillus fermentum was the host for 18 phage species in the samples, among which Casjensviridae, Schitoviridae, and Mesyanzhinovviridae were the most abundant, accounting for 21.14%, 15.77%, and 15.44%, respectively. Notably, the abundances of Schitoviridae and Mesyanzhinovviridae viruses were also high (ranking first and sixth in abundance, respectively) according to the viral species annotation results generated via Kraken for metavirome sequencing.
Using an integrity greater than 95% as the threshold, a total of 241 complete viruses were obtained from the six samples, accounting for 0.51% of the total number of contigs. Among them, 72 sequences were longer than 50 kb in length, accounting for 29.99% of the complete viruses. Annotation and host prediction were performed for 72 complete viruses, and 23 sequences had a PhaGCN annotation score greater than 0.9. These 23 sequences belonged to six families, including Schitoviridae, Mesyanzhinovviridae, and Zobellviridae, and included bacteria from eight genera, including Synechococcus, Streptococcus, and Lactobacillus, as hosts. The annotation scores of the remaining 49 sequences were low; therefore, they may be new viruses that have not yet been discovered. The two sequences with the lowest scores in species annotation were selected and compared to the NCBI database using BLASTn. The results showed that the query cover values were both between 0% and 4%, suggesting that there are abundant novel viruses in the sludge.

3.4. Viral Genes Related to Carbon, Nitrogen, and Phosphorus Metabolism

Five types of carbon metabolism genes, i.e., carbohydrate-binding module family, carbohydrate esterase family, glycoside hydrolase family, glycosyltransferase family, and polysaccharide lyase family genes, were detected, for a total of 34 types of genes related to the metabolism of 1207 carbohydrate-active enzymes (Figure 4A). The glycoside hydrolase family and the glycosyltransferase family were most abundant, with 900 and 295 genes, respectively.
The prediction results for genes involved in nitrogen metabolism showed that the viruses contained 379 nitrogen metabolism genes in 13 species (Figure 4B), including gs_K00264, nmo, asnB, gdh_K00261, nifH, gdh_K00262, nirA, and gs_K00265. Among these genes, gs_K00264 accounted for the highest proportion (62.96%) of the total number of genes involved in nitrogen metabolism. gs_K00264 is a component of the organic degradation and synthesis pathway and encodes glutamate synthase (NADPH/NADH), which is a complex iron-sulfur protein involved in the assimilation of ammonia [34].
A total of 2123 genes of 60 types were obtained from the prediction of viral genes involved in phosphorus metabolism (Figure 4C) and were involved in metabolic pathways such as pyrimidine metabolism, purine metabolism, the pentose phosphate pathway, and oxidative phosphorylation, with dut, dcd and rtpR being the three most abundant genes. In addition, these genes included those encoding adenylate kinase (adk) and guanylate kinase (gmk), which are involved in purine metabolism. Moreover, phosphatases such as phosphatase R (phoR) and phosphatase B (phoB) are also predicted to participate in the removal of phosphate.

3.5. Antibiotic Resistance Genes, Virulence Factors, and Toxin Genes Located on Viruses

Three pathogenic factors: antibiotic-resistance genes, virulence factor-encoding genes, and toxin genes, were analysed using PathoFact. A total of 217 ARGs (113 types) were detected, 942 virulence factor-encoding genes (180 types) were detected, and 788 toxin genes (32 types) were detected.
Among the 113 detected ARGs, 16 major classes and 39 subclasses of antibiotics developed resistance, helping the carrier bacteria resist antibiotics mainly through mechanisms such as antibiotic efflux or inactivation (Figure 5). ARGs resistant to beta-lactam and macrolide, lincosamide, and streptogramin (MLS) antibiotics had the highest relative abundances, accounting for 21.6% and 17.1%, respectively, of the total number of ARGs. The predicted 217 ARGs were distributed on 216 viral contigs. These viruses infect 21 genera of bacteria, mainly Lactobacillus (34.88%), Streptococcus (12.80%), and Staphylococcus (11.63%), which is consistent with the distribution pattern of previous host prediction results for all the samples. These results suggest that these bacteria may acquire and transmit ARGs through horizontal gene transfer, posing hidden dangers to public health security.
The virome contained 942 genes encoding 180 virulence factors (Figure 6). Regardless of the genes encoding potential virulence factors or the determined virulence factors, the number of non-secretable virulence factor-encoding genes was relatively large, with a total of 909 genes. A total of 96.50% of the genes were predicted to be virulence factors. These included clpP (4.78%), pfbA (4.67%), gmd (3.72%), clpC (3.72%), rpoS (3.40%), sigA/rpoV (3.40%), groEL (3.18%), csrA (2.44%), essC (2.12%), and htpB (2.02%). clpP encodes the ATP-dependent Clp protease proteolytic subunit, a proteolytic enzyme that helps pathogens escape from the phagocytosis of macrophages in the host, thereby improving the survival ability of pathogens [35]. The pfbA gene encodes a surface adhesin that binds to human fibronectin and plasminogen in the host extracellular matrix. Its function is mainly related to the adhesion and invasion of the host by pathogenic bacteria, which are mediated by adhesin and invasion [36].
The virome contained a total of 32 toxin genes, including RecT (73.48%), Phage_holin_4 (4.95%), PT-HINT (3.17%), Peptidase_S24 (3.05%), Zeta_toxin (2.41%), ADPrib_exo_Tox (1.40%), holin_tox_secr (1.40%), PLDc_2 (1.02%), and entD (1.02%) (Figure 7). RecT is a DNA single-strand annealing protein (SSAPs) that belongs to the RecT family and plays a role in the process of DNA recombination [37].

4. Discussion

Previous studies on the function of microorganisms in wastewater treatment systems have focused mainly on bacteria and archaea. Viruses are the most abundant entities in sludge, yet very little is known about the assemblage composition, both in terms of taxonomy as well as genetic makeup. This study used metagenomic technology to elucidate the potential ecological functions and public health risks of bacteria and viruses in the WWTPs.
This study has shown that bacteria such as Proteobacteria, Campylobacterota, Bacteroidota and Firmicutes are relatively highly abundant in WWTP. These findings are consistent with several previous result [38,39,40]. In this study, up to 41% of the bacterial sequences were potential animal pathogens, with the most abundant pathogen being the genus Arcobacter in the phylum Campylobacterota, accounting for 6.79% of the total number of sequences. This strain has been reported in poultry, livestock, pork and other meat products, and vegetables and can cause enteritis, severe diarrhoea, septicemia, and miscarriage in humans [41,42,43]. Moreover, it has been reported that this bacterium is significantly associated with various ARGs, such as tetA, sul1, and int1, and is a potential multidrug-resistant pathogenic bacterium [39]. The results of viral host prediction revealed that the hosts with the most viral infections were all Firmicutes, not Proteobacteria, which accounted for the greatest proportion. It is speculated that the phages of the Caudoviricetes class were generally the most abundant.
Compared with that of bacteria, the composition of the viral communities in WWTPs in different areas varies greatly, but as the results of this study show, phages of the Caudoviricetes class generally occupy a dominant position. We found that the sludge contained 0.66% human pathogenic viruses, with members of the families Orthoherpesviridae and Poxviridae being the two most common pathogenic viruses. This result was consistent with the findings of other studies on lakes or wastewater [11,44]. Orthoherpesviridae viruses are double-stranded DNA viruses that often infecting amniotes. Infection can cause various diseases, such as shingles, chickenpox, and genital herpes [45]. Viruses of the family Poxviridae are also a type of double-stranded DNA virus that can cause purulent lesions on the skin throughout the body after infecting humans [46]. Although existing research has shown that the proportion of pathogenic viruses in sludge is less than 1% [47], it is generally believed that viruses have a greater impact on public health and safety compared with pathogenic bacteria and protozoa in wastewater [48,49,50]. Additionally, safety concerns about the application of reclaimed water have been raised [51,52].
The ARGs located on phages can be transmitted between hosts through horizontal gene transfer, posing a risk to public health [53]. We detected various ARGs belonging to the OXA family, including OXA-42, OXA-46, and OXA-119, most of which are plasmid-mediated β-lactamases that can hydrolyse related antistaphylococcal penicillins, oxacillin, and other antibiotics. Due to possible contamination by faeces or hospital wastewater, β-Lactam and MLS antibiotics are also ubiquitously found in wastewater treatment systems worldwide [54]. Virus-mediated horizontal transfer of ARGs can enhance the viability of microbial communities in wastewater treatment systems, affecting the treatment effect of wastewater treatment systems on pathogenic bacteria. Moreover, various virulence factors and toxins encoded and expressed by viruses are involved in the host infection process. The lysozyme encoded by clpP can help the pathogen evade the host immune response, and DNA single-strand binding proteins of the RecT family promote viral replication. The holin proteins produced by phages are involved in host cell lysis. Phages accumulate the holin protein at a certain concentration, causing rupture of the host cell membrane and the release of endolysin, which hydrolyses the cell wall, thus leading to bacterial infection [55]. The presence of these factors enhances the ability of viruses to infect bacteria and promotes gene flow between microbial communities. This process facilitates the spread of functional genes required for pollutant degradation and nutrient transformation among microbial communities but may also accelerate the spread of related disease-causing genes.
Viruses are involved in the metabolism of key nutrients, such as carbon, nitrogen, and phosphorus in wastewater. Viral carbon-metabolizing enzymes, such as glycoside hydrolases and glycosyltransferases, as well as nitrogen- and phosphorus-metabolizing enzymes, such as glutamate synthase and dUTPase, are involved in processes such as the degradation of sugars and nucleotide metabolism and are closely related to viral replication and transmission. The glycoside hydrolase located on viruses helps the host decompose complex carbohydrates and generate energy, it promotes the growth and reproduction of the host, and improves the phage’s living conditions [56]; moreover, this hydrolase is conducive to the migration of organic matter in wastewater [57]. We found that the most abundant gene involved in nitrogen metabolism located on viruses was glutamate synthase, which is related to biofilm formation in bacteria [58]. In sludge, phage infection may enhance the ability of the host to form biofilms.
Phages also carry numerous phosphorus metabolism genes involved in the synthesis of progeny viruses. Nucleic acid synthesis is an important step in the production of progeny viruses. Within the viral genomes, we detected many genes involved in purine and pyrimidine metabolism. rtpR can reduce ribonucleoside triphosphate to deoxyribonucleoside triphosphate, which, after being catalysed by dcd, can deaminate dCTP to form dUTP. After catalysis by dut, dUTP removes a molecule of phosphoric acid to form dUMP. In addition, thyA and tmk are involved in this phosphorus metabolic pathway. The enzymes encoded by several genes involved in phosphorus metabolism form a pyrimidine synthesis pathway from CTP to dTDP, mediate the production of various nucleotides, and facilitate the transmission of viral genetic information and the generation of progeny viruses. In addition, several phosphate degradation- and biological respiration-related enzymes in this network play a role in the degradation of organic matter in wastewater. The presence of these genes is important for the stable and efficient degradation of substances in WWTPs. In addition to encoding auxiliary metabolic genes, viruses also affect wastewater treatment by lysing microorganisms involved in nutrient transport and element cycling [57]. In conclusion, viruses significantly regulate the function of microorganisms in sludge by transporting a variety of auxiliary metabolic genes and lysing their hosts.

5. Conclusions

We used metagenomic technologies to analyse the community composition and potential functions of bacteria and viruses in two WWTPs. The results revealed an abundance of pathogenic bacteria and pathogenic viruses in the sludge. The diverse pathogenic genes, such as antibiotic-resistance genes, virulence factors, and toxin genes, contained in the viruses have nonnegligible health risks for water reuse. Additionally, the diverse auxiliary metabolic genes contained in the viruses can regulate the metabolism of C, N, and P in the hosts, indirectly affecting wastewater treatment efficacy. Bacteriophages in sludge are not only carriers of harmful genes, but also an important resource bank, from which bacteriophages infected with pathogens can be isolated. One of the key challenges for future research on sewage is how to effectively reduce safety hazards and increase the utilization efficiency of reclaimed water and its microbial resources.

Author Contributions

Conceptualization, W.X.; Methodology, H.Z., M.Y., X.F., Q.G. and H.Y.; Investigation, H.Z., M.Y. and X.F.; Data Curation, H.Z., M.Y., X.F., Q.G. and Y.T.; Writing—Original Draft Preparation, H.Z., M.Y., X.F. and Q.G.; Writing—Review and Editing, W.X. and Y.T.; Supervision, W.X.; Project Administration, W.X.; Funding Acquisition, W.X. and Y.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (grant number 32071570) and the Science and Technology Planning Project in Key Areas of Yunnan Province (grant number 202303AP140004).

Institutional Review Board Statement

This work did not involve studies of humans or animals.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is deposited in National Microbiology Data Center (NMDC) with accession numbers NMDC40053725-NMDC40053736.

Acknowledgments

The authors thank the National Natural Science Foundation of China and the Yunnan Province Sciences and Technology Department for their support with this work.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Breitbart, M.; Rohwer, F. Here a Virus, There a Virus, Everywhere the Same Virus? Trends Microbiol. 2005, 13, 278–284. [Google Scholar] [CrossRef] [PubMed]
  2. Mushegian, A.R. Are There 1031 Virus Particles on Earth, or More, or Fewer? J. Bacteriol. 2020, 202, e00052-20. [Google Scholar] [CrossRef] [PubMed]
  3. López-Bueno, A.; Tamames, J.; Velázquez, D.; Moya, A.; Quesada, A.; Alcamí, A. High Diversity of the Viral Community from an Antarctic Lake. Science 2009, 326, 858–861. [Google Scholar] [CrossRef]
  4. Martinez-Puchol, S.; Rusinol, M.; Fernandez-Cassi, X.; Timoneda, N.; Itarte, M.; Andres, C.; Anton, A.; Abril, J.F.; Girones, R.; Bofill-Mas, S. Characterisation of the Sewage Virome: Comparison of NGS Tools and Occurrence of Significant Pathogens. Sci. Total Environ. 2020, 713, 136604. [Google Scholar] [CrossRef] [PubMed]
  5. Fernandez-Cassi, X.; Timoneda, N.; Martínez-Puchol, S.; Rusiñol, M.; Rodriguez-Manzano, J.; Figuerola, N.; Bofill-Mas, S.; Abril, J.F.; Girones, R. Metagenomics for the Study of Viruses in Urban Sewage as a Tool for Public Health Surveillance. Sci. Total Environ. 2018, 618, 870–880. [Google Scholar] [CrossRef] [PubMed]
  6. Guerrero-Latorre, L.; Romero, B.; Bonifaz, E.; Timoneda, N.; Rusiñol, M.; Girones, R.; Rios-Touma, B. Quito’s Virome: Metagenomic Analysis of Viral Diversity in Urban Streams of Ecuador’s Capital City. Sci. Total Environ. 2018, 645, 1334–1343. [Google Scholar] [CrossRef] [PubMed]
  7. Adriaenssens, E.M.; Farkas, K.; Harrison, C.; Jones, D.L.; Allison, H.E.; McCarthy, A.J. Viromic Analysis of Wastewater Input to a River Catchment Reveals a Diverse Assemblage of RNA Viruses. mSystems 2018, 3, e00025-18. [Google Scholar] [CrossRef] [PubMed]
  8. Nadimpalli, M.L.; Marks, S.J.; Montealegre, M.C.; Gilman, R.H.; Pajuelo, M.J.; Saito, M.; Tsukayama, P.; Njenga, S.M.; Kiiru, J.; Swarthout, J.; et al. Urban Informal Settlements as Hotspots of Antimicrobial Resistance and the Need to Curb Environmental Transmission. Nat. Microbiol. 2020, 5, 787–795. [Google Scholar] [CrossRef] [PubMed]
  9. Sutradhar, I.; Ching, C.; Desai, D.; Suprenant, M.; Briars, E.; Heins, Z.; Khalil, A.S.; Zaman, M.H. Computational Model To Quantify the Growth of Antibiotic-Resistant Bacteria in Wastewater. mSystems 2021, 6, e0036021. [Google Scholar] [CrossRef]
  10. Roux, S.; Brum, J.R.; Dutilh, B.E.; Sunagawa, S.; Duhaime, M.B.; Loy, A.; Poulos, B.T.; Solonenko, N.; Lara, E.; Poulain, J.; et al. Ecogenomics and Potential Biogeochemical Impacts of Globally Abundant Ocean Viruses. Nature 2016, 537, 689–693. [Google Scholar] [CrossRef]
  11. Che, R.; Bai, M.; Xiao, W.; Zhang, S.; Wang, Y.; Cui, X. Nutrient Levels and Prokaryotes Affect Viral Communities in Plateau Lakes. Sci. Total Environ. 2022, 839, 156033. [Google Scholar] [CrossRef] [PubMed]
  12. Quast, C.; Pruesse, E.; Yilmaz, P.; Gerken, J.; Schweer, T.; Yarza, P.; Peplies, J.; Glöckner, F.O. The SILVA Ribosomal RNA Gene Database Project: Improved Data Processing and Web-Based Tools. Nucleic Acids Res. 2013, 41, D590–D596. [Google Scholar] [CrossRef]
  13. Yang, X.; Jiang, G.; Zhang, Y.; Wang, N.; Zhang, Y.; Wang, X.; Zhao, F.-J.; Xu, Y.; Shen, Q.; Wei, Z. MBPD: A Multiple Bacterial Pathogen Detection Pipeline for One Health Practices. iMeta 2023, 2, e82. [Google Scholar] [CrossRef]
  14. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A Flexible Trimmer for Illumina Sequence Data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef]
  15. Bankevich, A.; Nurk, S.; Antipov, D.; Gurevich, A.A.; Dvorkin, M.; Kulikov, A.S.; Lesin, V.M.; Nikolenko, S.I.; Pham, S.; Prjibelski, A.D.; et al. SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing. J. Comput. Biol. 2012, 19, 455–477. [Google Scholar] [CrossRef]
  16. Guo, J.R.; Bolduc, B.; Zayed, A.A.; Varsani, A.; Dominguez-Huerta, G.; Delmont, T.O.; Pratama, A.A.; Gazitua, M.C.; Vik, D.; Sullivan, M.B.; et al. VirSorter2: A Multi-Classifier, Expert-Guided Approach to Detect Diverse DNA and RNA Viruses. Microbiome 2021, 9, 37. [Google Scholar] [CrossRef]
  17. Nayfach, S.; Camargo, A.P.; Schulz, F.; Eloe-Fadrosh, E.; Roux, S.; Kyrpides, N.C. CheckV Assesses the Quality and Completeness of Metagenome-Assembled Viral Genomes. Nat. Biotechnol. 2021, 39, 578–585. [Google Scholar] [CrossRef] [PubMed]
  18. Li, W.; Godzik, A. Cd-Hit: A Fast Program for Clustering and Comparing Large Sets of Protein or Nucleotide Sequences. Bioinformatics 2006, 22, 1658–1659. [Google Scholar] [CrossRef]
  19. Wood, D.E.; Lu, J.; Langmead, B. Improved Metagenomic Analysis with Kraken 2. Genome Biol. 2019, 20, 257. [Google Scholar] [CrossRef]
  20. Lu, J.; Breitwieser, F.P.; Thielen, P.; Salzberg, S.L. Bracken: Estimating Species Abundance in Metagenomics Data. Peerj Comput. Sci. 2017, 3, e104. [Google Scholar] [CrossRef]
  21. Breitwieser, F.P.; Salzberg, S.L. Pavian: Interactive Analysis of Metagenomics Data for Microbiome Studies and Pathogen Identification. Bioinformatics 2020, 36, 1303–1304. [Google Scholar] [CrossRef]
  22. Mihara, T.; Nishimura, Y.; Shimizu, Y.; Nishiyama, H.; Yoshikawa, G.; Uehara, H.; Hingamp, P.; Goto, S.; Ogata, H. Linking Virus Genomes with Host Taxonomy. Viruses 2016, 8, 66. [Google Scholar] [CrossRef]
  23. Shang, J.; Tang, X.; Sun, Y. PhaTYP: Predicting the Lifestyle for Bacteriophages Using BERT. Brief. Bioinform. 2023, 24, bbac487. [Google Scholar] [CrossRef] [PubMed]
  24. Shang, J.; Jiang, J.; Sun, Y. Bacteriophage Classification for Assembled Contigs Using Graph Convolutional Network. Bioinformatics 2021, 37, i25–i33. [Google Scholar] [CrossRef]
  25. Shang, J.; Sun, Y. CHERRY: A Computational metHod for accuratE pRediction of Virus-pRokarYotic Interactions Using a Graph Encoder-Decoder Model. Brief. Bioinform. 2022, 23, bbac182. [Google Scholar] [CrossRef] [PubMed]
  26. Seemann, T. Prokka: Rapid Prokaryotic Genome Annotation. Bioinformatics 2014, 30, 2068–2069. [Google Scholar] [CrossRef]
  27. Zhang, H.; Yohe, T.; Huang, L.; Entwistle, S.; Wu, P.; Yang, Z.; Busk, P.K.; Xu, Y.; Yin, Y. dbCAN2: A Meta Server for Automated Carbohydrate-Active Enzyme Annotation. Nucleic Acids Res. 2018, 46, W95–W101. [Google Scholar] [CrossRef] [PubMed]
  28. Tu, Q.; Lin, L.; Cheng, L.; Deng, Y.; He, Z. NCycDB: A Curated Integrative Database for Fast and Accurate Metagenomic Profiling of Nitrogen Cycling Genes. Bioinformatics 2019, 35, 1040–1048. [Google Scholar] [CrossRef]
  29. Zeng, J.; Tu, Q.; Yu, X.; Qian, L.; Wang, C.; Shu, L.; Liu, F.; Liu, S.; Huang, Z.; He, J.; et al. PCycDB: A Comprehensive and Accurate Database for Fast Analysis of Phosphorus Cycling Genes. Microbiome 2022, 10, 101. [Google Scholar] [CrossRef]
  30. Buchfink, B.; Reuter, K.; Drost, H.-G. Sensitive Protein Alignments at Tree-of-Life Scale Using DIAMOND. Nat. Methods 2021, 18, 366–368. [Google Scholar] [CrossRef]
  31. de Nies, L.; Lopes, S.; Busi, S.B.; Galata, V.; Heintz-Buschart, A.; Laczny, C.C.; May, P.; Wilmes, P. PathoFact: A Pipeline for the Prediction of Virulence Factors and Antimicrobial Resistance Genes in Metagenomic Data. Microbiome 2021, 9, 49. [Google Scholar] [CrossRef] [PubMed]
  32. Liu, B.; Zheng, D.; Zhou, S.; Chen, L.; Yang, J. VFDB 2022: A General Classification Scheme for Bacterial Virulence Factors. Nucleic Acids Res. 2022, 50, D912–D917. [Google Scholar] [CrossRef] [PubMed]
  33. Albano, C.; Bagarello, S.; Giordano, S.; Sanfilippo, M.F.; Comparato, C.; Scardino, G.; Garbo, V.; Boncori, G.; Condemi, A.; Cascio, A.; et al. Granulicatella Spp., a Causative Agent of Infective Endocarditis in Children. Pathogens 2022, 11, 1431. [Google Scholar] [CrossRef] [PubMed]
  34. Vanoni, M.A.; Curti, B. Glutamate Synthase: A Complex Iron-Sulfur Flavoprotein. Cell Mol. Life Sci. 1999, 55, 617–638. [Google Scholar] [CrossRef] [PubMed]
  35. Gaillot, O.; Pellegrini, E.; Bregenholt, S.; Nair, S.; Berche, P. The ClpP Serine Protease Is Essential for the Intracellular Parasitism and Virulence of Listeria Monocytogenes. Mol. Microbiol. 2000, 35, 1286–1294. [Google Scholar] [CrossRef]
  36. Yamaguchi, M.; Terao, Y.; Mori, Y.; Hamada, S.; Kawabata, S. PfbA, a Novel Plasmin- and Fibronectin-Binding Protein of Streptococcus Pneumoniae, Contributes to Fibronectin-Dependent Adhesion and Antiphagocytosis. J. Biol. Chem. 2008, 283, 36272–36279. [Google Scholar] [CrossRef]
  37. Iyer, L.M.; Koonin, E.V.; Aravind, L. Classification and Evolutionary History of the Single-Strand Annealing Proteins, RecT, Redβ, ERF and RAD52. BMC Genom. 2002, 3, 8. [Google Scholar] [CrossRef]
  38. Tang, Y.; Liang, Z.; Li, G.; Zhao, H.; An, T. Metagenomic Profiles and Health Risks of Pathogens and Antibiotic Resistance Genes in Various Industrial Wastewaters and the Associated Receiving Surface Water. Chemosphere 2021, 283, 131224. [Google Scholar] [CrossRef] [PubMed]
  39. Leroy-Freitas, D.; Machado, E.C.; Torres-Franco, A.F.; Dias, M.F.; Leal, C.D.; Araújo, J.C. Exploring the Microbiome, Antibiotic Resistance Genes, Mobile Genetic Element, and Potential Resistant Pathogens in Municipal Wastewater Treatment Plants in Brazil. Sci. Total Environ. 2022, 842, 156773. [Google Scholar] [CrossRef]
  40. Fierer, N.; Holland-Moritz, H.; Alexiev, A.; Batther, H.; Dragone, N.B.; Friar, L.; Gebert, M.J.; Gering, S.; Henley, J.B.; Jech, S.; et al. A Metagenomic Investigation of Spatial and Temporal Changes in Sewage Microbiomes across a University Campus. mSystems 2022, 7, e0065122. [Google Scholar] [CrossRef]
  41. Ramees, T.P.; Dhama, K.; Karthik, K.; Rathore, R.S.; Kumar, A.; Saminathan, M.; Tiwari, R.; Malik, Y.S.; Singh, R.K. Arcobacter: An Emerging Food-Borne Zoonotic Pathogen, Its Public Health Concerns and Advances in Diagnosis and Control—A Comprehensive Review. Vet. Q. 2017, 37, 136–161. [Google Scholar] [CrossRef]
  42. Chieffi, D.; Fanelli, F.; Fusco, V. Arcobacter Butzleri: Up-to-Date Taxonomy, Ecology, and Pathogenicity of an Emerging Pathogen. Compr. Rev. Food Sci. Food Saf. 2020, 19, 2071–2109. [Google Scholar] [CrossRef] [PubMed]
  43. Ferreira, S.; Queiroz, J.A.; Oleastro, M.; Domingues, F.C. Insights in the Pathogenesis and Resistance of Arcobacter: A Review. Crit. Rev. Microbiol. 2016, 42, 364–383. [Google Scholar] [CrossRef] [PubMed]
  44. Bibby, K.; Peccia, J. Identification of Viral Pathogen Diversity in Sewage Sludge by Metagenome Analysis. Environ. Sci. Technol. 2013, 47, 1945–1951. [Google Scholar] [CrossRef]
  45. Roizman, B.; Baines, J. The Diversity and Unity of Herpesviridae. Comp. Immunol. Microbiol. Infect. Dis. 1991, 14, 63–79. [Google Scholar] [CrossRef]
  46. McFadden, G. Poxvirus Tropism. Nat. Rev. Microbiol. 2005, 3, 201–213. [Google Scholar] [CrossRef] [PubMed]
  47. Kuo, H.-W.D.; Zure, D.; Lin, C.-R. Occurrences of Similar Viral Diversity in Campus Wastewater and Reclaimed Water of a University Dormitory. Chemosphere 2023, 330, 138713. [Google Scholar] [CrossRef] [PubMed]
  48. Farhadkhani, M.; Nikaeen, M.; Yadegarfar, G.; Hatamzadeh, M.; Pourmohammadbagher, H.; Sahbaei, Z.; Rahmani, H.R. Effects of Irrigation with Secondary Treated Wastewater on Physicochemical and Microbial Properties of Soil and Produce Safety in a Semi-Arid Area. Water Res. 2018, 144, 356–364. [Google Scholar] [CrossRef] [PubMed]
  49. Gonzales-Gustavson, E.; Rusiñol, M.; Medema, G.; Calvo, M.; Girones, R. Quantitative Risk Assessment of Norovirus and Adenovirus for the Use of Reclaimed Water to Irrigate Lettuce in Catalonia. Water Res. 2019, 153, 91–99. [Google Scholar] [CrossRef]
  50. Moazeni, M.; Nikaeen, M.; Hadi, M.; Moghim, S.; Mouhebat, L.; Hatamzadeh, M.; Hassanzadeh, A. Estimation of Health Risks Caused by Exposure to Enteroviruses from Agricultural Application of Wastewater Effluents. Water Res. 2017, 125, 104–113. [Google Scholar] [CrossRef]
  51. Savchenko, O.M.; Kecinski, M.; Li, T.; Messer, K.D. Reclaimed Water and Food Production: Cautionary Tales from Consumer Research. Environ. Res. 2019, 170, 320–331. [Google Scholar] [CrossRef] [PubMed]
  52. Deng, S.; Yan, X.; Zhu, Q.; Liao, C. The Utilization of Reclaimed Water: Possible Risks Arising from Waterborne Contaminants. Environ. Pollut. 2019, 254, 113020. [Google Scholar] [CrossRef] [PubMed]
  53. Soucy, S.M.; Huang, J.; Gogarten, J.P. Horizontal Gene Transfer: Building the Web of Life. Nat. Rev. Genet. 2015, 16, 472–482. [Google Scholar] [CrossRef] [PubMed]
  54. Strange, J.E.S.; Leekitcharoenphon, P.; Møller, F.D.; Aarestrup, F.M. Metagenomics Analysis of Bacteriophages and Antimicrobial Resistance from Global Urban Sewage. Sci. Rep. 2021, 11, 1600. [Google Scholar] [CrossRef]
  55. Saier, M.H.; Reddy, B.L. Holins in Bacteria, Eukaryotes, and Archaea: Multifunctional Xenologues with Potential Biotechnological and Biomedical Applications. J. Bacteriol. 2015, 197, 7–17. [Google Scholar] [CrossRef] [PubMed]
  56. Anderson, C.L.; Sullivan, M.B.; Fernando, S.C. Dietary Energy Drives the Dynamic Response of Bovine Rumen Viral Communities. Microbiome 2017, 5, 155. [Google Scholar] [CrossRef] [PubMed]
  57. Chen, Y.; Wang, Y.; Paez-Espino, D.; Polz, M.F.; Zhang, T. Prokaryotic Viruses Impact Functional Microorganisms in Nutrient Removal and Carbon Cycle in Wastewater Treatment Plants. Nat. Commun. 2021, 12, 5398. [Google Scholar] [CrossRef]
  58. Kimura, T.; Kobayashi, K. Role of Glutamate Synthase in Biofilm Formation by Bacillus subtilis. J. Bacteriol. 2020, 202, e00120-20. [Google Scholar] [CrossRef]
Figure 1. The relative abundance of bacterial genera in all samples and the bacterial pathogen in each sample.
Figure 1. The relative abundance of bacterial genera in all samples and the bacterial pathogen in each sample.
Viruses 16 00535 g001
Figure 2. Viral composition and the relative abundance of pathogenic viruses. (A) Viral composition of all samples. Number means the number of reads classified in each level. (B) Relative abundance of the dominant pathogenic viral families among all viral sequences.
Figure 2. Viral composition and the relative abundance of pathogenic viruses. (A) Viral composition of all samples. Number means the number of reads classified in each level. (B) Relative abundance of the dominant pathogenic viral families among all viral sequences.
Viruses 16 00535 g002
Figure 3. Predicted hosts of viral contigs from WWTPs.
Figure 3. Predicted hosts of viral contigs from WWTPs.
Viruses 16 00535 g003
Figure 4. The number of viral AMGs that participate in the carbon (A), nitrogen (B), and phosphorus (C) cycles.
Figure 4. The number of viral AMGs that participate in the carbon (A), nitrogen (B), and phosphorus (C) cycles.
Viruses 16 00535 g004
Figure 5. Composition (above) and number (below) of detected ARGs in the virome within WWTPs.
Figure 5. Composition (above) and number (below) of detected ARGs in the virome within WWTPs.
Viruses 16 00535 g005
Figure 6. Heatmap representing the abundance of viral genes assigned to the virulence factor count of each sample. The red font indicates dominant genes.
Figure 6. Heatmap representing the abundance of viral genes assigned to the virulence factor count of each sample. The red font indicates dominant genes.
Viruses 16 00535 g006
Figure 7. Relative abundance of viral genes assigned to toxin within all samples.
Figure 7. Relative abundance of viral genes assigned to toxin within all samples.
Viruses 16 00535 g007
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhao, H.; Yang, M.; Fan, X.; Gui, Q.; Yi, H.; Tong, Y.; Xiao, W. A Metagenomic Investigation of Potential Health Risks and Element Cycling Functions of Bacteria and Viruses in Wastewater Treatment Plants. Viruses 2024, 16, 535. https://doi.org/10.3390/v16040535

AMA Style

Zhao H, Yang M, Fan X, Gui Q, Yi H, Tong Y, Xiao W. A Metagenomic Investigation of Potential Health Risks and Element Cycling Functions of Bacteria and Viruses in Wastewater Treatment Plants. Viruses. 2024; 16(4):535. https://doi.org/10.3390/v16040535

Chicago/Turabian Style

Zhao, Haozhe, Mingfei Yang, Xiang Fan, Qian Gui, Hao Yi, Yigang Tong, and Wei Xiao. 2024. "A Metagenomic Investigation of Potential Health Risks and Element Cycling Functions of Bacteria and Viruses in Wastewater Treatment Plants" Viruses 16, no. 4: 535. https://doi.org/10.3390/v16040535

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop