Metagenomic Methods for Addressing NASA's Planetary Protection Policy Requirements on Future Missions: A Workshop Report

Molecular biology methods and technologies have advanced substantially over the past decade. These new molecular methods should be incorporated among the standard tools of planetary protection (PP) and could be validated for incorporation by 2026. To address the feasibility of applying modern molecular techniques to such an application, NASA conducted a technology workshop with private industry partners, academics, and government agency stakeholders, along with NASA staff and contractors. The technical discussions and presentations of the Multi-Mission Metagenomics Technology Development Workshop focused on modernizing and supplementing the current PP assays. The goals of the workshop were to assess the state of metagenomics and other advanced molecular techniques in the context of providing a validated framework to supplement the bacterial endospore-based NASA Standard Assay and to identify knowledge and technology gaps. In particular, workshop participants were tasked with discussing metagenomics as a stand-alone technology to provide rapid and comprehensive analysis of total nucleic acids and viable microorganisms on spacecraft surfaces, thereby allowing for the development of tailored and cost-effective microbial reduction plans for each hardware item on a spacecraft. Workshop participants recommended metagenomics approaches as the only data source that can adequately feed into quantitative microbial risk assessment models for evaluating the risk of forward (exploring extraterrestrial planet) and back (Earth harmful biological) contamination. Participants were unanimous that a metagenomics workflow, in tandem with rapid targeted quantitative (digital) PCR, represents a revolutionary advance over existing methods for the assessment of microbial bioburden on spacecraft surfaces. The workshop highlighted low biomass sampling, reagent contamination, and inconsistent bioinformatics data analysis as key areas for technology development. Finally, it was concluded that implementing metagenomics as an additional workflow for addressing concerns of NASA's robotic mission will represent a dramatic improvement in technology advancement for PP and will benefit future missions where mission success is affected by backward and forward contamination.


Introduction
S ince the beginning of extraterrestrial exploration by NASA, planetary protection (PP) has been an important effort to prevent biological forward contamination of non-Earth environments. The Committee on Space Research (COSPAR) has formulated a Planetary Protection Policy with associated implementation requirements as an international standard to protect against interplanetary biological and organic contamination (UNOOSA, 2017).
Bacterial endospores, due to their ability to survive extreme conditions, are considered worst-case biological indicators for verifying spacecraft cleanliness (COSPAR, 2011). The current NASA Standard Assay (NSA) is designed to enumerate microorganisms that have formed endospores at the time of sampling, respire aerobically, survive a heat treatment at 80°C for 15 min, and can be cultivated on Tryptic Soy Agar (NASA, 2011).
Although the NSA method, developed to assess PP risk for spacecraft bound for Mars, has been suitable for some flight missions (e.g., COSPAR Category IVa, IVb and IVc to Mars), the NSA method alone is insufficient for missions with sample return or life detection. The inability of the NSA method to detect all PP-relevant taxa or dead organisms or environmental DNA (eDNA) could potentially lead to false negative results for undetected biology. Although the NSA may serve as an indicator of biological cleanliness, it does not identify PPrelevant microbial taxa capable of flight survival, nor can it address the issue of microbial diversity on spacecraft surfaces.
Bacterial endospore counts are neither representative of total spacecraft bioburden nor the only microorganisms of PP concern (e.g., La Duc et al., 2007;Cooper et al., 2011;Hendrickson et al., 2021) and not all recovered organisms, viruses, or virus like particles (VLP), recovered from extreme environments require sporulation to survive. To assess total bioburden and to identify microorganisms of concern, as well as to understand their true potential for forward contamination during robotic spaceflight missions, cultivationindependent molecular methods capable of detecting all microorganisms are needed (NASA, 2019).
Although PP requirements have evolved to meet the needs of increasingly sophisticated missions, nucleic acidsbased technologies have not yet been vetted and adopted into standard PP practices except as research tools (La Duc et al., 2014). Molecular biology approaches targeting nucleic acids enable broad-based detection of all microbial taxa and deliver information needed for life detection protocols. Such information can be used to develop tailored microbial reduction strategies, to communicate biological contamination and function risk assessments, and to facilitate options for end-of-mission strategies.
For example, in Mars sample return missions, detailed inventories of spacecraft bioburden are essential when comparing outbound and inbound samples. Such analyses are needed to fully and confidently assess the authenticity of returned and in situ analyzed samples (i.e., bio-signatures of extraterrestrial origin vs. terrestrial contamination) (Venkateswaran et al., 2012;Minich et al., 2018;Hendrickson et al., 2021). For Ocean Worlds missions, there is an increase in the cleanliness standard for PP compliance, and more complete spacecraft bioburden data will be needed for probabilistic risk assessments for landing and sampling.
Addressing the Space Studies Board's (SSB) earlier recommendations, research performed in FY19 demonstrated that sample processing has advanced sufficiently, and NASA's future missions can now implement molecular biology methods. Appropriate and stringent validation will still be necessary for effective implementation (Venkateswaran et al., 2012). The high informational content of metagenomic sequence data for species-and strain-level taxonomic assignment and for functional gene identification allows for an informed risk-based assessment of spacecraft biological contamination.
In addition, taxon-or gene-specific bioburden can be quantitatively determined using quantitative PCR techniques (e.g., Schnetzinger et al., 2013). Implementation of metagenomics from the current research-focused applications to routine PP practice will benefit all current and planned future missions. It will increase confidence in the ability to reduce forward contamination threats during exploration of planets, and it will lead to tailored, more efficient, and costeffective microbial reduction plans for each hardware item on a spacecraft. Worst-case scenario modeling approaches (Singh et al., 2018) will be avoided by characterizing the total microbial community on each piece of hardware.
Although remarkable strides have been made in molecular biology applications, there are still components of the overall workflow that require tailoring for spacecraft monitoring. These include: (1) sample collection; (2) sample processing (extraction, library preparation, and sequencing); (3) sequence data processing and bioinformatics; and (4) establishing a curated list of microorganisms and genes relevant to each mission.
To address the evolving technologies and to support the needs of the PP program, the workshop was held to facilitate a state-of-the-art assessment of metagenomics and to provide NASA with recommendations for a modern nucleic acids-based spacecraft cleanliness verification approach.

Objectives
The primary objective of this article is to summarize the outcomes of the Multi-Mission Metagenomics Technology Development Workshop held on January 19, 2022. The workshop goals were to (1) assess the state of metagenomics and other advanced molecular biology techniques in the context of providing a validated framework to supplement the NSA, (2) identify knowledge and technology gaps, and (3) determine what steps are needed to certify molecular techniques as new PP methods.
Although shotgun metagenomic sequencing and other cultivation-independent techniques have been widely adopted in academia, industry, and government agencies, there is a requirement to validate these methods to address the SSB recommendations and to modernize NASA's PP practices (National Academies of Sciences, Engineering, and Medicine, 2021). The steps in certification include sample collection and processing, nucleic acids-based molecular analyses, data analysis pipeline development, and quantitative risk assessment for PP-relevant microorganisms.
The workshop highlighted low biomass sampling, reagent contamination, and inconsistent bioinformatic data analysis as critical areas for further knowledge base and technology development.
The PP implementation and spacecraft cleaning workflows and the resulting high cleanliness make molecular characterization of low biomass spacecraft surfaces a challenge. Nonetheless, experts at this workshop suggested that nucleic acids-based technologies have matured, and steps should be taken to initiate and adopt such techniques for spacecraft bioburden taxonomic assessment, functional gene/functional potential assessment, and quantitative analyses of taxa and genes of interest.
The potential benefits of achieving the workshop goals for current and planned future NASA missions include: (1) the ability to perform rapid assessment of multi-domain (i.e., Bacteria, Archaea, and Eukarya) spacecraft-associated microbial burden; (2) increased throughput by automation; (3) data that will contribute to risk analysis and modeling based on functional profiles rather than conservative guidelines; and (4) the design of highly tailored microbial reduction strategies used to reduce relevant viable microorganisms present on hardware.

Current Technologies for Bioburden Assessment
Historically, bioburden assessment and microbial reduction strategies were adopted in part from food and pharmaceutical practices. The current NSA technique was established in the 1960s (NASA, 1967), though subsequently it was reported that the microorganisms cultured by the NSA account for <0.1% of the overall microbial population present on a given spacecraft surface (Ghosh et al., 2010).
Further, not all endospore-forming bacteria tolerate spaceflight conditions, whereas other microorganisms of PP relevance, such as fungi, will be missed due to the restrictive cultivation conditions of the NSA (e.g., La Duc et al., 2007;Cooper et al., 2011). As spacecraft size and complexity increases, including surfaces that are difficult to clean, along with the use of probabilistic risk assessment-based requirements (e.g., Mars 2020 sample return and Europa Lander), the assumption that potentially all microorganisms may tolerate spaceflight is untenable from biological, engineering, and mathematical perspectives.
Thus, taxonomic and functional gene content information derived from molecular methods (e.g., metagenomic sequencing) is the only way to populate the kinds of risk models available for identification and quantitation of PP-relevant (i.e., contamination threat) microorganisms.
In addition to the NSA, NASA has certified other bioburden assessments, including two rapid assays approved in 2005 as alternative measures of microbial contamination (Table 1). The adenosine triphosphate (ATP) assay and the Limulus Amoebocyte Lysate (LAL) assay (Benardini and Venkateswaran, 2016) were implemented principally to not only provide rapid analysis to circumvent the 3-day incubation period of the NSA, but also offer a means to screen spacecraft hardware for microbial contamination (Kern et al., 2005;Morris et al., 2010).
However, the LAL and ATP assays are unable to provide any taxonomic information, to distinguish between microorganisms of PP concern and those of no concern, or to provide quantitative cell abundance for microorganisms of concern (Table 1). Thus, the lack of a rapid quantitative or semi-quantitative method for assessing microbial diversity and taxonomy is recognized as a capability gap that still needs to be addressed to characterize total and PP-relevant microorganisms in cleanrooms and on spacecraft. Analyses of nucleic acids, including metagenomic sequencing and quantitative digital PCR analysis, are the most appropriate technologies to address this gap.

Developing and Implementing Molecular Biology Techniques for Microbial Inventory of Low Biomass Environments
In 1992, the SSB document ''Biological Contamination of Mars: Issues and Recommendations'' (NRC, 1992) asked for the immediate development of methods beyond the NSA for determining microbial bioburden. In 2006, in a document titled ''Preventing the Forward Contamination of Mars'' (NRC, 2006), the SSB recommended that NASA require the systematic collection of phylogenetic data to assess microbial communities in the assembly, test, and launch operation (ATLO) environments associated with spacecraft to be sent to Mars.
This 2006 report also recommended the implementation of broad-based assessments of microbial bioburden such as the ATP and LAL assays. Further, in 2019, the NASA Planetary Protection Independent Review Board (PPIRB) prepared a final report to NASA/Science Mission Directorate (SMD) that indicated that NASA PP policy should move beyond exclusive adherence to bacterial endospore counts. The use of proven modern techniques and wellestablished genomic tools for monitoring and characterization of bioburden of cleanroom facilities and flight hardware was encouraged (Space Science Board, 2018, 2020).
The Mars Program Office at JPL and NASA's Office of Planetary Protection funded several activities to test molecular methods and the feasibility of such methods for measuring bioburden and biodiversity of spacecraft associated surfaces. Despite these recommendations, however, the NSA remains the only fully validated and approved method of spacecraft bioburden assessment for verifying compliance with COSPAR. Over the past two decades, the rate of molecular tool development for non-cultivation-based characterization of microbial communities has increased dramatically. These techniques have included low-throughput 16S ribosomal RNA (rRNA) gene amplicon cloning (late 1990s), 16S rRNA microarray analyses (early 2000s; e.g., Loy et al., 2002), high-throughput 16S rRNA gene amplicon and fungal internal transcribed spacer region amplicon next-generation sequencing (e.g., Huse et al., 2007;Buee et al., 2009), and shotgun metagenomic sequencing in late 2000s and 2010s (e.g., Bashir et al., 2016;Danko et al., 2021).
High-throughput 16S rRNA gene amplicon sequencing has been employed for a large number of studies of microbial communities but the utility of the method is negatively impacted by incomplete taxonomic coverage/lower diversity (e.g., Klindworth et al., 2013;Hug et al., 2016), limited phylogenetic resolution (e.g., Poretsky et al., 2014), and biases introduced by PCR amplification leading to a distortion of the true microbial community structure (e.g., Green et al., 2015;Naqib et al., 2019).
Further, no single primer set is capable of targeting all domains of life. Conversely, non-targeted shotgun metagenomic sequencing provides the best opportunity for lowbias preparation of low biomass samples. The non-selective nature of metagenomics allows it to serve as a discovery tool with potential for detecting and identifying PP-relevant microbial taxa across all three domains of life and from some viruses, regardless of prior identification or ability to cultivate.
Although some metagenome sequencing approaches are relatively rapid, the overall turn-around time for nucleic acids extraction, library preparation, sequencing, and anal-ysis is not rapid enough for routine spacecraft bioburden assessment. A comparison of relevant characteristics of realtime PCR, digital PCR, amplicon sequencing, and shotgun sequencing is shown in Table 2.
To overcome the turnaround time limitations of sequencing methods, the participating workshop scientists identified quantitative PCR as a viable option, including both real-time PCR and digital PCR. Digital PCR has several advantages relative to real-time PCR, including absolute quantification without need for standards, relative insensitivity to enzymatic inhibitors, and robust resolution at low template copy numbers (e.g., Tan et al., 2021).
For samples with low levels of biomass, severe patchiness can occur, and numerous replicate extractions and analyses can be employed to detect contamination. Although digital PCR has been suggested for low yield samples, its use has not yet been widely adopted by NASA researchers.
Singly or in combination, the use of these molecular approaches could enable whole genome data acquisition and phylogenetically informed quantitative results more rapidly than the three days of cultivation needed for NSA. Digital PCR performed with targeted molecular assays could provide quantitative data relevant to PP cleanliness concerns within a single day. Assays targeting highly conserved genes (e.g., 16S rRNA genes in bacteria and archaea) are likely to be more sensitive (e.g., due to the presence of multiple copies of rRNA gene operons in most microbial cells).
However, varying rRNA gene copy numbers cannot be converted exactly into cell numbers, though lineage-specific conversions have been attempted (e.g., Kembel et al., 2012;Angly et al., 2014). For narrower taxon-specific analyses, single copy genes (e.g., rpoB, gyrB, etc.) are easier to convert to cell numbers and may also be relatively insensitive to reagent contamination issues that afflict non-targeted metagenomics approaches. Quantitative analyses should be employed in tandem with metagenomic sequencing to provide high-resolution functional gene content and allow for discovery of novel microorganisms of PP concern. These combined datasets will provide quantitative and semi-quantitative taxonomic and functional information about spacecraft-associated microbiomes, enabling decision making on a comprehensive data set and thereby lowering contamination risk.
Although the focus of committee members was on PP rather than Astrobiology, the committee did discuss the need to differentiate between live cells (affecting forward contamination) and dead cells and viruses, VLP and eDNA molecules (affecting life detection science missions). Nucleic acids-based characterization of microbial communities can permit the differentiation of live and dead organisms (e.g., Vaishampayan et al., 2013), and this can be critically important, depending on mission criteria.
Although microbial growth and activity has been inferred from metagenomic sequence data using bioinformatic strategies (Emiola and Oh, 2018;Danko et al., 2021), more commonly, a selective degradation or inactivation of free nucleic acid is performed through enzymatic (e.g., DNase; Marotz et al., 2018) or chemical (e.g., propidium monoazide [PMA]; Nocker et al., 2006) treatment. However, whether these methods are viable at the low nucleic acid levels in cleanroom and spacecraft samples is not yet known.
Further, prior studies have indicated that the application of PMA treatment may not generate quantitative results in complex ecosystems . Alternatively, interrogation of RNA instead of DNA focuses sequencing efforts on viable organisms, as RNA molecules degrade rapidly after cell death (e.g., Li et al., 2017). Targeting RNA instead of DNA could also increase the limit of detection due to higher numbers of ribosomes per cell relative to copies of rRNA gene operons per cell (e.g., Nardello et al., 2020), though ribosome copy numbers cannot be readily converted into cell numbers.

Process Requirements for Validation of Metagenomic Sequence Technology
Although there are clear advantages to using molecular tools for evaluating spacecraft bioburden, these tools have specific limitations that must be addressed before a robust protocol can be implemented by NASA. The most serious of these limitations is derived from the extremely low nucleic acid yields from spacecraft swab samples.
These DNA recoveries are often in the femtogram (fg) to picogram (pg) range, which are levels at or below the tolerance threshold for established sequencing detection (e.g., Rinke et al., 2016). Further, the presence of reagent nucleic acid contamination must be considered when analyzing ultra-low bioburden samples. Ascertainment of the true signal from the background noise, as well as establishing reproducibility from multiple extractions (e.g., de Goffau et al., 2018), will be necessary.
Such low concentrations of recovered nucleic acids may require additional DNA amplification strategies (Silander and Saarela, 2008;Ahsanuddin et al., 2017), which may lead to further complications from the amplification of the contaminating DNA within reagents (i.e., the ''kitome'') and other background signals as well as the potential of added noise from erroneous amplifications.
Reagent contamination issues can only be controlled through use of proper controls and ultrapure reagents (e.g., Salter et al., 2014;Eisenhofer et al., 2019). In addition, although library synthesis in protocols can be largely biasfree, the nucleic acid extraction process is known to introduce bias, and microorganisms undergoing incomplete lysis can be undercounted or missed entirely in molecular analyses. Further, by itself, sequencing performed on extracted DNA cannot distinguish between DNA derived from living cells and remnant or contaminating DNA.
Thus, careful consideration is needed in the application of metagenomics to ensure that the appropriate protocols are developed to capture microbial signatures exclusively from living cells, as well as proper controls to account for background reagent-contaminating DNA ''noise.'' Additional challenges for deriving absolute quantification from sequencing may be overcome by implementing spike-ins during sample processing or by implementing quantitative rapid molecular methods such as digital quantitative PCR.
Nonetheless, in addition to extraction methods with complete microbial cell lysis, all methods will need ultra-clean reagents (i.e., DNA free) across all stages of sample processing (i.e., collection, extraction, pre-amplification, and library synthesis), proper controls, and stable bioinformatics pipelines.
Thus, a molecular verification process to supplement or replace the NSA bioburden assessment will have four key features: (1) rapid quantification of biological contaminants above background noise; (2) characterization of taxonomic and functional gene content of microorganisms; (3) viability assessment of detected microorganisms; and (4) accurate determination of the functional potential and risk profile of contaminants.
The first two features can be addressed with a molecular biology workflow, the third with chemical pretreatment followed by molecular analysis or direct interrogation of RNA in place of DNA, and the fourth can be addressed with bioinformatics. To achieve these goals, method development and validation is needed in four areas, including: (1) sample collection, (2) sample processing and sequencing, (3) bioinformatics, and (4) risk assessment.

Sample collection
The certification goal is to secure flight-approved sampling devices for large ( ‡0.1 m 2 , wipe) and small (*25 cm 2 , swab) surface areas that will have (1) repeatable and high yield microbial recovery from spacecraft surfaces; (2) optimized yield for release of biological materials from the sampling device; (3) molecular cleanliness with low noise/ background nucleic acids and contact residue (<0.02 mg/ cm 2 ); (4) a predictable dynamic range of collection in high and low biomass systems (i.e., those producing £10 ng of Recommendation: Flight-certified swab and wipe methods for molecular assays are needed. Sampling device qualification should include a comparison of accuracy, precision, specificity, limit of detection, robustness, linearity, and background nucleic acid levels. A standard operating protocol for sample collection should be developed. DNA and those producing >10 ng of DNA); (5) compliance with ISO5 to 8.5 cleanroom particulate count [foreign object debris (AIA/NAS 412)/ISO 14664-1:2015(E)]; and (6) compliance with electrostatic discharge demands (<50 V sensitivity; ANSI/ESDA/JEDEC JS-002-2014).
Prior studies have shown cotton swabs to be superior to Copan swabs for smaller (25 cm 2 ) spacecraft surfaces for collecting endospore-forming bacteria (e.g., Kwan et al., 2011). Likewise, DNA-free macrofoam-based device collected more biomolecules when compared with the polyester wipes for larger (1 m 2 ) surfaces. These studies employed quantitative PCR to measure microbial burden (Kwan et al., 2011;Bargoma et al., 2013).
If the goals of the metagenomics workshop are achieved, standardized sampling devices and an operating protocol for the collection of microorganisms and other bio-signatures from spacecraft surfaces will be developed along with a comprehensive background nucleic acids profile for the sampling devices. Efficiency of nucleic acids recovery and limits of detection will need to be evaluated with reference materials such as mock microbial communities. Procedures for approval testing and validation of new sampling devices should also be established.

Sample processing
Prior evaluation of sample processing techniques revealed that filtration devices increased the yield of the biomolecules of interest when low-biomass samples such as spacecraftassociated surfaces were examined (La Duc et al., 2009. In addition, analysis of microbial communities using targeted amplicon sequencing suggested that microbial profiles were dominated by the kitome (Minich et al., 2018). To gain information on the functional characteristics of spacecraft and cleanroom microbiomes, shotgun metagenome sequencing was also studied, including on samples with extremely low nucleic acid yields (e.g., Danko et al., 2021).
Based on prior results and current technologies, experts at the workshop identified a hybrid molecular approach, combining nucleic acid extraction, quantitative digital PCR, and metagenomic sequencing as a strategy to both meet rapid enumeration goals and provide taxonomic and functional gene sequence data needed for sophisticated risk assessment. Using rapid nucleic acid extraction followed by digital PCR with domain-level and taxon-specific digital PCR assays, spacecraft-relevant microorganisms can be enumerated on the scale of 6 h (Fig. 1).
A slower metagenomic sequencing approach can be used for broad detection of microbial taxa and genes of concern and provide data for risk modeling. Overall, the desired outcome is a protocol that will be adopted into a NASA PP handbook and will include SOPs for nucleic acid extraction, decision-making protocol for selection of digital PCR assays, viable organism enumeration, and nucleic acid sequencing relevant to spacecraft bioburden analysis.
However, there is a need to invest resources and fund commercial development of methods to increase the yield of biomolecules during extractions and reduce reagent contamination (Minich et al., 2018). Workshop experts also identified a need for DNA library synthesis protocols for long-read sequencers from ultra-low bioburden samples to improve identification of microbial taxa and genes.

Computational and bioinformatics analysis
Although the study of metagenomes is more than 20 years old (Handelsman et al., 1998), relatively little effort has been made to standardize or assess the accuracy of the numerous software packages that perform quality control or FIG. 1. A proposed hybrid molecular biology workflow for PP bioburden assessment. A combination of rapid, multiplexed, sequence-guided quantitative digital PCR assays is proposed together with longer-term knowledge-building next-generation sequence-acquisition workflows. Digital PCR analysis, based on assays built on a priori knowledge of taxa and genes of interest, can be used for functional gene or rRNA gene targets. To address reproducibility issues and DNA extraction efficiency limitations, mixtures of microbial cells at known abundances can be spiked into DNA extractions (1). For viability testing, samples can be treated using PMA before DNA extraction (2). For estimation of library preparation efficiency and to address background contamination, nucleic acid spike-ins can be performed before molecular processing (3). PMA, propidium monoazide; PP, planetary protection; rRNA, ribosomal RNA.

Recommendation:
The sample processing protocol should meet the following requirements: (1) lyse open cells and permit nucleic acid isolation from all domains of life including hardy microorganisms; (2) differentiate between viable and non-viable taxa; (3) have the ability to process nucleic acid samples from both high and low biomass samples (e.g., 1-10 and >10 pg); and (4) define robust sampling and library preparation negative and positive controls.

902
GREEN ET AL.
annotation of sequence data, especially with low-biomass samples such as those collected from cleanroom surfaces (e.g., McIntyre et al., 2017;Vollmers et al., 2017;Ye et al., 2019). When comparing results from metagenomic analyses performed by different bioinformatics groups on the same dataset sampled from a cleanroom, Wood et al. (2021) demonstrated that there was poor agreement of detected species, and relative abundance of overlapping species. Although each of the bioinformatics groups in the study detected species that have the potential for forward contamination, the disagreement in species-level annotation hampers the ability of metagenomic sequencing to provide actionable information to NASA. For example, falsely detected species might require extra cleaning of spacecraft, which could negatively affect the schedule of a spacecraft assembly and delay launch.
Conversely, problematic species that are present but not properly annotated could enable forward contamination. Most concerning was that none of the tested bioinformatics pipelines could annotate more than 15% of the sequence data, due to reference databases lacking the breadth of microbial diversity present in cleanroom environments or being of PP-relevance.
To address these concerns, significant effort is needed to test for accuracy (i.e., specificity and sensitivity) and to standardize metagenomics pipelines, potentially using artificially generated metagenomes containing PP relevant taxa. To further increase the accuracy of these pipelines, public reference databases must be fortified with genomes of microbial species found in cleanroom environments, cleanroomadjacent facilities without strict cleaning regimes (to capture hitchhikers), and from environments that resemble those on extraterrestrial planetary bodies (e.g., radionucleotide dumping sites, nuclear accidents, and hot and cold arid biomes, etc.), before they are used for PP analysis.
In addition, database curation to remove poorly sequenced entries and/or to increase the stringency and accuracy of metagenomically assembly genomes (MAGs) is paramount to ensure accurate results. Efforts must be taken to maintain and expand functional gene databases relevant to classifying ''contaminant risk'' microorganisms. This could include maintaining a database of genes extracted from microorganisms that survive in high-risk environments and the construction of ortholog gene catalogs for target threat genes.
For example, there has been extensive research on physiological traits from Earth environments that are relevant for specific mission types, such as cold tolerance, endospore formation, UV tolerance, heat tolerance, radiation tolerance, etc., but PP-relevant data from these studies are curated in institutional databases and online repositories and have not been linked to PP compliance verification.
Current bioinformatic pipelines exist for spacecraft sequence data processing but require skilled computational biologists to operate. Streamlined reporting does exist for some metagenomic applications and could be adapted to analyze and report PP data. In addition, all workflows will need to be open access and transparent to allow for evaluation and reproducibility across a broad international scientific community.
In addition, sequence technologies that produce longer read lengths, such as Oxford Nanopore, should be qualified for use in low-biomass cleanroom environments since they are more likely to encode phylogenetically resolvable DNA (e.g., Pearman et al., 2020), and they may be less likely to represent background reagent contamination (e.g., Olm et al., 2017).
Deliverables for achieving this goal include a comprehensive molecular analysis pipeline to enable real-time implementation on NASA missions. This includes an endto-end systems level test with an easily executable pipeline and a graphical reporting interface. This system will depend on a curated database with known spacecraft assembly microorganisms and will be iteratively updated with de novo generated nucleic acid sequence data similar to the Catalog of Earth's Microbiomes (Nayfach et al., 2021).
For each mission, the reference database will have to be fixed to ensure standardization of protocols. The bioinformatics must enable the generation of metagenome-assembled genomes (MAGs) and resolve novel sequences. The computational analysis also requires background noise assessment, including full characterization of reagent negative controls and molecular analyses of reagent lot numbers to characterize the kitomes used in the analysis. A reagent contamination database may be needed to fully address kitome issues.

PP-relevant microorganism assessment
For quantitative microbial risk assessment modeling, profiling of genetic traits of microbial communities is more valuable than phylogenetic identification. For example, identifying the presence of radiation resistance traits is critical, either through identification of specific organisms and inferring survival potential or through identification of resistance-relevant genes that can exist in multiple organisms. Similarly, genes encoding physiological capabilities of interest (e.g., sporulation, anaerobiosis, desiccation resistance, extreme temperature resistance, and tolerance of other anticipated hostile spaceflight-relevant conditions) should be identified and included as PP-relevant genes in a risk analysis model. Danko et al. (2021) noted, however, that although certain gene functions are critical for survival during spaceflight, these genes are not necessarily restricted to organisms that are PP-relevant (e.g., DNA repair genes), and that bioinformatic-or culture-dependent analyses (where possible) should be performed to assess any given microorganism's resistance to radiation, desiccation, or other forces present in spaceflight.

RECOMMENDATION:
The computational resources need to include easy-to-use and revisable bioinformatics pipelines and output interfaces that meet the following requirements: (1) one-click bioinformatics analysis software solution for rapid assessment of PP-risk from metagenomic sequence data; (2) broad spectrum of PP-relevant microorganisms identified with defined confidence level analytics; (3) broad spectrum of space exploration-relevant functional gene content identified with defined confidence level analytics; (4) ability to generate metagenome-assembled genomes (MAGs) and enhance reference databases; (5) resolution framework for novel sequences; and (6) integrated noise reductions or background subtraction to compensate for contaminating nucleic acids from reagents.
Thus, although identification of PP-relevant genetic traits is critical, the detection of microbial taxa that are frequently identified in space programs should also be included in the relevant genetic feature list.
The abundance of genetic traits of concern is another important parameter for risk quantification, since risk scales with concentration. Despite the low biomass in these sample types and the elevated potential for background contamination, the effect of trace contamination in molecular analyses may be mitigated through highly targeted quantitative digital PCR assays and by demonstrating the potentially low impact on risk estimation of known reagent contaminants that do not have traits of PP relevance.
Workshop panelists noted that the International Space Station (ISS) microbial monitoring workflows, including molecular processing and cultivation already developed, can benefit PP policy and practices. Deliverables for the ISS monitoring goal include spacecraft PP risk models that can target relevant microorganisms (not all viable microorganisms) and an Earth-based extreme environment database of microorganisms, genomes, and functional genes. Similar databases highlighting microbial taxa relevant to planetary missions are also needed.

Recommendations and Outcomes from the Workshop
Panel experts agreed that there is a clear opportunity to incorporate molecular assays into the NASA PP efforts and identified a viable workflow to provide an unbiased assessment of the composition and gene content of microbial communities and quantitative analysis of bioburden. For missions in which forward and backward contamination are of concern, metagenome sequencing can provide allinclusive phylogenetic and functional characteristics of the microorganisms present and supplement current cultivationbased approaches.
When rapid and quantitative measurements are needed, quantitative digital PCR was strongly recommended. The two methods can work in tandem, with metagenomics informing the design and application of digital PCR assays (Fig. 1). NASA's Office of Planetary Protection should establish a set of requirements that adhere to and expand upon established policies for microbial detection, microbial load assessment, and risk. The current suite of assays (NSA, ATP, LAL) has established quantitative requirements in specific applications that define detection levels/limits and PP risk.
The same approach is needed to introduce new molecular assays. Of particular importance is establishing efficacy and limits of detections for each step, evaluating and miti-gating background nucleic acids reagent contamination. Collaboration with industry partners through NASA RFPs must be initiated to produce ultra-clean DNA-free reagents for DNA extraction, digital PCR, and library synthesis (e.g., sample collection devices, water, nucleic acid extraction reagents, oligonucleotides, PCR master mixes, preamplification reagents, library synthesis reagents, etc.). To control for and mitigate signal noise, the development of required workflows should incorporate PP-relevant whole cell microbial reference controls at appropriate cell numbers. DNA controls at low input levels are also needed. 1. Generate high-quality metagenome-assembled genomes (MAGs) from PP-relevant environmental samples. 2. Generate microbial genomes from isolated extremophilic microorganisms. 3. Establish reference databases of PP-relevant microorganisms and functional genes. 4. Establish a digital PCR assay library for PP-relevant microbial taxa and functional genes. 5. Develop a hybrid risk assessment model using metagenomics and quantitative digital PCR data.
It is important to consider timing when implementing new technologies into the NASA systems since there are many technical groups working in concert to accomplish critical tasks. The validation of new assays and workflows can take 4-5 years and will require a structured effort and adequate resources. To accomplish this, it will be necessary to act on the recommendations of the Committee on the Review of Planetary Protection Policy Development Processes of the SSB.
NASA should adequately fund both the Office of Planetary Protection and the research necessary to determine appropriate requirements for planetary bodies and to enable state-of-the-art PP techniques for monitoring and verifying compliance with these requirements. This should include periodic workshops to continuously evaluate technology development and implementation of new methods for bioburden assessment.
A significant amount of research has already been conducted by NASA in molecular and metagenomics analysis (Venkateswaran et al., 2012;Danko et al., 2021;Wood et al., 2021) and building on those existing initiatives could validate this novel PP molecular approach by 2026. The development of metagenomic technologies from the current research-based environmental applications to a mainstream spacecraft application would be a step forward in modernizing NASA's PP program. This would benefit all future missions to Mars and the Icy Worlds by increasing the detection of microbial taxa with forward contamination potential and by providing comprehensive data reliability leading to reduced false positives in sample return, and ultimately, increased chance of scientific mission success.
In collaboration with the various groups and programs already leveraging sequencing or PCR-based approaches, NASA's Office of Planetary Protection needs to establish a set of requirements that adhere to and expand upon established policies for microbial detection, microbial load assessment, and risk calculation, and incorporate them in NASA PP Policy documents.
The current suite of assays (NSA, ATP, LAL) has established set requirements that define detection levels/ limits and PP risk, and the same approach is needed to introduce a new molecular workflow and should not be delayed. The committee also recommends online maintenance of PP SOPs (i.e., wiki format) that allows for global access and transparency, version tracking, topic queries, and focused community involvement.