The Caenorhabditis elegans Female-Like State: Decoupling the Transcriptomic Effects of Aging and Sperm Status

Understanding genome and gene function in a whole organism requires us to fully comprehend the life cycle and the physiology of the organism in question. Caenorhabditis elegans XX animals are hermaphrodites that exhaust their sperm after 3 d of egg-laying. Even though C. elegans can live for many days after cessation of egg-laying, the molecular physiology of this state has not been as intensely studied as other parts of the life cycle, despite documented changes in behavior and metabolism. To study the effects of sperm depletion and aging of C. elegans during the first 6 d of adulthood, we measured the transcriptomes of first-day adult hermaphrodites and sixth-day sperm-depleted adults, and, at the same time points, mutant fog-2(lf) worms that have a feminized germline phenotype. We found that we could separate the effects of biological aging from sperm depletion. For a large subset of genes, young adult fog-2(lf) animals had the same gene expression changes as sperm-depleted sixth-day wild-type hermaphrodites, and these genes did not change expression when fog-2(lf) females reached the sixth day of adulthood. Taken together, this indicates that changing sperm status causes a change in the internal state of the worm, which we call the female-like state. Our data provide a high-quality picture of the changes that happen in global gene expression throughout the period of early aging in the worm.

microarrays, which are less accurate than RNA-seq, especially for genes expressed at low levels .
Here, we investigate what we argue is a distinct state in the C. elegans life cycle. Although C. elegans hermaphrodites emerge into adulthood replete with sperm, after $3 d of egg-laying the animals become spermdepleted and can only reproduce by mating. This marks a transition into what we define as the endogenous female-like state. This state is behaviorally distinguished by increased male-mating success (Garcia et al. 2007), which may be due to an increased attractiveness to males (Morsci et al. 2011). This increased attractiveness acts at least partially through production of volatile chemical cues (Leighton et al. 2014). These behavioral changes are also coincident with functional deterioration of the germline (Andux and Ellis 2008), muscle (Herndon et al. 2002), intestine (McGee et al. 2011), and nervous system (Liu et al. 2013), changes traditionally attributed to the aging process (Golden and Melov 2007).
To decouple the effects of aging and sperm loss, we devised a two-factor experiment. We examined wild-type XX animals at the beginning of adulthood (before worms contained embryos, referred to as first-day adults) and after sperm depletion (6 d after the last molt, which we term sixth-day adults). Second, we examined feminized XX animals that fail to produce sperm but are fully fertile if supplied with sperm by mating with males (see Figure 1). We used fog-2 null mutants to obtain feminized animals. fog-2 is involved in germ-cell sex determination in the hermaphrodite worm and is required for sperm production (Schedl and Kimble 1988;Clifford et al. 2000). C. elegans defective in sperm formation will emerge from the larval stage as female adults. As time moves forward, these spermless worms only exhibit changes related to biological aging. As a result, fog-2(lf) mutants should show fewer gene changes during the first 6 d of adulthood compared with their egg-laying counterparts that age and also transition from egg-laying into a sperm-depleted stage.
Here, we show that we can detect a transcriptional signature associated with loss of hermaphroditic sperm marking entrance into the endogenous female-like state. We can also detect changes associated specifically with biological aging. Biological aging causes transcriptomic changes consisting of 5592 genes in C. elegans. 4552 of these changes occurred in both genotypes we studied, indicating they do not depend on sperm status. To facilitate exploration of the data, we have generated a website where we have deposited additional graphics, as well as all of the code used to generate these analyses: https://wormlabcaltech. github.io/Angeles_Leighton_2016.

Strains
Strains were grown at 20°on NGM plates containing E. coli OP50. We used the laboratory C. elegans strain N2 as our wild-type strain (Sulston and Brenner 1974). We also used the N2 mutant strain JK574, which contains the fog-2(q71) allele, for our experiments.

RNA extraction
Synchronized worms were grown to either young adulthood or the sixth day of adulthood prior to RNA extraction. Synchronization and aging were carried out according to protocols described previously (Leighton et al. 2014). 1000-5000 worms from each replicate were rinsed into a microcentrifuge tube in S basal (5.85 g/liter NaCl, 1 g/liter K 2 HPO 4 ; 6 g/liter KH 2 PO 4 ), and then spun down at 14,000 rpm for 30 sec. The supernatant was removed and 1 ml of TRIzol was added. Worms were lysed by vortexing for 30 sec at room temperature and then 20 min at 4°. The TRIzol lysate was then spun down at 14,000 rpm for 10 min at 4°to allow removal of insoluble materials. Thereafter, the Ambion TRIzol protocol was followed to finish the RNA extraction (MAN0001271 rev. date: 13 Dec 2012). Three biological replicates were obtained for each genotype and each time point.
RNA-seq RNA integrity was assessed using an RNA 6000 Pico Kit for Bio-Analyzer (Agilent Technologies #5067-1513) and mRNA was isolated using a NEBNext Poly(A) mRNA Magnetic Isolation Module (New England Biolabs, NEB, #E7490). RNA-seq libraries were constructed using the NEBNext Ultra RNA Library Prep Kit for Illumina (NEB #E7530), following manufacturer's instructions. Briefly, mRNA isolated from $1 mg of total RNA was fragmented to the average size of 200 nt by incubating at 94°for 15 min in first-strand buffer, cDNA was synthesized using random primers and ProtoScript II Reverse Transcriptase, followed by second-strand synthesis using Second Strand Synthesis Enzyme Mix (NEB). Resulting DNA fragments were end-repaired, dA-tailed and ligated to NEBNext hairpin adaptors (NEB #E7335). After ligation, adaptors were converted to the 'Y' shape by treating with USER enzyme, and DNA fragments were size-selected using Agencourt AMPure XP beads (Beckman Coulter #A63880) to generate fragment sizes between 250 and 350 bp. Adaptor-ligated DNA was PCR amplified, followed by AMPure XP bead clean-up. Libraries were quantified with Qubit dsDNA HS Kit (ThermoFisher Scientific #Q32854) and the size distribution was confirmed with High Sensitivity DNA Kit for Bioanalyzer (Agilent Technologies #5067-4626). Libraries were sequenced on Illumina HiSeq2500 in single-read mode with a read length of 50 nt, following manufacturer's instructions. Base calls were performed with RTA 1.13.48.0 followed by conversion to FASTQ with bcl2fastq 1.8.4.

Statistical analysis
RNA-seq analysis: RNA-seq alignment was performed using Kallisto (Bray et al. 2016) with 200 bootstraps. Kallisto was run in single-end Figure 1 Experimental design to identify genes associated with sperm loss and aging. Studying the wild-type worm alone would measure time-and sperm-related changes at the same time, without allowing us to separate these changes. Studying the wild-type worm and a fog-2(lf) mutant would enable us to measure sperm-related changes but not time-related changes. By mixing both designs, we can measure and separate both modules. read mode, setting the average fragment length of 200bp, and a standard deviation of 60bp for all samples. Differential expression analysis was performed using Sleuth (Pimentel et al. 2016). The following general linear model (GLM) was fitted: where y i is the TPM count for the ith gene; b 0;i is the intercept for the ith gene; b X;i is the regression coefficient for variable X for the ith gene; A is a binary age variable indicating first-day adult (0) or sixthday adult (1); G is the genotype variable indicating wild type (0) or fog-2(lf) (1); and b A::G;i refers to the regression coefficient accounting for the interaction between the age and genotype variables in the ith gene. Genes were called significant if the FDR-adjusted q-value for any regression coefficient was ,0.1. Our script for differential analysis is available on GitHub.
Regression coefficients and TPM counts were processed using Python 3.5 in a Jupyter Notebook (Pérez and Granger 2007). Data analysis was performed using the Pandas, NumPy and SciPy libraries (McKinney 2011;Van Der Walt et al. 2011;Oliphant 2007). Graphics were created using the Matplotlib and Seaborn libraries (Waskom et al. 2016;Hunter 2007). Interactive graphics were generated using Bokeh (Bokeh Development Team 2014).
Tissue, phenotype, and gene ontology enrichment analyses (TEA, PEA, and GEA, respectively) were performed using the WormBase Enrichment Suite for Python (Angeles-Albores et al. 2016, 2017a. Briefly, the WormBase Enrichment Suite accepts a list of genes and identifies the terms with which these genes are annotated. Terms are annotated by frequency of occurrence, and the probability that a term appears at this frequency under random sampling is calculated using a hypergeometric probability distribution. The hypergeometric probability distribution is extremely sensitive to deviations from the null Figure 2 (A) Differentially expressed isoforms in the aging category. We identified a common aging expression signature between N2 and fog-2(lf) animals, consisting of 6193 differentially expressed isoforms totaling 5592 genes. The volcano plot is randomly down-sampled 30% for ease of viewing. Each point represents an individual isoform. b Aging is the regression coefficient. Larger magnitudes of b indicate a larger log-fold change. The y-axis shows the negative logarithm of the q-values for each point. Green points are differentially expressed isoforms; orange points are differentially expressed isoforms of predicted transcription factor genes (Reece-Hoyes et al. 2005). An interactive version of this graph can be found on our website. (B) Enriched tissues in agingassociated genes. Tissue enrichment analysis (Angeles-Albores et al. 2016) showed that genes associated with muscle tissues and the nervous system are enriched in aging-related genes. Only statistically significantly enriched tissues are shown. Enrichment fold change is defined as Observed=Expected: hmc, head mesodermal cell. distribution, which allows it to identify even small deviations from the null.

Data availability
Strains are available from the Caenorhabditis Genetics Center. All of the data and scripts pertinent to this project, except the raw reads, can be found on our Github repository https://github.com/WormLabCaltech/ Angeles_Leighton_2016. Supplementary Material, File S1 contains the list of genes that were altered in aging regardless of genotype. File S2 contains the list of genes and their associations with the fog-2(lf) phenotype. File S3 contains genes associated with the female-like state. Raw reads were deposited to the Sequence Read Archive under the accession code SUB2457229.

RESULTS AND DISCUSSION
Decoupling time-dependent effects from sperm-status via GLMs In order to decouple time-dependent effects from changes associated with loss of hermaphroditic sperm, we measured wild-type and fog-2(lf) adults at the first-day adult stage (before visible embryos were present) and sixth-day adult stage, when all wild-type hermaphrodites have laid all their eggs (see Figure 1), but mortality is still low (,10%) (Stroustrup et al. 2013). We obtained 16-19 million reads mappable to the C. elegans genome per biological replicate, which enabled us to identify 14,702 individual genes totaling 21,143 isoforms (see Figure 2A).
One way to analyze the data from this two-factor design is by pairwise comparison of the distinct states. However, such an analysis would not make full use of all the statistical power afforded by this experiment. Another method that makes full use of the information in our experiment is to perform a linear regression in three dimensions (two independent variables, age and genotype, and one output). A linear regression with one parameter (age, for example) would fit a line between expression data for young and old animals. When a second parameter is added to the linear regression, said parameter can be visualized as altering the y-intercept, but not the slope, of the first line (see Figure 3A). When the variables are categorical, as in this case, this model is called a General Linear Model (GLM).
Although a simple linear model is often useful, sometimes it is not appropriate to assume that the two variables under study are entirely independent. For example, in our case, three out of the four time point/ genotype combinations we studied did not have sperm, and sperm status is associated with both the fog-2(lf) self-sterile phenotype and biological age of the wild-type animal. One way to statistically model such correlation between variables is to add an interaction term to the linear regression. This interaction term allows extra flexibility in describing how changes occur between conditions. For example, suppose a given theoretical gene X has expression levels that increase in a fog-2-dependent manner, but also increase in an age-dependent manner. However, aged fog-2(lf) animals do not have the expression levels of X that would be expected from adding the effect of the two perturbations; instead, the expression levels of X in this animal are considerably above what is expected. In this case, we could add a positive interaction coefficient to the model to explain the effect of genotype on the y-intercept as well as the slope (see Figure 3B). When the two perturbations affect a single genetic pathway, these interactions can be interpreted as epistatic interactions.
For these reasons, we used a GLM with interactions to identify a transcriptomic profile associated with the fog-2(lf) genotype independently of age, as well as a transcriptomic profile of C. elegans aging common to both genotypes. The change associated with each variable is referred to as b; this number, although related to the natural logarithm of the fold change, is not equal to it. However, it is true that larger magnitudes of b indicate greater change. Thus, for each gene we performed a linear regression, and we evaluated whether the b values associated with each coefficient were significantly different from 0 via a Wald test corrected for multiple hypothesis testing. A coefficient was considered to be significantly different from 0 if the q-value associated with it was ,0.1. The expression level of a hypothetical gene increases by the same amount as worms age, regardless of genotype. However, fog-2(lf) has higher expression of this gene than the wild type at all stages (blue arrow). (B) A linear regression with two variables and an interaction term. In this example, the expression level of this hypothetical gene is different between wild-type worms and fog-2(lf) (blue arrow). Although the expression level of this gene increases with age, the slope is different between wild type and fog-2(lf). The difference in the slope can be accounted for through an interaction coefficient (red arrow).
A quarter of all genes change expression between the 1st day of adulthood and the 6th day of adulthood in C. elegans We identified a transcriptomic signature consisting of 5592 genes that were differentially expressed in sixth-day adult animals of either genotype relative to first-day adult animals (see File S2). This constitutes slightly less than a quarter of the genes in C. elegans. TEA (Angeles-Albores et al. 2016) showed that nervous tissues, including the "nerve ring", "dorsal nerve cord", "PVD", and "labial sensillum", were enriched in genes that become differentially expressed through aging. Likewise, certain muscle groups ("anal depressor muscle" and "intestinal muscle") were enriched (see Figure 2B). GEA (Angeles-Albores et al. 2017a) revealed that genes that were differentially expressed during the course of aging were enriched in terms involving respiration ("respiratory chain" and "oxoacid metabolic process"); translation ("cytosolic large ribosomal subunit"); and nucleotide metabolism ("purine nucleotide", "nucleoside phosphate", and "ribose phosphate"). PEA (Angeles-Albores et al. 2017a) showed that this gene list was associated with phenotypes that affect the C. elegans gonad, including "gonad vesiculated", "gonad small", "oocytes lack nucleus", and "rachis narrow".
To verify the quality of our dataset, we generated a list of 1056 goldstandard genes expected to be altered in sixth-day adult worms using previous literature reports, including downstream genes of daf-12, daf-16, and aging and lifespan extension datasets (Murphy et al. 2003;Halaschek-Wiener et al. 2005;Lund et al. 2002;McCormick et al. 2012;Eckley et al. 2013). Of 1056 standard genes, we found 506 genes in our time-responsive dataset. This result was statistically significant, with a p-value ,10 -38 .
Next, we used a published compendium (Reece-Hoyes et al. 2005) to search for known or predicted transcription factors. We found 145 transcription factors in the set of genes with differential expression in aging nematodes. We subjected this list of transcription factors to TEA to understand their expression patterns. Six of these transcription factors were expressed in the "hermaphrodite specific neuron", a neuron physiologically relevant for egg-laying (hlh-14, sem-4, ceh-20, egl-46, ceh-13, hlh-3), which represented a statistically significant twofold enrichment Figure 4 fog-2(lf) partially phenocopies early aging in C. elegans. The b on each axis is the regression coefficient from the GLM, and can be loosely interpreted as an estimator of the log-fold change. Loss of fog-2 is associated with a transcriptomic phenotype involving 1881 genes. 1040/1881 of these genes are also altered in wild-type worms as they progress from young adulthood to old adulthood, and 905 change in the same direction. However, progression from young to old adulthood in a fog-2(lf) background results in no change in the expression level of these genes. (A) We identified genes that change similarly during feminization and aging. The correlation between feminization and aging is almost 1:1. (B) Epistasis plot of aging vs. feminization. Epistasis plots indicate whether two genes (or perturbations) act on the same pathway. When two effects act on the same pathway, this is reflected by a slope of -0.5. The measured slope was -0.51 6 0.01. of this tissue (q ,10 -1 ). The term "head muscle" was also overrepresented at twice the expected level (q ,10 -1 , 13 genes).
The fog-2(lf) expression signature overlaps significantly with the aging signature Of the 1881 genes that we identified in the fog-2(lf) signature, 1040 genes were also identified in our aging set. Moreover, of these 1040 genes, 905 genes changed in the same direction in response to either aging or germline feminization. The overlap between these signatures suggests an interplay between sperm status and age. The nature of the interplay should be captured by the interaction coefficients in our model. There are four possibilities. First, the fog-2(lf) worms may have a fast-aging phenotype, in which case the interaction coefficients should match the sign of the aging coefficient. Second, the fog-2(lf) worms may have a slow-aging phenotype, in which case the interaction coefficients should have an interaction coefficient that is of opposite sign, but not greater in magnitude than the aging coefficient (if a gene increases in aging in a wild-type worm, it should still increase in a fog-2(lf) worm, albeit to a lesser extent). Third, the fog-2(lf) worms may exhibit a rejuvenation phenotype. If this is the case, then these genes should have an interaction coefficient that is of opposite sign and greater magnitude than their aging coefficient, such that the change of these genes in fog-2(lf) mutant worms is reversed relative to the wild type. Finally, if these genes are indicative of a female-like state, then these genes should not change with age in fog-2(lf) animals, since these animals do not exit this state during the course of the experiment. Moreover, because wild-type worms become female as they age, a further requirement for a transcriptomic signature of the female-like state is that aging coefficients for genes in this signature should have genotype coefficients of equal sign and magnitude. In other words, entrance into the female-like state should be not be path-dependent.
To evaluate which of these possibilities was most likely, we selected the 1040 genes that had aging, genotype, and interaction coefficients significantly different from zero, and we plotted their temporal coefficients against their genotype coefficients (see Figure 4A and Figure 5). We observed that the aging coefficients were strongly predictive of the genotype coefficients. Most of these genes fell near the line y = x, suggesting that these genes define a female-like state.
We considered how loss-of-function of fog-2 and aging could both interact to cause entry into this state. We reasoned that a plausible mechanism is that fog-2 promotes sperm production, and aging promotes sperm depletion. This simple pathway model suggests that a double perturbation consisting of aging and loss of function of fog-2 should show nonadditivity of phenotypes (generalized epistasis). To test whether these two perturbations deviate from additivity, we generated an epistasis plot using this gene set. We have previously used epistasis plots to measure transcriptome-wide epistasis between genes in a pathway (Angeles-Albores et al. 2017b). Briefly, an epistasis plot shows the expected expression of a double perturbation under an additive model (null model) on the x-axis, and the deviation from this null model in the y-axis. In other words, we calculated the x-coordinates for each point by adding b Genotype þ b Aging ; and the y-coordinates are equal to b Interaction for each isoform. We have previously shown that if two genes or perturbations act within a linear pathway, an epistasis plot will generate a line with slope equal to -0.5. When we generated an epistasis plot and found the line of best fit, we observed a slope of -0.51 6 0.01, which suggests that the fog-2 gene and time are acting to generate a single transcriptomic phenotype along a single pathway. Overall, we identified 405 genes that changed in the same direction through age or mutation of the fog-2(lf) gene, and that had an interaction coefficient of opposite sign to the aging or genotype coefficient (see File S4). Taken together, these observations suggest that these 405 genes define a female-like state in C. elegans.
Analysis of the female-like state expression signature To better understand the changes that happen after sperm loss, we performed TEA, PEA, and GEA on the set of 405 genes that we associated with the female-like state (see Fig. 5). TEA showed no tissue enrichment using this gene set. GEA showed that this gene list was enriched in constituents of the ribosomal subunits almost four times above background (q ,10 -5 , 17 genes). The enrichment of ribosomal constituents in this gene set in turn drives the enriched phenotypes: "avoids bacterial lawn", "diplotene absent during oogenesis", "gonad vesiculated", "pachytene progression during oogenesis variant", and "rachis narrow." The expression of most of these ribosomal subunits is downregulated in aged animals or in fog-2(lf) mutants.

DISCUSSION
Defining an early aging phenotype Our experimental design enables us to decouple the effects of egg-laying from aging. As a result, we identified a set of almost 4000 genes that are altered similarly between wild-type and fog-2(lf) mutants. Owing to the read depth of our transcriptomic data (20 million reads) and the num-ber of samples measured (three biological replicates for four different life stages/genotypes), this dataset constitutes a high-quality description of the transcriptomic changes that occur in aging populations of C. elegans. Although our data only capture $50% of the expression changes reported in earlier aging transcriptome literature, this disagreement can be explained by a difference in methodology; earlier publications typically addressed the aging of fertile wild-type hermaphrodites only indirectly, or queried aging animals at a much later stage of their life cycle.

GLMs enable epistasis measurements
We set out to study the self-fertilizing (hermaphroditic) to self-sterile (female-like) transition by comparing wild-type animals with fog-2(lf) mutants as they aged. Our computational approach enabled us to distinguish between two biological processes that are correlated within samples. Because of this intrasample correlation, identifying this state via pairwise comparisons would not have been straightforward. Although it is a favored method among biologists, such pairwise comparisons suffer from a number of drawbacks. First, pairwise comparisons are unable to draw on the full statistical power available to an experiment because they discard almost all information except the samples being compared. Second, pairwise comparisons require a researcher to define a priori which comparisons are informative. For experiments with many variables, the number of pairwise combinations is explosively large. Indeed, even for this two-factor experiment, there are six possible pairwise comparisons. On the other hand, by specifying a linear regression model, each gene can be summarized with three variables, each of which can be analyzed and understood without the need to resort to further pairwise combinations. Figure 6 (A) A substrate-dependent model showing how fog-2 promotes sperm generation, whereas aging promotes sperm depletion, leading to entry into the female-like state. Such a model can explain why fog-2 and aging appear epistatic to each other. (B) The complete C. elegans life cycle. Recognized stages of C. elegans are marked by black arrows. States are marked by red arrows to emphasize that at the end of a state, the worm returns to the developmental time point it was at before entering the state. The L2d state is an exception. It is the only stage that does not return to the same developmental time point; rather, the L2d state is a permissive state that allows entry into either dauer or the L3 stage. We have presented evidence of a female-like state in C. elegans. At this point, it is unclear whether the difference between hermaphrodites and females is reversible by males. Therefore, it remains unclear whether it is a stage or a true state.
The C. elegans female-like state Our explorations have shown that the loss of fog-2(lf) partially phenocopies the transcriptional events that occur naturally as C. elegans ages from the first day of adulthood to the sixth day of adulthood. Moreover, epistasis analysis of these perturbations suggests that they act on the same pathway, namely sperm generation and depletion (see Figure 6). Self-sperm generation promotes the hermaphrodite state, whereas sperm depletion marks entry into the female-like state. Given the enrichment of neuronal transcription factors that are associated with sperm loss in our dataset, we believe this dataset should contain some of the transcriptomic modules that are involved in these pheromone production and behavioral pathways, although we have been unable to find these genes.
Behavioral and physiological changes upon mating are not unknown in other species. In particular, in the fruit fly Drosophila melanogaster, a sex peptide present in the male seminal fluid is known to drive changes in gene expression (Liu and Kubli 2003;Xue and Noll 2000;Avila et al. 2011;Heifetz et al. 2014;Rezával et al. 2014;Mack et al. 2006) as well as behavior. More recently, sperm was found to be necessary to drive changes in aggression in the fruit fly (Bath et al. 2017). These changes are often reversible upon the disappearance of seminal fluid or sperm. In the case of C. elegans, we have observed that sperm loss is associated with gene expression changes that probably reflect physiological changes in the worm. Our experimental design did not include a test for reversibility of these changes. The possibility of a rescue experiment with males raises interesting possibilities. What fraction of the changes observed upon loss of self-sperm are reversible? Does male seminal fluid or male sperm cause changes beyond rescue?
The C. elegans life cycle, life stages and life states C. elegans has a complicated life cycle, with two alternative developmental pathways that have multiple stages (larval development and dauer development), followed by reproductive adulthood. In addition to these developmental stages, researchers have recognized that C. elegans has numerous life states that it can enter into when given instructive environmental cues. One such state is the L1 arrest state, where development ceases entirely upon starvation (Johnson et al. 1984;Baugh and Sternberg 2006). More recently, researchers have described additional diapause states that the worm can access at the L3, L4, and young adult stages under conditions of low food (Angelo and Gilst 2009;Seidel and Kimble 2011;Schindler et al. 2014). Not all states of C. elegans are arrested, however (see Figure 6). For example, the L2d state is induced by crowded and nutrient-poor conditions (Golden and Riddle 1984). While in this state, the worm is capable of entry into either dauer or the L3 larval stage, depending on environmental conditions. Thus, the L2d state is a permissive state, and marks the point at which the nematode development is committed to a single developmental pathway.
Identification of the C. elegans life states has often been performed by morphological studies (as in the course of L4 arrest or L2d) or via time courses (L1 arrest). However, not all states may be visually identifiable, or, even if they are, the morphological changes may be very subtle, making positive identification difficult. However, the detailed information afforded by a transcriptome should in theory provide sufficient information to definitively identify a state, since transcriptomic information underlies morphology. Moreover, transcriptomics can provide an insight into the physiology of complex metazoan life states. By identifying differentially expressed genes and using ontology enrichment analyses to identify gene functions, sites of expression, or phenotypes that are enriched in a given gene set, we can obtain a clear picture of the changes that occur in the worm analogous to identifying gross morphological changes.
RNA-seq is a powerful technology that has been used successfully in the past as a qualitative tool for target acquisition, although recent work has also used RNA-seq to measure genetic interactions via epistasis (Dixit et al. 2016;Angeles-Albores et al. 2017b). Here, we have shown that whole-organism RNA-seq data can also be used to successfully identify internal states in a multicellular organism.