Extensive Epitranscriptomic Methylation of A and C Residues on Murine Leukemia Virus Transcripts Enhances Viral Gene Expression

The data presented in the present study demonstrate that MLV RNAs bear an exceptionally high level of the epitranscriptomic modifications m6A, m5C, and Nm, suggesting that these each facilitate some aspect of the viral replication cycle. Consistent with this hypothesis, we demonstrate that mutational removal of a subset of these m6A or m5C modifications from MLV transcripts inhibits MLV replication in cis, and a similar result was also observed upon manipulation of the level of expression of key cellular epitranscriptomic cofactors in trans. Together, these results argue that the addition of several different epitranscriptomic modifications to viral transcripts stimulates viral gene expression and suggest that MLV has therefore evolved to maximize the level of these modifications that are added to viral RNAs.

identified as a nuclear heterotrimer, consisting of the proteins METTL3, METTL14, and WTAP, that adds m 6 A to mRNA sites bearing the consensus sequence 5=-RA*C-3=, where R is a purine (1)(2)(3). Once added, m 6 A can be recognized by several "readers," including the nuclear YTHDC1 and cytoplasmic YTHDF2 proteins, which then regulate the splicing, translation and/or stability of that mRNA. Less is known about the m 5 C modification, although NSUN2 has been shown to add m 5 C to a few cellular mRNAs (4-7), and we have recently identified NSUN2 as the primary writer of m 5 C residues on the HIV-1 genome (8).
Previously, we reported that m 6 A residues enhance viral gene expression and replication for HIV-1, influenza A virus, and the polyomavirus simian virus 40 (SV40) (9)(10)(11), and others have also reported that m 6 A residues promote HIV-1 and enterovirus 71 replication (12,13) and play a role in the activation of lytic replication in Kaposi's sarcoma herpesvirus (KSHV)-infected cells (14,15). More recently, we reported that addition of m 5 C also enhances HIV-1 gene expression (8), and others have reported that Nm modifications on HIV-1 transcripts promote HIV-1 replication by inhibiting the detection of viral transcripts by the innate antiviral RNA sensor MDA5 (16). Consistent with a positive role for these epitranscriptomic modifications in the regulation of viral replication, we recently reported that HIV-1 genomic RNA (gRNA) bears a far higher level of m 6 A, m 5 C, and Nm residues than does the average cellular mRNA (8). Here, we extend these earlier findings by demonstrating that the addition of both m 6 A and m 5 C independently upregulates murine leukemia virus (MLV) gene expression and replication. Moreover, we further show that MLV gRNAs are also extensively epitrancriptomically modified, with m 6 A, m 5 C, and Nm residues all detected at levels that range from ϳ9to Ͼ20-fold higher than observed on cellular poly(A) ϩ RNA. Together, these observations are consistent with the hypothesis that sites of epitranscriptomic modification on viral mRNAs are under positive selection and suggest that many viruses may have independently evolved to use epitranscriptomic gene regulation as a mechanism to promote their replication and, hence, pathogenesis.

RESULTS
Extensive epitranscriptomic modification of the MLV RNA genome. The initial goal of this project was to quantify the epitranscriptomic RNA modifications present on MLV gRNA using ultrahigh-performance liquid chromatography and tandem mass spectrometry (UPLC-MS/MS) (8,17). This required the purification of the gRNA away from cellular tRNAs and other noncoding RNAs (ncRNAs) that are heavily modified. MLV virions released into the supernatant media from 3T3 cells infected with MLV derived from the pNCS proviral plasmid (18) were pelleted through a 20% sucrose cushion followed by banding on a 7.2 to 20% iodixanol gradient, which separates retroviral virions from cellular exosomes and debris (19). Because MLV virions contain high levels of tRNAs, 7SL RNA, and other cellular ncRNAs (20), isolation of MLV virion particles is necessary but not sufficient to yield pure MLV gRNA. Therefore, we next isolated total virion RNA, denatured it in urea loading dye and then size fractionated the RNA by electrophoresis on a 1.5% Tris-borate-EDTA (TBE) preparative agarose gel. The ϳ8-kb MLV gRNA was then visualized and excised. This procedure was performed in triplicate to yield three independent MLV gRNA preparations.
To assess the purity of the MLV gRNA preparations, we performed transcriptome sequencing (RNA-seq) and then aligned the reads obtained first to the mouse genome and then to the MLV genome. As shown in Fig. 1A, 99.48% of the reads obtained from gRNA preparation 1 aligned to the MLV genome, while only 0.52% aligned to the mouse genome, and closely similar results were obtained for MLV gRNA preparations 2 and 3. Within the average 0.49% of reads that mapped to the mouse genome, ϳ0.34% aligned to murine mRNAs, while ϳ0.17% aligned to ncRNAs. Of these, Ͻ0.001% aligned to tRNAs, which are highly epitranscriptomically modified, thus excluding cellular RNA contamination as a significant source of modified ribonucleotides. While these data also revealed that the MLV-specific reads obtained aligned to the entire MLV genome (Fig. 1B), we did observe consistent differences in the observed signal across the genome. This result, which has also been observed when purified HIV-1 gRNA was analyzed using the same RNA-seq technique (21), likely reflects differences in the efficiency of linker ligation to the fragmented gRNA and/or PCR amplification of different viral sequences. We next quantified the precise level of several epitranscriptomic modifications on the MLV RNA genome using UPLC-MS/MS, as previously described (17). Quantification of the level of multiple epitranscriptomic modifications across the three MLV gRNA preparations revealed a very high level of reproducibility ( Fig. 1C and D). These data also revealed a particularly high level of m 6 A (ϳ51 m 6 A residues per gRNA) and high levels of 2=O-methylated adenosine (Am), guanosine (Gm), and cytosine (Cm) (ϳ27, ϳ25, and ϳ18 residues, respectively, per gRNA), as well as a high level of m 5 C (ϳ13 residues per gRNA). These levels are Methylation of A and C Residues on MLV ® considerably higher than previously reported for cellular poly(A) ϩ RNA (2,8,22) and quantified here for highly purified poly(A) ϩ RNA isolated from the producer 3T3 cell line. Specifically, the level of m 6 A observed on 3T3 poly(A) ϩ RNA is ϳ0.24% of "A" residues, while ϳ2.4% of all "A" residues in the MLV genome are m 6 A (Fig. 1D). It was also found that ϳ0.06% of all "C" residues in 3T3 poly(A) ϩ RNA are modified to m 5 C, compared to ϳ0.54% of "C" residues in the MLV gRNA (Fig. 1D). Similarly, Am, Gm, and Cm represent from 0.74 to 1.25% of their cognate residue in the MLV genome, while we found that these 2=O-Me-modified nucleotides each represent Ͻ0.1% of the A, G, and C residues present on 3T3 poly(A) ϩ RNA. We therefore conclude that m 6 A, m 5 C, Am, Gm, and Cm are all highly overrepresented on MLV gRNAs compared to the average cellular mRNA.
While our data identify the five epitranscriptomic modifications listed above as unusually prevalent on MLV gRNA, with Ͼ10 modified residues of each per gRNA, we also detected several other modified nucleotides at levels ranging from ϳ2 to ϳ4 residues per MLV genome ( Fig. 1C and D). In the case of 1-methylguanosine (m 1 G) and 7-methylguanosine (m 7 G), the observed levels in the MLV genome are ϳ10-fold higher than the very low levels detected on 3T3 poly(A) ϩ RNA. Similarly, 1-methyladenosine (m 1 A) was detected at an ϳ20-fold higher level on MLV gRNAs than on cellular poly(A) ϩ RNA (0.20% versus 0.01%). While these three modified residues are all present at low levels on MLV gRNAs (Fig. 1C), we note that internal m 7 G residues have recently been proposed to facilitate the translation of cellular mRNAs (23), thus not only implying that even low levels of a given epitranscriptomic modification can exert a phenotypic effect but also providing a possible explanation for the overrepresentation of m 7 G residues on the MLV genome. While we did not detect any epitranscriptomically modified "U" residues, such as 2=O-Me-uridine (Um) or pseudouridine, we note that the UPLC-MS/MS method is less sensitive for detecting modified uridines at low concentrations of input RNA, as in this case, due to the comparatively inefficient ionization of uridine compared to other ribonucleosides. Nevertheless, our data do suggest that levels of Um in the MLV gRNA are Ͻ1nM, which equates to Ͻ20 Um residues per MLV gRNA.
Mapping of m 6 A and m 5 C residues on the MLV RNA genome. Having demonstrated that MLV gRNAs bear a substantial number of m 6 A and m 5 C residues, we wished to map these residues not only on the gRNA but also on MLV RNAs expressed in infected 3T3 cells. For this purpose, we infected 3T3 cells with MLV virions rescued from the pNCS proviral clone (18) and, at 48 h postinfection (hpi), pulsed the cells with the highly photoreactive uridine analog 4-thiouridine (4SU) for a further 24 h (24). We then isolated MLV virions, as described above, from the supernatant media of MLV-infected 3T3 cells and purified total virion RNA. In parallel, we also harvested total RNA from MLV-infected 3T3 cells and subjected this RNA to a single round of poly(A) ϩ isolation to enrich for mRNAs. The MLV virion and MLV-infected cell RNA preparations were then subjected to the previously described PA-m 6 A-seq or PA-m 5 C-seq procedures (8,25). Briefly, the purified 4SU-labeled RNAs were incubated with an antibody specific for either m 6 A or m 5 C and then UV irradiated to cross-link the antibody to the bound site. The resultant RNA-protein complexes were then incubated with RNase T 1 , to remove unbound RNA, and the bound ϳ20-nucleotide (nt) RNA fragments recovered, converted to cDNA, and subjected to deep sequencing. As may be observed in Fig. 2A, we detected several m 6 A peaks on the MLV gRNA, almost all of which were also observed on the MLV transcripts expressed in infected 3T3 cells. Consistent with the fact that all MLV virion RNAs are genome length, while ϳ50% of the MLV transcripts expressed in infected cells are env mRNAs that have been spliced to remove the Gag and Pol open reading frames, we detected a 2-to 3-fold-lower level of m 6 A binding sites in the MLV gag and pol genes compared to the env gene and long terminal repeat (LTR), in the intracellular RNA sample ( Fig. 2A). While our UPLC-MS/MS data indicate that each MLV gRNA contains ϳ51 m 6 A residues (Fig. 1D), we only detected ϳ20 major m 6 A sites using the PA-m 6 A-seq technique. While the reasons for this discrepancy are unclear, we note that it has been recently reported that m 6 A residues embedded in duplex RNA are not readily detected by m 6 A-specific antibodies (26). Nevertheless, this discrepancy does suggest that the m 6 A sites that were detected on the MLV RNA genome are likely to be heavily modified. We note that retroviral gRNAs, which are packaged as homodimers, are not thought to function as mRNAs in infected cells (27), and we therefore caution that, while the very similar localization of mapped m 6 A residues on MLV gRNAs and intracellular MLV mRNAs is certainly consistent with a comparable level of epitranscriptomic modification of these two viral RNA pools, this remains to be clearly demonstrated.
The similar PA-m 5 C-seq technique was used to map sites of m 5 C modification on the MLV genome and, as expected, revealed minimal overlap with the mapped m 6 A sites (Fig. 2B). We again detected a higher level of antibody binding in the gag/pol region of the MLV genome in the virion-derived RNA sample than in the intracellular MLV RNA, although this varied somewhat by peak. In contrast to the PA-m 6 A-seq data, which identified somewhat fewer m 6 A modification sites on the MLV gRNA than predicted by the UPLC-MS/MS data, the PA-m 5 C-seq data detected ϳ40 m 5 C sites on the MLV gRNA ( Fig. 2B), which is more than the ϳ13 sites predicted by the UPLC-MS/MS technique (Fig. 1D), suggesting that most of these m 5 C sites are likely to be only partially modified.
Epitranscriptomic addition of m 6 A and m 5 C facilitates MLV gene expression. One way to test whether the addition of m 6 A or m 5 C has any effect on MLV gene expression and replication is to mutate the locations of these modifications by, for example, changing mapped m 6 A residues to "G" residues and mapped m 5 C residues to "U" residues. In the case of m 6 A, the modified "A" is found in the context of the sequence 5=-RA*C-3=, where R is a purine (1), so modified "A" residues are easier to identify within the ϳ20-nt peaks mapped in Fig. 2A. Nevertheless, many peaks do contain more than one 5=-RA*C-3= consensus sequence. In contrast, m 5 C modifications on mRNAs do not occur in a sequence consensus (8), so there are generally multiple "C" residues within each of the m 5 C peaks mapped in Fig. 2B. This complicates the design of MLV mutants lacking specific m 6 A or m 5 C sites, as interpretation of the results obtained requires that all the introduced mutations are silent, which in practice means located in the wobble position of codons. In the case of m 6 A, it proved impossible to design silent mutations that would ablate most of the m 6 A sites mapped in Fig. 2A, though this was possible for all three of the 5=-RA*C-3= motifs found in one of the most prominent peaks, in the MLV env gene, to generate the MLV-Δm 6 A mutant (indicated by the symbol "#" in Fig. 3A, lane 1). In the case of m 5 C, we were fortunate that four prominent m 5 C peaks located in the MLV pol To confirm that these introduced mutations indeed resulted in the loss of the predicted modified residues, we rescued the pNCS-based MLV-Δm 6 A and MLV-Δm 5 C mutants by transfection into 293T cells followed by culture in susceptible 3T3 cells. We then performed PA-m 6 A-seq and PA-m 5 C-seq using intracellular RNA preparations derived from 4SU-pulsed 3T3 cells infected with wild-type MLV or with the MLV-Δm 6 A or MLV-Δm5C mutants. As shown in Fig. 3A (upper two panels), the mutations introduced into the MLV-Δm 6 A mutant, indicated by "#," resulted in the precise loss of the predicted major m 6 A peak, while other peaks were unaffected. In the case of the MLV-Δm 5 C mutant, the four introduced mutations not only ablated the four targeted peaks in the MLV pol gene (indicated by "#" in the lower two panels of Fig. 3A) but also seemed to result in an overall reduction in m 5 C addition, even at sites that were not altered. The reasons for this effect are not clear, but it could imply that m 5 C addition to RNAs, unlike m 6 A addition, is in some way cooperative.
Next, we assessed whether the introduced mutations affected MLV gene expression and/or replication. For this purpose, we transfected 293T cells with the wild-type MLV proviral expression vector pNCS or with the pNCS-Δm 6 A or pNCS-Δm 5 C mutant plasmids. While we observed a modest, ϳ30% reduction in the level of MLV Gag production in the 293T cells transfected with either MLV mutant plasmid (Fig. 3B), similar levels of MLV gRNA expression were observed (Fig. 3C). We note that transfection of 293T cells results in massive overexpression of the MLV genome that might tend to hide any minor differences in MLV RNA expression that result from the mutational inactivation of a subset of the mapped m 6 A and m 5 C sites. Analysis of the supernatant media revealed that the MLV-Δm 6 A mutant released 2-to 3-fold less MLV Gag protein into the supernatant media, and 293T cells transfected with the MLV-Δm 5 C mutant also released ϳ2-fold less MLV Gag protein (Fig. 3D). One possibility we considered is that the mutations present in the MLV-Δm 6 A or MLV-Δ m 5 C mutant might affect gRNA packaging into MLV virion particles. To address this, we performed quantitative reverse transcription-PCR (qRT-PCR) to measure the level of MLV gRNA isolated from the supernatant media of the transfected 293T cells and then normalized these data by qRT-PCR analysis of the level of cellular 7SL RNA, which is known to also be selectively packaged into MLV virions (20). As shown in Fig. 3E, this analysis did not reveal any reduced packaging into virions of the MLV gRNA produced by the MLV-Δm 6 A or MLV-Δm 5 C mutants, although clearly fewer virions were released (Fig. 3D).
Next, we normalized the MLV-containing supernatants obtained from the transfected 293T cells, using the MLV Gag quantitations shown in Fig. 3D and then used equal amounts of each MLV variant to infect susceptible 3T3 cells. At 72 hpi, which should equate to ϳ3 MLV replication cycles, we harvested the infected cells and analyzed MLV Gag protein expression (Fig. 3F) and gRNA expression (Fig. 3G). As may be observed, we detected a statistically significant reduction in the level of both Gag protein and MLV gRNA in the cultures infected with the MLV-Δm 6 A and MLV-Δm 5 C mutant, though this effect was only 2-to 3-fold. We note, however, that since these mutants retain a substantial number of m 6 A and m 5 C sites, a modest phenotype is not unexpected. Nevertheless, these data are clearly consistent with the hypothesis that the m 6 A and m 5 C epitranscriptomic modifications detected on MLV transcripts are both able to facilitate some aspect of MLV replication.
While the mutations introduced into the MLV-Δm 6 A and MLV-Δm 5 C mutants are designed to be silent and to not impact sequences with a known regulatory role, it remains possible that they could affect an important RNA structure or protein binding site unrelated to the targeted RNA modifications, for example, a sequence that modulates the level of MLV RNA splicing. We therefore wished to test whether inhibition of m 6 A or m 5 C addition, or their enhanced detection by cellular readers, might also impact MLV gene expression. Previously, we reported that overexpression of the key cellular m 6 A reader YTHDF2 increases viral gene expression for three distinct viral species, viz. HIV-1, influenza A virus and the SV40 (9-11), and we therefore sought to determine whether stable overexpression of murine YTHDF2 in 3T3 cells would also enhance MLV gene expression. For this purpose, we generated clonal 3T3 cell lines transduced with a lentiviral vector expressing either FLAG-tagged YTHDF2 or GFP and selected single cell clones expressing readily detectable levels of these proteins (Fig. 4A). We then infected these cells with wild type MLV and assessed viral gene expression at 48 and 72 hpi by Western blotting for MLV p65 Gag. As shown in Fig. 4A, YTHDF2 overexpression indeed resulted in a readily detectable increase in MLV Gag expression, thus again arguing that m 6 A addition to MLV transcripts facilitates viral gene expression and replication.
While the proteins that "write" and "read" m 6 A modifications are well defined, this is less clear for the m 5 C modification as several cytidine methyltransferases have been described. However, the cellular protein NSUN2 has previously been reported to add m 5 C to specific cellular mRNAs (4-7), and we have recently reported that NSUN2 is primarily responsible for the addition of m 5 C modifications to HIV-1 transcripts (8). An interesting aspect of NSUN2 is that it forms a transient covalent bond with the "C" residues it is methylating and release requires the action of a conserved cysteine residue located at position 271 in NSUN2. As a result, mutagenesis of cysteine 271 to alanine (C271A) leads to the spontaneous formation of NSUN2 cross-links to target "C" residues on RNAs (6). Therefore, in cells expressing NSUN2-C271A, we predicted that this mutant protein would cross-link to MLV gRNA and potentially be packaged into MLV virions. As shown in Fig. 4B, we indeed observed packaging of the NSUN2-C271A mutant, but not wild-type NSUN2, into MLV virions produced in transfected 293T cells, thus not only identifying NSUN2 as an enzyme that adds m 5 C to MLV gRNAs but also, more generally, confirming that MLV gRNAs do indeed bear "C" residues that are methylated in producer cells.
We next asked in reduced expression of NSUN2 would result in a reduction in MLV gene expression. This was achieved by the efficient knockdown of NSUN2 expression using RNA interference (RNAi), as shown in Fig. 4C. Importantly, knockdown of NSUN2 expression in 3T3 cells also resulted in a marked drop in the expression of the MLV Gag proteins (Fig. 4C). Therefore, consistent with the data presented in Fig. 3, these results argue that addition of m 5 C, like addition of m 6 A, to MLV transcripts enhances MLV RNA expression.

DISCUSSION
Previously, we and others have reported that the addition of m 6 A facilitates viral gene expression and replication for a range of distinct viruses, including HIV-1, influenza A virus, SV40, enterovirus 71, and KSHV (9)(10)(11)(12)(13)(14)(15). More recently, we have also presented evidence indicating that m 5 C promotes HIV-1 mRNA translation (8), while others have reported that Nm residues on HIV-1 transcripts promote HIV-1 replication by inhibiting activation of the innate antiviral RNA sensor MDA5 (16). Together, these observations indicate that at least a subset of the epitranscriptomic modifications found on mRNAs, specifically m 6 A, m 5 C, and Nm, each acts as a positive regulator of some aspect of the viral replication cycle and should therefore be selected for during viral evolution. Consistent with this hypothesis, we recently reported that m 6 A, m 5 C, and Nm ribonucleotides were all present at substantially higher levels on the HIV-1 RNA genome than on poly(A) ϩ RNA isolated from human cells, with m 5 C (ϳ20ϫ higher) and Nm (11ϫ to 28ϫ higher) being particularly enriched (8).
Despite the evidence delineated above arguing for a positive role for at least some epitranscriptomic modifications in the viral life cycle, this issue has remained controversial. Specifically, one group has argued that m 6 A actually inhibits HIV-1 gene replication (28), and others have suggested that m 6 A modification of flaviviral transcripts, including Zika virus RNAs, inhibited some aspect of the viral replication cycle (29,30). Why a rapidly evolving, lytically replicating virus, such as Zika virus, should retain m 6 A sites if these inhibit virus replication in cis was not, however, investigated.
Here, we have sought to shed further light on the role of epitranscriptomic modifications in regulating viral gene expression and replication, and we present three lines of evidence arguing that m 6 A and m 5 C indeed exert a positive effect on the replication of the retrovirus MLV when present in cis on viral RNAs. First, we demonstrate that MLV genomic RNAs are subject to exceptionally high levels of modification by addition of m 6 A and m 5 C, with observed levels that are from ϳ10-fold higher than observed in purified 3T3 poly(A) ϩ RNA (Fig. 1). Second, we mapped several sites of m 6 A and m 5 C addition on MLV transcripts and then mutationally ablated a subset of these by the introduction of silent mutations ( Fig. 2 and 3). While these two mutant viruses, termed MLV-Δm 6 A and MLV-Δm 5 C, retained the majority of their m 6 A and m 5 C residues, we nevertheless observed a modest but significant reduction in viral protein and RNA expression in infected 3T3 cells (Fig. 3). Finally, we sought to determine whether overexpression of the key m 6 A reader YTHDF2, or downregulation of the m 5 C writer NSUN2, would affect MLV gene expression. Indeed, and as previously also reported for HIV-1 (8,9), we observed enhanced MLV gene expression upon overexpression of murine YTHDF2 (Fig. 4A), and a substantial decline in MLV gene expression in 3T3 cells upon downregulation of NSUN2 expression using RNAi (Fig. 4C). These MLV data therefore confirm and extend our previously reported results, generated using HIV-1 (8,9), indicating that m 6 A and m 5 C are positive regulators of viral gene expression and further argue that MLV, like HIV-1 and likely many other virus species, has evolved to use the epitranscriptomic writers and readers expressed by infected cells as a way to increase viral gene expression and replication. While the data presented in this manuscript do not address the mechanisms by which m 6 A and m 5 C exert this positive effect, our previous work suggests that m 6 A may primarily act by increasing viral mRNA stability, while m 5 C enhances viral mRNA translation (8)(9)(10).

MATERIALS AND METHODS
Plasmids and cDNA cloning. A lentiviral vector was used to generate a clonal 3T3-derived cell line stably expressing FLAG-tagged mouse YTHDF2. Briefly, the mouse ythdf2 gene (NP_663368) was PCR amplified from a cDNA library and cloned into the pLEX vector (9) 5= to an internal ribosome entry site (IRES) and the puromycin (puro) resistance gene, all driven by the cytomegalovirus immediate-early promoter. A previously described (9) pLEX-based lentiviral vector expressing FLAG-tagged green fluorescent protein (GFP) was used as a control. A FLAG-tagged murine NSUN2 (NP_060225) expression plasmid was generated by PCR amplification of an NSUN2 cDNA that was then cloned into the pcDNA3.1 expression plasmid to generate pcDNA-FLAG-NSUN2. A mutant form of NSUN2 was generated by introducing the C271A mutation into pcDNA-FLAG-NSUN2. The NSUN2-C271A mutant spontaneously forms stable covalent bonds with target cytosines on RNA (6). The pcDNA-based expression plasmid ph3G-HA, expressing a hemagglutinin (HA) epitope-tagged form of human APOBEC3G (A3G) has been described (31). Here, we substituted the FLAG epitope tag for HA to generate ph3G-FLAG.
The pNCS MLV proviral expression vector has been described and was a gift from Stephen Goff (18). Two MLV mutant clones were generated, one mutated to remove a single m 6 A site (Δm 6 A) and the second mutated to remove four m 5 C sites (Δm 5 C). Only silent mutations were introduced at these sites. Two DNA gBlocks were synthesized by IDT containing these silent mutations and cloned into pNCS to generate pNCS-Δm 6 A and pNCS-Δm 5 C, respectively. Mutations introduced into pNCS-Δm 6 A were as follows, counting from the start codon of Env, with introduced mutations indicated by lowercase letters: nt 354, 5=-GAAGAgCCTctcACCTCC-3=. Mutations introduced into pNCS-Δm 5 C are as follows, counting Methylation of A and C Residues on MLV ® from the start codon of Gag, with introduced mutations again indicated in lowercase letters: nt 2898, 5=-GGtTTtTGTaGatTaTGGATt-3=; nt 3645, 5=-agtGCTCAGaGaGCTGAAtTGATAGCAtTgACt-3=; nt 4242, 5=-a GAACAtTaAAAAATATtACTGAGACtTGt-3=; and nt 4500, 5=-ATtTTtCCtAGGTTt-3=. Clone integrity was verified by Sanger sequencing.
MLV infections. To generate infectious MLV, pNCS-based proviral expression vectors (18) were transfected into 293T cells (CRL-3216; ATCC) using polyethyleneimine (PEI). At 24 h posttransfection, supernatant media were exchanged for fresh media. At 72 h posttransfection, the supernatant media containing infectious MLV virions were passed through a 0.45-m-pore size filter and then used for infection of 3T3 cells.
MLV gRNA purification. MLV virions were purified by a two-step method, as previously described (19). Briefly, the supernatant media from MLV-infected 3T3 cells were harvested at 72 hpi and passed through a 0.45-m-pore size filter, and the virions were then pelleted through a 20% sucrose cushion by ultracentrifugation. The virion pellet was resuspended and layered onto a 7.2 to 20% iodixanol gradient (OptiPrep; Axis-Shield) prior to ultracentrifugation to separate virions from cellular debris and exosomes. The virion band was then harvested and total RNA extracted using TRIzol. The isolated RNA was heat denatured in a loading buffer containing urea and run on a preparative 1.5% TBE agarose gel. An RNA band of ϳ8 kb, corresponding in size to the MLV gRNA, was visualized, excised, and RNA isolated using acid phenol, followed by phenol-chloroform extraction. To determine the amount of modified nucleotides on murine mRNA as a control, total RNA was extracted from 3T3 cells using TRIzol and then subjected to two sequential rounds of poly(A) ϩ RNA purification, using cellulose oligo(dT) beads, and one round of rRNA depletion using a RiboZero Gold kit (Illumina). The bulk of the purified MLV RNA was then used for UPLC-MS/MS analysis of RNA modifications, while a small aliquot was retained for RNA-seq analysis, which was used to determine the purity of the MLV gRNA sample. RNA-seq was performed using the SMARTer stranded total RNA-Seq kit v2 (NEB) according to the manufacturer's instructions.
RNA modification identification by UPLC-MS/MS. Nucleosides were generated from purified MLV RNA by nuclease P1 digestion (Sigma) in buffer containing 25 mM NaCl and 2.5 mM ZnCl 2 for 2 h at 37°C, followed by incubation with Antarctic phosphatase (NEB) for an additional 2 h at 37°C (32). Nucleosides were separated and quantified using UPLC-MS/MS as previously described (17), except acetic acid was used in place of formic acid. Triplicate MLV gRNA samples were assessed by this method, as well as one poly(A) ϩ RNA sample isolated from the producer 3T3 cell line.
PA-antibody-seq. PA-m 6 A-seq and PA-m 5 C-seq were performed as previously described (8)(9)(10). Briefly, 3T3 cells were infected with MLV as described above. At 48 hpi, cells were pulsed with 100 mM 4-thiouridine (4SU). After a further 24 h, total cellular RNA was extracted from the MLV-infected 3T3 cells using TRIzol, while MLV gRNA was extracted from virions that were collected by ultracentrifugation of the supernatant media through a 20% sucrose cushion. Total cellular poly(A) ϩ RNA was purified using oligo(dT) magnetic beads (AM1922; Invitrogen) and 10 g of poly(A) ϩ RNA or virion gRNA was then used according to the previously reported PA-m 6 A-seq protocol (10, 25) using either an m 6 A-specific (202111; Synaptic Systems) or m 5 C-specific (C15200081; Diagenode) polyclonal antibody.
NSUN2 packaging into virions. Plasmids expressing FLAG-tagged versions of the wild-type or C271A mutant form of murine NSUN2, or human A3G, were cotransfected with pNCS into 293T cells using PEI. At 72 h posttransfection, the supernatant media were passed through 0.45-m-pore size filters, and virions were harvested by ultracentrifugation through a 20% sucrose cushion. Protein was extracted from the virion pellet in Laemmli buffer (33) before analysis by Western blotting. Protein from producer cells was also harvested in Laemmli buffer to demonstrate protein expression from the transfected plasmids.
Western blots. Proteins were extracted using Laemmli buffer, sonicated and denatured at 95°C for 10 min and then separated on Tris-glycine-SDS polyacrylamide gels (Invitrogen). After electrophoresis, proteins were transferred to a nitrocellulose membrane, and then blocked in 5% milk in PBS plus 0.1% Tween (PBS-0.1% Tween). Membranes were incubated in primary and secondary antibodies diluted in 5% milk in PBS-0.1% Tween for 1 h each and then washed in PBS-0.1% Tween. Each antibody was used at a 1:5,000 dilution. The antibody targeting MLV Gag has been described (34) and was a gift from Stephen Goff. Antibodies targeting actin (60008; Proteintech), NSUN2 (20854; Proteintech), and the FLAG epitope tag (F1804; Sigma), as well as anti-mouse horseradish peroxidase (HRP)-conjugated (A9044; Sigma) and anti-rabbit HRP-conjugated (A6154; Sigma) antibodies, were also used. Western blot signals were visualized by chemiluminescence. ImageJ was used for quantification of the intensity of protein bands.
siRNA transfections. To investigate the effect of reduced NSUN2 protein levels on MLV protein expression, RNAi was utilized to knockdown NSUN2 expression in 293T cells. An siRNA specific to NSUN2 (siNSUN2), or a control small interfering RNA (siRNA; SR310319; Origene), was transfected into 293T cells at a concentration of 25 pmol/ml using Lipofectamine RNAiMAX (Invitrogen). At 48 h posttransfection, cells underwent a second siRNA transfection and were then incubated for a further 24 h. Cells were then transfected with pNCS. At 72 h posttransfection, the cells were harvested for Western blot analysis.
YTDHF2 overexpression in 3T3 cells. The lentiviral expression vectors pLEX-GFP and pLEX-YTHDF2 were transfected into 293T cells along with the packaging plasmids pMD2G (12259; Addgene) and pΔ8.74 (22036; Addgene). Media were changed at 24 h posttransfection. At 72 h posttransfection the supernatant media containing the lentiviral particles was harvested and passed through a 0.45-m-pore size filter and overlaid on 3T3 cells (CRL-1658; ATCC). At 48 hpi, transduced 3T3 cells were selected by the addition of puromycin to the culture media. After a further 72 h, cells were single cell cloned, expanded, and assessed by Western blotting for FLAG-GFP or FLAG-YTHDF2 expression. A single clone was then selected to determine the effect of overexpression of YTHDF2 on MLV infection. The GFP-or YTHDF2-overexpressing cell lines were infected with MLV and, at 72 hpi, protein was harvested for Western blot analysis.
Quantitative real-time PCR. Relative MLV gRNA expression levels were determined by qRT-PCR. The level of GAPDH (glyceraldehyde-3-phosphate dehydrogenase) mRNA was used to normalize all cellular qRT-PCR experiments, while 7SL RNA, which is packaged by MLV virions (20), was used to normalize virion qRT-PCR experiments. All primer sequences are listed below. RNA was extracted using the TRIzol method. cDNA was generated using the Ambion cDNA synthesis kit with random primers according to the manufacturer's protocol. All qRT-PCR experiments were performed using Thermo Fisher Power Sybr green PCR master mix (4367659; ABI) following the manufacturer's instructions. All qRT-PCR data were quantified using the ΔΔC T method.
Bioinformatic analysis. Read alignments were performed using Bowtie (35). Reads were first aligned to the mouse genome, allowing up to one mismatch, then unaligned reads were aligned to the pNCS MLV transcriptome, again allowing up to one mismatch. At least one characteristic T-to-C mutation, resulting from 4SU incorporation and cross-linking to the antibody used, were required in both mouse and viral aligned reads. All data were processed using in-house Perl scripts and SAMtools (36) and visualized with IGV, as previously described (10).
Data availability The raw sequencing data obtained by RNA-seq have been submitted to the NCBI expression omnibus and are available under GenBank accession number GSE129302.