A Novel Class of Somatic Small RNAs Similar to Germ Cell Pachytene PIWI-interacting Small RNAs*

Background: Germ cells exclusively express PIWI-interacting small RNAs for transposon and gene regulation. Results: Somatic cells express similar RNAs that do not require known small RNA proteins and that partially complement mRNAs. Conclusion: These somatic small RNAs represent a novel small RNA population, which potentially regulates mRNA translation. Significance: Defining novel small RNAs is essential for elucidating the mechanisms that control gene expression. PIWI-interacting RNAs (piRNAs) are small noncoding RNAs that bind PIWI family proteins exclusively expressed in the germ cells of mammalian gonads. MIWI2-associated piRNAs are essential for silencing transposons during primordial germ cell development, and MIWI-bound piRNAs are required for normal spermatogenesis during adulthood in mice. Although piRNAs have long been regarded as germ cell-specific, increasing lines of evidence suggest that somatic cells also express piRNA-like RNAs (pilRNAs). Here, we report the detection of abundant pilRNAs in somatic cells, which are similar to MIWI-associated piRNAs mainly expressed in pachytene spermatocytes and round spermatids in the testis. Based on small RNA deep sequencing and quantitative PCR analyses, pilRNA expression is dynamic and displays tissue specificity. Although pilRNAs are similar to pachytene piRNAs in both size and genomic origins, they have a distinct ping-pong signature. Furthermore, pilRNA biogenesis appears to utilize a yet to be identified pathway, which is different from all currently known small RNA biogenetic pathways. In addition, pilRNAs appear to preferentially target the 3′-UTRs of mRNAs in a partially complementary manner. Our data suggest that pilRNAs, as an integral component of the small RNA transcriptome in somatic cell lineages, represent a distinct population of small RNAs that may have functions similar to germ cell piRNAs.

During male germ cell development, repetitive sequence-derived piRNAs are mainly expressed in primordial germ cells in fetal testes, whereas nonrepetitive sequence-derived piRNAs are predominantly expressed in pachytene spermatocytes and round spermatids in adult testes (9). Therefore, these two types of piRNAs are also called prepachytene and pachytene piRNAs, respectively. Prepachytene piRNAs are produced through the ping-pong mechanism and consist of repetitive sequence-derived primary and secondary transcripts, the production of which involves both MILI (enriched in primary transcripts) and MIWI2 (enriched in secondary transcripts) (6,9,11). On the one hand, the ping-pong positive feedback loop inhibits transposon expression through post-transcriptional cleavage of transposon transcripts (11). On the other hand, these transposonderived piRNAs induce transposon silencing through piRNAdependent DNA methylation (9). In contrast, pachytene piRNAs associate with MIWI and MILI, creating two subcategories of piRNAs: MILI-associated and MIWI-associated piRNAs (12). These piRNAs do not show ping-pong characteristics such as secondary transcripts that are partially complementary to primary transcripts. Instead, pachytene piRNAs represent largely, if not exclusively, primary transcripts (12).
We and others have reported the existence of piRNA-like RNAs (pilRNAs) in both gonadal and somatic tissues (13)(14)(15)(16). However, a detailed characterization of this novel class of small RNAs expressed in somatic cells/tissues has been lacking. Here, we report the identification of hundreds of murine pilRNAs from small RNA libraries of murine testicular Sertoli cells, interstitial cells of Cajal (ICCs) from the murine gastrointestinal tract, and murine small intestine, in addition to millions of murine stomach small RNAs that align to newly defined germ cell piRNA clusters. Our in-depth analyses revealed a high degree of similarities between these somatic pilRNAs and pachytene piRNAs in size, nucleotide composition, clustering, genomic origin, and potential targets. However, unlike pachytene piRNAs, pilRNAs have a distinct ping-pong signature. Finally, biogenesis of pilRNAs is mediated by an unidentified mechanism different from any of the currently known small RNA biogenetic pathways indicative of an entirely novel population of small RNAs.

EXPERIMENTAL PROCEDURES
Knock-out Mouse Lines Used-All animal work was conducted following the animal use protocol approved by the Institutional Animal Care and Use Committee of the University of Nevada, Reno. For FACS-based Sertoli cell purification, control (Amh-Cre; Rosa26-mTmG ϩ/tg ), Sertoli cell-specific Dicer conditional knock-out (cKO) (Amh-Cre; Dicer lox/lox ; Rosa26-mTmG ϩ/tg ), and Sertoli cell-specific Drosha cKO (Amh-Cre; Drosha lox/ lox; Rosa26-mTmG ϩ/tg ) mice were generated as described (16 -18). Miwi universal KO mouse line was obtained from the Mutant Mouse Regional Resource Centers. Mov10l1 universal knock-out mice were purchased from the Jackson Laboratory. Miwi2 knock-out mice were generated as described (19).
Tissue Collection and Cell Purification-Tissue samples (brain, colon, heart, kidney, liver, lung, small intestine, spleen, stomach, and testis) were collected from adult wild-type or knockout C57/BL6 mice. Upon collection, samples were immediately snap frozen in liquid nitrogen followed by storage at Ϫ80°C until RNA isolation. Wild-type Sertoli cells were collected from postnatal day 6 mice as previously reported (16). Amh-cre control and cKO Sertoli cells from 2-day-old mice were isolated by FACS based on GFP expression from the mTmG reporter (20). ICCs were purified from adult mice using FACS as described (21).
qPCR Analyses of Small Noncoding RNAs Expression-Small noncoding RNAs (sncRNAs) were isolated from purified Sertoli cells, ICCs, and small intestine, and sncRNA cDNAs were prepared as previously described (22). qPCR analyses were performed using the 7900HT Fast Real-Time PCR System (Applied Biosystems). A TaqMan-based method was implemented using TaqMan probe (5Ј-FAM-CTCGGATCCACTA-GTC-MGB-3Ј (where FAM is 6-carboxyfluorescein and MGB is minor groove binder)) and U6 snRNA as an endogenous control (23). The levels in all somatic tissues were normalized to those in testis and knock-out samples to respective wild-type control samples. Primers for sncRNA qPCR are included in Table 1.
Small RNA Sequencing-Sequencing libraries prepared using sncRNAs from Sertoli cells, ICCs, and small intestine were sequenced on the 454 GS20 platform (454 Life Sciences, Roche, Branford, CT) as previously reported (16). Sequencing data has been deposited to the NCBI Gene Expression Omnibus database with an accession number of GSE48388 (24). 454 Sertoli pilRNA annotations were previously published and can be found with accession number GSE40692 (16).
Two biological replicates of FACS-purified Sertoli cells from control, Dicer cKO, and Drosha cKO testes (n ϭ 2) were sequenced using the Ion Torrent Personal Genome Machine (PGM) (Invitrogen) as described (17,18). Briefly, sncRNAs were isolated using the mirVana miRNA isolation kit (Invitrogen), and cDNA libraries were generated using the Ion Total RNA-Seq kit v2 (Invitrogen). Emulsion PCR was carried out on the Ion OneTouch (Invitrogen) using the Ion OneTouch 200 template kit v2 (Invitrogen) for cDNA amplification on ion sphere particles. Samples were enriched for positive spheres using the Ion OneTouch enrichment system (Invitrogen). After enrichment, sequencing was performed on the PGM using the Ion PGM 200 sequencing kit (Invitrogen) and Ion 316 chips (Invitrogen). Stomach samples (n ϭ 2) were sequenced in a similar fashion with the exception of 318 chips being used in lieu of 316 chips. Stomach data were submitted to the NCBI Gene Expression Omnibus under accession number GSE53780 (24).
Sequence reads from Ion Torrent sequencing of Sertoli cell sncRNA libraries were aligned to the mouse genome (build mm9) using the Ion Torrent Server (Suite 2.2) t-map aligner  (30). Resulting alignments to known germ cell piRNAs were classified as pilRNAs. To account for sequencing depth variation, the abundance of each pilRNA was normalized to the total aligned reads obtained from each sample, multiplied by 1 ϫ 10 6 , and averaged between biological replicates. pilRNAs with Յ1 read in all six libraries were removed from analysis. Resulting reads from PGM sequencing of wild-type stomach samples were mapped to mm9 using Ion Torrent Server Suite 3.2.1 t-map aligner (Invitrogen) (31). Recently redefined piRNA producing transcript genomic loci were obtained and primary miRNA locations (miRBase release 18) were omitted from these clusters to avoid contaminating miRNA reads (28,34). In-house python scripts were used to ascertain reads mapped within piRNA clusters and their corresponding nucleotide lengths and composition.
Precursor Transcript RT-PCR-Large RNA from adult wildtype mouse brain and testis was isolated using the mirVana miRNA isolation kit (Invitrogen). RNA was treated with either DNaseI amplification grade (Invitrogen) or RNase A (Sigma-Aldrich). DNaseI treatment was performed according to the manufacturer's recommendations. RNase A treatment was performed at 37°C for 60 min with 0.5 g of RNase A in 1X DNaseI reaction buffer, followed by the same reaction conditions used for the DNaseI treatment (without the DNase). Using 1 g of treated RNA, reverse transcription was performed using the SuperScript III first-strand synthesis system kit with random hexamers (Invitrogen). A reaction volume of 20 l was used, consisting of 10 l of 2ϫ GoTaq (Promega), 10 pmol of forward and reverse primer, and 1 l of cDNA. PCR conditions were as follows: 1 cycle at 95°C/2min, 28 cycles of amplification (95°C for 30 s, 58°C for 30 s, and 72°C for 45 s), followed by 1 cycle of 72°C for 5 min. PCR product was run on a 2% agarose gel with the 100-bp DNA ladder H3 (Caisson Labs). Primers and antic-ipated amplicon sizes for the precursor transcript RT-PCR are included in Table 2.
pilRNA Post-Transcription Target Analysis-Annotated pilRNA sequences were reverse complement aligned to all known murine 3Ј-UTRs, 5Ј-UTRs, and coding sequences (CDSs) using Sequery v1.0 (16). 3Ј-UTRs, 5Ј-UTRs, and CDSs were collected from ENSEMBL release 71 (35). Reverse complement analysis was first performed allowing up to two mismatches and then repeated allowing up to five mismatches with no gaps allowed. Gene features aligned to were considered potential post-transcriptional targets of pilRNAs.
Ping-Pong Analysis-Reads were subjected to a pairwise analysis of 5Ј ends with a 10-or 11-nt overlap using Sequery v1.0 (16). Reads that reverse complemented at their 5Ј ends with a 10-nt overlap were considered ping-pong pilRNAs.

RESULTS
Identification of pilRNAs in Somatic Cells/Tissues-We initially searched for pilRNAs from sequence reads of three small noncoding RNA (sncRNA) libraries prepared using murine Sertoli cells purified from postnatal day 6 testes, ICCs of the murine gastrointestinal tract, and small intestine based on criteria described previously (16). Among 696 nonredundant pilRNAs identified, 117 were from the ICC sncRNA library, 263 were found in Sertoli cells, and 316 were found in the small intestine; ϳ40 -60% of these pilRNAs have sequences identical to known germ cell piRNAs. The abundance of pilRNAs varied among the three somatic cell sncRNA libraries, with relatively more pilRNAs identified in ICCs than in small intestine or Sertoli cells (Fig. 1A and supplemental Tables S1-S4). The size of germ cell piRNAs ranges from 24 to 32 nt with MILI-, MIWI2-, and MIWI-associated piRNAs being ϳ26, 28, and 30 nt long, respectively (6). The lengths of identified pilRNAs ranged from 20 to 32 nt with the majority at 30 nt (Fig. 1B), which is similar to the major size of pachytene/MIWI-associated piRNAs.
Recent data redefined murine genomic loci of piRNA-producing transcripts using a series of sequencing techniques including small RNA-Seq, sequencing of samples enriched for piRNAs based on 2Ј-O-methyl-modified 3Ј termini, cap analysis of gene expression sequencing, and polyadenylation site  (34). These techniques clarified the termini of transcripts that produced piRNAs and were combined with temporal analyses allowing for the classification of piRNA clusters as prepachytene, hybrid, or pachytene piRNAs (34). Using these redefined clusters and our recent stomach small RNA deep sequencing data, we were able to map ϳ1.6 million stomach sncRNA reads to these germ cell piRNA clusters ( Fig. 1C and supplemental Table S5). This pilRNA population was the largest classifiable population, taking up 18% of the stomach small RNA sequencing reads. Coinciding with our initial pilRNA results, stomach pilRNAs aligned to the redefined germ cell piRNA clusters were predominantly 30 nt in length, similar to the pachytene/MIWI-associated piRNAs (Fig. 1D). Stomach replicates had a R 2 of 0.98, showing good correlation and corroboration of the stomach pilRNA results (Fig. 1E).
Validation of pilRNA Expression in Somatic Tissues-We verified the expression of three novel pilRNAs identified from murine small intestine (piR-in3, piR-in18, and piR-in87) in 10 different tissues using qPCR (Fig. 2). As expected, the two germ cell-expressed piRNAs (piR-118029 and piR-126541) were predominantly detected in the testis. Interestingly, all three small intestine pilRNAs also showed higher expression in the testis than in most of the somatic tissues analyzed. The highest levels of pilRNA-in3 were detected in brain, whereas pilRNA-in18 was most abundant in heart. The top three organs expressing pilRNA-in87 included testis, brain, and heart (Fig. 2).
pilRNAs Resemble Pachytene piRNAs-Repeat-associated piRNAs are produced through the ping-pong mechanism involving MILI and MIWI2 in mice (6,9,11). These piRNAs include both primary (contain 5Ј uracil) and secondary (contain 10 th position adenine) piRNA transcripts, and there is a consistent 10-nt complementary overlap between primary and secondary transcripts (11). piRNAs produced through the pingpong mechanism tend to display an accumulation of secondary piRNAs, which contain an adenine at the 10 th nt position (6). We analyzed the pilRNA sequences and found that the 10 th nt preference for adenine was absent. In addition, a strong preference for the signature 5Ј uracil of primary piRNA transcripts was detected in over 50% of pilRNAs (Fig. 3A). Stomach pilRNAs aligned to redefined germ cell piRNA loci confirmed this data with most reads preferentially containing a 5Ј uracil opposed to the 10 th nt adenine (Fig. 3B). The nucleotide preferences of novel pilRNAs and pilRNAs that align to known piRNA sequences were investigated separately in ICCs, Sertoli cells, and small intestine (Fig. 4, A-C, respectively). A bias toward a 5Ј uracil and a lack of a 10 th nt adenine preference was observed in both novel pilRNAs and pilRNAs that align to known piRNA sequences. Together, these data suggest that FIGURE 2. Validation of pilRNA expression in multiple murine tissues. Levels of three pilRNAs (pilRNA-in3, pilRNA-in18, and pilRNA-in87) identified from the small intestine sncRNA libraries and two germ cell piRNAs (piR-118029 and piR-126541) were determined using qPCR in 10 tissues including nine somatic tissues and the testis. U6 snRNA was used as an endogenous control, and all somatic tissue values are relative to levels of testis expression. FIGURE 3. Somatic pilRNAs resemble germ cell pachytene piRNAs. A, somatic cell-expressed pilRNAs identified by 454 sequencing display a preference for 5Ј uracil but lack enrichment of an adenine at the 10 th position. B, PGM sequenced stomach pilRNAs display a preference for 5Ј uracil but lack enrichment of an adenine at the 10 th position. C, clustering analyses of pilRNAs by aligning pilRNAs to known prepachytene and pachytene piRNA clusters. The number of pilRNAs found within one or both of the two known types of piRNA clusters is indicated. The majority of pilRNAs appear to belong to pachytene piRNA clusters. D, clustering analyses of pilRNAs by aligning pilRNAs to recently redefined prepachytene, hybrid, and pachytene piRNA clusters. The number of pilRNAs found within each category of piRNA clusters is indicated. The majority of stomach pilRNAs belong to pachytene piRNA clusters. Nt, nucleotide; Sm Int, small intestine. pilRNAs resemble pachytene piRNAs and are predominantly primary pilRNAs.
pilRNAs Tend to Cluster with Pachytene piRNAs-Clustering is a key characteristic of piRNAs. However, lone piRNAs derived from unique, unclustered genomic locations have been found in large quantities as well (12,14). To properly classify the pilRNAs identified, we collected genomic coordinates containing murine prepachytene and pachytene piRNA clusters, and pilRNAs were aligned and categorized within these clusters (9,12). 60 of 117 ICC pilRNAs (ϳ51%) were found to cluster with pachytene piRNA clusters, whereas none belonged to prepachytene piRNA clusters (Fig. 3C). Although 3 of 263 Sertoli cell pilRNAs were found to belong to prepachytene piRNA clusters, these 3, together with 31 additional Sertoli pilRNAs, also fell into pachytene piRNA clusters (Fig. 3C). Similarly, 117 of 316 small intestine pilRNAs (ϳ37%) belonged to pachytene piRNA clusters, with only 2 also found in prepachytene piRNA clusters (Fig. 3C). The preferential clustering with pachytene piRNAs again supports the notion that pilRNAs are more like pachytene piRNAs. We confirmed that these results with stomach reads aligned to the redefined germ cell piRNA clusters. Of ϳ1.6 million, reads only 7,850 aligned to prepachytene clusters and 27,945 to hybrid clusters, whereas 98% (1.54 million) of the stomach pilRNAs aligned to pachytene clusters (Fig. 3D). Recent data have demonstrated that pachytene piRNA clusters are predominantly intergenic (34). This was found to be true for stomach pilRNAs as well, with an average pilRNA density of 15,800 pilRNA per intergenic cluster in contrast to 422 per nonintergenic cluster (supplemental Table S5).
pilRNA Clusters Are Similar to Pachytene piRNA Clusters-Like pachytene piRNAs, somatic pilRNAs predominantly originate from intergenic genomic loci. Fig. 5A shows an example of two intergenic bidirectional pachytene piRNA clusters (data from NCBI Gene Expression Omnibus accession number GSM1096603) (34). The corresponding expression of stomach pilRNAs is similar to that seen in the pachytene spermatocytes within these clusters. In addition, the stomach pilRNAs that originate from these clusters are largely 30 nt in length and have a preference for 5Ј uracil similar to pachytene piRNAs (Fig. 5, B and C). RT-PCR detected the presence of several precursor transcripts in both the stomach and testis, suggesting that these clusters are also transcribed in somatic cells (Fig. 5D).
Although pilRNAs Are Similar to Pachytene piRNAs, pilRNAs Exhibit Ping-Pong Activity-To evaluate whether pilRNAs are in fact products of the ping-pong mechanism, we performed pairwise analyses of the 5Ј end of pilRNA pairs. In the small intestine sncRNA library, one single ping-pong match was identified between piR-141849 and pilRNA-in87, which, in addition to having a 10-nt overlap, also had the signature 5Ј uracil and 10 th nt adenine. The ICC and Sertoli cells did not have any ping-pong signature detected. In contrast, the stomach had a large fraction (28%) of pilRNAs that had the canonical 10-nt overlap at their 5Ј ends (Fig. 6A). A similar analysis looking for 11-nt overlaps had markedly reduced alignments (7%), indicating that a 10-nt overlap of 5Ј ends is indeed prominent in the stomach pilRNAs. The stomach pilRNAs that generate pairs with a 10-nt overlap also have a preference for a 5Ј uracil, as well as the 10 th nt adenine indicative of ping-pong activity (Fig. 6B). These ping-pong pilRNAs predominantly align to pachytene clusters (Fig. 6C). In addition, the presence of secondary transcripts with 10 th nt adenines do not seem to be dependent on the type of cluster the pilRNA is derived from (Fig. 6D). In fact all ping-pong pilRNAs, regardless of the cluster they originate from, have similar nucleotide profiles.
pilRNA Biogenesis Is Independent of Known miRNA or piRNA Biogenetic Pathways-DICER and DROSHA are two RNase III enzymes essential for the biogenesis of the best-studied small RNAs, miRNAs (36,37). In addition, DICER is also required for the production of endo-siRNAs (37). Because of their necessity in miRNA and/or endo-siRNA production, we examined the effects of DICER and DROSHA depletion on pilRNA expression in murine Sertoli cells. Using Sertoli cells purified from Sertoli cell-specific Dicer (Amh-Cre; Dicer lox/lox ; Rosa26-mTmG ϩ/tg ) and Drosha (Amh-Cre; Drosha lox/lox ; Rosa26-mTmG ϩ/tg ) conditional knock-out testes, we performed FIGURE 4. Nucleotide preference of pilRNAs that align to known piRNA and novel pilRNAs. A, ICC pilRNAs show a 5Ј uracil preference and no 10 th nt preference in both groups. B, Sertoli pilRNAs that align known piRNAs show a 5Ј uracil preference for both groups but less so in novel pilRNAs. Both groups show no 10 th nt preference. C, small intestine pilRNAs show a 5Ј uracil preference and no 10 th nt preference in both groups. Nt, nucleotide; Sm Int, small intestine. NOVEMBER 21, 2014 • VOLUME 289 • NUMBER 47 JOURNAL OF BIOLOGICAL CHEMISTRY 32829 sncRNA deep sequencing (sncRNA-Seq) (17,38). Sequencing reads aligned to known piRNAs were used for differential pilRNA expression analyses. Although levels of Sertoli cell pilRNAs were slightly altered, almost all pilRNAs were still present at similar levels to control, as shown in the heat map representing the pilRNA transcriptomes of control, Dicer-and Drosha-null Sertoli cells (Fig. 7, A and B, and supplemental Table S6). The persistent expression of pilRNAs in the absence of either DICER or DROSHA suggests that pilRNA production is independent of the two proteins. To see whether pilRNA production is dependent on piRNA biogenetic pathways, we examined the expression of two pilRNAs similar to known piR-NAs and three novel pilRNAs in 10 organs of MIWI, MIWI2, and MOV10L1 universal KO mice (Fig. 7C and supplemental Table S7). As expected, the two known piRNAs showed minimal or no expression in the testes of the three KO lines. The three novel pilRNAs were also detected in testes but consistently decreased in the knock-outs. In comparison with the somatic tissues, testis samples had the largest decrease in expression between KOs and controls. This is possibly due to the decrease in germ cell content in KO testis, a phenomenon observed previously in MIWI2 KO lines (19). Despite the fact that there was some differential pilRNA expression between control and KO lines in all somatic tissues, pilRNAs were still detected at variable levels, suggesting that their biogenesis in somatic cells does not rely on the known piRNA biogenetic pathways.

Somatic piRNA-like RNA Characterization
pilRNAs Can Target 3Ј-UTRs by Partial Annealing-Posttranscriptional regulation is one of the most important functions of small RNAs, e.g. miRNAs and endo-siRNAs (39 -41). miRNAs recognize their targets by partial complementary annealing to their target sequences mostly located in 3Ј-UTRs of mRNAs, whereas endo-siRNAs tend to bind to 3Ј-UTRs in a perfect complementary manner (40 -42). If pilRNAs also function as a post-transcriptional regulator in a manner similar to that of miRNAs and endo-siRNAs, we would expect to see that  (Table 2). RNA was either treated with DNase (D columns) or RNase (R columns) as a negative control. All amplicons matched to their anticipated sizes ( Table 2). these pilRNAs could bind mRNAs, especially 3Ј-UTRs. We therefore performed bioinformatic analyses by aligning ICC, Sertoli, and small intestine pilRNAs in a reverse complementary manner to all known mouse 3Ј-UTRs, 5Ј-UTRs, and CDSs (35). We first allowed a 1-2-nt mismatch and found a small number of piRNAs (24 of 263 Sertoli cells and 5 of 316 small intestine pilRNAs) could complementarily anneal to murine mRNAs with the majority (88% Sertoli cells and 80% small intestine) matched to 3Ј-UTRs ( Fig. 8A and supplemental Tables S8 and S9). We then increased the mismatch allowance to 1-5 nt. Interestingly, a drastic increase in alignments was observed in piRNAs of all three tissue-origins, with 12 ICC, 96 Sertoli cells, and 66 small intestine pilRNAs annealed to mRNA sequences ( Fig. 8A and supplemental Tables S10 -S12). The targeting sites were mainly located in 3Ј-UTRs, which displayed the highest percentage and were potentially targeted by 67% ICC, 79% Sertoli cells, and 62% small intestine pilRNAs when one to five mismatches were allowed (Fig. 8A). An increase in CDS-targeting pilRNAs was also noted when the mismatch allowance was increased to five, with many of the same pilRNAs annealing to both the CDSs and 3Ј-UTRs of mRNAs. We examined the number of mRNAs that could potentially be targeted by pilRNAs in a manner of partial complementary annealing, and a preference of potential pilRNA binding sites in 3Ј-UTRs was observed (Fig. 8B). This was further exemplified by the fact that at least 2.4 times more mRNAs displayed potential 3Ј-UTR pilRNA binding sites in comparison with those in 5Ј-UTRs and CDSs combined (Fig. 8B).

DISCUSSION
Given that three PIWI proteins are all expressed exclusively in germ cells in gonads, piRNAs have long been regarded as germ cell-specific (1,3,(5)(6)(7)(8). Several reports have documented the detection of piRNA-like RNAs in somatic cell types (13)(14)(15)(16), but systematic characterization of pilRNA structural features, genomic origins, biogenesis, and functions have not been reported. Approximately 6 years ago, we performed sncRNA deep sequencing on two somatic cell types (Sertoli cells purified from the postnatal day 6 testes and interstitial cells of Cajal purified from gastrointestinal tract of adult mice) and somatic murine small intestine tissue using the 454 sequencing platform. We identified a total of 696 pilRNAs from sncRNA libraries of Sertoli cells, ICCs, and small intestine. Our recent Ion Torrent PGM-based sncRNA-Seq analyses on the murine stomach resulted in 1.6 million reads aligned to germ cell piRNA clusters. By analyzing sequence reads from the two types of sncRNA-Seq analyses, we defined the pilRNA transcriptomes in two murine somatic cell types (i.e. Sertoli cell and ICC) and two murine organs (i.e. stomach and small intestine). Interestingly, pilRNAs are very similar to non-repeat-associated intergenic piRNAs expressed abundantly and predominantly in pachytene spermatocytes and round spermatids in adult testes. This notion is supported by the following findings: 1) the major size of pilRNAs is 30 nt, which is closer to that of pachytene piRNAs; 2) pilRNAs largely fall within known pachytene piRNA instead of prepachytene piRNA clusters; and 3) similar to pachytene piRNAs, pilRNAs are mainly derived from intergenic regions (6). In contrast to pachytene piRNAs, which only have ϳ2% ping-pong activity, pilRNAs have a definite ping-pong signature with ϳ30% of stomach pilRNAs having a 10-nt overlap at their 5Ј ends (12). Therefore, we propose that pilRNAs are similar to germ cell pachytene piRNAs but are different in the fact that they have ping-pong activity like prepachytene piRNAs. It remains unknown how pachytene piRNAs are produced and what roles pachytene piRNAs play in spermatogenesis. Elucidation of piRNA biogenesis and functions would help us understand those of pilRNAs, and vice versa.
Although the function of pilRNAs remains elusive, their dynamic, tissue-specific expression patterns strongly suggest that pilRNAs are under strict regulation and thus may play an important role in cellular functions. Given that significantly more hits were observed when one to five mismatches were allowed, pilRNAs may be able to target mRNAs by partial annealing to their binding sites located in 3Ј-UTRs of mRNAs. In this aspect, pilRNAs or pachytene piRNAs are similar to miRNAs because both pilRNAs/piRNAs and miRNAs can tolerate several mismatches in target recognition. Upon increasing the mismatch allowance from up to two to up to five, an increase in CDS-targeting pilRNAs was also observed. This . pilRNA production is independent of the known miRNA/endo-siRNA or piRNA biogenetic pathways. A, heat map representing the pilRNA transcriptome in wild-type, Dicer-or Drosha-null murine Sertoli cells based on small RNA sequencing. Expression is based on a Log 10 scale. B, scatter plots depicting log 2 (normalized value) expression between replicates from FACS-purified Sertoli cells of Amh-cre control, Dicer conditional knock-out, and Drosha conditional knock-out mice. R 2 values were at least 0.82, indicating relatively good correlation between replicates. Samples had an average depth of ϳ450,000 aligned reads. C, heat maps representing qPCR analyses of expression levels of three intestinal pilRNAs (pilRNA-in3, pilRNA-in18, and pilRNA-in87) and two known germ cell piRNAs (piR-118029 and piR-126541) in 10 organs of MIWI, MIWI2, and MOV10L1 global KO mice. U6 snRNA was used as an endogenous control, and values in KO samples were relative to respective values of wild-type samples.
observation may be the result of CDSs having longer sequences than UTRs, thereby increasing the random chance of a complementary match. Further bioinformatic analyses may reveal the rules that pilRNAs follow to recognize their mRNA targets, and their effects on mRNA fate need to be confirmed experimentally in the future.
Although both DICER and DROSHA are ubiquitously expressed, DROSHA substrates must contain either the stem-loop structure or double-stranded RNA. Because neither pilRNAs nor pachytene piRNAs contain such a structure, it is not surprising to see that inactivation of either Dicer or Drosha, despite changes in pilRNA levels probably caused by secondary effects, does not block pilRNA biogenesis. We know pachytene piRNAs require MIWI and several other proteins including MOV10L1, MITOPLD, and GASZ, which are exclusively expressed in male germ cells (25,(43)(44)(45). In addition, our lab has confirmed the lack of MIWI, MIWI2, and MILI expression in several somatic tissues including murine stomach and confirmed the absence of these proteins by Western blot (data not shown). Therefore, it is expected that inactivation of MIWI/MIWI2 or MOV10L1 would not abolish pilRNA production. This was also anticipated, because although pilRNAs are similar to pachytene piRNAs, they do have a key difference in that they have pingpong mechanism activity. However, given the high degree of similarities between pilRNAs and pachytene piRNAs, similar machinery is likely operating in somatic cells for pilRNA production. Thus, it is critical to identify the proteins responsible for somatic pilRNA production. The elucidation of pilRNA biogenesis will further help define the physiological role of pilRNAs in somatic cell lineages.
In summary, our data confirm that somatic cells indeed express abundant pilRNAs, and these pilRNAs are similar to pachytene piRNAs with the exception of ping-pong activity. pilRNAs are synthesized using a yet to be defined pathway, and pilRNAs may function in a manner similar to miRNAs as posttranscriptional regulators. With more understanding of piRNAs, the biogenesis and function of pilRNAs, the somatic counterparts of germ cell piRNAs, will be elucidated in the near future.