Gemin4 is an essential gene in mice, and its overexpression in human cells causes relocalization of the SMN complex to the nucleoplasm

ABSTRACT Gemin4 is a member of the Survival Motor Neuron (SMN) protein complex, which is responsible for the assembly and maturation of Sm-class small nuclear ribonucleoproteins (snRNPs). In metazoa, Sm snRNPs are assembled in the cytoplasm and subsequently imported into the nucleus. We previously showed that the SMN complex is required for snRNP import in vitro, although it remains unclear which specific components direct this process. Here, we report that Gemin4 overexpression drives SMN and the other Gemin proteins from the cytoplasm into the nucleus. Moreover, it disrupts the subnuclear localization of the Cajal body marker protein, coilin, in a dose-dependent manner. We identified three putative nuclear localization signal (NLS) motifs within Gemin4, one of which is necessary and sufficient to direct nuclear import. Overexpression of Gemin4 constructs lacking this NLS sequestered Gemin3 and, to a lesser extent Gemin2, in the cytoplasm but had little effect on the nuclear accumulation of SMN. We also investigated the effects of Gemin4 depletion in the laboratory mouse, Mus musculus. Gemin4 null mice die early in embryonic development, demonstrating that Gemin4 is an essential mammalian protein. When crossed onto a severe SMA mutant background, heterozygous loss of Gemin4 failed to modify the early postnatal mortality phenotype of SMA type I (Smn−/−;SMN2+/+) mice. We conclude that Gemin4 plays an essential role in mammalian snRNP biogenesis, and may facilitate import of the SMN complex (or subunits thereof) into the nucleus.


INTRODUCTION
Pre-mRNA splicing is a central feature of the eukaryotic gene expression program. The removal of intronic sequences from pre-mRNAs is catalyzed by a macromolecular machine called the spliceosome. Key components of spliceosomes include the small nuclear ribonucleoproteins (snRNPs). Each of these snRNPs contains a common set of seven RNA binding factors, called Sm proteins, that forms a heptameric ring around the snRNA, known as the Sm core. Biogenesis of the Sm core is carried out by another macromolecular assemblage called the Survival Motor Neuron (SMN) complex, consisting of at least nine proteins (Gemins 2-8, unrip and SMN) (reviewed in Battle et al., 2006a;Matera et al., 2007;Matera and Wang, 2014).
Following RNA polymerase II-mediated transcription in the nucleus, pre-snRNAs are exported to the cytoplasm for assembly into stable RNP particles (Jarmolowski et al., 1994;Ohno et al., 2000). The SMN complex is thought to bind both the Sm proteins (B, D1, D2, D3, E, F, G) and the uridine-rich snRNAs (U1, U2, U11, U12, etc.), bringing them together and forming the Sm core RNP Pellizzoni et al., 2002). Following 3′ processing and hypermethylation of the snRNA 5′ cap, the Sm core RNP is transported from the cytoplasm back into the nucleus. This process requires one of two known nuclear localization signals (NLSs), the 5′ trimethylguanosine (TMG) cap and the Sm core (Fischer et al., 1993;Marshallsay and Luhrmann, 1994). The adaptor that recognizes the cap is a protein called Snurportin (Huber et al., 1998), whereas the Sm core adaptor is recognized by factor(s) within the SMN complex itself (Narayanan et al., 2004). Both adaptors use a common import receptor protein, Importin-β (Huber et al., 2002;Palacios et al., 1997).
The cytoplasmic SMN complex is thought to chaperone RNP biogenesis by conferring stringent specificity toward snRNAs and preventing illicit Sm core assembly on nontarget RNAs Pellizzoni et al., 2002). The function of the nuclear SMN complex is completely unclear, and Gemin4 is the only member of the complex that contains a classical NLS motif (see below). A number of subcomplexes containing SMN and/or various Gemins have also been described (Battle et al., 2007;Hao et al., 2007;Yong et al., 2010), the functions of which are also unknown. Biochemical and cell biological analyses of the SMN complex have begun to elucidate functions of some of its individual components. SMN and Gemin2 form the core of the complex and are thought to be the primordial proteins in the evolution of the complex (Kroiss et al., 2008). Gemin5 is thought to deliver snRNAs to the SMN complex by recognizing specific RNA structural and sequence features, including the Sm protein binding site (Battle et al., 2006b;Lau et al., 2009;Yong et al., 2010) and the 7-methylguanosine cap (Bradrick and Gromeier, 2009). Gemins 6 and 7 are hypothesized to act as scaffolding intermediates during assembly of the Sm core (Battle et al., 2006a;Ma et al., 2005). Gemin8 interacts directly with SMN and forms a bridge for the Gemin6/7 heterodimer complex with unrip, bringing this trimeric module into the rest of the SMN complex (Carissimi et al., 2005(Carissimi et al., , 2006a. Gemin3/dp103/Ddx20 is a DEAD box RNA helicase (Charroux et al., 1999;Yan et al., 2003) that is essential for Sm core formation (Shpargel and Matera, 2005) as well as for metazoan development (Mouillet et al., 2008;Shpargel et al., 2009). Gemin4 does not bind SMN directly, but is brought to the complex by virtue of its interaction with Gemin3 (Charroux et al., 2000;Otter et al., 2007). Gemin4 is necessary for Sm core formation in vitro (Shpargel and Matera, 2005), but little else is known regarding the function of this protein.
Here, we show that Gemin4 is an essential gene in the mouse and that the protein displays a dominant effect on the localization of SMN, Gemin3 and other members of the SMN complex when overexpressed in cultured human cells. This relocalization effect was completely dependent on the presence of an eight amino acid NLS within the N-terminal region of Gemin4 that is necessary and sufficient for targeting heterologous GFP constructs to the nucleus.

Gemin4 contains a functional nuclear localization signal
Like most of the Gemin proteins, Gemin4 has no other known paralogs in the vertebrate proteome, and no known orthologs among non-vertebrate species (Kroiss et al., 2008). In humans and mice, Gemin4 is a 1058 amino acid (a.a.) protein with few known sequence features. Bioinformatic analysis using several web-based prediction tools suggested the existence of three classical ( pat7 subtype) NLS sequences, one located within a slightly larger leucine zipper motif in the carboxy (C) terminal half and two others in the amino (N) terminal half of the protein (Fig. 1A).
The sequences of human and mouse Gemin4 are >84% identical and >90% similar. In order to distinguish endogenous from exogenous Gemin4 we created expression vectors that encode N-or C-terminal GFP-tagged versions of mouse Gemin4 (GFP-Gemin4 and Gemin4-GFP, respectively). As with SMN, endogenous Gemin4 typically localizes diffusely throughout the cytoplasm and in distinct nuclear foci called Cajal bodies (Charroux et al., 2000) (Fig. 1B). Curiously, both myc and GFP N-terminally tagged mouse or human Gemin4 proteins predominantly localized to the nucleoplasm ( Fig. 1 and see below). This accumulation in the nucleus was neither due to the placement of the tag nor due to the tag itself, as C-terminally tagged Gemin4-GFP (or -myc) showed the same pattern and GFP-SMN mirrored SMN's endogenous distribution ( Fig. 1B and data not shown). Thus, overexpression of Gemin4 results in a dominant gain-of-function phenotype, perhaps exposing a cis-acting NLS that is normally masked in the cytoplasm or titrating a trans-acting factor that normally facilitates Gemin4 nuclear export.
We mapped the NLS activity to the N-terminal half of Gemin4 using N-and C-terminal truncations (Fig. 2). Precise deletions of the predicted NLS motifs ( Fig. 2A) identified an eight amino acid sequence that regulates GFP-Gemin4 nuclear import. Deletion of NLS1 (a.a. residues 62-69) or the leucine zipper (a.a. 714-735) had little effect; however, deletion of  relocalized the construct to the cytoplasm (Fig. 2B). Consistent with these findings, Lorson et al. (2008) reported that a ∼50 a.a. sequence (a.a. 194-243) overlapping this region was required for Gemin4 import. The NLS2 motif is therefore necessary for nuclear import of Gemin4. To address whether NLS2 is sufficient for this process, we fused it to the carboxy terminus of GFP. Two constructs were expressed, one consisting of only the NLS2 sequence (NLS2 min), and one with five additional residues flanking the NLS in both directions (NLS2ext). As shown in Fig. 2B, native GFP displayed an overall pan-cellular localization pattern, whereas GFP-NLS2 min was much more concentrated in the nucleus, albeit with a weak signal in the cytoplasm. In contrast, the GFP-NLS2ext construct was almost entirely nuclear (Fig. 2B). These observations demonstrate that the Gemin4 NLS2 motif is both necessary and sufficient for nuclear import.

Gemin4 overexpression redistributes the SMN complex to the nucleoplasm
Although the tudor domain of SMN can interact directly with Importin-β (Narayanan et al., 2004), Gemin4 is the only SMN complex member that contains an NLS. We therefore examined the ability of Gemin4 to influence the subcellular localization of other SMN complex proteins. HeLa cells were transiently transfected with GFP-Gemin4 and immunostained for SMN, Gemin2, Gemin3 and Unrip (Fig. 3). In each case, overexpression of GFP-Gemin4 caused relocalization of the endogenous protein to the nucleus. There was a clear dosage effect to the redistribution. In weakly transfected cells, SMN was qualitatively reduced (but still visible) in the cytoplasm (Fig. 3A,B). In cells strongly expressing GFP-Gemin4, endogenous SMN was no longer detectable in the cytoplasm and the nuclear SMN foci (Cajal bodies) were much less frequent (Fig. 3A,B). Similar effects on the cellular distributions of SMN and Gemin2 were observed upon overexpression of myc-Gemin4 (Fig. 4A,C). Quantitative analysis showed that expression of myc-or GFP-Gemin4 significantly relocalized endogenous SMN and Gemin2 to the nucleoplasm (Fig. 4B,F).

High levels of Gemin4 overexpression disrupts Cajal bodies
Gemin4 overexpression drives cytoplasmic SMN complexes into the nucleoplasm, but these proteins frequently fail to concentrate in Cajal bodies (Fig. 3). We quantified the loss of SMN, Gemin2, and Gemin3 nuclear foci in myc-or GFP-Gemin4 expressing cells and found that the ability of these SMN complex components to accumulate within nuclear foci was significantly  (Park et al. 2001;Di et al. 2003). Three predicted nuclear localization signal (NLS) sequences are shown (in bold type), one of which resides within a larger C-terminal leucine-rich motif (gray type). (B) Endogenous Gemin4 protein (α-Gemin4) localizes throughout the cytoplasm as well as in distinct nuclear foci called Cajal bodies. Full-length GFP-tagged mouse constructs (GFP-Gemin4 and Gemin4-GFP), as well as human Gemin4 (GFP-hGemin4) localize to the nucleoplasm when transiently transfected into HeLa cells. GFP-Smn localizes primarily to the cytoplasm, accumulating in nuclear foci, called Cajal bodies. Insets in upper left corners are the DAPI-stained nuclei. Scale bar: 5 µm. inhibited ( Fig. 4 and data not shown). To determine if the loss of these nuclear foci in myc-or GFP-Gemin4 expressing cells was due to Cajal body disassembly, we examined the distribution of known Cajal body marker proteins: coilin, WDR79 and NPAT ( Fig. 5A-C). Nuclear foci with these Cajal body markers were largely unaffected in cells expressing lower levels of GFP-Gemin4, but we observed an appreciable reduction of nuclear foci in cells with high GFP-Gemin4 expression (Fig. 5C). We then quantified the loss of coilin-positive nuclear foci and found a significant increase in cells that contained no distinguishable nuclear foci (Fig. 5D). Thus, the observed Gemin4-dependent relocalization of SMN and its binding partners to the nucleoplasm perturbs Cajal body integrity in a dose-dependent manner. Because coilin interacts with SMN and mediates recruitment of the SMN complex to Cajal bodies (Hebert et al., , 2001, it is likely that the SMN binding sites on coilin are simply swamped by the preponderance of nuclear SMN.

The Gemin4 NLS is required for SMN and Cajal body disruption
To further examine the critical Gemin4 regions involved in SMN relocalization, we overexpressed Gemin4 lacking the NLS (myc-G4ΔNLS2). We predicted that NLS2 was responsible for the disruption of SMN nuclear foci upon overexpression of Gemin4. Middle row: expression of internal deletion constructs shows that only NLS2 is required for nuclear accumulation of Gemin4. Bottom row: although GFP alone displays pan-cellular localization, the minimal NLS2 motif (GFP-NLS2 min) is primarily nuclear. The extended NLS sequence (GFP-NLS2ext) is exclusively nuclear. Insets in upper left corners of panels show the DAPIstained nuclei. Scale bar: 5 µm.
We found that SMN nuclear foci were relatively unaffected by myc-or GFP-Gemin4ΔNLS2 expression (Fig. 6A,B), demonstrating that the loss of SMN nuclear foci upon Gemin4 overexpression requires the presence of Gemin4 NLS2. In contrast, both Gemin2 and Gemin3 nuclear foci were significantly reduced upon myc-or GFP-Gemin4ΔNLS2 expression (Fig. 6). This observation indicates that Gemin2 and Gemin3 are sequestered by Gemin4ΔNLS2 in the cytoplasm. Notably, the reduction in Gemin3 nuclear foci was much more robust than that of Gemin2 (Fig. 6D,F). This is perhaps due to the fact that Gemin4 is a direct binding partner of Gemin3, whereas Gemin2 is not (Otter et al., 2007). Additionally, we note that Coilin nuclear foci were observed in nearly all myc-or GFP-Gemin4ΔNLS2 transfected and control cells (Fig. 6G,H). Taken together, these data suggest that the disruption of coilin and SMN nuclear foci depends upon a functional Gemin4 NLS.
Nucleoplasmic relocalization of Gemin3 requires a C-terminal motif in Gemin4 As mentioned above, Gemin4 does not interact directly with SMN; it is thought to be tethered to the SMN complex via its strong and direct interaction with Gemin3 (Charroux et al., 2000;Otter et al., 2007). As shown in Fig. 3C and Fig. 7A, overexpression of GFP-Gemin4 mislocalizes endogenous Gemin3 to the nucleoplasm. We used this effect as a way to map the interaction between Gemin4 and Gemin3. Interestingly, the GFP-Gemin4ΔCT construct, which contains the NLS and localizes to the nucleus, did not perturb localization of endogenous Gemin3 (Fig. 7B). This finding suggests that the truncated Gemin4 no longer interacts with Gemin3 and that the domain responsible for this interaction is located in the C-terminal half of Gemin4. Consistent with this interpretation, we found that relocalization of Gemin3 depended on the presence of the putative leucine zipper motif in Gemin4. Cells transfected with GFP-Gemin4Δzip displayed a Gemin3 localization pattern comparable to that of non-transfected cells (Fig. 7C). These data demonstrate that the Gemin4-dependent translocation of the SMN complex to the nucleus depends on a leucine-rich domain in the Gemin4 Cterminus and further suggest that Gemin4 may bind to Gemin3 via this motif.

Characterization of a Gemin4 gene trap allele in the mouse
Experiments in cultured cells can contribute a great deal of information for studying protein function. However, it is difficult to extrapolate the relevance of a given protein from single cell studies to the importance that it may have in the context of organismal development. Therefore, we created and characterized a loss-offunction Gemin4 mouse model. We obtained mouse embryonic stem (ES) cells (Lexicon Genetics) that contain a retroviral insertion (Zambrowicz et al., 1998) in the single Gemin4 intron. This 'gene trap' insertion is derived from an engineered retroviral cassette that preferentially inserts itself into upstream introns within the mouse genome. The genomic organization of Gemin4 is ideal for this kind of gene-trapping scheme because it is a single-intron gene and the upstream exon (exon 1) is particularly short, encoding only the first three a.a. residues (see Fig. 8). Exon 2, thus, contains essentially the entire protein coding sequence. The gene trap contains an upstream element consisting of a 5′ splice acceptor site fused to a β-galactosidase/neomycin-resistance (β-geo) cassette that also contains a transcription termination sequence. The β-geo cassette is transcribed from the endogenous promoter of the Gemin4 gene, located on mouse chromosome 11. The result is a fusion transcript in which the exon upstream of the insertion site is spliced in-frame to the β-geo reporter. The fusion transcript encodes three residues of the Gemin4 polypeptide fused to the β-geo cassette. The downstream element contains a puromycin-resistance cassette along with its own PGK promoter that 'hijacks' expression of exon 2. The gene trap has stop codons introduced in all three downstream reading frames. There is a splice donor site that fuses this transcript with exon 2 (Fig. 8A).
Blastocysts harboring the mutant ES cells were injected into pseudopregnant females, resulting in four chimeric males. Two of these mice displayed germline transmission of the gene trap, which was ascertained by PCR genotyping of the founder progeny (Fig. 8B). Founder mice were then backcrossed for three generations onto the C57BL/6J genetic background.
Gemin4 is an essential gene in the mouse Heterozygous animals were intercrossed and the resulting offspring were PCR genotyped at various developmental stages. Although heterozygous and wild-type progeny were readily detected at postnatal day (P) 1, embryonic day (E) 13.5, E7.5 and E5.5, no Gemin4 homozygotes were detected at any of these time points (Table 1 and data not shown). These findings demonstrate that although Gemin4 is thought to be an evolutionary newcomer to the SMN complex (it has been identified only in vertebrate genomes) (Kroiss et al., 2008), it is an essential mammalian gene. The data are also consistent with gene-targeting experiments showing that mouse Gemin2 and Gemin3 knockouts are also early embryonic lethal mutations (Jablonka et al., 2002;Mouillet et al., 2008).

Transgenic suppression of Smn lethality by SMN2 is background-dependent
The human genome contains two copies of the SMN gene, SMN1 and SMN2. Homozygous loss of SMN2 is asymptomatic, but loss of SMN1 results in a neuromuscular disease, spinal muscular atrophy (SMA) (Lefebvre et al., 1995). SMA is caused by hypomorphic reduction of SMN protein levels, whereas complete loss of gene function is embryonic lethal (reviewed in Burghes and Beattie, 2009;Cauchi, 2010). The mouse genome contains only a single copy of the Smn gene, null mutation of which is early embryonic lethal (Schrank et al., 1997). Transgenic expression of human SMN2 in the Smn knockout background rescues embryonic lethality, and recapitulates the SMA type I phenotype (Hsieh-Li et al., 2000;Monani et al., 2000).
In order to analyze the effect of Gemin4 copy number on the SMA phenotype, we generated a colony of Smn +/− ;SMN2 +/+ mice and backcrossed them onto the C57BL/6J inbred background for more than six generations. We then intercrossed the animals and were surprised to find that the Smn −/− ;SMN2 +/+ genotype was never detected postnatally (Table 2). Similar results were observed on a mixed C57BL/6J;129X1/SvJ background (Table 2). In contrast, and consistent with previous findings (Monani et al., 2000), Smn −/− ; SMN2 +/+ animals were detected in their expected numbers on the FVB/NJ hybrid background. The expected 1:2 ratios of wild-type to heterozygous mice in the non-FVB strains (Table 2) suggest that the Smn −/− ;SMN2 +/+ embryos in those strains died in utero. We conclude that SMN2 function as a genetic suppressor of the Smn embryonic lethal phenotype is background dependent.

Gemin4 does not function as a genetic modifier of mouse Smn
In order to assay the effects of Gemin4 copy number on the SMA phenotype, we crossed the Gemin4 gene trap onto the SMA background to obtain Gemin4 +/− ;Smn +/− ;SMN2 +/+ animals. We intercrossed these mice and analyzed their progeny. As shown in Fig. 6. Immunofluorescence of endogenous SMN complex and coilin proteins in cells transfected with GFP-Gemin4ΔNLS2. HeLa cells were transfected with GFP-Gemin4ΔNLS2 (shown in green) and then co-stained (in red) with antibodies targeting SMN (A), Gemin2 (C), Gemin3 (E) or coilin (G). As quantified above, GFP-Gemin4ΔNLS2 expression disrupts Gemin2 (F) and Gemin3 (D), but not SMN (B) or coilin (H) nuclear foci. Chi squared analysis shows P=1.0 for coilin, P>0.7 for SMN, and P<0.001 for Gemin3 and Gemin2. Insets in upper left corners of GFP images are the DAPI-stained nuclei. Arrowheads indicate transfected cells with reduced cytoplasmic staining of endogenous proteins. Arrows indicate nuclear foci in non-transfected cells. Scale bars: 5 µm. Table 3, SMA mice were present in expected numbers irrespective of the Gemin4 copy number. This failure to exacerbate (or ameliorate) the early lethality phenotype was puzzling, considering the fact that both Gemin2 and Gemin3 heterozygous mice produce roughly half the protein levels compared to wild-type and have associated phenotypes as a result (Jablonka et al., 2002;Mouillet et al., 2008). Notably, Gemin4 heterozygotes are phenotypically indistinguishable from their wild-type littermates.
We therefore examined the expression profile of Gemin4 in various tissues. Consistent with microarray expression profiles (NCBI gene ontology database profile GDS3142), Gemin4 was readily detectable in all the tissues we examined (data not shown). Although wild-type adult and neonatal mice expressed approximately twofold greater amounts of Gemin4 mRNA than their heterozygous littermates, the protein levels from these animals were consistently equivalent (Fig. S1). It is possible that posttranslational mechanisms exist to regulate Gemin4 protein levels in heterozygotes. We screened numerous commercial and noncommercial antibodies against Gemin4 for their ability to recognize the mouse protein (not shown). Although a few worked nominally well for western blot analyses, unfortunately none of them were suitable for immunofluorescence experiments. Thus we were unable to analyze Gemin4 preimplantation embryos to carry out additional phenotypic analyses. We also note that Gemin4 heterozygous mice are phenotypically identical to wild-type mice and, as a result, they are unable to modify the SMA phenotype.

DISCUSSION
Gemin4 contains a functional NLS in its N-terminal domain that is necessary and sufficient for nuclear import of exogenous cargoes. Curiously, the localization pattern of epitope-tagged versions of Gemin4 does not mirror that of the endogenous protein. Ectopically expressed Gemin4 is primarily nucleoplasmic, whereas the endogenous protein is localized throughout the cytoplasm, and in nuclear Cajal bodies. Mislocalization of the tagged constructs was not due to the presence of the tag itself, as a construct containing a deletion of the NLS (GFP-Gemin4ΔNLS2; Fig. 2) was completely excluded from the nucleus, with the exception of a very small number of cells that had distinct myc-Gemin4ΔNLS2 nuclear foci that colocalized with SMN (Fig. 6A). This rare targeting to Cajal bodies suggests that a small fraction of myc-Gemin4ΔNLS2  proteins may gain access to the nucleus by piggybacking onto Gemin3 and the rest of the SMN complex. We also discovered that relocalization of Gemin3 depends on the presence of the leucine zipper motif in Gemin4 (Fig. 7). Because Gemin4ΔNLS2 retains this motif, it is somewhat surprising to find that Gemin4ΔNLS2 is apparently limited in its ability to piggyback onto Gemin3 for import into the nucleus. The fact that SMN, but not other SMN complex components like Gemin2 and Gemin3, retains the ability to localize to Cajal bodies upon overexpression of Gemin4ΔNLS2 suggests that SMN can be imported independently from the other Gemins. Consistent with this observation is the fact that SMN was shown to directly interact with Importin-β (Narayanan et al., 2002(Narayanan et al., , 2004.
Gemin4 is thought to be tethered to the SMN complex via its interaction with Gemin3, and it was shown to form a subcomplex together with Gemin5 to create a Gemin3/4/5 heterotrimer (Battle et al., 2007;Carissimi et al., 2005Carissimi et al., , 2006b. Overexpression of Gemin4 likely alters the stoichiometry of the various members of the SMN complex (or subcomplexes thereof ). Thus it is also a bit surprising to find that Gemin4 overexpression results in the nucleoplasmic accumulation of SMN and all other tested members of the complex. These findings indicate that Gemin4 may play a regulatory role involved in the nuclear import of the SMN complex. SMN enters the nucleus during import of newlyformed snRNPs and is part of a two-component nuclear import signal (Fischer et al., 1993;Narayanan et al., 2002Narayanan et al., , 2004. Depending on the combination of binding partners in a given complex or subcomplex, it is possible that the Gemin4 NLS is masked. Under certain conditions, the Gemin4 NLS might serve to tip the balance of import in one direction or the other.

Characterization of Gemin4 loss-of-function mice
Our data clearly demonstrate that Gemin4 is an essential mammalian gene. Importantly, null mutations in Smn, Gemin2 and Gemin3 are all embryonic lethal, demonstrating that these genes are also essential (Jablonka et al., 2002;Mouillet et al., 2008;Schrank et al., 1997). U snRNP biogenesis is an essential cellular process that ensures the availability of splicing factors required for gene expression. Previous knockdown experiments revealed that depletion of Gemin4 (or Gemin3) causes an intermediate, yet significant, loss of U snRNP assembly activity in a HeLa cells (Shpargel and Matera, 2005). This intermediate effect was not overly detrimental, as cell death was not nearly as pronounced as it is upon SMN knockdown (Lemm et al., 2006;I.D.M. and A.G.M., unpublished observations). These findings suggest that Gemin4 and Gemin3 may be dispensable when it comes to the basal levels of U snRNPs needed to maintain cells in culture. However, during mammalian development U snRNA and snRNP levels increase dramatically from the 2-16 cell stage to the blastocyst stage (Dean et al., 1989;Lobo et al., 1988). The SMN complex would most likely need to operate at peak efficiency to account for the large increase in U snRNP production during these critical stages of development and the full complement of Gemins may be required to stabilize the complex or directly aid in assembly of snRNPs (Strzelecka et al., 2010). This idea is bolstered by the fact that Smn, Gemin2, Gemin3 and Gemin4 null mice all die during early embryogenesis (Jablonka et al., 2002;Mouillet et al., 2008;Schrank et al., 1997).
Gemin4 was also reported to reside in certain mammalian miRNP complexes (Dostie et al., 2003;Hutvagner and Zamore, 2002;Mourelatos et al., 2002). Given their wide range of gene regulatory roles, including post-transcriptional mRNA cleavage or translational repression (He and Hannon, 2004), miRNPs are required for mammalian development (Bernstein et al., 2003;Chen et al., 2004;Houbaviy et al., 2003). Gemins 3 and 4 were reported to bind in a separate complex along with the Argonaute protein, eIF2C2/Ago2 (Mourelatos et al., 2002;Nelson et al., 2004), although the significance of these findings has not been explored. Because relatively few proteins make up this miRNP subclass, it is likely that removal of any of its members would render it nonfunctional. Gemin4 is not known to have a paralog or redundant equivalent; it is reasonable to assume that if Gemin4 is required for miRNP assembly, loss of its function would be detrimental to this pathway as well as for snRNP biogenesis.
In summary, we have shown that Gemin4 has the potential to redirect the localization of other SMN complex members from the cytoplasm to the nucleoplasm in cultured mammalian cells. Gene disruption in mice demonstrated that Gemin4 is required for embryonic viability. We also found that SMN2 failed to rescue the embryonic lethality phenotype of Smn knockouts, when bred on the C57BL/6J inbred background. These data demonstrate the existence of additional genetic modifiers of SMA. We were unable to conclude whether Gemin4 is such a modifier, as we found that heterozygous Gemin4 mice express wild-type levels of protein.

Plasmid construction
Mouse total brain cDNA was used to PCR amplify mouse Gemin4. The amplicon was cloned into the pEGFP (Clontech) or pcDNA-myc (Invitrogen) vectors to express both N-and C-terminally tagged proteins. The QuickChange site-directed mutagenesis kit (Stratagene) was used to create the deletion constructs following the manufacturer's protocol ( primer sequences available upon request). Gemin4 heterozygotes were intercrossed and the genotypes of F1 progeny were analyzed by PCR at P1, E13.5 and E7.5.  Parental mice with the genotype of Gemin4 +/-;Smn +/− ;SMN2 +/+ were intercrossed and the resulting progeny were PCR genotyped to look for SMA type I-like mice that were either wild type or heterozygous for Gemin4.

Mouse lines and crosses
Various inbred mouse strains used in this study (FVB/NJ, C57BL/6J and 129Sv/J) were obtained from the Jackson Laboratory. Wild-type animals thus obtained were used for colony maintenance and to outcross Gemin4 and Smn mutants onto the various genetic backgrounds described in the Results section. Mice carrying a human SMN2 transgene in the background of a null mutation in the endogenous Smn gene [SMA type I mice: FVB.Cg-Tg(SMN2)89 Ahmb ;Smn tm1Msd /J] were obtained from the Jackson Laboratory. Gemin4 mice were created by purchasing ES cells with the Gemin4 gene-trap cassette from Lexicon Genetics and these cells were injected into donor blastocysts and subsequently injected into a pseudopregnant female mouse by the Case Western Reserve University transgenic mouse facility. All strains were maintained on a standard diet of 50/10 food pellets and sterile water. These mice were housed in microisolation chambers. Breeding pairs for SMA type I mice consisted of mice that were homozygous for the transgene and heterozygous for the knockout allele, which resulted in pups that display the SMA phenotype and control littermates. Breeding pairs for Gemin4 mice were heterozygous for the gene trap cassette. All mice were humanely euthanized according to protocols and standards set forth by the appropriate Institutional Animal Care and Use Committees (IACUC): the Case Western Reserve University Animal Resource Center (CWRU ARC) and the University of North Carolina Division of Laboratory Animal Medicine (UNC DLAM).

Reverse transcription and quantitative real-time PCR
RNA was extracted from homogenized liver using an RNeasy kit (Qiagen), according to the manufacturer's protocol, including an RNasefree DNase (Qiagen) on-column digestion step to remove genomic DNA. A SuperScript First Strand Synthesis System for reverse transcription polymerase chain reaction (RT-PCR) (Invitrogen) was used to synthesize cDNA in a 20 µl reaction containing 5 µg RNA, 179 ng random hexamer primer and 40 U RNaseOut RNase inhibitor (Invitrogen). The reverse transcription reaction was performed (Invitrogen SuperScript), according to the manufacturer's protocol. In addition to the PCR mastermix buffer (gotaq, Promega), each PCR reaction mixture (20 µl) contained 1 µl cDNA at 1:10 dilution. Primers used are listed below. PCR conditions consisted of one step at 95°C with 5 min hold and two-segment cycles (95°C with 15 s hold and 60°C with 1 min hold) followed by a terminal step 72°C.

Western blotting
Protein lysates were prepared by flash freezing mouse tissues in liquid nitrogen then crushing the tissue into a fine powder. This powder was then transferred into RIPA-buffer with protease inhibitor (Thermo Fisher Scientific), homogenated with a 27.5G syringe and centrifuged at maximum speed at 4°C for 10 min. Supernatants were quantified using a standard Bradford assay protocol. Equal amounts of samples (60 µg) were subjected to 4-12% MOPs/NuPage gradient gel system (Invitrogen) and transferred onto a nitrocellulose membrane (Schleicher and Schuell) following standard protocols. Lanes were transferred to nitrocellulose membranes and incubated with appropriate antibodies. Gemin4 immunoreactive bands were detected using the polyclonal Gemin4 antibody and horseradish peroxidase-conjugated goat antibodies to rabbit IgG (Gemin4 Ab, 1:200, Santa Cruz Biotechnology), monoclonal anti α-tubulin antibody (1:5000, Sigma-Aldrich) and secondary mouse and rabbit antibodies (1:10,000 and 1:5000, respectively, Thermo Fisher Scientific). Detection of protein bands was achieved using standard chemiluminescence substrates (SuperSignal West Femto, Thermo Fisher Scientific).