Enforcement of developmental lineage specificity by transcription factor Oct1

Embryonic stem cells co-express Oct4 and Oct1, a related protein with similar DNA-binding specificity. To study the role of Oct1 in ESC pluripotency and transcriptional control, we constructed germline and inducible-conditional Oct1-deficient ESC lines. ESCs lacking Oct1 show normal appearance, self-renewal and growth but manifest defects upon differentiation. They fail to form beating cardiomyocytes, generate neurons poorly, form small, poorly differentiated teratomas, and cannot generate chimeric mice. Upon RA-mediated differentiation, Oct1-deficient cells induce lineage-appropriate developmentally poised genes poorly while lineage-inappropriate genes, including extra-embryonic genes, are aberrantly expressed. In ESCs, Oct1 co-occupies a specific set of targets with Oct4, but does not occupy differentially expressed developmental targets. Instead, Oct1 occupies these targets as cells differentiate and Oct4 declines. These results identify a dynamic interplay between Oct1 and Oct4, in particular during the critical window immediately after loss of pluripotency when cells make the earliest developmental fate decisions. DOI: http://dx.doi.org/10.7554/eLife.20937.001


Introduction
The mammalian blastocyst inner cell mass (ICM) contains undifferentiated, pluripotent cells capable of generating all tissue lineages of the embryo proper. Cultured embryonic stem cells (ESCs) are derived from these cells and have similar capabilities (Abranches et al., 2009). The POU transcription factor Oct4/Pou5f1 is an indispensable component of the regulatory circuitry underlying these properties (Morey et al., 2015). It is expressed in the ICM and in ESCs where its loss accompanies differentiation (Nichols et al., 1998). Oct4 is also widely used to generate induced pluripotent stem cells (iPSCs) from somatic cells (Takahashi and Yamanaka, 2006).
Together with other factors, Oct4 sustains pluripotency by activating 'core' targets such as Pou5f1 (encoding Oct4 itself) and Nanog (Boyer et al., 2005). It also maintains 'poised' targets, including developmentally critical transcription regulators, in a silent but readily inducible state (Bernstein et al., 2006;Meissner et al., 2008). These genes frequently encode developmentally important transcription factors and are marked with a bivalent chromatin signature defined by the simultaneous presence of H3K4me3 and H3K27me3 (Azuara et al., 2006;Bernstein et al., 2006;Ku et al., 2008;Pan et al., 2007).
Here, we show that ESCs lacking Oct1 have no discernable defects when maintained in an undifferentiated state, but that silent, normally poised developmental-specific genes fail to induce properly upon differentiation. Additionally, genes specific for alternative developmental lineages are inappropriately expressed. Most prominently, placenta-specific genes not normally expressed in any ESC-derived lineage are induced, indicating that Oct1 restricts extra-embryonic gene expression in differentiating ESCs. Additionally, these cells show phenotypic defects when differentiated into multiple lineages, form smaller and less differentiated teratomas, and fail to generate chimerism when injected into blastocysts. ChIPseq identifies a group of targets co-bound by Oct1 and Oct4 in ESCs associated with non-classical binding sites termed MOREs (More Palindromic Octamer Related Elements, ATGCATATGCAT). These sites are inducibly bound by Oct1 in somatic cells lacking Oct4. The function of Oct1 at these genes is to insulate their expression against repression by oxidative stress, and consistently Oct1-deficient ESCs are hypersensitive to oxidative stress. Oct1 associates with developmentally poised targets upon differentiation and Oct4 loss, explaining the altered gene expression observed with RNAseq. These results establish Oct1 as a key mediator of both developmental-specific gene induction and repression, and identify a dynamic interplay in which Oct1 replaces Oct4 at target genes as ESCs differentiate and early decisions about induction or repression of lineage-specific genes are made. eLife digest Humans and most other animals are composed of hundreds of different types of cell, including nerve cells, muscle cells and blood cells. Despite performing many different roles, these cells all develop from a single fertilized egg, which divides to make a particular group of cells that when studied in the laboratory are called embryonic stem cells (or ESCs for short).
The ability of a cell to become a different cell type is defined as "potency". ESCs are unique because they can specialize into any type of cell present in the adult organism, and they are therefore called "pluripotent". However, as the embryo develops, its ESCs gradually lose their potency, and become more and more specialized. The activity of a great number of genes must be regulated during the transition from pluripotent to specialized cells, and some of the mechanisms involved in this transition are still unclear.
ESCs are known to need a gene-regulating protein called Oct4 to remain pluripotent and Shen, Kang, Shakya et al. now show that a similar protein named Oct1 is essential for their transition to becoming more specialized. When the gene for Oct1 was deleted from mouse ECSs, they behaved largely like "normal" ESCs, but could not properly mature into certain cell types such as heart and nerve cells. Molecular analyses revealed that Oct4 and Oct1 compete to regulate the activity of many common genes with opposing outcomes: Oct4 keeps ESCs pluripotent while Oct1 leads them to specialize. The Oct4 protein is abundant in ESCs and prevails over Oct1, but as the cells mature, the levels of Oct4 drop, and Oct1 takes over in the regulation of their common target genes.
Going forward, a better understanding of how ESCs become specialized will help basic research in the laboratory and allow scientists to tackle new questions about how the human body develops and how our organs work. In the longer-term, these findings might also have applications in the field of regenerative medicine, which aims to repair or replace a person's cells, tissues or organs to improve their health.

Results
Oct1 germline-deficient ESCs are phenotypically normal but differentiate abnormally We derived Oct1-deficient ESC lines by intercrossing Pou2f1 germline heterozygotes (Wang et al., 2004). Oct1-deficient animals die in utero (Sebastiano et al., 2010;Wang et al., 2004), but survive long enough to derive ESCs. Two Oct1-deficient lines and two littermate WT controls were generated. All had normal karyotypes (not shown). Oct1-deficient ESCs proliferate at normal rates (not shown), are morphologically normal ( Figure 1A) and can be propagated for a month in culture with no loss of ESC morphology (not shown). They express normal levels of Oct4, Sox2, and Nanog protein but no Oct1 ( Figure 1B). In addition, cells express the pluripotency-associated Pou5f1 (Oct4), Sox2, Nanog, and Dppa4 mRNAs at normal levels ( Figure 1C). Ahcy, a stress-inducible Oct1 target in which the function of Oct1 is to prevent stress-associated repression (Kang et al., 2009;Shakya et al., 2011), was also unaltered.
To study differentiation, we used early-passage Oct1-deficient and WT control ESCs to form embryoid bodies (EBs). Oct1-deficient ESCs were able to aggregate into EBs at d four with morphology similar to WT ( Figure 1D). Similar results were obtained at days 1 and 2 (Figure 1-figure supplement 1A). During EB formation, Pou5f1 and Sox2 were down-modulated with similar kinetics in Oct1-deficient and WT cells, while Pou2f1 (Oct1) remained undetectable ( Figure 1E). Sox17 (endoderm), Brachyury (T, definitive mesoderm), and Fgf5 (definitive ectoderm) expression in Oct1-deficient EBs was grossly similar to WT at some (days 4, 9, or 14) timepoints ( Figure 1F), consistent with findings that Oct1 is dispensable for gastrulation (Sebastiano et al., 2010;Wang et al., 2004). However, there were consistent defects in expression in the Oct1-deficient condition at day 14 for Sox17 and day 4 for T and Fgf5. Sox17, T and Fgf5 are known Oct4 targets . By day 5, Oct1-deficient EBs were somewhat smaller in appearance ( Figure 1-figure supplement 1B). We therefore looked for further evidence of defects in induction kinetics in three other known silent but developmentally inducible Oct4 target genes: Hoxa5, Hoxc6, and Gata2 . Each of these genes showed a similar pattern of defective induction in Oct1-deficient EBs relative to WT controls ( Figure 1G).
To study gene induction using a more developmentally restricted system, we analyzed expression of known developmentally inducible Oct4 target genes during RA-mediated differentiation of WT and Oct1-deficient ESCs. RA treatment of ESCs ultimately results in a largely neuronal phenotype, but waves of gene expression, differentiation, proliferation, and cell death take place during the course of RA treatment (Walker et al., 2007). Upon differentiation, ESCs ±Oct1 lose their clustered, spherical, refractile morphology with similar kinetics (not shown). Pou5f1 and Sox2 were also lost with similar kinetics ±Oct1, while Pou2f1 was not detectable in KO cells ( Figure 2A). To study developmental gene expression, we tested Hoxa5, Hoxc6, Cdx2, and Sox17. These genes encode developmentally important transcription factors and are known Oct4 targets , but are silent in ESCs. Upon RA-mediated differentiation, lineage-appropriate (ectoderm) genes such as Hoxa5 and Hoxc6 (Jiang et al., 2011) normally 'resolve' their bivalent state by losing H3K27me3 and becoming induced, while lineage-inappropriate (e.g. endoderm) genes such as Cdx2 and Sox17 normally resolve by losing H3K4me3, gaining DNA methylation, and becoming stably silenced. Induction of Hoxa5 and Hoxc6 was robust following RA-mediated differentiation of WT cells, but defective in the Oct1 KO condition ( Figure 2B). In contrast, Cdx2 was ectopically activated upon RA-mediated differentiation of Oct1-deficient ESCs ( Figure 2B). Similarly, the definitive endodermspecific gene Sox17 is not normally induced upon RA-mediated differentiation, but showed ectopic expression in the absence of Oct1 ( Figure 2B). The ectopic Sox17 expression observed with RA differentiation differed from the expression defects observed in Oct1-deficient EBs, which include endodermal lineages and which showed 100-fold stronger Sox17 expression ( Figure 1F). These results indicate that Oct1-deficient ESCs induce lineage-appropriate developmental genes poorly, while ectopically expressing lineage-inappropriate genes.
In order to determine whether forced Oct1 expression during differentiation was sufficient to correct defects in gene expression, we differentiated Oct1-deficient ESCs using RA and infected the cell during the differentiation timecourse with lentiviral vectors expressing Oct1 and a puromycin reistance cassette, or empty vectors containing the puromycin resistance cassette alone. We confirmed that cells transduced with this vector overexpressed Oct1 by immunoblotting ( Figure 2C). Cells Three representative images of each genotype from wells of a 96-well plate are shown. (E) EBs were collected at 4, 9, and 14 days, and cDNA was prepared and subjected to RT-qPCR. Expression levels were normalized to GAPDH. Pluripotency genes (Pou5f1, Sox2) and Pou2f1 were tested. Three biological replicates were performed. Error bars denote ±standard deviation. (F) Additional genes representative of all three germ layers, Sox17, T ,and Fgf5, were tested as in E. (G) Three known poised Oct4 target genes, Hoxa5, Hoxc6, and Gata2, were tested as in E. DOI: 10.7554/eLife.20937.003 The following figure supplement is available for figure 1:  b-actin is shown as a loading control. (D) Oct1-deficident ESCs were differentiated using RA for 14 days. 4 days into the timecourse, cells were infected with lentiviruses expressing Oct1 and a puromycin resistance cassette, or an empty vector (EV) control. Cells were selected with puromycin for the remainder of the timecourse. cDNAs from the endpoint cultured were used to study expression of Hoxa5 relative to an RPL13 ribosomal protein internal standard. Cells were prepared in triplicate for each condition. Error bars denote ±standard deviation. *denotes p<0.05. (E) Immunofluorescence images of WT and Oct1-deficient ESCs differentiated into neurons. Cells were cultured as EBs for 8 days, followed by culture for a further 8 days in neuralizing media (see Materials and methods). b-tubulin III and DAPI staining are shown.  were infected over 2 consecutive days and selected with puromycin throughout the remained of the 14-day differentiation timecourse. At timepoints after day 6, infection and selection with empty vector skewed the expression of genes such as Hoxa5, suggesting that infection and selection were skewing the populations of cells in the culture. Cells infected at 4 and 5 days, however, did not show major differences (not shown), suggesting that the composition of cells in culture was not being significantly altered. We therefore infected differentiating Oct1-deficient cells consecutively on days 4 and 5, prepared cDNAs at day 14 and examined gene expression. By RT-qPCR, Oct1 was undetectable in cells transduced and selected with empty vector but robustly expressed by cells transduced with Oct1 (not shown). Expression of the developmentally-inducible Hoxa5 gene was significantly augmented (p=0.026) by ectopic Oct1 expression ( Figure 2D). These results indicate that restoration of Oct1 expression at these times and conditions can correct at least some of the gene expression defects associated with Oct1 deficiency.
RA-mediated differentiation yields neuronal precursor cells but not neurons. We used a differentiation system involving EB generation and culture in insulin, transferrin and selenium (see Materials and methods) to generate arborized neurons that express the marker b-tubulin III (Tubb3) and the neuroectoderm genes Nestin (Nes) and Map2. Staining of 2-day-old EBs for b-tubulin III prior to laminin/poly-L-lysine dish attachment -early in the differentiation protocol -revealed fewer b-tubulin IIIpositive cells in the Oct1 deficient condition (Figure 2-figure supplement 1). Upon complete differentiation (8 d EBs, 8 days in neuralizing monolayer culture), WT ESCs formed neurons robustly ( Figure 2E, asterisk) while few b-tubulin III-expressing neurons were formed from Oct1-deficient ESCs. Oct1-deficient cells that did induce b-tubulin III tended to do so at lower levels, and the few cells that did express b-tubulin III robustly were nevertheless abnormal ( Figure 2E-F, arrow). To test if Oct1 loss induced a kinetic delay that could be overcome by longer culture, cells were incubated for 8 or 16 additional days (16 or 24 days in neuralizing medium, 24 or 32 days total differentiation). In neither case were neurons formed ( Figure 2G-H and data not shown). To study gene expression, individual wells of common genotypes differentiated for 16 days were pooled and subjected to RT-qPCR for Tubb3, Nes and Map2. Each of these genes showed defective expression in the absence of Oct1 ( Figure 2I).
To test an unrelated developmental system, we performed cardiomyocyte differentiation by culturing EBs in hanging drops followed by culture with gelatin (see Materials and methods). Oct1-deficient ESCs failed to form beating cardiomyocytes, unlike WT ( Figure 3A and Videos 1-12). RNA was collected from pooled beating and non-beating WT colonies, and Oct1-deficient colonies, and used to analyze Mef2c and Hand1, regulators of cardiomyocyte differentiation, and the terminal differentiation markers Mlc2v and Mlc2a. We observed Mlc2v expression defects in the Oct1-deficient condition equivalent to non-beating cardiomyocyte colonies from WT ESCs. Mlc2a and Mef2c expression was even weaker than non-beating WT cardiomyocyte colonies ( Figure 3B). In contrast, Hand1 showed no expression defects ( Figure 3B), indicating that Oct1 deficiency does not globally down-regulate genes associated with cardiomyocyte differentiation. Cumulatively, the results indicate that although Oct1-deficient ESCs appear normal in the absence of differentiation cues, they do not induce poised developmentally inducible genes and fail to repress lineage-inappropriate genes such as Cdx2 and Sox17, resulting in multiple cellular defects following differentiation.

Oct1 conditional-deficient ESCs display abnormal gene expression upon differentiation
Although the ESC lines described above were derived from littermate animals and had normal karyotypes, it was possible that the developmental phenotypes and altered gene expression patterns resulted from differences unrelated to Oct1 status. Furthermore, the observed gene expression defects could result from compensatory changes due to development in an Oct1-deficient environment. Finally, the allele used to generate these lines is a severe hypomorph rather than a complete null (Wang et al., 2004). To circumvent these issues, we generated tamoxifen-inducible, Oct1 conditional-deficient ESCs.
We previously described Pou2f1 conditional (floxed) mice (Shakya et al., 2015b). We generated inducible-conditional Oct1 ESCs by crossing the floxed allele onto Rosa26-Cre-ERT2 and Rosa26lox-stop-lox-YFP (see Materials and methods). Pregnant animals were used to isolate Pou2f1 f1/fl ; Rosa26-Cre-ERT2;Rosa26-lox-stop-lox-YFP ESC lines in which Oct1 could be acutely deleted and YFP induced by 4-hydroxytamoxifen (4-OHT) administration. Treatment of parent ESCs with 4-OHT resulted in variegated YFP + colonies ( Figure 4A, step one at top). Colonies with good morphology were picked (red arrow), trypsinized and expanded into derived ESC lines ( Figure 4A, step two at  Figure 4B). The designation D will be used to differentiate this allele from the germline deficient allele used in Figures 1-3. As with Oct1 germline-deficient ESCs, derived Pou2f1 D/D ESC lines displayed normal colony morphology ( Figure 4C), proliferated normally (not shown) yet expressed no Oct1 ( Figure 4D). The derived cells showed normal karyotype profiles and could be propagated for >1 month without loss of an undifferentiated phenotype (not shown). Similar to germline Oct1 deficient ESCs, derived Pou2f1 D/D cells also expressed Oct4, Sox2 and Nanog at normal levels ( Figure 4D).
In differentiated cells, Oct1 promotes glycolysis and dampens mitochondrial function. Oct1 deficiency dramatically increases mitochondrial amino acid oxidation and oxygen consumption while decreasing glycolysis and to a lesser extent glucose oxidation (Shakya et al., 2009). These changes contribute to failure of fibroblasts to undergo oncogenic transformation, despite the fact that they grow at normal rates and can be immortalized by serial passage. To test if similar changes occur in ESCs lacking Oct1, we analyzed the metabolic profile of these cells. Few differences were noted, with only phosphoethanolamine (also known as phosphorylethanolamine, p=0.047) and inositol (p=0.037) showing significant changes (Figure 4-figure supplement 1). The lack of difference may be due to redundant functions of co-expressed Oct4 and Oct6, or co-selection for metabolic stability when selecting and propagating ESCs.
Derived Pou2f1 D/D ESC lines, and parent cell line controls, were subjected to RA-mediated differentiation. Similar to results using Oct1 germline-deficient ESCs, the derived Pou2f1 D/D ESCs lost Oct4 and Sox2 expression with kinetics identical to the parent line ( Figure 4E). Microscopic imaging of the differentiating cells revealed that they were morphologically similar until approximately d 12, at which point Pou2f1 D/D cells showed an increase in columnar/epithelial appearance (Figure 4-figure supplement 2). Also as before, the induction of silent, developmentally poised genes was defective: Hoxa5 and Hoxc6 both showed reduced expression in timecourse assays ( Figure 4F). The cells also showed ectopic Cdx2 expression upon RA treatment ( Figure 4F). As with germline-deficient ESCs, Pou2f1 D/D ESCs did not generate true neurons efficiently ( Figure 4G).
To determine the effect of conditional Oct1 loss during differentiation, parent cells were treated with 4-OHT following 8 d EB formation and 4 d in insulin, transferrin and selenium. After an additional 4 days, cells were fixed and stained with antibodies against b-tubulin III to score neurogenesis and YFP to score deletion. 40-50% of the treated cells induced YFP. Nearly all cells that induced btubulin III and/or generated neuron morphology lacked YFP expression ( Figure 4I and Figure 4J). A few cells (2/~700) were both YFP-and b-tubulin III-positive (not shown), though it is possible that these cells are Pou2f1 heterozygous as 4-OHT treatment can result in recombination of only one allele ( Figure 4B).
Oct1-deficient ESCs form smaller, less differentiated teratomas and fail to generate chimeric mice Parent and Pou2f1 D/D ESCs were injected subcutaneously into contralateral flanks of NCr Nude immunocompromised animals to generate teratomas. ESCs lacking Oct1 consistently generated smaller tumors ( Figure 5A-C). Immunoblotting confirmed that recovered tumors maintained their original Oct1 status ( Figure 5D). Histological analysis confirmed that parent cells generated mature teratomas that included, e.g., glial tissue, and glandular epithelial and squamous elements ( Figure 5E). In contrast, Oct1 deficient ESCs generated areas of focally immature cells, consistent with reduced differentiation. Occasionally tumors were comprised virtually entirely of primitive malignant cells resembling a germ cell tumor ( Figure 5E, lower right). A standard measure of pluripotency is the ability to contribute efficiently to adult cells and tissues (De Los Angeles et al., 2015). We injected parent and Pou2f1 D/D ESCs into albino C57BL/6 blastocysts, resulting in high contribution in the case of the parent line ( Figure 5F, left), but no contribution in the case of the derived lines (right). The average percent chimerism from two separate sets of injections (33 animals from parent cell line injections, 36 combined from two different Pou2f1 D/D lines) confirmed the lack of contribution ( Figure 5G). 18/33 animals injected with parent ESCs showed some detectable chimerism (55%), while 1/36 animals injected with conditional knockout ESCs showed transient trace chimerism in the eye (0.03%). The cells were imaged immediately prior to blastocyst injection to confirm an undifferentiated phenotype ( Figure 5-figure supplement 1).

Defects in lineage-specific gene expression in differentiated Oct1deficient ESCs
To identify gene expression changes stemming from loss of Oct1, we performed RNAseq with undifferentiated and 14 d RA-differentiated parent and Pou2f1 D/D ESCs. Three independent replicates were performed for each of the four conditions. Between 18.1 and 24.9 million sequence reads were generated for each sample, 73% to 82% of which aligned uniquely to the mouse Mm10 reference genome. 99.6% of the reads within coding regions aligned to the correct strand. Variance between replicates was similar regardless of genotype or differentiation state (not shown). Unsupervised hierarchical clustering indicated that 0 and 14 days samples separated clearly from each other regardless of genotype, while within each timepoint the KO and parent WT samples clustered together (Figure 6-figure supplement 1A). These results indicated that the effect of RA treatment and differentiation on gene expression was far stronger than the effect of Oct1 deletion. Plotting gene expression levels in the parent vs Pou2f1 D/D cells ( Figure 6A) showed relatively few gene expression changes in the undifferentiated condition (>2.5 fold, p<0.01, 253 total genes). These genes never changed by >7 fold (Supplementary file 1). In contrast, 1123 genes change expression in differentiated Pou2f1 D/D cells, some of which varied by >200 fold. Plotting gene expression fold change vs. pvalue ( Figure 6B) recapitulated these findings. Comparing genes differentially expressed at the two timepoints revealed little overlap (23 genes, Figure 6C). Analysis of the genomic alignments revealed that expression of control genes such as Tbp was unaltered, while pluripotency genes such as Nanog were silenced equivalently ( Figure 6D). Other pluripotency genes such as Pou5f1, Klf4, Dnmt3l, and Dppa4 behaved similarly to Nanog (not shown). One gene showing increased expression in the undifferentiated state was Pou2f3 (Oct11, Figure 6-figure supplement 1B). Pou2f3 shows low but detectable mRNA expression in WT ESCs, and is repressed upon differentiation to nearly undetectable levels. It is slightly elevated in Oct1-deficient ESCs but decreases to an even greater extent upon differentiation. RT-qPCR confirmed these changes in the context of overall low expression (Figure 6-figure supplement 1C). It is therefore unlikely that this protein provides a compensatory function upon differentiation. In addition, Ahcy, a stress-responsive Oct1 target (Kang et al., 2009;Shakya et al., 2011) showed weakened expression in the absence of Oct1 specifically in the differentiated condition ( Figure 6E). An even larger cohort of~800 genes was aberrantly expressed upon differentiation of Pou2f1 D/D cells. These genes are strongly associated with alternative developmental fates ( Figure 6F and Figure 6-figure supplement 2B). Examples include Sox17, Cdx2 and Gata4 (endoderm), Fgb (endoderm/liver), Gata2 (mesoderm/endothelial), Pparg and Irx3 (mesoderm/mesenchymal), Muc13 (epithelial/hematopoietic) and Tnfrsf9 (which encodes CD137/hematopoietic). The difference in Sox17 and Gata2 expression between EBs (defective) and RA-differentiated cells (elevated) likely arises from the additional developmental fates specified in EBs.
Unexpectedly, differentiating Pou2f1 D/D cells also resulted in inappropriate expression of genes associated with trophoblast and placental development, the specification of which is normally restricted to trophectoderm cells rather than the inner cell mass (from which ESCs are derived). Examples include Cdx2 (which is also expressed in endoderm), Prl8a6, Hand1 (which is also expressed in cardiomyocytes), Pappa2, Prl3b1, and Psg27 ( Figure 6F and Figure 6-figure supplement 2B). Some of these genes are also expressed in other lineages while others are highly specific. In aggregate, they indicate improper activation of an extra-embryonic program. Using RT-qPCR we confirmed unperturbed expression of Tbp, defective expression of Ahcy, and elevated expression of Prl8a6 in differentiating Oct1-deficient ESCs ( Figure 6G). These results indicate that Oct1 deficiency results in defective lineage specification upon differentiation.
We used ChIPseq to identify common and unique Oct1 and Oct4 target genes in ESCs. We also performed H3K4me3 ChIPseq as a control. The ChIPseq data were of high quality based on measures of signal/noise ratio (see Materials and methods). After filtering, 27.3 (Oct1), and 23.7 (Oct4) million alignable reads were generated, corresponding to 692 (Oct1), and 8673 (Oct4) peaks. Allocating the peaks to nearest genes revealed 209 unique Oct1 target genes, 356 common targets, and 5563 unique Oct4 targets ( Figure 7A). The smaller size of the Oct1 target pool relative to Oct4 may be attributable to >10 fold lower Oct1 levels in ESCs as observed by RT-qPCR ( Figure 1C, Figure 1E, Figure 2A) and RNAseq (not shown). Oct1 may also require a more open chromatin context, and/or the presence of specific co-bound factors, to access DNA. For example Il2 and Ifng are known Oct1 targets in differentiated T cells (Shakya et al., 2015b(Shakya et al., , 2011 but were not identified as targets in this analysis. Motif analysis of unique and co-bound peaks revealed significant differences in recognized DNA elements. Regions associated exclusively with Oct4 were significantly enriched for Oct-Sox compound elements that likely also associate with Sox2 in ESCs ( Figure 7B). In contrast, target regions preferentially associated with Oct1 were enriched for the simple octamer element ATTTGCAT (shown by the software as an Oct4 motif in Figure 7B). Interestingly, co-occupied peaks strongly associate with a motif termed a MORE that is known to bind two Oct protein molecules (Reményi et al., 2001;Tomilin et al., 2000). In differentiated cells lacking Oct4, oxidative stress induces homodimeric Oct1 binding to MORE-containing genes such as Polr2a, Ahcy, Ell, and Rras2. Oxidative stress-induced binding occurs via phosphorylation of a conserved serine residue in the DNA-binding domain (Kang et al., 2009). These genes were constitutively co-bound by Oct1 and Oct4 in ESCs (Figure 7-figure supplement 1). Additional examples of genes associated with Oct4 alone (Pou5f1), or Oct1 alone (Taf12) are shown in Figure 7C. This panel also shows another example of a MORE containing gene (Polr2a, two tandem MOREs binding four molecules) that also associates with both proteins but shows an Oct1 bias, as well as an example (Pax6) that is bound by both proteins but in two different locations. Using ChIP-qPCR we validated two genes, Polr2a (Oct1enriched) and Pou5f1 (Oct4-enriched, Figure 7D). The complete set of identified targets is shown in Supplementary file 2.
Intersecting the ChIPseq and RNAseq data revealed little overlap. Only 34 Oct1-bound or Oct1/Oct4 co-bound targets showed differential expression following RA-mediated differentiation ( Figure 7A). Examples include Pank4, Cdh5 and Med16. 193 Oct1-bound and 325 Oct1/ Oct4-co-bound genes did not show expression differences at d 14. Examples include Tbx3, Tcf4, and Txb6, which also showed no differences throughout the differentiation timecourse ( Figure 7E). Instead, 1066 genes with altered expression in differentiated Oct1-deficient cells showed Oct4 but not Oct1 enrichment. These findings indicate that (1)  differentiation, and (2) developmental genes shown to be differentially expressed in RA treated Oct1-deficient cells were not Oct1 targets in ESCs.
The above findings could be reconciled by postulating that (1) Oct functions at co-bound targets in ESCs to buffer them against oxidative stress as described previously in fibroblasts (Kang et al., 2009;Shakya et al., 2011), and (2) developmental genes that are differentially expressed in Oct1-deficient cells but exclusively bound by Oct4 in ESCs become Oct1 targets during the differentiation process as Oct4 is lost. To test the first hypothesis, we studied the effect of H 2 O 2 exposure on the expression of two cobound genes, Ahcy and Polr2a, in ESCs ±Oct1. Both genes contain conserved MORE sequences ( Figure 8A). Treatment of cells with 2 mM H 2 O 2 resulted in a rapid loss of Ahcy and Polr2a mRNA specifically in Oct1-deficient ESCs ( Figure 8B), exactly as observed in fibroblasts (Kang et al., 2009;Shakya et al., 2011). As expected, these cells were hypersensitive to H 2 O 2 ( Figure 8C). These results suggest that as in other cell types, Oct1 functions in ESCs to buffer these genes from oxidative stress-associated inhibition.
To test the hypothesis that Oct1 occupies Oct4 targets as cells differentiate and Oct4 is lost, we performed ChIP-qPCR timecourses using differentiating ESCs and antibodies against Oct1 and Oct4. Material was collected from 0, 2, 4, 6, 8, 10, 12 and 14 d of differentiation with RA. We chose a gene, Hoxc5, that contains a conserved perfect octamer sequence ( Figure 8D), but is not an Oct1 target based on ChIPseq ( Figure 8E). Hoxc5 also shows poor induction in upon RA-mediated differentiation of Pou2f1 D/D cells (Figure 6-figure supplement 2A). Oct1 ChIP-qPCR revealed no binding in ESCs, as expected based on the ChIPseq ( Figure 8F), however robust binding was transiently observed at 6 d. By 14 d of differentiation Oct1 binding was again undetectable. We also examined a target region between the linked Myf5 and Myf6 (Mrf4) loci on chromosome 10 that contains several near-perfect octamer sites (not shown), and is strongly bound by Oct4 but not Oct1 ( Figure 8E). Oct1 inducibly occupied this region even more rapidly (2 d) as Oct4 binding was lost, and in this case Oct1 binding was maintained, at varying levels, during ESC differentiation ( Figure 8F). Finally, we studied two additional genes, Pou5f1 and Rest (Nrsf), which also contain conserved perfect octamer sequences in their regulatory regions and also show exclusive Oct4 binding. These genes are both expressed in ESCs and silenced as differentiation proceeds. Here early Oct1 binding was identified, which was maintained at low levels during the differentiation timecourse in the case of Pou5f1 but transient in the case of Rest ( Figure 8F). These results indicate a highly dynamic interplay between Oct1 and Oct4 in differentiating cells.

Discussion
Our results indicate that Oct1-deficient ESCs are unperturbed in terms of morphology, growth, metabolism, and gene expression. EBs formed from these cells are microscopically normal at early timepoints and express genes associated with all three germ layers. However, Oct1-deficient ESCs show phenotypic and molecular defects upon differentiation. These cells fail to form neurons and cardiomyocytes, generate smaller and less differentiated teratomas, and fail to contribute to adult mouse tissues. Prior work has shown that partial knockdown of Oct1 also inhibits neuron formation in the context of knockout of the related protein Oct2 (Theodorou et al., 2009).
Three molecular defects manifest upon differentiation of Oct1-deficient ESCs. First, loss of Oct1 results in a failure to fully induce genes associated with a given developmental lineage. Second, Oct1 is necessary for the repression of alternative embryonic developmental lineages. As a result, upon differentiation gene expression programs are marked not only by poor induction of lineageappropriate gene expression, but also by ectopic expression of genes specific to alternative lineages  For example, the Cdx2 promoter contains a perfect consensus octamer element and is a known Oct1 target in somatic cells (Jin and Li, 2001;Wang et al., 2009). In the early embryo, Cdx2 promotes trophectoderm fate and is under tight repression by Oct4 (Yeap et al., 2009;Yuan et al., 2009). Later in development, Cdx2 is induced in the endoderm-derived developing gastrointestinal tract (Guo et al., 2004;Lu et al., 2008) and during primitive hematopoiesis , but is not widely expressed in ectoderm (Suh and Traber, 1996). In RA-differentiated cells, Cdx2 thus represents both a lineage-inappropriate gene and an extra-embryonic lineage. Cdx2 is misexpressed following RA-mediated differentiation of both germline and inducible-conditional Oct1deficient ESCs. Interestingly Oct1 may execute the opposite function in extra-embryonic tissue, as germline Oct1-deficient mice show defects in extra-embryonic tissues including poor expression of Cdx2 (Sebastiano et al., 2010).
ChIPseq experiments reveal that Oct1 and Oct4 regulate common and distinct targets in ESCs. These differences in bound targets lead to functional consequences, as the two proteins recruit different cofactors such as Jmjd1a in the case of Oct1 and Jmjd1c in the case of Oct4 (Shakya et al., 2015a(Shakya et al., , 2011. Oct4 occupies a large group of >5000 genes, including developmentally poised genes such as Hoxc5 and Myf5, and core pluripotency genes such as Pou5f1 and Nanog. Oct1 does not occupy these genes in ESCs, consistent with the ability of Oct1-deficient ESCs to maintain pluripotency. Instead Oct1 co-occupies a cohort of 325 genes with Oct4 that are highly enriched for a motif known as a MORE (Reményi et al., 2001;Tomilin et al., 2000). Oct proteins are known to homo-and hetero-dimerize (Kang et al., 2009;Tantin et al., 2008;Tomilin et al., 2000;Verrijzer et al., 1992). The configuration of Oct proteins can determine cofactor association and hence regulatory output (Reményi et al., 2001;Tomilin et al., 2000). Many of these constitutively co-bound genes were previously shown to become occupied by Oct1 upon oxidative stress exposure in differentiated cells lacking Oct4 (Kang et al., 2009). The function of Oct1 at these genes is to insulate them against inhibition by oxidative stress. Fibroblasts lacking Oct1 show inappropriate repression of MORE-containing genes following H 2 O 2 exposure (Kang et al., 2009;Shakya et al., 2011). We demonstrate the identical phenotype using Oct1-deficient ESCs. Oct1 also exclusively associates with a small number (~200) of other genes including Taf12, which contains another binding site variant known as a TMFORE (Kang et al., 2009).
Notably, in undifferentiated cells Oct1 does not associate with developmental targets that become deregulated upon differentiation of Oct1-deficient ESCs. Oct4 is present at higher levels in  ESCs compared to Oct1, suggesting that mass action may contribute to the lack of Oct1 binding. This model predicts that Oct1 would occupy these genes as Oct4 is lost during differentiation. We tested four regions bound by Oct4 but not Oct1 in ESCs, Hoxc5, Myf5/Myf6, Rest and Pou5f1, predicting that Oct1 binding will manifest as cells differentiate and Oct4 is lost. In all cases, Oct1 binding was observed at one or more points during the differentiation timecourse. We propose that Oct1 transiently replaces Oct4 at many such Oct4 target genes upon differentiation, where it promotes lineage-appropriate target gene expression, and represses expression of lineage-inappropriate targets. The binding events occur during a brief but important window during which critical decisions about suppression or potentiation of lineage-specific developmental Oct4 target genes are made. Binding also occurs before many of the affected target genes are induced, suggesting that Oct1 is not the principal driver of expression of these genes, but instead establishes a chromatin context in which these genes remain poised for expression, or become permanently repressed. Of the genes tested in ChIP-qPCR RA differentiation timecourses, the lineage-appropriate Hoxc5 gene shows poor induction in upon differentiation of Oct1-deficient cells ( Figure 6-figure supplement 2A), Myf5 and Myf6 are mesoderm-specific and lineage-inappropriate, Rest is both pluripotencyassociated and lineage-inappropriate, and Pou5f1 is more restricted to ESCs. These latter genes showed no evidence of ectopic expression. This observation can be reconciled with our model by positing that redundant mechanisms, perhaps mediated by other Oct proteins such as Oct6/Pou3f1, enforce their repression in differentiating ESCs.
The ability of Oct1 to suppress genes for alternative developmental lineages is reminiscent of findings using T cells in which Oct1 suppresses alternative T cell lineage genes via inter-chromosomal communication between gene loci that execute opposing gene expression programs (Kim et al., 2014). Oct1 interacts with CTCF (Kim et al., 2014), helping it foster exclusive gene expression programs in T cells. More work is required to determine if Oct1 insures mutually exclusive embryonic developmental gene expression programs through similar mechanisms.

Derivation of Oct1 germline and conditional ESCs
All mice were C57BL/6J background. Oct1 germline-deficient ESCs were generated by intercrossing heterozygous Pou2f1 -/+ mice (Wang et al., 2004) to generate a 1:2:1 ratio of Pou2f1 -/-: Pou2f1 -/+ : Pou2f1 +/+ embryonic offspring. ESCs were derived from preimplantation blastocysts and genotyped. Heterozygous ESCs were not studied further. Littermate WT ESCs lines constituted the controls for these experiments. Oct1 inducible-conditional ESCs were generated by first separately crossing mice with the Pou2f1 conditional (floxed) allele (Shakya et al., 2015b) to the YFP reporter B6.129 Â 1-Gt(ROSA)26Sor tm1(EYFP)Cos/J (Jackson labs #006148) and inducible cre transgenic line B6.129-Gt (ROSA)26Sor tm1(cre/ERT2)Tyj/J (Jackson labs #008463). Resulting Pou2f1 fl/fl animals were intercrossed to generate embryonic Pou2f1 fl/fl offspring in which LSL-YFP was expressed from one Rosa26 allele and Cre-ERT2 was expressed from the other. Parent ESCs were derived from these preimplantation blastocysts. The parent lines constituted the controls for derived 4-OHT-treated, Pou2f1 D/D :YFP + lines. Cell lines were routinely authenticated by genotyping. Mycoplasma testing was conducted regularly in-house using a previously published method (Molla Kazemiha et al., 2009). Cells were negative throughout the study. Top shows best consensus sequences associated with binding. Bottom shows best matches to annotated weight matrices. In the case of known motifs, deviation of physiological binding sites from consensus causes recurring sequences meet threshold criteria for the compound 'Oct4-Sox2' site but not for a simple octamer site ('Oct4'). This is why the percentage of target sites computationally associated with 'Oct4-Sox2' is higher than for 'Oct4.   Figure 7B is shown at top. The MORE sequence (Reményi et al., 2001;Tomilin et al., 2000) is shown at bottom. Mammalian (mouse, human, dog) conservation is shown. MORE position relative to TSS is shown in parentheses. Polr2a contains two adjacent MOREs (Kang et al., 2009)

Cell culture
ESCs were cultured as previously (Shakya et al., 2015a) with 2i conditions: the ERK inhibitor PD0325901 (1 mM, LC Laboratories) and the GSK3 inhibitor CHIR99021 (3 mM, LC Laboratories). 4-OHT (Sigma) was dissolved in ethanol and used at 500 nM for 24 hr. Two methods were used to generate EBs. Low-attachment dishes were used to generate WT and Oct1-deficient EBs for microscopic analysis, RT-qPCR and the generation of neurons. Briefly, ESCs were trypsinized and feeders depleted by binding to gelatin-coated dishes for 30-60 min. ESC suspensions were plated on lowattachment dishes for 5-7 days. For cardiomyocyte differentiation, the hanging drop method (Wang and Yang, 2008) was used in order to generate single EBs in 96-well plates. Individual EBs were then used to generate cardiomyocyte colonies in 24-well plates. Generation of neurons was accomplished as in (Bain et al., 1995), with modifications. Briefly, EBs were formed for 4 days using low-attachment dishes, followed by culture for a further 4 days as EBs in 0.1 mM RA/DMEM. After 8 days, EBs were trypsinized and cultured for 8 days in 1:1 F12:DMEM, 10 mg/mL insulin (SAFC Biosciences), 5.5 mg/mL transferrin and 38.7 mM sodium selenite (ThermoFisher) on laminin/poly-Llysine-coated ChamberSlides (Corning). Cells in Figure 2E-F were cultured for eight additional d. For H 2 O 2 treatment, ESCs were seeded 24 hr prior to treatment on 6-well plates with sparse feeders. Cells were treated with 2 mM H 2 O 2 (Sigma) for the indicated times.

RT-qPCR
RNA was isolated using TRIzol (Thermo Fisher, Waltham MA), followed by RNAeasy purification (Qiagen) using the RNA cleanup procedure. cDNA was synthesized using SuperScript III and random hexamers (Thermo Fisher). RT-qPCR oligonucleotide primers are listed in Supplementary file 3.

Lentiviral Oct1 complementation
The Oct1 (Pou2f1) cDNA and IRES (internal ribosomal entry site) elements were amplified and cloned together by overlap PCR. In the first PCR, primers to the 5' end of Oct1 containing a NotI restriction site and to the 3' end of Oct1 that contained a 5' extension of IRES-complementary DNA were used. The sequences were: Oct1-NotI-For: 5'-AATGAAAAAAGCGGCCGCCATGAATAATCCA TCAGAAAC-3'; Oct1-Rev-IRES: 5'-TTAGGGGGGGGGGAGGGATCTTCACTGTGCCTTGGAG-3'. In the second PCR, an IRES sequence was amplified using primers to the 5' end of the IRES containing a 5' extension of DNA complementary of the Oct1 3' end, and primers to the 3' end of the IRES containing an NdeI restriction site. The sequences were: IRES-overlap-FOR: 5' AGATCCC TCCCCCCCCCCTAACGTTACTGGCCGAA-3'; IRES-Rev-NdeI: 5'-GGGAATTCCATATGTGTGGCCA TATTATCATCGTGT-3'. The third PCR used as a template the PCR products from the first two rounds, along with the Oct1-NotI-For and IRES-Rev-NdeI primers. This process generated a DNA fragment containing an Oct1 cDNA fused to an IRES at the 3' end, along with a NotI site at the 5' terminus and an NdeI site at the 3' terminus. The fragment was cloned into the optimized, self-inactivating, nonreplicative pHAGE lentiviral vector using the NotI and NdeI restriction sites. To insert a Puro cassette after the IRES, the cDNA was amplified using primers containing 5' NdeI and 3' ClaI restriction sites. The sequences were Puro-NdeI-For: 5'-GGAATTCCATATGATGACCGAG Figure 8 continued example of neuronal differentiation is shown. In stem cells, Oct1 and Oct4 collaborate at constitutively expressed MORE-containing targets such as Polr2a and Ahcy to insulate them against oxidative stress (red and black short dashed lines). Oct4 poises developmental genes of all embryonic lineages (long dashed black line) and repress trophectoderm-specific genes (solid black block line). Oct4 additionally activates pluripotency genes (solid black arrow). In differentiating cells, Oct1 occupies MORE genes in response to oxidative stress and buffers their expression, as described previously (Kang et al., 2009;Shakya et al., 2011). Oct1 also contributes to eventual lineage-specific developmental gene activation (solid red line), and alternate developmental lineage gene repression (including trophectoderm, solid red block line). DOI: 10.7554/eLife.20937.030 TACAAGCCCACGGT-3'; Puro-ClaI-Rev: 5' GGTTTATCGATTCAGGCACCGGGCTTGC-3'. Because the IRES apparently attenuated expression of the Puro resistance cassette in this vector, puromycin selection was performed at 0.75 mg/mL. To generate an empty vector control, the vector was cut with NdeI and NotI, filled in with Klenow fragment, and re-ligated.

Immunofluorescence
Immunofluorescence was performed as described previously (Kang et al., 2013), using mouse monoclonal antibodies against b-tubulin-III (R and D Systems MAB1195) and rabbit polyclonal antibodies against YFP (Life Technologies A6455). Secondary antibodies used were goat anti-rabbit-Alexa568 (Life Technologies A-11011) and goat anti-mouse-Alexa488 (Life Technologies A-11001).

Teratoma formation
Teratomas were generated as described (Nelakanti et al., 2015) by injecting parent or KO ESCs into contralateral flanks of female NCr Nude mice (NCRNU-F, Taconic). Mice were sacrificed at four wk. Tumors were excised, washed with cold PBS and weighed. 1/3 of the excised tumor was used to make lysates for protein analysis using a Dounce homogenizer with RIPA lysis buffer (50 mM Tris pH 7.4, 150 mM NaCl, 0.1% SDS, 0.1% sodium deoxycholate, 1 mM EDTA and protease inhibitors [Roche]) on ice. Lysates were centrifuged 10,000 Â g for 10 min. Supernatant protein concentrations were normalized using Bradford assays. 6Â Laemmli sample buffer was added. The mixture was boiled for 5 min and resolved using a 10% SDS-PAGE gel. The remainder of the tumor was fixed in formaldehyde, paraffin-embedded, sectioned and H and E stained for histological analysis by a blinded pathologist.

RNAseq
RNA was prepared from three independent cultures of undifferentiated or 14 d RA-differentiated parent Pou2f1 fl/fl or 4-OHT treated Pou2f1 D/D ESCs. Concentration was determined using a Quant-iT RNA assay kit and a Qubit fluorometer (Thermo Fisher). Intact poly(A) RNA was purified from total RNA samples (100-500 ng) with oligo(dT) magnetic beads, and stranded mRNA sequencing libraries were prepared as described using the Illumina TruSeq mRNA library preparation kit. Purified libraries were qualified on an Agilent Technologies 2200 TapeStation using a D1000 ScreenTape assay. Molarity of adapter-modified molecules was defined by qPCR using the Kapa Biosystems Library Quant Kit. Individual libraries were normalized to 10 nM and equal volumes were pooled in preparation for Illumina sequencing. Sequencing libraries (25 pM) were chemically denatured and applied to an Illumina HiSeq v4 paired end flow cell using an Illumina cBot. Hybridized molecules were clonally amplified and annealed to sequencing primers with reagents from an Illumina HiSeq PE Cluster Kit v4-cBot. Following transfer of the flowcell to an Illumina HiSeq 2500 instrument (HCS v2.2.38 and RTA v1.18.61), a 125-cycle paired-end sequence run was performed using HiSeq SBS Kit v4 sequencing reagents. Fastq data quality were checked using Fastqc verision 0.10.1 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Quality scores dipped towards the 3' end of the reads, so reads were trimmed at 50 bases to eliminate poor-quality data. The resulting 50-base reads were aligned to the mouse mm10 genome (GRCm38, December 2011) plus splice junctions using novoalign version 2.08.01 (http://www.novocraft.com). Alignments to splice junctions were translated back to genome coordinates using the SamTranscriptomeParser application in the USeq package (Nix et al., 2010). Aligned reads were quality checked using the Picard tools' CollectRnaSeqMetrics command (https://broadinstitute.github.io/picard/). On average 99.0% of the reads aligned to the mouse genome, with 78% of reads providing unique alignments, and 86% of reads providing alignments to protein coding and UTR regions of the genome. Tests for differential gene expression were performed with DESeq2, version 1.10.0 (Love et al., 2014). Genes with a count of at least 50 in one or more samples were tested. Genes showing at least 2.5-fold change of expression at an adjusted p-value of <0.01 were selected as differentially expressed. Figures were generated in R version 3.2.3 (http://www.r-project.org) using functions from the gdata and gplots libraries.

ChIP/ChIPseq
ChIP was performed as described (Shakya et al., 2015a). ChIP oligonucleotide primers are listed in Supplementary file 3. Antibodies used were the following: Oct1 (Bethyl, a mixture of A301-716A + A301-717A), Oct4 (Santa Cruz, sc-8629) and H3K4me3 (Millipore, 07-473). ChIPseq was performed as described previously (Shakya et al., 2015a(Shakya et al., , 2015b, using a single IP per condition and clones of parent or derived Oct1-deficient ESCs. For ChIPseq, reads were aligned to the mouse reference genome (mm10) with the Burrows-Wheeler Aligner (BWA, version: 0.5.9). Reads were filtered for alignment quality of >Q10 and duplicates were removed using Picard tools (function MarkDuplicates). After filtering there were 21.1 (H3K4me3), 27.3 (Oct1), and 23.7 (Oct4) million reads. MACSv2 peak caller (version: 2.1) was used to call ChIPseq regions of enrichment with the following parameters (-p 1e-5 -nomodel -shiftsize <fragment_length/2> for Oct1, Oct4 and -p 1e-2 -broad for H3K4me3). To estimate the -shiftsize parameter (predominant fragment length divided by 2) we performed strand cross-correlation analysis using SPP R package (version: 1.10.1) with default parameters. Peaks overlapping with ENCODE blacklisted regions were filtered using BEDtools (function itersectBed). We also discarded peaks localized to mitochondria, chromosome Y, and unmapped contigs. After filtering we had 692 (Oct1), and 8673 (Oct4) peaks. Signal to noise ratio was assessed by calculating normalized strand coefficient (NSC) and relative strand correlation (RSC) using the SPP R package with default parameters (version: 1.10.1). The obtained values of NSC and RSC (H3K4me3: 2.28, 1.25; Oct1: 1.02, 1.45; Oct4: 1.05, 2.32) indicate highly enriched datasets with large fragment-length peak as compared to read-length peak. The NSC value for Oct1 transcription factor was somewhat smaller but typical for high quality datasets generated for factors with small numbers of genuine binding sites (692 MACS2-identified peaks for Oct1). We used MACSv2 function bdgdiff to build fold-enrichment signal tracks for all positions in the genome. Signal tracks were converted to TDF files using igvtools (https://www.broadinstitute.org/igv/igvtools). Peaks were allocated to genes using the annotatePeaks.pl program from HOMER suite (Hypergeometric Optimization of Motif Enrichment, version: 4.7, http://homer.salk.edu/homer/) by determining the closest RefSeq transcription start sites of the genes to the peaks. Functional enrichment analysis was performed using the findGO.pl program from HOMER and Bonferroni as well as Benjamini and Hochberg correction for multiple testing corrections. Robustness of the analysis was confirmed using MEME-ChIP (Machanick and Bailey, 2011), which generate highly similar motifs.

Motif analysis
Transcription factor enrichment within ChIPSeq peaks (de novo motif discovery and known motif matching) was determined using findMotifsGenome.pl program from HOMER. Motif analysis was run on overlapped and separately on unique Oct1 and Oct4 ChIPseq peaks. Oct1 and Oct4 ChIPseq peak overlaps were defined by requiring the distance between peak summits to be 100 bp. Motif lengths of 6-24 bp were identified within 200 bp regions centered on peak summits and an option of random background was selected for motif discovery.

Funder Grant reference number Author
National Institute of Allergy and Infectious Diseases

R01AI100873 Dean Tantin
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.