Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Investigation of the molecular biology underlying the pronounced high gene targeting frequency at the Myh9 gene locus in mouse embryonic stem cells

  • Lei Tan ,

    Contributed equally to this work with: Lei Tan, Yi Hu, Yalan Li

    Roles Data curation, Formal analysis, Writing – original draft

    Affiliation Laboratory of Animal Disease Prevention & Control and Animal Model, The Key Laboratory of Animal Vaccine & Protein Engineering, College of Veterinary Medicine, Hunan Agricultural University (HUNAU), Changsha, Hunan, China

  • Yi Hu ,

    Contributed equally to this work with: Lei Tan, Yi Hu, Yalan Li

    Roles Formal analysis

    Affiliation Laboratory of Animal Disease Prevention & Control and Animal Model, The Key Laboratory of Animal Vaccine & Protein Engineering, College of Veterinary Medicine, Hunan Agricultural University (HUNAU), Changsha, Hunan, China

  • Yalan Li ,

    Contributed equally to this work with: Lei Tan, Yi Hu, Yalan Li

    Roles Formal analysis

    Affiliation Laboratory of Animal Disease Prevention & Control and Animal Model, The Key Laboratory of Animal Vaccine & Protein Engineering, College of Veterinary Medicine, Hunan Agricultural University (HUNAU), Changsha, Hunan, China

  • Lingchen Yang,

    Roles Formal analysis

    Affiliation Laboratory of Animal Disease Prevention & Control and Animal Model, The Key Laboratory of Animal Vaccine & Protein Engineering, College of Veterinary Medicine, Hunan Agricultural University (HUNAU), Changsha, Hunan, China

  • Xiong Cai,

    Roles Formal analysis

    Affiliation Institute of Innovation and Applied Research in Chinese Medicine, Hunan University of Chinese Medicine, Changsha, Hunan, China

  • Wei Liu,

    Roles Formal analysis

    Affiliation Laboratory of Animal Disease Prevention & Control and Animal Model, The Key Laboratory of Animal Vaccine & Protein Engineering, College of Veterinary Medicine, Hunan Agricultural University (HUNAU), Changsha, Hunan, China

  • Jiayi He,

    Roles Formal analysis

    Affiliation Laboratory of Animal Disease Prevention & Control and Animal Model, The Key Laboratory of Animal Vaccine & Protein Engineering, College of Veterinary Medicine, Hunan Agricultural University (HUNAU), Changsha, Hunan, China

  • Yingxin Wu,

    Roles Formal analysis

    Affiliation Laboratory of Animal Disease Prevention & Control and Animal Model, The Key Laboratory of Animal Vaccine & Protein Engineering, College of Veterinary Medicine, Hunan Agricultural University (HUNAU), Changsha, Hunan, China

  • Tanbin Liu,

    Roles Formal analysis

    Affiliation Laboratory of Animal Disease Prevention & Control and Animal Model, The Key Laboratory of Animal Vaccine & Protein Engineering, College of Veterinary Medicine, Hunan Agricultural University (HUNAU), Changsha, Hunan, China

  • Naidong Wang,

    Roles Formal analysis

    Affiliation Laboratory of Functional Proteomics (LFP), The Key Laboratory of Animal Vaccine & Protein Engineering, College of Veterinary Medicine, HUNAU, Changsha, Hunan, China

  • Yi Yang,

    Roles Data curation, Formal analysis

    Affiliation Laboratory of Functional Proteomics (LFP), The Key Laboratory of Animal Vaccine & Protein Engineering, College of Veterinary Medicine, HUNAU, Changsha, Hunan, China

  • Robert S. Adelstein,

    Roles Validation, Writing – original draft, Writing – review & editing

    Affiliation Laboratory of Molecular Cardiology (LMC), NHLBI/NIH, Bethesda, MD, United States of America

  • Aibing Wang

    Roles Funding acquisition, Visualization, Writing – original draft, Writing – review & editing

    bingaiwang@hunau.edu.cn

    Affiliations Laboratory of Animal Disease Prevention & Control and Animal Model, The Key Laboratory of Animal Vaccine & Protein Engineering, College of Veterinary Medicine, Hunan Agricultural University (HUNAU), Changsha, Hunan, China, Laboratory of Molecular Cardiology (LMC), NHLBI/NIH, Bethesda, MD, United States of America

Abstract

The generation of genetically modified mouse models derived from gene targeting (GT) in mouse embryonic stem (ES) cells (mESCs) has greatly advanced both basic and clinical research. Our previous finding that gene targeting at the Myh9 exon2 site in mESCs has a pronounced high homologous recombination (HR) efficiency (>90%) has facilitated the generation of a series of nonmuscle myosin II (NM II) related mouse models. Furthermore, the Myh9 gene locus has been well demonstrated to be a new safe harbor for site-specific insertion of other exogenous genes. In the current study, we intend to investigate the molecular biology underlying for this high HR efficiency from other aspects. Our results confirmed some previously characterized properties and revealed some unreported observations: 1) The comparison and analysis of the targeting events occurring at the Myh9 and several widely used loci for targeting transgenesis, including ColA1, HPRT, ROSA26, and the sequences utilized for generating these targeting constructs, indicated that a total length about 6 kb with approximate 50% GC-content of the 5’ and 3’ homologous arms, may facilitate a better performance in terms of GT efficiency. 2) Despite increasing the length of the homologous arms, shifting the targeting site from the Myh9 exon2, to intron2, or exon3 led to a gradually reduced GT frequency (91.7, 71.8 and 50.0%, respectively). This finding provides the first evidence that the HR frequency may also be associated with the targeting site even in the same locus. Meanwhile, the decreased trend of the GT efficiency at these targeting sites was consistent with the reduced percentage of simple sequence repeat (SSR) and short interspersed nuclear elements (SINEs) in the sequences for generating the targeting constructs, suggesting the potential effects of these DNA elements on GT efficiency; 3) Our series of targeting experiments and analyses with truncated 5’ and 3’ arms at the Myh9 exon2 site demonstrated that GT efficiency positively correlates with the total length of the homologous arms (R = 0.7256, p<0.01), confirmed that a 2:1 ratio of the length, a 50% GC-content and the higher amount of SINEs for the 5’ and 3’ arms may benefit for appreciable GT frequency. Though more investigations are required, the Myh9 gene locus appears to be an ideal location for identifying HR-related cis and trans factors, which in turn provide mechanistic insights and also facilitate the practical application of gene editing.

Introduction

Transgenic animals that have had their genomes modified by genetic manipulation are extensively used to investigate the in vivo biological functions of genes as well as to mimic human diseases [15]. The mouse is the most widely engineered and commonly used organism for biomedical research. Its embryonic stem (ES) cell-mediated gene targeting has been a critical technology for generating such genetically modified mice [6, 7]. Moreover, the homologous recombination (HR)-based insertion, deletion or point mutation of the genome in ES cells and the consequent generation of targeted gain- and loss-of-function alleles have allowed the creation of thousands of mouse models for different purposes [1, 8, 9]. It is well known that the event of HR-mediated modification is rare. Even in mouse ES cells, obtaining the desired ES clone has therefore been a critical but time-consuming and labor-intensive step in previous studies [3, 10, 11]. Numerous factors including the length of the homologous arms, and the structure of the targeting vector, the targeted loci, the utilization of isogenic DNA and the status of ES cells, can affect the HR efficiency have been investigated and described [1216]. However, the HR efficiency varies greatly even when the general principles are complied with. Furthermore, the conclusions from investigating the above factors may not be always consistent. For instance, the length of targeting arms is thought to be an important determinant of targeting efficiency [17], while increasing the targeting arm length does not always lead to the elevation of HR frequency [14]. These suggest the mechanisms underlying these influencing factors still remain elusive.

With the recent advent of a series of programmable nucleases or genome editing tools including zinc-finger nucleases (ZFN), transcription activator-like effector nucleases (TALEN) and Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated protein9 (Cas9) (CRISPR/Cas9), the GT frequency in mammalian cells including mouse ES cells has been dramatically improved [1823], suggesting the compatibility of artificial nucleases with traditional gene targeting methods. Therefore, these new technologies have stimulated scientific attention, since they have the potential of bypassing the requirement of using mouse ES cells in the generation of genetically engineered mice. However, these technologies are also accompanied by some disadvantages, in particular, the off-target effects and the inability to insert large DNA fragments [24], as further substantiated by a recent study [25]. In contrast, the time-consuming shortcoming of the traditional method has been overcome by a recent report, which demonstrated that the approach of gene targeting in mouse extended pluripotent stem (EPS) cells, coupled with tetraploid complementation technology, can generate mutant mice in approximately 2 months [26]. Additionally, targeted alterations of the genome in ES cells, created with or without the assistance of gene editing tools, are widely used for various purposes, such as the in vitro differentiation of ES cells and the screening of druggable chemicals [23]. These highlight the necessity to investigate the intrinsic properties of HR-based gene targeting in ES cells for further improving the efficiency and for providing valuable clues to apply gene targeting in other mammalian cells.

Our previous finding indicated that gene targeting at the exon2 site of the Myh9 gene which encodes the protein of nonmuscle myosin heavy chain IIA (NMHC IIA) has a pronounced high HR efficiency (>90%) in mouse ES cells. This unique property has been applied to the production of a series of nonmuscle myosin II (NM II) related mutant mouse lines (≥7) [2730]. Moreover, this finding has also been extended to the site-specific insertion of other exogenous genes in mouse ES cells [31]. In the present study, we intended to investigate the molecular biology underlying this distinctive HR efficiency from multiple aspects.

Materials and methods

This work was approved by the Animal Ethics Committee of Hunan Agricultural University, Hunan, China. No animal experiments were conducted, no ethic permits were therefore needed for this report, which complied with all of the relevant regulations.

Bioinformatics analysis

A 3950 bp sequence immediately before and a 1327 bp sequence immediately after the Myh9 gene exon2 across various species (including mouse, human, chimpanzee, pig, rabbit, and rat) were retrieved from useast.ensembl.org and further confirmed with genome.ucsc.edu. The comparison and alignment of the sequence region (<6 kb) around the Myh9 gene exon2 across distinct species was performed by using Clustal W in the DNAStar version 7.10 software (Lasergene) [32]. The DNA features including GC-content, simple sequence repeat (SSR), short interspersed nuclear elements (SINEs), and long interspersed nuclear elements (LINEs), other DNA elements in the sequences used for creating targeting constructs were analyzed via Repeat Masker Web Server (http://www.repeatmasker.org/cgi-bin/WEBRepeatMasker). The CpG islands in these sequences were determined using CpG report software (https://www.ebi.ac.uk/Tools/seqstats/emboss_newcpgreport/).

Generation of targeting constructs

The constructs targeting to the Myh9 gene exon2, intron2 and exon3 sites were separately generated by using the same strategy as described previously [27, 31]. Briefly, DNA fragments for the 5’ and 3’ homologous arms were amplified by PCR using the primer pairs listed in S1 Table and the template of 129/Sv genomic BAC clone DNA, and then cloned into the vector mpNTKV-LoxP described previously [27]. Likewise, the truncated 5’ and 3’ homologous arms were firstly amplified by PCR with the primer pairs listed in S1 Table and the template of 129/Sv genomic BAC clone DNA, and then various combinations of the 5’ and 3’ homologous arms were cloned into the mpNTKV-LoxP vector. Nucleotide sequences of the cloned DNA fragments were verified in all cases by sequencing. All targeting vectors were linearized by restriction enzymes before electroporation.

Culture and electroporation of mouse ES cells

The mouse V6.5 ES cells used in this study were originally derived by Eggan et al [33] from F1 hybrid mice with 50% C57BL/6 and 50% 129/Sv mixed genetic background (Notably, the sequences of the targeting constructs are isogenic to both chromosomes from C57BL/6 and 129/Sv genetic background in terms of the Myh9 gene region involved.), cultured and electroporated with linearized targeting vectors as described before [27, 29], followed by drug selection using 400μg/ml G418 and 200μM ganciclovir. Drug resistant colonies were picked and expanded for the preparation of genomic DNA.

PCR identification of recombination events

HR events occurring at the Myh9 gene exon2 (including those truncated targeting constructs), intron2 and exon3 sites were separately identified by PCR with the primer pair (the forward primer located at the neomycin, while the reverse primer resided immediately outside the 3’ short arm) listed in S1 Table. The preparation of genomic DNA, the components of PCR system and the PCR reaction conditions completely followed the previous description [31]. The PCR products were analyzed by electrophoresis on 1.0% agarose gel, the wild-type allele produced no band while the targeted allele yielded an expected ≥2.1, 2.2, 2.3 kb band, respectively, as indicated in S1 Table. At least five randomly selected PCR-amplified products were excised, mixed, extracted and cloned into the T-easy vector for sequencing (Promega). The sequencing results could further confirm the gene targeting recombinants.

Data analysis

The relationship between the length of homologous arms and the targeting efficiency, as well as the significance, were analyzed with GraphPad.Prism.v5.0 software (GraphPad Software inc, CA, USA).

Results and discussion

Bioinformatic analysis of the Myh9 gene locus and other widely used loci for site-specific integration of exogenous genes

As demonstrated in previous studies, gene targeting at the Myh9 gene locus, in particular the exon2 site, had a pronounced high HR efficiency (>90%) in mESCs [27, 2931]. Although GT efficiency in mESCs is considered 10–100 fold higher than that in mammalian somatic cells [34]. We are not aware of any reports showing such a high GT efficiency in mESCs under similar experimental conditions. Thus, the Myh9 gene locus can be added to the list of a few identified mitotic HR hotspots [3537]. This finding also evoked our further curiosity about the molecular biology underlying it. Furthermore, the mechanisms responsible for the higher relative rate of gene targeting in mESCs than those in somatic cells remain unclear. As an initial effort, we examined whether the Myh9 locus has some common genetic features with other widely used loci for site-specific integration of exogenous genes, including collagen alpha 1 (ColA1), hypoxanthine phosphoribosyltransferase (HPRT) and ROSA26. To this aim, we firstly collected and summarized targeting events occurring at these loci from the references, as indicated in Table 1. Though there were several differences in the length of 5’ and 3’ arms, the ratio of them, ES cell lines used, the selection markers utilized, appreciable targeting efficiencies were obtained in these loci. Without considering other factors, several characteristics could be observed: a) These loci have at least one unique property, such as only one copy of exogenous gene insertion at the HPRT locus, high expression of exogenous gene at the ColA1, Myh9 and ROSA26 loci, thereby being widely used for targeted transgenesis; b) Generally, a 5–6 kb total length of homologous arms was utilized, while longer total length of homologous arms did not mean higher targeting efficiency; c) It seemed that the length ratio of the 5’ and 3’ arms also matters, as further demonstrated in our subsequent experiments. Next, we examined whether there exist some common genetic features among these loci. For this purpose, the sequences used for generating the constructs targeting to these loci were analyzed as described in Materials and Methods, the analytical results based on the 5’ or 3’ arm were summarized in Table 2. As far as these sequences were concerned, several features could be generalized: 1) GC-content in these sequences for the 5’ and 3’ arms displayed a range of 38–68%, a similar and about 50% GC-content seemed better than other cases with regard to GT efficiency; 2) There existed different DNA elements in the 5’ or 3’ arm of these constructs, such as simple sequence repeat (SSR), short interspersed nucleotide elements (SINEs), long interspersed nucleotide elements (LINEs), even CpG island or other elements, the effects of these elements on targeting efficiency have also been suggested [38]; 3) Notably, different DNA elements were distributed in the 5’ and 3’ arms of them, no common genetic properties could therefore be identified from these sequences. Furthermore, the practical influences of these DNA elements on GT frequency remained to be substantiated.

thumbnail
Table 1. Comparison of the HR efficiency and features of frequently targeted loci/sites.

https://doi.org/10.1371/journal.pone.0230126.t001

thumbnail
Table 2. Comparison of the sequence features of several widely used loci for site-specific insertion of exogenous gene.

https://doi.org/10.1371/journal.pone.0230126.t002

GT efficiency at various targeting sites of the Myh9 gene locus

The effects of loci, including chromosome position and sequence context on gene targeting efficiency are well recognized [17, 37], but the effects of shifting the position of homologous arms on the same locus remain to be explored. Undoubtedly, the property of high GT efficiency at the Myh9 locus is useful for investigating this aspect. For this purpose, a series of constructs targeting to various sites of the Myh9 gene locus including the exon2, intron2 and exon3 positions, were generated. Notably, the constructs targeting to the intron2 and exon3 sites even had slightly longer homologous arms as indicated in Fig 1A since the GT efficiency was suggested to be directly proportional to the length of the homologous arms. Gene targeting experiments with these constructs were performed and HR events were identified as described above (Fig 1B and 1C). In contrast to previous studies [16, 47, 48], our results demonstrated that longer arms are not always associated with higher GC efficiency and also revealed that there is obvious difference in the GT efficiency at various sites even in the same locus. Briefly, the highest GT efficiency occurred at the exon2 site (91.7%), followed by that at the intron2 site (71.8%), while the exon3 site had the lowest efficiency (50.0%), displaying a trend of gradual decrease with the shift of the targeting site from the exon2 to intron2 and exon3 (Table 3). Consistent with a previous idea that the nature of the locus is critical [17], our result further highlighted the importance of the suitable targeting site in the same gene locus for desirable GT frequency. Furthermore, this view was also supported by recent studies, which suggested that even the sites for the application of gene editing tools to some specific loci should be considered carefully [49, 50]. Notably, our previous studies indicated that in spite of high GT frequency at the Myh9 gene locus, both PCR and Southern Blot did not identify ES clones with double knockout alleles [27, 2931]. The major reason might be that homozygous knockout of the Myh9 gene can cause abnormal proliferation and ESC morphology [51], which in turn result in bias for colony picking, so does the current study.

thumbnail
Fig 1. Gene targeting at various sites of the mouse Myh9 locus.

(A) Diagram of the constructs targeting to different sites of the Myh9 gene locus. Shown are part of the Myh9 gene locus including exons1-4, the distance between them, the down arrows representing the targeted insertion of Neomycin (upper), the constructs targeting to distinct sites of the Myh9 gene locus and the length of the 5’ and 3’ arms of these constructs (middle), as well as the whole sequence region used for the generation of these constructs (lower). The HR efficiency of these targeting constructs is summarized in Table 3. The average percentage of HR efficiency is derived from two repeats. (B) Representative gel image of PCR identification of the GT recombinants at the Myh9 gene exon2 site. (C) Representative gel image of PCR identification of the GT recombinants at the Myh9 gene exon3 site. For representative gel image of PCR identification of the GT recombinants at the Myh9 intron2 site, please see reference [31]. M: DNA Marker; P: Positive control; N: Negative control; PC: positive clone; NC: negative clone.

https://doi.org/10.1371/journal.pone.0230126.g001

thumbnail
Table 3. Gene targeting frequency at distinct targeting sites of the Myh9 locus in mouse ES cells.

https://doi.org/10.1371/journal.pone.0230126.t003

To explore the underlying reasons responsible for obvious difference in the GT efficiency at various sites of the Myh9 locus, the whole sequences used to create these targeting constructs were analyzed in detail as described in Materials and Methods. As summarized in Table 4, the GC-content in these three sequences for constructs targeting to the Myh9 exon2 (50.5%), intron2 (50.6%) and exon3 (51.1%) was only slightly increased with the extended length of the homologous arms, while the percentage of SSR and SINEs in them was markedly decreased. Briefly, the construct targeting to the Myh9 exon2 possessed the highest percentage of SSR (11.1%) and SINEs (9.2%), followed by those of the construct targeting to the intron2 (10.1%, 8.0%, respectively), while the construct targeting to the Myh9 exon3 had the lowest ones (4.2%, 2.3%, respectively). Interestingly, the trends of decreased percentage of SSR and SINEs in the homologous arms of these constructs were completely consistent with that of the GT efficiency for them. Importantly, the potential effects of these elements on gene targeting have been suggested [5256], though direct evidence remain to be substantiated. Therefore, this finding presented here not only supports the concept, but also provide an excellent locus for in-depth investigating this issue in the future.

thumbnail
Table 4. The bioinformatic analysis of the sequences used to create the constructs targeting to different sites of the Myh9 locus.

https://doi.org/10.1371/journal.pone.0230126.t004

GT efficiency at the Myh9 gene locus with various homologous arm lengths

Earlier investigations suggested that there is a good correlation between GT efficiency and the length of homologous arms [47, 5659], but another report also showed that increasing the length of the targeting sequences does not always enhance the efficiency of HR [14]. This idea was partially supported by our observation that the construct targeting to the exon2 site of the Myh9 gene locus with a total length of approximate 5.7 kb homologous arms obtained an unprecedented HR frequency [27, 29, 30]. Additionally, our effort to use the constructs with increasing length of homologous arms targeting to distinct sites of the Myh9 gene locus also confirmed this concept, as demonstrated above. Next, two questions related to the targeting events at the Myh9 gene locus were raised about whether this phenomenon is associated with specific sequences in the homologous arms, how long the homologous arms and what the ratio for them are required for optimal GT efficiency. In particular, there is no exact answer for the latter one. On the one hand systematic studies have not been conducted, and on the other hand comparing results generated from different studies is problematic because of the presence of multiple variables in addition to the extent of homology [60]. To answer these questions, a series of truncated targeting constructs with various lengths of the 5’ and 3’ homologous arms were generated as described in Materials and Methods and indicated in Fig 2A. Similarly, PCR was utilized to identify targeting recombinants (Fig 2B). The GT frequency for this series of constructs was summarized in Table 5. Additionally, the trend and relationship of the length of homologous arms and the GT efficiency was analyzed as indicated in Fig 2C. The results indicated: a) With the shortening of the length of the homologous arms either at the 5’ or 3’, the GT frequency displayed a generally downward trend, suggesting the effect of the homologous arm length on the GT efficiency, as consistent with previous reports [57, 59, 60]. Furthermore, the total length of the homologous arms had a positive correlation with the GT frequency (R2 = 0.7256, p<0.01); b) The shortening of 5’ and 3’ arms seemed to have different effects on the GT frequency, overall with that of the former more pronounced than that of the latter; c) Only when the truncated length of the 5’ arm reached one fourth of the original length, the decrease of the targeting efficiency approached an order of magnitude. This differed from a previous report [47] and suggested the effect of the length of the homologous arms on the targeting frequency is complicated and may not be pronounced as anticipated [14]; d) The effect of the absence of the fragment containing the exon2 sequence which is more conserved than others in the 3’ arm on the targeting efficiency was not as large as expected; e) A ratio of 2:1 for the length of the 5’ and 3’ arms seemed better than other combinations (Table 5). Collectively, a targeting construct with a 5.7 kb sequence surrounding the Myh9 gene exon2 as the homologous arms produces a pronounced high GT efficiency in mESCs, while this efficiency is correlated with the length of homologous arms but not to a specific sequence within them. Additionally, an optimal ratio of 2:1 for the length of the homologous arms is suggested even in the presence of gene editing tools for desirable targeting efficiency. Lastly, we examined whether the shortening of 5’ or 3’ homologous arm alters the amount of DNA elements in these sequences, which in turn influences GT efficiency. Similarly, the presence of different DNA elements in each truncated 5’ or 3’ arm was analyzed as above and summarized in Table 6. Firstly, the shortening of 5’ or 3’ arm seemed not to obviously change the GC-content. Secondly, only the shortening of 5’ arm from 4 kb to 2 kb led to a markedly increase of the percentage of SINEs, which in turn alleviates the effect of the shortening of the 5’ arm on GT efficiency. This fact was to some extent consistent with the idea from a previous report [38], which suggests the integration frequency of targeting vector correlates with the amount of SINEs present in the arms.

thumbnail
Fig 2. Gene targeting at the Myh9 gene exon2 site with various length of the homologous arms.

(A) Diagram of a series of truncated targeting constructs. The 5’ homologous arm is left-side truncated from 4, 3, 2 to 1 kb, while the 3’ one is right-side truncated from 1.7, 1.1 to 0.6 kb. Additionally, a left-side truncated 3’ arm with the deletion of the exon2 and beyond (about 0.5 kb) is also generated. Each targeting construct is given a number (No) as indicated at the left, the total length, the length of each 5’ or 3’ arm and the ratio between them are also indicated. Detailed information of these constructs and their HR frequency in mouse ES cells are summarized in Table 5. The average percentage of HR efficiency is derived from two repeats. (B) Shown is representative gel image of PCR identification of the GT recombinants for the targeting construct No4. M: DNA marker; PC: positive clone; NC: negative clone. (C) The graph indicates a downward trend of the gene targeting efficiency with the decreased length of the homologous arms and reflects the correlation of the gene targeting frequency with the total length of the homologous arms, the R2 value shown in the graph is calculated from thirteen points (p<0.01).

https://doi.org/10.1371/journal.pone.0230126.g002

thumbnail
Table 5. Summary of the HR frequency for each targeting construct.

https://doi.org/10.1371/journal.pone.0230126.t005

thumbnail
Table 6. Genetic features of the sequences used for creating different 5’ and 3’ truncated homologous arms.

https://doi.org/10.1371/journal.pone.0230126.t006

Conclusion

In the current study we further investigated our previous finding that gene targeting at the Myh9 gene locus has a pronounced high HR frequency. Our study clarified several points and provided some new insight into homologous recombination-based gene targeting. Firstly, through the analysis of the Myh9 and several other widely used loci for targeting transgenesis, the properties of a 6 kb total length, and 50% GC-content present in the homologous arms, may facilitate the achievement of desirable GT efficiency. Secondly, consistent with the central role for the loci, our study also highlights the importance for the selection of an exact targeting site in the practical application of gene targeting, this may be associated with the amount of DNA elements like SINEs in the homologous arms. Thirdly, the targeting efficiency has a positive correlation with the total length of the homologous arms. Moreover, an optimal ratio of 2:1 for the length of the 5’ and 3’ arms with 50% GC-content and the potential effects of DNA elements including SINEs in the arms on GT frequency, are further suggested. Lastly, the in-depth investigations of gene targeting greatly benefit the understanding of the cellular machinery related to HR, the regulation of it, as well as HR-associated factors. The Myh9 gene locus as a potential mitotic recombination hotspot undoubtedly is an ideal molecular and genetical location for further exploration of these aspects.

Supporting information

S1 Table. The PCR primers used in this study.

https://doi.org/10.1371/journal.pone.0230126.s001

(DOCX)

Acknowledgments

The authors acknowledge Dr. Wen Xie for the mES cell culture and electroporation and Dr Chengyu Liu and Mary Ann Conti for reading and revising the manuscript.

References

  1. 1. Mak TW. Gene targeting in embryonic stem cells scores a knockout in Stockholm. Cell. 2007;131(6):1027–31. Epub 2007/12/18. pmid:18083089.
  2. 2. Gama Sosa MA, De Gasperi R, Elder GA. Animal transgenesis: an overview. Brain Struct Funct. 2010;214(2–3):91–109. Epub 2009/11/26. pmid:19937345.
  3. 3. Menke DB. Engineering subtle targeted mutations into the mouse genome. Genesis. 2013;51(9):605–18. Epub 2013/08/06. pmid:23913666.
  4. 4. Du TT, Dewey JB, Wagner EL, Cui R, Heo J, Park JJ, et al. LMO7 deficiency reveals the significance of the cuticular plate for hearing function. Nat Commun. 2019;10(1):1117. Epub 2019/03/10. pmid:30850599.
  5. 5. Clauss S, Bleyer C, Schüttler D, Tomsits P, Renner S, Klymiuk N, et al. Animal models of arrhythmia: classic electrophysiology to genetically modified large animals. Nat Rev Cardiol. 2019. Epub 2019/03/22. pmid:30894679.
  6. 6. Capecchi MR. Gene targeting in mice: functional analysis of the mammalian genome for the twenty-first century. Nat Rev Genet. 2005;6(6):507–12. Epub 2005/06/03. pmid:15931173.
  7. 7. Kumagai K, Takanashi M, Ohno SI, Kuroda M, Sudo K. An improved Red/ET recombineering system and mouse ES cells culture conditions for the generation of targeted mutant mice. Exp Anim. 2016. Epub 2016/11/29. pmid:27890869.
  8. 8. Belizário JE, Akamini P, Wolf P, Strauss B, Xavier-Neto J. New routes for transgenesis of the mouse. J Appl Genet. 2012;53(3):295–315. Epub 2012/05/10. pmid:22569888.
  9. 9. Singh M, Murriel CL, Johnson L. Genetically engineered mouse models: closing the gap between preclinical data and trial outcomes. Cancer Res. 2012;72(11):2695–700. Epub 2012/05/18. pmid:22593194.
  10. 10. Müller U. Ten years of gene targeting: targeted mouse mutants, from vector design to phenotype analysis. Mech Dev. 1999;82(1–2):3–21. Epub 1999/06/04. pmid:10354467.
  11. 11. Aida T, Imahashi R, Tanaka K. Translating human genetics into mouse: the impact of ultra-rapid in vivo genome editing. Dev Growth Differ. 2014;56(1):34–45. Epub 2014/01/22. pmid:24444057.
  12. 12. te Riele H, Maandag ER, Berns A. Highly efficient gene targeting in embryonic stem cells through homologous recombination with isogenic DNA constructs. Proc Natl Acad Sci USA. 1992;89(11):5128–32. Epub 1992/06/01. pmid:1594621; PubMed Central PMCID: PMC49242.
  13. 13. Templeton NS, Roberts DD, Safer B. Efficient gene targeting in mouse embryonic stem cells. Gene Ther. 1997;4(7):700–9. Epub 1997/07/01. pmid:9282171.
  14. 14. Lu ZH, Books JT, Kaufman RM, Ley TJ. Long targeting arms do not increase the efficiency of homologous recombination in the beta-globin locus of murine embryonic stem cells. Blood. 2003;102(4):1531–3. Epub 2003/05/06. pmid:12730107.
  15. 15. Bouabe H, Okkenhaug K. Gene targeting in mice: a review. Methods Mol Biol. 2013;1064:315–36. Epub 2013/09/03. pmid:23996268; PubMed Central PMCID: PMC4524968.
  16. 16. Saito S, Adachi N. Advances in the Development of Gene-Targeting Vectors to Increase the Efficiency of Genetic Modification. Biol Pharm Bull. 2016;39(1):25–32. Epub 2016/01/05. pmid:26725425.
  17. 17. Yáñez RJ, Porter AC. A chromosomal position effect on gene targeting in human cells. Nucleic Acids Res. 2002;30(22):4892–901. Epub 2002/11/16. pmid:12433992; PubMed Central PMCID: PMC137162.
  18. 18. Osiak A, Radecke F, Guhl E, Radecke S, Dannemann N, Lütge F, et al. Selection-independent generation of gene knockout mouse embryonic stem cells using zinc-finger nucleases. PLoS ONE. 2011;6(12):e28911. Epub 2011/12/24. pmid:22194948; PubMed Central PMCID: PMC3237556.
  19. 19. Flemr M, Bühler M. Single-Step Generation of Conditional Knockout Mouse Embryonic Stem Cells. Cell Rep. 2015. Epub 2015/07/21. pmid:26190102.
  20. 20. Oji A, Noda T, Fujihara Y, Miyata H, Kim YJ, Muto M, et al. CRISPR/Cas9 mediated genome editing in ES cells and its application for chimeric analysis in mice. Sci Rep. 2016;6:31666. Epub 2016/08/18. pmid:27530713.
  21. 21. Wang B, Li K, Wang A, Reiser M, Saunders T, Lockey RF, et al. Highly efficient CRISPR/HDR-mediated knock-in for mouse embryonic stem cells and zygotes. BioTechniques. 2015;59(4):201–8. Epub 2015/10/14. pmid:26458548.
  22. 22. Ukai H, Kiyonari H, Ueda HR. Production of knock-in mice in a single generation from embryonic stem cells. Nat Protoc. 2017;12(12):2513–30. Epub 2017/12/01. pmid:29189772.
  23. 23. Liu P, Li Y, Lei J, Dong L. Creating Knockin Alleles in Mouse Embryonic Stem Cells by CRISPR/Cas9-Mediated Homologous Recombination Without Drug Selection. Methods Mol Biol. 2019;1874:115–37. Epub 2018/10/25. pmid:30353511.
  24. 24. Gurumurthy CB, Lloyd KCK. Generating mouse models for biomedical research: technological advances. Dis Model Mech. 2019;12(1). Epub 2019/01/11. pmid:30626588.
  25. 25. Rezza A, Jacquet C, Le Pillouer A, Lafarguette F, Ruptier C, Billandon M, et al. Unexpected genomic rearrangements at targeted loci associated with CRISPR/Cas9-mediated knock-in. Sci Rep. 2019;9(1):3486. Epub 2019/03/07. pmid:30837594.
  26. 26. Li H, Zhao C, Xu J, Xu Y, Cheng C, Liu Y, et al. Rapid generation of gene-targeted EPS-derived mouse models through tetraploid complementation. Protein Cell. 2018. Epub 2018/06/28. pmid:29948855.
  27. 27. Wang A, Ma X, Conti MA, Liu C, Kawamoto S, Adelstein RS. Nonmuscle myosin II isoform and domain specificity during early mouse development. Proc Natl Acad Sci USA. 2010;107(33):14645–50. Epub 2010/08/04. pmid:20679233; PubMed Central PMCID: PMC2930417.
  28. 28. Wang A, Ma X, Conti MA, Adelstein RS. Distinct and redundant roles of the non-muscle myosin II isoforms and functional domains. Biochem Soc Trans. 2011;39(5):1131–5. Epub 2011/09/23. pmid:21936777; PubMed Central PMCID: PMC3512102.
  29. 29. Zhang Y, Conti MA, Malide D, Dong F, Wang A, Shmist YA, et al. Mouse models of MYH9-related disease: mutations in nonmuscle myosin II-A. Blood. 2012;119(1):238–50. Epub 2011/09/13. pmid:21908426; PubMed Central PMCID: PMC3251230.
  30. 30. Zhang Y, Liu C, Adelstein RS, Ma X. Replacing nonmuscle myosin 2A with myosin 2C1 permits gastrulation but not placenta vascular development in mice. Mol Biol Cell. 2018:mbcE17120713. Epub 2018/07/26. pmid:30044719.
  31. 31. Liu T, Hu Y, Guo S, Tan L, Zhan Y, Yang L, et al. Identification and characterization of MYH9 locus for high efficient gene knock-in and stable expression in mouse embryonic stem cells. PLoS ONE. 2018;13(2):e0192641. Epub 2018/02/14. pmid:29438440.
  32. 32. Burland TG. DNASTAR's Lasergene sequence analysis software. Methods Mol Biol. 2000;132:71–91. Epub 1999/11/05. pmid:10547832.
  33. 33. Eggan K, Akutsu H, Loring J, Jackson-Grusby L, Klemm M, Rideout WM, et al. Hybrid vigor, fetal overgrowth, and viability of mice derived by nuclear cloning and tetraploid embryo complementation. Proc Natl Acad Sci USA. 2001;98(11):6209–14. Epub 2001/05/02. pmid:11331774; PubMed Central PMCID: PMC33447.
  34. 34. Porteus M. Using homologous recombination to manipulate the genome of human somatic cells. Biotechnol Genet Eng Rev. 2007;24:195–212. Epub 2007/12/07. pmid:18059634.
  35. 35. Haber JE. Exploring the pathways of homologous recombination. Curr Opin Cell Biol. 1992;4(3):401–12. Epub 1992/06/01. pmid:1323316.
  36. 36. Buzina A, Shulman MJ. An element in the endogenous IgH locus stimulates gene targeting in hybridoma cells. Nucleic Acids Res. 1996;24(8):1525–30. Epub 1996/04/15. pmid:8628687; PubMed Central PMCID: PMC145805.
  37. 37. Raynard SJ, Read LR, Baker MD. Evidence for the murine IgH mu locus acting as a hot spot for intrachromosomal homologous recombination. J Immunol. 2002;168(5):2332–9. Epub 2002/02/23. pmid:11859123.
  38. 38. Ishii A, Kurosawa A, Saito S, Adachi N. Analysis of the role of homology arms in gene-targeting vectors in human cells. PLoS ONE. 2014;9(9):e108236. Epub 2014/09/25. pmid:25250686; PubMed Central PMCID: PMC4176728.
  39. 39. Lee H. Genetically engineered mouse models for drug development and preclinical trials. Biomol Ther (Seoul). 2014;22(4):267–74. Epub 2014/08/22. pmid:25143803; PubMed Central PMCID: PMC4131519.
  40. 40. Beard C, Hochedlinger K, Plath K, Wutz A, Jaenisch R. Efficient method to generate single-copy transgenic mice by site-specific integration in embryonic stem cells. Genesis. 2006;44(1):23–8. Epub 2006/01/10. pmid:16400644.
  41. 41. Bronson SK, Plaehn EG, Kluckman KD, Hagaman JR, Maeda N, Smithies O. Single-copy transgenic mice with chosen-site integration. Proc Natl Acad Sci USA. 1996;93(17):9067–72. Epub 1996/08/20. pmid:8799155; PubMed Central PMCID: PMC38596.
  42. 42. Palais G, Nguyen Dinh Cat A, Friedman H, Panek-Huet N, Millet A, Tronche F, et al. Targeted transgenesis at the HPRT locus: an efficient strategy to achieve tightly controlled in vivo conditional expression with the tet system. Physiol Genomics. 2009;37(2):140–6. Epub 2009/01/15. pmid:19141541.
  43. 43. Samuel MS, Munro J, Bryson S, Forrow S, Stevenson D, Olson MF. Tissue selective expression of conditionally-regulated ROCK by gene targeting to a defined locus. Genesis. 2009;47(7):440–6. Epub 2009/04/25. pmid:19391117.
  44. 44. Dickel DE, Zhu Y, Nord AS, Wylie JN, Akiyama JA, Afzal V, et al. Function-based identification of mammalian enhancers using site-specific integration. Nat Methods. 2014;11(5):566–71. Epub 2014/03/25. pmid:24658141; PubMed Central PMCID: PMC4008384.
  45. 45. Soriano P. Generalized lacZ expression with the ROSA26 Cre reporter strain. Nat Genet. 1999;21(1):70–1. Epub 1999/01/23. pmid:9916792.
  46. 46. Strathdee D, Ibbotson H, Grant SG. Expression of transgenes targeted to the Gt(ROSA)26Sor locus is orientation dependent. PLoS ONE. 2006;1:e4. Epub 2006/12/22. pmid:17183668; PubMed Central PMCID: PMC1762389.
  47. 47. Deng C, Capecchi MR. Reexamination of gene targeting frequency as a function of the extent of homology between the targeting vector and the target locus. Mol Cell Biol. 1992;12(8):3365–71. Epub 1992/08/01. pmid:1321331; PubMed Central PMCID: PMC364584.
  48. 48. Capecchi MR. Altering the genome by homologous recombination. Science. 1989;244(4910):1288–92. Epub 1989/06/16. pmid:2660260.
  49. 49. Ge X, Xi H, Yang F, Zhi X, Fu Y, Chen D, et al. CRISPR/Cas9-AAV Mediated Knock-in at NRL Locus in Human Embryonic Stem Cells. Mol Ther Nucleic Acids. 2016;5(11):e393. Epub 2016/11/30. pmid:27898094.
  50. 50. Goyal A, Myacheva K, Groß M, Klingenberg M, Duran Arqué B, Diederichs S. Challenges of CRISPR/Cas9 applications for long non-coding RNA genes. Nucleic Acids Res. 2017;45(3):e12. Epub 2017/02/10. pmid:28180319.
  51. 51. Conti MA, Even-Ram S, Liu C, Yamada KM, Adelstein RS. Defects in cell adhesion and the visceral endoderm following ablation of nonmuscle myosin heavy chain II-A in mice. J Biol Chem. 2004;279(40):41263–6. Epub 2004/08/05. pmid:15292239.
  52. 52. Cornea AM, Russell DW. Chromosomal position effects on AAV-mediated gene targeting. Nucleic Acids Res. 2010;38(11):3582–94. Epub 2010/02/27. pmid:20185563; PubMed Central PMCID: PMC2887962.
  53. 53. Tikhonov AP, Lavie L, Tatout C, Bennetzen JL, Avramova Z, Deragon JM. Target sites for SINE integration in Brassica genomes display nuclear matrix binding activity. Chromosome Res. 2001;9(4):325–37. Epub 2001/06/23. pmid:11419796.
  54. 54. Cai HW, Inoue M, Yuyama N, Takahashi W, Hirata M, Sasaki T. Isolation, characterization and mapping of simple sequence repeat markers in zoysiagrass (Zoysia spp.). Theor Appl Genet. 2005;112(1):158–66. Epub 2005/10/20. pmid:16235048.
  55. 55. Deyle DR, Hansen RS, Cornea AM, Li LB, Burt AA, Alexander IE, et al. A genome-wide map of adeno-associated virus-mediated human gene targeting. Nat Struct Mol Biol. 2014;21(11):969–75. Epub 2014/10/06. pmid:25282150; PubMed Central PMCID: PMC4405182.
  56. 56. Seibt KM, Wenke T, Muders K, Truberg B, Schmidt T. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization. Plant J. 2016. Epub 2016/03/22. pmid:26996788.
  57. 57. Reid LH, Shesely EG, Kim HS, Smithies O. Cotransformation and gene targeting in mouse embryonic stem cells. Mol Cell Biol. 1991;11(5):2769–77. Epub 1991/05/01. pmid:1850104; PubMed Central PMCID: PMC360052.
  58. 58. Valancius V, Smithies O. Double-strand gap repair in a mammalian gene targeting reaction. Mol Cell Biol. 1991;11(9):4389–97. Epub 1991/09/01. pmid:1875928; PubMed Central PMCID: PMC361301.
  59. 59. Vasquez KM, Marburger K, Intody Z, Wilson JH. Manipulating the mammalian genome by homologous recombination. Proc Natl Acad Sci USA. 2001;98(15):8403–10. Epub 2001/07/19. pmid:11459982; PubMed Central PMCID: PMC37450.
  60. 60. Smith K. Theoretical mechanisms in targeted and random integration of transgene DNA. Reprod Nutr Dev. 2001;41(6):465–85. Epub 2002/07/20. pmid:12126294.