Modification of Paraburkholderia TALE-Like Protein to Activate Transcription of Target Genes in Plant

1Department of Genetics, Faculty of Agriculture, Ain Shams University, Cairo, Egypt. 2Department of Biological Sciences, Faculty of Science, King Abdulaziz University (KAU), P.O. Box 80141, Jeddah 21589, Saudi Arabia. 3Department of Biological Sciences, Science and Arts College, Rabigh Campus, King Abdulaziz University (KAU), Jeddah, Saudi Arabia. 4Department of Chemistry, Science and Arts College, Rabigh Campus, King Abdulaziz University (KAU), Jeddah, Saudi Arabia.

Many approaches to controlling gene expression were undertaken. They include overexpression of natural transcription factors (TFs) in transgenics, tissue-specific expression of transcription factors and modified expression of transcription factors. Some of these methods were successful, but none has proven to be broadly applicable as the case of synthetic transcription factors. 1 Synthetic transcriptional factors are hybrid of (i) designed DNA binding domains that recognize specific sequences in the promoter present upstream the target gene; (ii) effector domains that activate or repress the transcription of target genes; and (iii) nuclear localization signal (NLS) for translocating the synthetic TF to the nucleus where transcription occurs. The protein product of the designed transcription factors is engineered to target specific genomic sequences to enable the regulation of the target genes. 1 Zinc finger proteins (ZF) are first protein domain engineered to in vitro bind to any chosen locus 2 . Engineering of the modular DNA binding domain (DBD) of TALE protein with predetermined sequence specificity is a first step to building synthetic transcription factors. Recently, Transcriptional Activator-Like Effector (TALE) DNA binding domains from plant pathogen Xanthomonas sp have also been used to target pre-determined genomic loci. [3][4][5][6][7] TALEs function as transcription activators during infection as one of a large group of effector proteins secreted in plant cells via type III secretion system. The N-terminal of TALE contains secretion and translocation signal. The central part of this protein contains unique DNA binding domain composed of repeating modular DNA-binding domain, nuclear localization signals (NLS) and a highly conserved acidic activation domain (AD). 8 The central repetitive sequence is nearly identical except the residues 12 and 13, named repeat variable diresidues (RVDs). 9 Different TALEs vary in repeat number and RVD composition. 8 Many TALE homologs were identified in different bacterial species (Ralstonia solanacearum and Paraburkholderia rhizoxinica. [10][11][12] The pathogenesis role in plants was major concern for scientists until the DNA binding code was deciphered. 9,13 The latter groups identified the four major RVDs comprising HD, NI, NG and NN, which bind to cytosine (C), adenine (A), thiamine (T), and guanine (G)/A, respectively.
The major challenge of using of DNA binding domain of TALE in plants is the pathogenic nature of Xanthomonas and Ralstonia TALE, which may lead to adaptive immunity in some plant species. Testing new TALE-like protein in plant system could increase our repertories for plant gene expression custom control. The present work aimed at developing new synthetic transcription factor for controlling gene expression in plants based on nonpathogenic bacteria. I n t h e p r e s e n t s t u d y, w e u s e d Paraburkholderia TALE-like protein to synthesize new TALE from non-pathogenic origin and approach to generate this TALE can be used to monitor expression of any desired gene.

Materials and Methods
O v e r a l l d e s c r i p t i o n m o d i f i e d Paraburkholderia TALE-like protein construction to activate transcription of target gene in plants is shown in Figure 1.

transcription activator designing
Paraburkholderia TALE-like protein sequence (uniprot: E5AV36) was modified by reordering repeat domain region at the DNA level to the new repeat variable di-residues (RVDs) sequence NI-NG-NI-NG-NI-NI-NI-ND-ND-NG-NI-NI that target the effector binding element (EBE) (e.g., ATATAAACCTAA) of the minimal BS3 promoter. Subsequently, DNA sequence of the nuclear localization signal (NLS) (encoding PKKKRK) was fused to N-terminus and DNA sequence of the Herpes simplex virus VP16 (encoding ADFEFEQMFTDALGIDELPQ) was fused to the DNA sequence encoding the C-terminus of the protein. Modified DNA sequence was reverse translated using Vector NTI Software v11.5 (Thermo Fisher, USA). Codon usage table of Nicotiana benthamiana [gbpln]:100 ( Table  1) was used as a reference and codons for each amino acid with the highest frequencies were selected taking into consideration the avoidance of repetitive sequences. NcoI restriction site were added to reverse translated sequence by adding two cytosine bases upstream open reading frame (ORF), this addition was enough to generate NcoI site, XbaI site was added downstream ORF.

dna synthesis and Vectors Construction
The modified reverse translated sequence was synthesized by Integrated DNA Technology Inc. (IDT, USA) and cloned in pIDSMART. Synthesized BAT was subcloned in pRT103 plant expression vector14 at NcoI and XbaI restriction sites. HindIII fragment of pRT103 was subsequently cloned to binary vector pCAMBIA1300, 10 colonies were picked and cultivated overnight in Luria-Bertani (LB) broth media supplemented with 50 µM kanamycin, plasmids were extracted by alkaline lysis15 and digested with HindIII Inc. (IDT, USA), and resulted mBS3 clones were digested with NcoI restriction enzyme followed by agarose gel electrophoresis. Then, the P-mBS3 fragment was purified by QIAquick Gel Extraction kit (Cat. No. 28704, Qiagen INC., USA) following the instruction manual. Plant Gly GGG 11.1

Fig. 2.
Steps for construction the expression vector pCAMBIA1300-dBAT binary vector pCAMBIA2201 was digested with NcoI restriction enzyme to remove tail-to-tail CaMV 35S promoters followed by electrophoresis and a 9914 bp fragment was purified from the gel. P-mBS3 NcoI fragment and the 9914 bp NcoIdigested pCAMBIA2201 fragment was ligated using T4 ligase (New England Biolabs, USA). Resultant of ligation reaction was transformed in E. coli strain MachI. Transformed bacteria were plated on LB medium supplemented with 50 µM kanamycin and 2 µg/ml X-Gal overnight of which 10 white colonies were selected for cultivation and plasmid extraction. Resulted plasmids were checked for the incorporation of P-mBS3 by NcoI digestion followed by gel electrophoresis. Then, 5 ng of each positive clone were used for polymerase chain reaction (PCR) using 10 pmol primers of OriF: CAGCCGATTGTCTGTTGTG (in the NPTII gene upstream the promoter at bases 9823 up to 9843 of pCAMBI2201) and 10 pmol of OriR: CGGGTGACAAGAACAAGAGG (within P-mBS3 at bases 10002 up to 9082 of pCAMBI2201-mBS3::GUS), 5 µL GoTaq® Master Mix, and final reaction volume adjusted to 10 µL. PCR amplification was carried out in a thermocycler programmed as the following: initial start separation cycle at 94°C for 2 min, 32 cycles including a denaturation step at 94°C for 30 sec, an annealing step at 52°C for 30 sec and polymerization step at 72°C for 30 sec, Then a final extension cycle at 72°C for 10 min. The PCR products (180 bp) were screened by agarose gel electrophoresis at 2% and positive clones, which indicate the right promoter orientation were termed pCAMBIA-mBS3 ( Figure 4). Then, 1 ug DNA each from pCAMBIA-mBS3::GUS  18. Infiltrated 1-cm discs were collected 48 h post inoculation. Then, 4 leaf discs were stored at -80°C for RNA extraction. Then, 2 leaf discs were used for qualitative GUS assay. GUS staining was conducted by immersing leaf discs in GUS staining solution (which contains 10 mM sodium phosphate [pH 7], 10 mM EDTA, 0.1% Triton X-100, 0.1%

Validation of GUs assay via sqrt-PCr
Total RNA was isolated from 2 leaf discs of 4 different infiltrated plants using SV Total RNA extraction kit (Promega Inc., USA) according to the manufacturer protocol. First-strand cDNA synthesis was performed by The ImProm-II™ Reverse Transcription System (Promega Inc., USA) as follows: 1.5?µg of total RNA and 50 µmol poly-T primer were incubated at 70°C for 5?min, then reaction quickly chilled on ice before addition of 2?µL of 10?mM dNTPs, 5?µL of 5x buffer, 1?µL of 40 U RNasin® Ribonuclease Inhibitor and 1?µL of ImProm-II™ Reverse Transcriptase in a total of 25?µL mixture. Then, the mixture was incubated at 42°C for 1?h and 5 µL of first strand reaction were used as template for PCR as follows: 12.5 µL GoTaq® Master Mix, 10 pmol of each primer (gusF: GCGTAATGCTCTACACCACG, gusR: AAGGGTAATGCGAGGTACGG). PCR product of bp was expected to generate. Actin gene was used as a house-keeping. For PCR, 10 pmol actnF: AAGATACTCACAGAAAGAGGCTACTC and 10 pmol actnR: GGGAGCTAATGCAGT AATTTCCTT, were used to amplify bp and reaction volume was adjusted to 25 µL by ddH2O. PCR conditions for amplifying actin gene fragment was the same as those used for amplifying GUS gene fragment.

resUlts and disCUssion
Transcription activator like effectors (TALE) from Xanthomonas sp. is used as a modular DNA binding protein. However, Xanthomonas sp. is a plant pathogen, but defense mechanisms are developed by plants to neutralize Xanthomonas TALE. 19 TALE-like proteins identified in several bacterial species, particularly the repeat domain region.20 One of those is Paraburkholderia rhizoxinica, which is an endosymbiotic bacterium of Rhizopus microspores. Paraburkholderia TALElike protein is computationally identified from Paraburkholderia rhizoxinica genome. 12

design of transcription activator
Paraburkholderia TALE-like protein repetitive sequences has similarity with Xanthomonas TALE repeats except for the lack of TALE secreting signal in N-terminus and the activation domain and the nuclear localization signal (NLS) in C-terminus ( Figure 5). These domains are crucial to translocation and function in plant cells as transcriptional activators8. In this study, nuclear localization signal (PKKKRK) was fused to N-terminus and Herpes simplex virus VP16 (ADFEFEQMFTDALGIDELPQ) was fused to C-terminus of the original Paraburkholderia TALE-like protein (BAT) (uniprot: E5AV36) to facilitate transcription activation function in plants ( Figure 5). New modified NLS:BAT2:VP16 protein sequence (Figure 6) was reverse translated following codon usage table of Nicotiana benthamiana (Nb)[gbpln]:100 used as a reference. Nb translation table was used to maximize protein translation rate in the cell and avoid the recovery of premature or troncated proteins. NcoI and XbaI resteriction site were added at the 5´and 3´ ends, Fig. 6. Paraburkholderia TALE-like protein with RVD (in bold yellow letters inside respectively, for subsequent cloning into pRT103 plant expression vector (Figure 7). Synthesized DNA sequence of the new BAT was confirmed by Sanger sequencing in which results indicated 100% match with our designed sequence ( Figure  7). Resterction digestion by HindIII enzyme for pRT103 resulted in the recovery of 2728 bp. Purified HindIII flanking region was used for subcloning into pCAMBIA1300 binary vector used for transient transformation or agro-infiltration technique. Agro-ifiltration offers a roboust method for testing expression of forign DNA in plant leaves at the transient level.

transient Gene expression assay in nicotiana Benthamiana leaves
Since Paraburkholderia TALE-like protein is entirely formed of DNA binding domain (no activation domain, NLS, or secretion signal), we added Herpes simplex virus VP 16 activation domain in order to make designer transcription activator (Figure 7). We hypothesized that modification will not affect the DNA binding capability of BAT protein and the VP16 domain transcriptional activation will remain functioning. To investigate the capability of DNA binding in vivo and activate gene transcription in plants, N. benthamiana leaves were co-infiltrated by pCAMBIA1300-dBAT and pCAMBIA2201-mBS3::GUS21. Two days post inoculation, leaf discs were stained using X-Gluc stain and blue signals were detected in leaf discs co-infiltrated with the two plasmids, while no signal was detected with pCAMBIA2201-mBS3::GUS infiltrated leaves, only (Figure 8). Semi-quantitative real time polymerase chain reaction (sqRT-PCR) for GUS gene indicated that BAT transcription was detectable in N.b. plants (Figure 9).
Our results demonstrate that dBAT cassette is active in tobacco as DNA binding protein. Despite the previously identified TALE effector, Paraburkholderia TALE-like protein lacks NLS or transcription activation domain. Original BAT sequence failed to transport to the nucleus in plant cells22. In this study, we demonstrated adding nuclear localization signal to facilitate nuclear localization and gene activation in plant leaves in vivo via agro-infiltration technique.
Paraburkholderia TALE-like (BAT) protein is functionally different when compared to xanthomonas TALE proteins, where gene activation by dBAT results in lower signal than previously reported23. This result could be due to the lack of N-terminus and C-terminus domains, which may play important roles in protein/DNA binding stabilization. However, we suggest another strategy for enhanced synthetic transcription activator by using Paraburkholderia TALE-like repeat array with Xanthomonas TALE scaffold. This strategy may overcome the low binding capability of BAT, and also helps to escape from Further studies are needed to test the toxicity of Paraburkholderia TALE-like protein in different plant species and to develop programmable transcription activators of controlling any desired gene with higher DNA affinity. Finally, we claim that the approach for synthesizing a new TALE of non-pathogenic origin opens an avenue towards the possibility to manipulate and monitor expression of any desired gene via transgenesis.
As compared to CRISPR/Cas9 technology that is considered an easier approach in gene edition, while TALE was abandoned as the latter is considered a time-consuming and low-efficiency process in construction controlling elements. However, CRISPR/Cas9 approach has an offtarget activity that declines its application in vivo, while TALE still can be applied in vivo for gene editing and therapy because of its high targeting capability. We introduce a high-efficient method for constructing custom TALEs to overcome its key limitation. We created a new DNA binding protein as a molecular tool for specific targeting of any desired. The new method was verified by the use of GUS as a reporter gene. Our new approach is fast and less demanding. Recently, Qasim et al. 24 developed a TALEN-mediated gene editing system to promote the T cell receptor a chain and CD52 gene loci. Such a bridge-to-transplantation strategy promoted the therapeutic potential of TALEN geneediting technology.
More recently, Zhang et al.3 also used a new method to customize TALEs to be as easy and rapid as CRISPR. They developed a whole pipeline to assemble custom TALEs in as little as few hours. They verified their new method by using customized TALEs to target promoters of two transcription factor genes namely HNF4a and E47. The new constructed TALEs induced expression of two endogenous genes in two cancer cells, namely HepG2 and PANC1, which resulted in promoting successful differentiation of these cancer cells.
The results of the present study as well as those of recent studies 3,24 might act in re-promoting the application of TALE-based technology in more therapeutic applications in the future.
In conclusion, use of Paraburkholderia TALE-like protein in synthesizing new TALE from non-pathogenic origin successfully monitored expression of desired genes.