Synthetic reconstruction of the hunchback promoter specifies the role of Bicoid, Zelda and Hunchback in the dynamics of its transcription

For over 40 years, the Bicoid-hunchback (Bcd-hb) system in the fruit fly embryo has been used as a model to study how positional information in morphogen concentration gradients is robustly translated into step-like responses. A body of quantitative comparisons between theory and experiment have since questioned the initial paradigm that the sharp hb transcription pattern emerges solely from diffusive biochemical interactions between the Bicoid transcription factor and the gene promoter region. Several alternative mechanisms have been proposed, such as additional sources of positional information, positive feedback from Hb proteins or out-of-equilibrium transcription activation. By using the MS2-MCP RNA-tagging system and analysing in real time, the transcription dynamics of synthetic reporters for Bicoid and/or its two partners Zelda and Hunchback, we show that all the early hb expression pattern features and temporal dynamics are compatible with an equilibrium model with a short decay length Bicoid activity gradient as a sole source of positional information. Meanwhile, Bicoid’s partners speed-up the process by different means: Zelda lowers the Bicoid concentration threshold required for transcriptional activation while Hunchback reduces burstiness and increases the polymerase firing rate.


Sample-size estimation
 You should state whether an appropriate sample size was computed when the study was being designed  You should state the statistical method of sample size computation and any required assumptions  If no explicit power analysis was used, you should describe how you decided what sample (replicate) size (number) to use Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission:

Replicates
 You should report how often each experiment was performed  You should include a definition of biological versus technical replication  The data obtained should be provided and sufficient information should be provided to indicate the number of independent biological and/or technical replicates  If you encountered any outliers, you should describe how these were handled  Criteria for exclusion/inclusion of data should be clearly stated  High-throughput sequence data should be uploaded before submission, with a private link for reviewers provided (these are available from both GEO and ArrayExpress) Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: In this study, we analyzed movies of developing embryos with specific maternal and zygotic genotypes. The sample size is the number of nuclei captured within the imaging region of 300 µm (Antero-Posterior axis) by 100µ (Ventral Dorsal axis). This region in our cycle of interest (nc13) contains ~1000 nuclei. There is no explicit power analysis for this size. We consider this sample size is sufficient given the reproducibility of expression patterns found in this study (Fig. 1H, Fig. 2A 2 Statistical reporting  Statistical analysis methods should be described and justified  Raw data should be presented in figures whenever informative to do so (typically when N per group is less than 10)  For each experiment, you should identify the statistical tests used, exact values of N, definitions of center, methods of multiple test correction, and dispersion and precision measures (e.g., mean, median, SD, SEM, confidence intervals; and, for the major substantive results, a measure of effect size (e.g., Pearson's r, Cohen's d)  Report exact p-values wherever possible alongside the summary statistics and 95% confidence intervals. These should be reported for all key questions and not only when the p-value is less than 0.05.
Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: (For large datasets, or papers with a very large number of statistical tests, you may upload a single table file with tests, Ns, etc., with reference to sections in the manuscript.)

Group allocation
 Indicate how samples were allocated into experimental groups (in the case of clinical studies, please specify allocation to treatment method); if randomization was used, please also state if restricted randomization was applied  Indicate if masking was used during group allocation, data collection and/or data analysis Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: Additional data files ("source data")  We encourage you to upload relevant additional data files, such as numerical data that are represented as a graph in a figure, or as a summary table In our study, the number of biological replicates refers to the number of embryos of a given genotype analyzed. The information is indicated in the figure legends. All embryos were taken except for the very rare ones experiencing aberrant development.
We analyzed 3 to 7 embryos for each genotype. Given the very high reproducibility of the expression patterns between samples of the same genotype (Fig. 1H, Fig. 2A,B and Fig. 3G, I), we consider this size is sufficient to capture the features of expression patterns within the acceptable error of 2 % embryo length (a nucleus length). As our data is obtained from real-time experiments, we don't use technical replicates in our study.
In Fig. 1, 2 and 3, pattern features are shown with 95% confidence intervals (indicated in Figure legends). In Fig. 5, when distinguishing the models' goodness of fit to the data, we use a likelihood ratio test with a threshold for p-value equal 0.05.
This information doesn't apply to our submission because we did not allocate samples in experimental groups.