Coronary blood vessels from distinct origins converge to equivalent states during mouse and human development

Most cell fate trajectories during development follow a diverging, tree-like branching pattern, but the opposite can occur when distinct progenitors contribute to the same cell type. During this convergent differentiation, it is unknown if cells ‘remember’ their origins transcriptionally or whether this influences cell behavior. Most coronary blood vessels of the heart develop from two different progenitor sources—the endocardium (Endo) and sinus venosus (SV)—but whether transcriptional or functional differences related to origin are retained is unknown. We addressed this by combining lineage tracing with single-cell RNA sequencing (scRNAseq) in embryonic and adult mouse hearts. Shortly after coronary development begins, capillary endothelial cells (ECs) transcriptionally segregated into two states that retained progenitor-specific gene expression. Later in development, when the coronary vasculature is well established but still remodeling, capillary ECs again segregated into two populations, but transcriptional differences were primarily related to tissue localization rather than lineage. Specifically, ECs in the heart septum expressed genes indicative of increased local hypoxia and decreased blood flow. Adult capillary ECs were more homogeneous with respect to both lineage and location. In agreement, SV- and Endo-derived ECs in adult hearts displayed similar responses to injury. Finally, scRNAseq of developing human coronary vessels indicated that the human heart followed similar principles. Thus, over the course of development, transcriptional heterogeneity in coronary ECs is first influenced by lineage, then by location, until heterogeneity declines in the homeostatic adult heart. These results highlight the plasticity of ECs during development, and the validity of the mouse as a model for human coronary development.


Sample-size estimation
• You should state whether an appropriate sample size was computed when the study was being designed • You should state the statistical method of sample size computation and any required assumptions • If no explicit power analysis was used, you should describe how you decided what sample (replicate) size (number) to use Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission:

Replicates
• You should report how often each experiment was performed • You should include a definition of biological versus technical replication • The data obtained should be provided and sufficient information should be provided to indicate the number of independent biological and/or technical replicates • If you encountered any outliers, you should describe how these were handled • Criteria for exclusion/inclusion of data should be clearly stated • High-throughput sequence data should be uploaded before submission, with a private link for reviewers provided (these are available from both GEO and ArrayExpress) Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: Sample size was chosen empirically according to previous experience in the calculation of experimental variability. No statistical method was used to predetermine sample size.
Information about the number of replicates and/or samples used for each experiment is located in the methods section. All replicates are biological replicates. All data collected is included in analysis unless indicated in the methods section (for instance, poor quality cells were excluded from sequencing analysis). Gene count tables for the e12 mouse, e17 mouse, human, and adult mouse datasets are included with the submission.

Statistical reporting
• Statistical analysis methods should be described and justified • Raw data should be presented in figures whenever informative to do so (typically when N per group is less than 10) • For each experiment, you should identify the statistical tests used, exact values of N, definitions of center, methods of multiple test correction, and dispersion and precision measures (e.g., mean, median, SD, SEM, confidence intervals; and, for the major substantive results, a measure of effect size (e.g., Pearson's r, Cohen's d) • Report exact p-values wherever possible alongside the summary statistics and 95% confidence intervals. These should be reported for all key questions and not only when the p-value is less than 0.05.
Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: (For large datasets, or papers with a very large number of statistical tests, you may upload a single table file with tests, Ns, etc., with reference to sections in the manuscript.)

Group allocation
• Indicate how samples were allocated into experimental groups (in the case of clinical studies, please specify allocation to treatment method); if randomization was used, please also state if restricted randomization was applied • Indicate if masking was used during group allocation, data collection and/or data analysis Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: Additional data files ("source data") • We encourage you to upload relevant additional data files, such as numerical data that are represented as a graph in a figure, or as a summary table • Where provided, these should be in the most useful format, and they can be uploaded as "Source data" files linked to a main figure or table • Include model definition files including the full list of parameters used • Include code used for data analysis (e.g., R, MatLab) • Avoid stating that data files are "available upon request" Please indicate the figures or tables for which source data files have been provided: Exact p-values are shown in the figures, the statistical tests used are reported in the methods section, and the precision measures are defined in the figure legends. Ns can be found in the methods section and in the figures in some cases.
For both RNA-seq and injury comparisons of SV-and Endo-lineage cells, the cells being compared (SV-derived vs Endo-derived) were in the same hearts and therefore processed together. Cell lineage was only revealed based on presence of a fluorescent marker during the analysis step.