Macrophages directly contribute collagen to scar formation during zebrafish heart regeneration and mouse heart repair.

Canonical roles for macrophages in mediating the fibrotic response after a heart attack include extracellular matrix turnover and activation of cardiac fibroblasts to initiate collagen deposition. Here we reveal that macrophages directly contribute collagen to the forming post-injury scar. Unbiased transcriptomics shows an upregulation of collagens in both zebrafish and mouse macrophages following heart injury. Adoptive transfer of macrophages, from either collagen-tagged zebrafish or adult mouse GFPtpz-collagen donors, enhances scar formation via cell autonomous production of collagen. In zebrafish, the majority of tagged collagen localises proximal to the injury, within the overlying epicardial region, suggesting a possible distinction between macrophage-deposited collagen and that predominantly laid-down by myofibroblasts. Macrophage-specific targeting of col4a3bpa and cognate col4a1 in zebrafish significantly reduces scarring in cryoinjured hosts. Our findings contrast with the current model of scarring, whereby collagen deposition is exclusively attributed to myofibroblasts, and implicate macrophages as direct contributors to fibrosis during heart repair.

Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency in reporting. For further information on Nature Research policies, seeAuthors & Referees and theEditorial Policy Checklist .

Statistics
For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section.
n/a Confirmed The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly The statistical test(s) used AND whether they are one-or two-sided Only common tests should be described solely by name; describe more complex techniques in the Methods section.
A description of all covariates tested A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted Give P values as exact values whenever suitable.

For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings
For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated Our web collection on statistics for biologists contains articles on many of the points above.

Software and code
Policy information about availability of computer code Imaging was performed using either an Olympus FV1000 confocal microscope, a Leica DM600 CFS confocal microscope or a Zeiss 780 Upright MP confocal microscope (as described in our methods section). RNA-seq raw sequencing data files (fastq) were downloaded from Illumina Basespace using Basespace module v0.2 to our Computational Bioinformatics Resource Group (CBRG) core server.
Next Generation Sequencing was performed on a NextSeq500 platform using a NextSeqTM500 150 cycle High Output Kit) (Illumina) to generate 80-basepair paired end reads. Read quality was evaluated using FastQC. Zebrafish reads were mapped to the Jul. 2014 Zv10/ danRer10 assembly version of the zebrafish genome using STAR (v.2.4.2a) splice-aware aligner. Count tables were generated using subread FeatureCounts (v1.4.5-p1q), with standard parameters. Differential expression was carried out using DESeq2 R package. Hierarchical clustering was performed on genes significantly differentially expressed in at least one comparison (p-value<0.05). Comparisons where p-values were not available were excluded. log2 Fold Change values from DESeq2 analysis were used to calculate Euclidian distances, and Ward's minimum variance method (Ward D) was used to perform hierarchical clustering (k=9). For mouse, trimmed reads (Trim Galore v.0.3.7, http://www.bioinformatics.babraham.ac.uk/projects/trim_galore, with default settings) were screened for quality (FastQC v.011.3, http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc) and then mapped to the mouse genome (Mm10) using TopHat63 v2.1.10, with Bowtie64 v.2.2.6, using the "b2-sensitive" setting with mate inner distance = 208 and mate standard deviation = 213 determined from test alignments. Transcriptome annotations downloaded from the Illumina iGenomes website (UCSC Mm10, https://support.illumina.com/sequencing/sequencing_software/igenome.html) were used with Cufflinks for transcript mapping. Cuffdiff was used for pair-wise comparisons. Genes with log2 fold change > 2 and FDR < 0.05 were deemed significantly differentially expressed. Cuffdiff results were loaded into R (http://www.R-project.org) for further analysis and plotting, including Principal components analysis (PCA) using the pcaMethods package and heatmap generation with pheatmap (http://CRAN.Rproject.org/package=pheatmap). For temporal analyses, raw counts from Cuffdiff were retrieved and genes with < 5 counts in all samples were removed before carrying out tests. DESeq2 was used to identify genes showing differential temporal regulation and STEM v.1.3.8 was used for temporal clustering and gene ontology enrichment analysis of temporal profiles. HMRA Analysis was done on the BioRad Precision Melt Analysis software (v1.3). FACS plots were generated with the FACSDIva 8.0.1 software.

October 2018
For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information.

Data
Policy information about availability of data All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: -Accession codes, unique identifiers, or web links for publicly available datasets -A list of figures that have associated raw data -A description of any restrictions on data availability Field-specific reporting Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection.

Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences
For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf

Life sciences study design
All studies must disclose on these points even when the disclosure is negative.  Supplementary Figures 1, 2, 6, 7 and 9 are provided with the paper.
To ensure the reproducibility and statistical power all experiments were performed in at least triplicates, with a sample size ranging from n=3 to n=9 per replicate.
No data were excluded from the analyses.
All experiments were performed at least in 3 independent replicates and the findings were replicated.
Samples were not allocated to experimental groups. Tick this box to confirm that the raw and calibrated dates are available in the paper or in Supplementary Information.

Animals and other organisms
Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research Laboratory animals

Wild animals
Field-collected samples

Ethics oversight
Note that full information on the approval of the study protocol must also be provided in the manuscript.

Human research participants
Policy information about studies involving human research participants Population characteristics

Recruitment
Ethics oversight Note that full information on the approval of the study protocol must also be provided in the manuscript.

Clinical data Policy information about clinical studies
All manuscripts should comply with the ICMJEguidelines for publication of clinical research and a completedCONSORT checklist must be included with all submissions.

Clinical trial registration
Study protocol Confirm that both raw and final processed data have been deposited in a public database such as GEO.
Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks.

Data access links
May remain private before publication.

Files in database submission
Genome browser session The axis labels state the marker and fluorochrome used (e.g. CD4-FITC).
The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers).
All plots are contour plots with outliers or pseudocolor plots.
A numerical value for number of cells or percentage (with statistics) is provided.

Methodology
Sample preparation Instrument Software Cell population abundance Gating strategy