Genome-wide chromatin interaction map for Trypanosoma cruzi

Díaz-Viraqué, Florencia; Chiribao, María Laura; Libisch, María Gabriela; Robello, Carlos

doi:10.1038/s41564-023-01483-y

Download PDF

Article
Open access
Published: 12 October 2023

Genome-wide chromatin interaction map for Trypanosoma cruzi

Nature Microbiology volume 8, pages 2103–2114 (2023)Cite this article

4010 Accesses
2 Citations
59 Altmetric
Metrics details

Subjects

Abstract

Trypanosomes are eukaryotic, unicellular parasites, such as Trypanosoma brucei, which causes sleeping sickness, and Trypanosoma cruzi, which causes Chagas disease. Genomes of these parasites comprise core regions and species-specific disruptive regions that encode multigene families of surface glycoproteins. Few transcriptional regulators have been identified in these parasites, and the role of spatial organization of the genome in gene expression is unclear. Here we mapped genome-wide chromatin interactions in T. cruzi using chromosome conformation capture (Hi-C), and we show that the core and disruptive regions form three-dimensional chromatin compartments named C and D. These chromatin compartments differ in levels of DNA methylation, nucleosome positioning and chromatin interactions, affecting genome expression dynamics. Our data reveal that the trypanosome genome is organized into chromatin-folding domains and transcription is affected by the local chromatin structure. We propose a model in which epigenetic mechanisms affect gene expression in trypanosomes.

Metacyclogenesis defects and gene expression hallmarks of histone deacetylase 4-deficient Trypanosoma cruzi cells

Article Open access 04 November 2021

Bromodomain factor 5 is an essential regulator of transcription in Leishmania

Article Open access 13 July 2022

An updated map of Trypanosoma cruzi histone post-translational modifications

Article Open access 25 March 2021

Main

Gene organization, regulation of genome expression and RNA metabolism in trypanosomes differ from that in other eukaryotes. Genes are organized into directional gene clusters (DGCs) separated by strand-switch regions (SSRs), where the transcription sense converges or diverges. The DGCs are transcribed as large polycistronic transcription units (PTUs)¹, and messenger RNA maturation occurs by co-transcriptional trans-splicing of spliced leader RNA and polyadenylation². It is widely accepted that post-transcriptional regulation is the main mode of gene expression regulation^3,4,5. However, recent reports have highlighted the impact of histone post-translational modifications, histone variants, nucleosome positioning, base J and chromatin organization on the regulation of gene expression, cell cycle control and differentiation^{6,7,8,9,10,11,12,13}.

Epigenetic mechanisms are important for transcriptional regulation in several protozoan parasites^14,15, including trypanosomes⁸. Recently, it was reported that specific spatial genome organization is required for the monogenic expression of the variant surface glycoprotein (VSG) in Trypanosoma brucei⁹, denoting the importance of the genome architecture on gene expression in trypanosomes. The surface of trypanosomes is covered with glycoproteins¹⁶ whose composition varies among different species, probably owing to molecular mechanisms that underpin the regulation of the expression of these genes. T. brucei is extracellular during completion of its life cycle and relies on antigenic variation of VSGs to evade the host immune system^17,18. By contrast, Trypanosoma cruzi, which has intracellular and extracellular life-cycle stages, simultaneously expresses thousands of slightly different surface proteins to evade host immunity^19,20,21.

The genomes of trypanosomes are partitioned into two large regions, formally known as compartments^22,23. In T. cruzi, the core compartment (which presents synteny among trypanosomatids) is composed of conserved genes and the disruptive (loss of synteny) compartment is where most of the surface coding genes are located; in particular, the disruptive compartment is enriched in genes coding for mucins, mucin-associated surface protein and trans-sialidases²². In T. brucei, genes encoding for VSGs are also located in a particular compartment in the genome, which is located mostly in the extreme ends (subtelomeres) of the chromosomes²³. While regions have been demarcated, the role of chromatin organization in controlling the expression of non-antigenic genes, general gene expression and pathogenesis in T. cruzi is still largely uncharted.

Stage-specific expression of surface proteins is essential for trypanosomes to complete their life cycles, so surface-protein-encoding genes need to be tightly regulated. Physical separation in the genome would be an efficient mechanism to regulate these genes. We hypothesized that specialized genome organization might enable gene regulation mechanisms in trypanosomes. To test this hypothesis, we analysed how the spatial organization of the genome influences gene expression in the context of previously defined genome compartments and report our results here.

T. cruzi genome is organized into chromatin-folding domains

On the basis of our previous observation that the T. cruzi genome is partitioned into compartments²², we wondered whether this linear organization correlates with three-dimensional conformation. To characterize chromatin conformation in T. cruzi cells, we mapped genome-wide interactions by analysing chromosome conformation capture (Hi-C) data that were previously used for genome assembly²⁴. The analysis of interaction matrices at various bin sizes revealed that there are several regions in close spatial proximity indicating the presence of chromatin-folding conformations (Fig. 1a). To identify all topological domains, we explored the formation of chromatin-folding domains (CFDs) with three different algorithms (Supplementary Table 1 and Extended Data Fig. 1). As in other organisms²⁵, the structural domains showed hierarchical organization, suggesting multiple levels of genome organization. Using a Hi-C map with 5 kb resolution, we identified 173 domains with 129 kb mean size, occupying 60% of the genome. We noticed that a considerable fraction of the genome remains structured in these folding domains, representing a prominent feature of genome organization. Notably, the structures we identified are smaller than topologically associating domains (TADs) usually described in other eukaryotes²⁶. TAD predictor algorithms failed to detect domains in the smallest chromosomes (<1.1 Mb) and at common resolutions for TAD identification. This result is reasonable as the chromosomes of T. cruzi are smaller (<3 Mb) and exhibit reduced non-coding regions (for example, lack of introns, and short intergenic and inter-DGC regions). In addition, the concept of TADs has been mainly developed for complex eukaryotes, mainly mammals^27,28. For these reasons, we refer to these structures as CFDs. Both compartments contain CFDs, and remarkably, 80% of them belong to only one compartment, indicating that CFDs in each compartment are mutually exclusive (Supplementary Table 1). Therefore, the previously defined linear genome compartments²² correspond to three-dimensional compartments of the nucleus. From here, core and disruptive will refer to the linear organization of the trypanosome genome and C and D to the three-dimensional compartments. The D compartment exhibits smaller CFDs (almost half the size of those in the C compartment) (Fig. 1b) and a higher frequency of interactions at different distances (Fig. 1c). Examining chromosomal interactions, we found that the D compartment presents several inter-chromosomal contacts, while in the C compartment, interactions are predominantly intra-chromosomal (Fig. 1d).

**Fig. 1: Chromatin interaction in *T. cruzi* genome compartments.**

Genome regions have different patterns of gene expression

To explore the influence of the three-dimensional genome organization on global gene expression, we performed RNA sequencing (RNA-seq) of different stages of the parasite, and we classified the genes into high-, medium- and low-expressed genes according to the number of transcripts detected. We found that the core compartment predominantly consists of moderate- and high-expressed genes in all the stages analysed. By contrast, the disruptive compartment, which in some cases comprises almost entire chromosomes, is enriched in low-expressed genes or genes with undetectable RNA levels (Fig. 2a). These differences are statistically significant in all the stages (Fig. 2b). Moreover, the core compartment exhibits similar levels of expression whereas the mean expression of the disruptive compartment increases in trypomastigotes (Extended Data Fig. 2a). We also evaluated the RNA levels of individual genes, and we observed that while core genes are expressed in similar proportions in all stages, the disruptive genes show lower overall expression (Fig. 2c,d and Extended Data Fig. 2b) and bimodal distribution in the trypomastigote stage, in which a discrete number of genes increase their expression (Fig. 2d and Extended Data Fig. 2b). Similar differences in expression between core and disruptive compartments were also obtained with the Brazil A4 strain (Supplementary Fig. 1). Thus, these differences in gene expression between genomic compartments are characteristic of T. cruzi and are not strain specific.

**Fig. 2: Transcriptional heterogeneity in the genomic compartments of *T. cruzi*.**

Regarding differentially expressed genes (DEGs), we found that 94% of genes enriched in epimastigotes belong to the core compartment, while in trypomastigotes, 20% of DEGs are in the core and 80% are in the disruptive compartment (Supplementary Table 3). Thereby, the core compartment presents stage-specific genes in both stages, whereas the disruptive compartment is characteristic of the trypomastigote stage, where long stretches of the genome concentrate many DEGs (Fig. 2a).

Next, we investigated whether these global differences of expression in the compartments are due to nuclear mechanisms of regulation or are mainly controlled by post-transcriptional mechanisms, generally cytoplasmic. To this end, we compared the expression of the compartments using cytoplasmic and nuclear transcriptomes²⁹, and we found that the expression differences can already be observed in the nucleus (Fig. 2e and Supplementary Fig. 2). To further explore these results, we examined nascent mRNAs, by measuring unprocessed transcripts from RNA-seq reads spanning the spliced leader acceptor (SLA) site, allowing the quantification of nascent or immature transcripts³⁰. We found similar levels of unprocessed transcripts corresponding to the core compartment in both epimastigotes and trypomastigotes and a reduction in the number of unprocessed transcripts from the disruptive compartment in epimastigotes, indicative of a reduced transcription or RNA maturation (Fig. 2f and Supplementary Fig. 2). Taken together, these results confirm that the reduced expression of the disruptive compartment is a consequence of nuclear mechanisms of regulation.

Well-positioned nucleosomes are enriched in the disruptive compartment

To determine whether the nucleosome landscape correlates with the transcriptional state, we assessed nucleosome positioning using micrococcal nuclease sequencing (MNase–seq) data⁷ from epimastigotes and trypomastigotes. In concordance with previous reports⁸, the core compartment exhibits a higher density of nucleosome occupancy in both stages (Fig. 3a). However, we identified a correlation between the percentage of disruptive genes in a scaffold and the density of well-positioned nucleosomes (Fig. 3b). Notably, the disruptive compartment exhibited double the density of well-positioned nucleosomes compared with the core compartment (Fig. 3c,d). To characterize the dynamic organization of nucleosomes, we compared nucleosome architecture local changes between epimastigotes and trypomastigotes. We found that the changes in the nucleosome map were more prominent in the core compartment (Fig. 3e); there were few differences in the presence and absence of nucleosomes between stages, and nucleosome shifting is the most abundant alteration. On the contrary, regarding nucleosome movement, few changes were detected in the disruptive compartment, consistent with more static regions.

**Fig. 3: Nucleosome positioning in the genomic compartments of *T. cruzi*.**

Genome-wide map of 5mC and 6mA marks in T. cruzi

As DNA methylation is a crucial determinant of spatial chromatin organization and gene expression³¹, we compared the genomic distribution of 5mC and 6mA DNA methylation marks in the infective and non-infective stages of T. cruzi. We found 6mA at low levels (0.04% of all adenines are methylated) both in epimastigotes and trypomastigotes and present at the TCG6mATC motif (Fig. 4a). In contrast, 5mC is the predominant DNA methylation mark we found—0.75% and 0.77% of all cytosines are methylated in epimastigotes and trypomastigotes, respectively—with a marked difference in distribution between compartments—a higher percentage of 5mC is present in the core in both stages (Fig. 4b,c). This modification is present mainly asymmetrically with the coding strand, with 64% of 5mC occurring in the antisense strand of coding regions (Fig. 4d,e), and most methylated cytosines were found at the dinucleotide GC (Fig. 4a). We found that 50.3% of methylated cytosines are located within genes, while this number rises to 65.4% in the case of adenines. In turn, 39% of the genes present 5mC modification, and 70% of them are shared between epimastigotes and trypomastigotes. Similarly, half of the genes with 6mA are shared between the two stages.

**Fig. 4: DNA methylation in the *T. cruzi* genome.**

Local chromatin structure affects global transcription in trypanosomes

Global expression analysis in T. cruzi revealed the presence of large actively expressed genome stretches flanked by silent regions—with very low or undetectable RNA levels—of average length 14 kb throughout the life cycle (Extended Data Fig. 3 and Supplementary Table 4). Considering it is proposed that DGCs are transcribed as PTUs, we expected these expressed regions to coincide with the extremes of DGCs, and those depleted of RNAs to coincide with SSRs. Notably, most of these silent regions (65%) correspond to internal areas of the PTUs, indicating that in several cases, there is no coincidence between DGCs and PTUs. Strikingly, the boundaries of transcribed regions are very well correlated with the CFDs we described (Fig. 5a). Therefore, the transcriptionally active regions of the chromosome between two silent regions are flanked by genome sites in close three-dimensional proximity. These results show that transcription is driven by the local structure of the chromatin and that there is not necessarily a coincidence between DGCs and PTUs.

**Fig. 5: Chromatin organization of trypanosomes affects gene expression.**

The transcriptionally active regions of the chromosomes in the core compartment can span, on average, 64 kb (up to 384 kb), and the number of genes they contain can vary from 1 up to 173 genes (mean of 29 genes; Supplementary Table 4). CFD boundaries suppress the expression of the genes, and the transcriptionally active region in between includes fractions of different DGCs (up to three different DGCs). The presence of these regions where a large set of linearly associated genes are transcribed is not as evident in the disruptive compartment that contains the antigenic genes, even in trypomastigotes.

To study whether this is a common feature of trypanosomes, we performed the same analyses in T. brucei Lister 427 using Hi-C and RNA-seq data from the bloodstream form (BF)^9,23. The CFDs determined also coincide with several DGC-internal regions where the transcription is interrupted in T. brucei (Fig. 5b). To further characterize these results, we analysed chromatin immunoprecipitation sequencing (ChIP–seq) data³² of the RNA polymerase II large subunit, RBP1. As expected, an enrichment of RBP1 was found at divergent and convergent SSRs, where transcription starts and ends. However, a striking enrichment of RBP1 was also observed at the internal regions of the DGCs that are flanked by the chromatin domains, indicating that they constitute transcription start and end regions. These results indicate that transcription initiation and termination are determined not only by DGC boundaries but also by chromatin structural domains in T. brucei.

Chromosome-folding domains are conserved across strains and stages

To determine the spatial genome organization at different stages of the T. cruzi life cycle and confirm that gene expression is affected by the local structure of the chromatin, we performed chromatin conformation capture (3C) experiments. To select which topological domain boundaries to analyse, we used the Hi-C maps from Brazil A4 to design 3C primers (Supplementary Table 5) in syntenic loci with the Dm28c strain (Supplementary Fig. 3). The chromatin interactions we observed in Brazil A4 epimastigotes by Hi-C analysis (chromosome (chr) 10: 660,000–700,000; chr29: 140,000–380,000) were also determined in 3C assays of Dm28c epimastigotes and trypomastigotes (PRFA01000007: 363,000–427,000; PRFA01000013: 272,663–513,267) (Fig. 6a,b). The resulting CFDs on Dm28c are 64 and 241 kb in length and are composed of 32 and 124 genes, respectively. Therefore, not only are collinearity and synteny blocks of genes at the sequence level present but also the three-dimensional organization of the genome is maintained between the strains and different stages of the parasite life cycle.

**Fig. 6: Self-interacting domain boundaries are conserved across cell types and strains.**

Finally, we investigated whether chromatin domain conservation also occurs during different stages of the T. brucei life cycle. We compared the locations of local minima of the insulation score (indicative of regions between two self-interacting domains) identified using Hi-C maps of procyclic forms (PFs) and BFs. Most of the boundary regions are shared between both stages (Fig. 6c). Of the 318 domain boundaries determined in the BF and 295 boundaries in the PF, 120 are common between both stages, corresponding to 37.7% and 40.5%, respectively. Moreover, we observed no variation in the contact counts by genomic distance, neither in short-range nor in long-range contacts (Fig. 6d). These results indicate that the overall three-dimensional organization of the core compartment of the genome is largely invariant in trypanosomes.

Discussion

The genome organization of trypanosomes has a common property in that genomes are divided into a core, syntenic among trypanosomatids, and a disruptive, or non-syntenic, compartment. The disruptive compartment is located in the subtelomeres in the African trypanosomes²³ while it is widely distributed along the genome in T. cruzi²². The disruptive compartment in T. cruzi comprises, in several cases, almost the entire chromosome, representing a relevant difference from T. brucei. This particular genome arrangement suggests an impact on three-dimensional chromosome organization. In the absence of enhancer–promoter regulation, spatial chromatin conformation could constitute a relevant mechanism for regulating gene expression in these parasites. Analysing the frequency of intra-chromosomal DNA–DNA interactions, we observed that the junctions between the core and disruptive compartments are prominent insulating boundaries of chromatin interactions. Therefore, the linear genome architecture (core and disruptive) has a three-dimensional—C and D—counterpart both in T. cruzi and T. brucei²³.

The genome of T. cruzi is organized into CFDs. These CFDs are smaller in the D chromatin and present a higher frequency of contacts, indicating that chromatin is more compact in this compartment, which is in agreement with recent FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements)–seq studies⁸. In this sense, the D compartment is densely packed with well-positioned nucleosomes, suggesting a less permissive chromatin structure. In turn, the D compartment exhibits a high frequency of inter-chromosomal contacts. This higher compaction of D chromatin may have the role of both avoiding spurious transcription and facilitating DNA recombination events necessary for generating the antigenic variability essential for the immune system evasion strategy of T. cruzi. Comparably, the global mechanism in T. brucei implies the recombination of the VSG genes into subtelomeric regions to switch the expression of the protein that covers the parasite’s surface as the infection progresses and the immune system matures its response³³. Therefore, both parasite genomes exhibit a D compartment composed of species- and stage-specific surface proteins, which would generate favourable conditions for generating antigenic diversity, by either recombination, allelic exclusion or differential transcription. On the contrary, it is expected that the C compartment, which does not undergo antigenic variation, presents less variability concerning the structure of the chromatin. In agreement with this, we observed well-conserved CFDs in this compartment in different stages and strains in both parasites. Thus, the main differences in genome architecture are those that allow T. brucei to survive extracellularly and T. cruzi to adapt to both intracellular and extracellular forms.

One mechanism to ensure coordinated regulation of gene expression of large stretches is the formation of higher-order three-dimensional chromatin structures³⁴. We investigated whether this three-dimensional genome organization implies selective regulation of gene expression, and we found that the C chromatin is mainly composed of genes with high and medium expression, while most of the genes from the D chromatin exhibit low or undetectable expression and a discrete number of genes highly increase their expression in trypomastigotes. The C compartment functions as a constitutively expressed region as its genes are expressed similarly throughout the life cycle, whereas the D compartment constitutes a specialized region for the infective stage. These findings modify the accepted paradigm that these parasites transcribe indiscriminately and then modulate post-transcriptionally their expression. This is observed in the C compartment, but it does not apply to the D compartment as very few of the hundreds of genes in each multigenic family are highly expressed. Both in T. cruzi and T. brucei, the D compartment constitutes regions of high chromatin compaction, which could favour gene-to-gene regulation, preventing spurious transcription and favouring intra- and inter-chromosomal recombination. In T. brucei, multiple VSG genes are transcribed before a single gene is chosen⁹, and this choice relies on the formation of an extra-nucleolar structure that contains a local reservoir of RNA polymerase I and RNA processing factors^9,35. In T. cruzi, the expression of the highly expressed surface genes is probably determined by the nucleoplasmic location of the loci and the proximity to the transcription and processing machinery, but Hi-C data of the infectious stage are necessary to study this and other aspects of the genome biology at this stage. Finally, the maintenance of three-dimensional organization along the infective and non-infective stages can constitute an advantage in driving life-cycle-specific transcriptional regulation.

The genome regions between two CFDs are characteristically RNA silent, while the regions contained in CFDs are actively transcribed, indicating that the expression is strongly determined by the three-dimensional organization of the chromatin and does not necessarily correlate with the DGCs. Therefore, the paradigm that there is a correspondence between DGCs and PTUs is not always fulfilled. Importantly, this is more than just a feature of T. cruzi. In T. brucei, when we eliminated the bias of DGCs as a reference for transcription start and end, we also found a similar correlation between the CFD and transcription in the C compartment, denoting a common feature of trypanosomes. RBP1 is enriched in these DGC-internal regions that are flanked by CFDs, endorsing that these are regions where transcription starts and ends. Transcription initiation and termination internal to DGC units were previously described in trypanosomes^36,37, and future studies are necessary to determine whether these DGC-internal transcription start and stop sites present similar characteristics as those in the SSRs (for example, histone modifications, histone variants, base J, methylation). Taken together, these results strongly support that the flow of genetic information from DNA to RNA in trypanosomes is regulated at transcription initiation through epigenetic mechanisms.

The most accepted idea regarding the differential regulation of gene expression in trypanosomes is that transcription is constitutive and that transcript levels are individually modulated by post-transcriptional mechanisms. However, here we provide evidence that the expression of the D compartment in T. cruzi does not respond to this widely accepted model: most of their genes exhibit low and undetectable levels, and a discrete number greatly increase their expression in the trypomastigote stage. These results do not go against the statement ‘trypanosomes mainly regulate at the post-transcription level’, but rather, this would be the expression modulation model in the C compartment. We propose that there are two major regulation models and that they are spatially isolated in the nucleoplasm by the C and D compartments in which the chromatin is organized. The compartmentalization of the genome makes it possible to isolate the molecular mechanisms necessary for the regulation of each type of gene. Moving forwards, the incorporation of time-course data and single-cell studies will reveal further insights into the mechanisms of three-dimensional chromatin structure and gene regulation in trypanosomes.

Methods

Cell culture

T. cruzi Dm28c³⁸ parasites were cultured axenically in liver infusion tryptose medium supplemented with 10% (v/v) inactivated fetal bovine serum (Gibco) at 28 °C. Trypomastigotes and intracellular amastigotes were collected from infected Vero cells as described³⁹. Vero cells were cultivated in Dulbecco’s modified Eagle’s medium (DMEM (1×) + GlutaMAX-l, Gibco by Life Technologies) supplemented with 10% (v/v) fetal bovine serum, (Gibco), penicillin 100 U ml⁻¹ and 100 µg ml⁻¹ streptomycin (Thermo Fisher Scientific) at 37 °C in a humidified 5% CO₂ atmosphere.

Methylation analysis

High-molecular-weight genomic DNA was purified using phenol–chloroform–isoamyl alcohol extraction followed by EtOH precipitation⁴⁰. Purified genomic DNA was fragmented to 10 kb using g-TUBEs (Covaris) to maximize the yield of nanopore sequencing. Genomic libraries were prepared using the Ligation Sequencing Kit (SQK-LSK109) and Native Barcoding Expansion Kit (EXP-NBD104, Oxford Nanopore Technologies) following the protocol for native barcoding of genomic DNA. Pooled samples were quantified using the Qubit dsDNA HS Assay Kit, loaded onto a R9.4.1 Flow Cell (FLO-MIN106D) and sequenced on an Oxford Nanopore Technologies (ONT) MinION platform for 72 h. The coverage per sample ranged from 29× to 34×. Raw data (multi_read_fast5 files) were converted to single_read_fast5 files using ont_fast5_api v3.1.6 (ONT) and base called using Guppy v3.6.0 (ONT) with dna_r9.4.1_450bps_modbases_dam-dcm-cpg_hac.cfg configuration. The strand-sensitive and single-nucleotide-based detection of DNA base modifications was performed with DeepMod⁴¹ v0.1.3, which detects m5C and m6A in DNA using the recurrent neural networks model. Only genomic positions with >5 read coverage and >90% of methylation percentage were considered for the analysis. Dm28c reference genome (TcruziDm28c2018 from TriTrypDB version 51) was used as input, and methylation positions were assigned to different genomic regions using BEDTools⁴² v2.27.1. Core and disruptive methylation quantifications were made in the largest 30 scaffolds of the Dm28c 2018 genome assembly, representing 38% of the genome size.

Nucleosome position analysis

MNase–seq data⁷ was obtained from National Center for Biotechnology Information (NCBI; BioProject PRJNA665060) using fasterq-dump v3.0.0. Quality control was performed using FastQC v0.11.9, and reads with less than 70 bp after trimming low-quality nucleotides (–nextseq-trim = 20) and clipping the adapter sequence were removed using Cutadapt⁴³ v2.7. For nucleosome calling, paired-end reads were aligned to both reference genomes Dm28c (TcruziDm28c2018 from TriTrypDB version 51) and Brazil A4 (TcruziBrazilA4 from TriTrypDB version 53) using Bowtie⁴⁴ v1.3.1, allowing up to two mismatches and a maximum insert size of 500 bp. The resulting BAM files were imported and processed in R v4.1.3 to merge biological replicates by chromosome. After duplicate removal with Picards MarkDuplicates (Picard Toolkit 2019), Fourier transform filtering to remove noise and peak calling to accurately define and classify the location of nucleosomes across the genome were performed with nucleR⁴⁵. Nucleosomes with sharp signals of the shape of the associated peaks were labelled as well-positioned (W) nucleosomes (high localization score), while flat peaks were labelled as fuzzy nucleosomes (low localization score)⁴⁵. In this analysis, well-positioned nucleosomes were considered when the nucleR peak width score and height score were higher than 0.6 and 0.4, respectively. Finally, Nucleosome Dynamics⁴⁶ v1.0 was used to study differences between stages: occupancy differences (insertions and evictions) and displacement of nucleosomes (shifts).

3C

Parasites (1 × 10⁸ trypomastigotes and 5 × 10⁹ epimastigotes) were collected through centrifugation at 2,200 × g for 10 min at room temperature and washed twice with PBS 1×. Cells were fixed with 1% formaldehyde for 15 min at room temperature on a shaker. The cross-linking was quenched by adding 2.5 ml of 1 M glycine (125 mM) and incubating for 5 min at room temperature, then on ice for 15 min. Fixed cells were then centrifuged, washed twice with ice-cold PBS 1× and resuspended in 1.4 ml of cold lysis buffer (10 mM Tris–HCl pH 8, 10 mM NaCl, 0.2% Nonidet P-40, 1× protease inhibitors). After homogenization with a Dounce homogenizer pestle A, the nuclei were washed twice with 1.25× rCutSmart buffer (New England BioLabs) and resuspended in 0.5 ml of rCutSmart buffer (New England BioLabs) containing 0.3% SDS. Samples were incubated for 40 min at 65 °C and then for 20 min at 37 °C. Then, 2% Triton X-100 was added, and the nuclei were further incubated for 1 h at 37 °C to sequester the SDS. Cross-linked DNA was digested using 400 U of EcoRI-HF (R3101 New England BioLabs) overnight at 37 °C, and the enzyme was inactivated with 1% SDS at 65 °C for 20 min. Digestion products were diluted by the addition of 4 ml 1.1× T4 DNA Ligase Reaction Buffer (New England BioLabs) with 1% Triton X-100 and incubated for 1 h at 37 °C. DNA was ligated for 4 h in a water bath at 16 °C (30 Weiss units of T4 DNA Ligase, M0202; New England BioLabs). Then, 300 μg of proteinase K was used to reverse the cross-linking overnight at 65 °C. Finally, samples were incubated with 300 μg RNase A, and the DNA was purified by phenol extraction followed by ethanol precipitation. The control library with all possible ligation products was prepared in parallel using non-cross-linked genomic DNA. Ligation products were detected and quantified in control and cross-linked libraries with PCR using locus-specific primers. We designed a set of six oligos excluding sequences from repetitive regions (Supplementary Table 5). The linear range of amplification was determined for all the libraries by serial dilution of DNA amounts.

Transcriptomic analysis

Total RNA from cell-derived trypomastigotes, epimastigotes and intracellular amastigotes was treated with RiboZero Gold magnetic beads from a eukaryote kit (250 ng as input) as described previously⁴⁷. Library preparation of purified RNA was then performed using the TruSeq Stranded Total RNA kit (Illumina) with random primers. A total of 80 bp paired-end reads were generated using an Illumina NextSeq 500 MID platform. Raw reads were processed with Cutadapt⁴³ v2.7 to trim low-quality nucleotides (–nextseq-trim = 20) and clip the adapter sequence. The resulting reads with less than 70 bp were discarded (–minimum-length = 70). Trimmed reads were checked for per-base quality using FastQC v0.11.9. Cleaned reads were then aligned to the Dm28c reference genome (TcruziDm28c2018 from TriTrypDB version 51) using Hisat2 (ref. ⁴⁸) v2.1.0 (–no-spliced-alignment–rna-strandness RF), and mapped reads with a mapping quality score <10 were discarded with SAMtools⁴⁹ v1.16.1 as it was previously described⁵⁰. The coverage files were smoothed, and counts per million (CPM) normalized in a bin size of 10 bp using the bamCoverage function from deepTools⁵¹. Salmon⁵² v1.5.1 was used to estimate transcript levels, and statistical analysis of differential expression of mRNAs was tested with DESeq2 (ref. ⁵³) v1.38.3. The expression of each transcript was quantified in transcript per million (TPM) units. Consistency between replicates was assessed with Pearson correlation and principal component analysis. Genes were considered as differentially expressed when they were statistically significant (P_adj < 0.001) and had a fold change in transcript abundance of at least two (|log2FC| > 1). To study the dynamic of RNA levels in genome compartments along the stages, we classified the transcripts into three groups based on their relative abundance. High-, medium- and low-expression genes were determined using quantiles on normalized counts. Also, to investigate expression patterns, we classified the scaffolds into ‘core’, ‘disruptive’ and ‘mixed’ according to the percentage of compartments that make them up. The bimodality coefficient was calculated using the mousetrap⁵⁴ v3.2.1 R package. The transcriptomic data corresponding to the different subcellular compartments of T. cruzi, Brazil A4 and T. brucei were analysed identically.

To study immature and nascent transcripts, 5′ untranslated regions (UTR) were first annotated and assigned to protein-coding genes using UTRme⁵⁵ v1.1.0. SLA sites identified at the same positions in all transcriptomes were used for the analysis to avoid bias of differential processing between stages. Alignments of reads spanning SLA sites were quantified, normalized and classified as processed and unprocessed according to the presence or absence of spliced leader sequence in the read, respectively.

To identify RNA zero coverage regions, genome coverage for all positions in BEDGRAPH format was calculated from mapping-quality-filtered BAM files using the genomecov function from BEDTools⁴² v2.27.1 with the -bga option. Then, all regions of the genome with zero coverage were extracted, and intervals <500 bp in length were filtered out.

Chromatin immunoprecipitation analysis

The 37 × 2 bp paired-end raw reads were processed with Cutadapt⁴³ v2.7 to remove low-quality bases from both ends (-q 30), remove flanking N bases from each read (–trim-n) and discard reads containing more than 5 N bases (–max-n 5). The resulting reads with less than 35 bp were discarded (–minimum-length = 35). Raw and trimmed reads were checked for per-base quality using FastQC v0.11.9. Cleaned reads were then aligned to T. brucei reference genome (TbruceiLister427_2018 from TriTrypDB version 53) using Bowtie⁴⁴ v1.3.1, allowing two mismatches (-v 2). Unique alignments reported (-m 1) were compressed and sorted using SAMtools⁴⁹ v1.16.1, and duplicate reads were removed using Picard tools v2.25.5 (MarkDuplicates). Normalization and peaking calling were conducted using MACS2 (ref. ⁵⁶) v2.2.7.1 with a false discovery rate of 5%, and peaks with low fold enrichment (-q 0.05 and –fe-cutoff 1.5) were filtered out. The –g parameter was set at 50081021.

Chromatin conformation computational analysis

Hi-C datasets were processed from raw reads to normalized contact maps using HiC-Pro⁵⁷ v3.1.0. Quality was assessed using MultiQC. TAD calling was performed with TADtools⁵⁸ and the hicFindTADs function of HiCExplorer⁵⁹ v3.7.2. The insulation score was calculated at 30, 50, 70, 100 and 150 kb sliding windows with FAN-C tool⁶⁰ v0.9.26b2. The data integration, comparison and visualization of the different datasets were performed using pyGenomeTracks⁶¹ v3.7.

All figures were designed using GIMP v2.10.32.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Raw sequencing data for Illumina and Oxford Nanopore Technologies platforms were deposited to NCBI: RNA-seq to BioProject ID PRJNA850400 and Methylation to BioProject ID PRJNA935260. Hi-C and RNA-seq data for T. cruzi Brazil A4 epimastigotes from a previous study²⁴ are available from NCBI under accession numbers SRR11803985 and SRR12792489. Hi-C data for the T. brucei PF and BF, as well as RNA-seq data from the BF from previous studies^9,23, are available from the NCBI SRA (Sequencing Read Archive) under accession numbers ERR3712002, ERR3712009, SRR7721317, SRR7721318 and SRR5809498–SRR5809500. RBP1 ChIP–Seq³² and MNase–seq⁷ data are available under SRR9022833, SRR9022834, SRR13260277, SRR13260279 and SRR12710803–SRR12710808 accession numbers. RNA-seq data from different subcellular compartments of T. cruzi²⁹ are available from the NCBI under accession numbers SRR4232036–SRR4232038. Genome FASTA and GFF files were retrieved from TritrypDB (https://tritrypdb.org/tritrypdb/app): TcruziDm28c2018 version 51, TcruziBrazilA4 version 53 and TbruceiLister427_2018 version 53. Source data are provided with this paper.

References

Johnson, P. J., Kooter, J. M. & Borst, P. Inactivation of transcription by UV irradiation of T. brucei provides evidence for a multicistronic transcription unit including a VSG gene. Cell 51, 273–281 (1987).
Article CAS PubMed Google Scholar
Matthews, K. R., Tschudi, C. & Ullu, E. A common pyrimidine-rich motif governs trans-splicing and polyadenylation of tubulin polycistronic pre-mRNA in trypanosomes. Genes Dev. 8, 491–501 (1994).
Article CAS PubMed Google Scholar
Clayton, C. & Shapira, M. Post-transcriptional regulation of gene expression in trypanosomes and leishmanias. Mol. Biochem. Parasitol. 156, 93–101 (2007).
Article CAS PubMed Google Scholar
Clayton, C. Regulation of gene expression in trypanosomatids: living with polycistronic transcription. Open Biol. 9, 190072 (2019).
Article CAS PubMed PubMed Central Google Scholar
Chávez, S. et al. Extensive translational regulation through the proliferative transition of Trypanosoma cruzi revealed by multi-omics. mSphere 6, e0036621 (2021).
Article PubMed Google Scholar
Maree, J. P. et al. Trypanosoma brucei histones are heavily modified with combinatorial post-translational modifications and mark Pol II transcription start regions with hyperacetylated H2A. Nucleic Acids Res. 50, 9705–9723 (2022).
Article CAS PubMed PubMed Central Google Scholar
Lima, A. R. J. et al. Nucleosome landscape reflects phenotypic differences in Trypanosoma cruzi life forms. PLoS Pathog. 17, e1009272 (2021).
Article CAS PubMed PubMed Central Google Scholar
Lima, A. R. J. et al. Open chromatin analysis in Trypanosoma cruzi life forms highlights critical differences in genomic compartments and developmental regulation at tDNA loci. Epigenetics Chromatin 15, 22 (2022).
Article CAS PubMed PubMed Central Google Scholar
Faria, J. et al. Spatial integration of transcription and splicing in a dedicated compartment sustains monogenic antigen expression in African trypanosomes. Nat. Microbiol. 6, 289–300 (2021).
Article CAS PubMed PubMed Central Google Scholar
Respuela, P., Ferella, M., Rada-Iglesias, A. & Åslund, L. Histone acetylation and methylation at sites initiating divergent polycistronic transcription in Trypanosoma cruzi. J. Biol. Chem. 283, 15884–15892 (2008).
Article CAS PubMed PubMed Central Google Scholar
Rosón, J. N. et al. H2B.V demarcates divergent strand-switch regions, some tDNA loci, and genome compartments in Trypanosoma cruzi and affects parasite differentiation and host cell invasion. PLoS Pathog. 18, e1009694 (2022).
Article PubMed PubMed Central Google Scholar
Nunes, V. S. et al. Trimethylation of histone H3K76 by Dot1B enhances cell cycle progression after mitosis in Trypanosoma cruzi. Biochim. Biophys. Acta Mol. Cell Res. 1867, 118694 (2020).
Article CAS PubMed Google Scholar
Ramos, T. C. P. et al. Expression of non-acetylatable lysines 10 and 14 of histone H4 impairs transcription and replication in Trypanosoma cruzi. Mol. Biochem. Parasitol. 204, 1–10 (2015).
Article CAS PubMed Google Scholar
Lizarraga, A. et al. Adenine DNA methylation, 3D genome organization, and gene expression in the parasite Trichomonas vaginalis. Proc. Natl Acad. Sci. USA 117, 13033–13043 (2020).
Article CAS PubMed Central Google Scholar
Bunnik, E. M. et al. Changes in genome organization of parasite-specific gene families during the Plasmodium transmission stages. Nat. Commun. 9, 1910 (2018).
Article PubMed PubMed Central Google Scholar
Ferguson, M. A. J. The surface glycoconjugates of trypanosomatid parasites. Philos. Trans. R. Soc. Lond. B 352, 1295–1302 (1997).
Article CAS Google Scholar
Vickerman, K. On the surface coat and flagellar adhesion in trypanosomes. J. Cell Sci. 5, 163–193 (1969).
Article CAS PubMed Google Scholar
Cross, G. A. M. Identification, purification and properties of clone-specific glycoprotein antigens constituting the surface coat of Trypanosoma brucei. Parasitology 71, 393–417 (1975).
Article CAS PubMed Google Scholar
Buscaglia, C. A. et al. The surface coat of the mammal-dwelling infective trypomastigote stage of Trypanosoma cruzi is formed by highly diverse immunogenic mucins. J. Biol. Chem. 279, 15860–15869 (2004).
Article CAS PubMed Google Scholar
Dos Santos, S. L. et al. The MASP family of Trypanosoma cruzi: changes in gene expression and antigenic profile during the acute phase of experimental infection. PLoS Negl. Trop. Dis. 6, e1779 (2012).
Article PubMed PubMed Central Google Scholar
De Pablos, L. M. & Osuna, A. Conserved regions as markers of different patterns of expression and distribution of the mucin-associated surface proteins of Trypanosoma cruzi. Infect. Immun. 80, 169–174 (2012).
Article PubMed PubMed Central Google Scholar
Berná, L. et al. Expanding an expanded genome: long-read sequencing of Trypanosoma cruzi. Microb. Genom. 4, e000177 (2018).
PubMed PubMed Central Google Scholar
Müller, L. S. M. et al. Genome organization and DNA accessibility control antigenic variation in trypanosomes. Nature 563, 121–125 (2018).
Article PubMed PubMed Central Google Scholar
Wang, W. et al. Strain-specific genome evolution in Trypanosoma cruzi, the agent of Chagas disease. PLoS Pathog. 17, e1009254 (2021).
Article CAS PubMed PubMed Central Google Scholar
Fraser, J. et al. Hierarchical folding and reorganization of chromosomes are linked to transcriptional changes in cellular differentiation. Mol. Syst. Biol. 11, 852 (2015).
Article PubMed PubMed Central Google Scholar
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Article CAS PubMed PubMed Central Google Scholar
Yu, M. & Ren, B. The three-dimensional organization of mammalian genomes. Annu. Rev. Cell Dev. Biol. 33, 265–289 (2017).
Article CAS PubMed PubMed Central Google Scholar
Beagan, J. A. & Phillips-Cremins, J. E. On the existence and functionality of topologically associating domains. Nat. Genet. 52, 8–16 (2020).
Article CAS PubMed Central Google Scholar
Pastro, L. et al. Nuclear compartmentalization contributes to stage-specific gene expression control in Trypanosoma cruzi. Front. Cell Dev. Biol. 5, 8 (2017).
Article PubMed PubMed Central Google Scholar
López-Escobar, L. et al. Stage-specific transcription activator ESB1 regulates monoallelic antigen expression in Trypanosoma brucei. Nat. Microbiol. 7, 1280–1290 (2022).
Article PubMed PubMed Central Google Scholar
Chen, K., Zhao, B. S. & He, C. Nucleic acid modifications in regulation of gene expression. Cell Chem. Biol. 23, 74–85 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cordon-Obras, C. et al. Identification of sequence-specific promoters driving polycistronic transcription initiation by RNA polymerase II in trypanosomes. Cell Rep. 38, 110221 (2022).
Article CAS PubMed Google Scholar
McCulloch, R., Morrison, L. J. & Hall, J. P. J. in Mobile DNA III (eds Chandler, M. et al.) 409–435 (ASM Press, 2015); https://doi.org/10.1128/9781555819217.ch19
Bhat, P., Honson, D. & Guttman, M. Nuclear compartmentalization as a mechanism of quantitative control of gene expression. Nat. Rev. Mol. Cell Biol. 22, 653–670 (2021).
Article CAS PubMed Google Scholar
Navarro, M. & Gull, K. A pol I transcriptional body associated with VSG mono-allelic expression in Trypanosoma brucei. Nature 414, 759–763 (2001).
Article CAS Google Scholar
Siegel, T. N. et al. Four histone variants mark the boundaries of polycistronic transcription units in Trypanosoma brucei. Genes Dev. 23, 1063–1076 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kolev, N. G. et al. The transcriptome of the human pathogen Trypanosoma brucei at single-nucleotide resolution. PLoS Pathog. 6, e1001090 (2010).
Article PubMed PubMed Central Google Scholar
Contreras, V. T. et al. Biological aspects of the DM 28C clone of Trypanosoma cruzi after metacylogenesis in chemically defined media. Mem. Inst. Oswaldo Cruz 83, 123–133 (1988).
Article CAS PubMed Google Scholar
Díaz-Viraqué, F. et al. Old yellow enzyme from Trypanosoma cruzi exhibits in vivo prostaglandin F2α synthase activity and has a key role in parasite infection and drug susceptibility. Front. Immunol. 9, 456 (2018).
Article PubMed PubMed Central Google Scholar
Díaz-Viraqué, F., Greif, G., Berná, L. & Robello, C. in Parasite Genomics (eds de Pablos, A. M. & Sotillo, J.) 3–13 (Humana, 2021); https://doi.org/10.1007/978-1-0716-1681-9_1
Liu, Q. et al. Detection of DNA base modifications by deep recurrent neural network on Oxford Nanopore sequencing data. Nat. Commun. 10, 2449 (2019).
Article PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 17, 10–12 (2011).
Article Google Scholar
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Article PubMed PubMed Central Google Scholar
Flores, O. & Orozco, M. nucleR: a package for non-parametric nucleosome positioning. Bioinformatics 27, 2149–2150 (2011).
Article CAS PubMed Google Scholar
Buitrago, D. et al. Nucleosome Dynamics: a new tool for the dynamic analysis of nucleosome positioning. Nucleic Acids Res. 47, 9511–9523 (2019).
Article CAS PubMed PubMed Central Google Scholar
Greif, G., Berná, L., Díaz-Viraqué, F. & Robello, C. in T. cruzi Infection: Methods and Protocols (eds Gómez, K. A. & Buscaglia, C. A.) 35–45 (Springer, 2019); https://doi.org/10.1007/978-1-4939-9148-8_3
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Hutchinson, S., Glover, L. & Horn, D. High-resolution analysis of multi-copy variant surface glycoprotein gene expression sites in African trypanosomes. BMC Genomics 17, 806 (2016).
Article PubMed PubMed Central Google Scholar
Ramírez, F. et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, W160–W165 (2016).
Article PubMed PubMed Central Google Scholar
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A. & Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods 14, 417–419 (2017).
Article CAS PubMed PubMed Central Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article PubMed PubMed Central Google Scholar
Wulff, D. U., Kieslich, P. J., Henninger, F., Haslbeck, J. & Schulte-Mecklenbeck, M. Movement tracking of cognitive processes: a tutorial using mousetrap. Preprint at OSF https://doi.org/10.31234/osf.io/v685r (2021).
Radío, S., Fort, R. S., Garat, B., Sotelo-Silveira, J. & Smircich, P. UTRme: a scoring-based tool to annotate untranslated regions in trypanosomatid genomes. Front. Genet. 9, 671 (2018).
Article PubMed PubMed Central Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP–seq (MACS). Genome Biol. 9, R137 (2008).
Article PubMed PubMed Central Google Scholar
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).
Article PubMed PubMed Central Google Scholar
Kruse, K., Hug, C. B., Hernández-Rodríguez, B. & Vaquerizas, J. M. TADtool: visual parameter identification for TAD-calling algorithms. Bioinformatics 32, 3190–3192 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ramírez, F. et al. High-resolution TADs reveal DNA sequences underlying genome organization in flies. Nat. Commun. 9, 189 (2018).
Article PubMed PubMed Central Google Scholar
Kruse, K., Hug, C. B. & Vaquerizas, J. M. FAN-C: a feature-rich framework for the analysis and visualisation of chromosome conformation capture data. Genome Biol. 21, 303 (2020).
Article PubMed PubMed Central Google Scholar
Lopez-Delisle, L. et al. pyGenomeTracks: reproducible plots for multivariate genomic datasets. Bioinformatics 37, 422–423 (2021).
Article CAS PubMed Google Scholar
Pfister, R. & Janczyk, M. Confidence intervals for two sample means: calculation, interpretation, and a few simple rules. Adv. Cogn. Psychol. 9, 74–80 (2013).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank A. Enright (University of Cambridge, United Kingdom) for facilitating the RNA-seq, N. G. Jones (University of York, United Kingdom) for critical reading of the manuscript, C. Salazar (Institut Pasteur de Montevideo) for technical assistance with Oxford Nanopore sequencing and A. Lizarraga (University of Chicago) and other members of the Laboratorio de Interacciones Hospedero–Patógeno—UBM (Institut Pasteur de Montevideo) for their helpful comments and many interesting discussions. This work was supported by the Research Council United Kingdom Grand Challenges Research Funder (GCRF) ‘A Global Network for Neglected Tropical Diseases’ MR/P027989/1 (C.R.); Agencia Nacional de Investigación e Innovación (ANII, Uruguay) DCI-ALA/2011/023–502, ‘Contrato de apoyo a las políticas de innovación y cohesión territorial’, Fondo para la Convergencia Estructural del Mercado Común del Sur (FOCEM) 03/1 (C.R.); Universidad de la República (Uruguay), CSIC Iniciacion 22320200200121UD (F.D.-V.); and Agencia Nacional de Investigación e Innovación (ANII, Uruguay), Proyecto de Investigación Fundamental ‘Fondo Clemente Estable’ FCE_3_2022_1_172653 (F.D.-V.). F.D.-V. received fellowships from the ANII (POS_NAC_2016_1_129916) and CAP (Universidad de la República, BFPD_2021_1#45569540). F.D.-V., M.L.C. and C.R. are members of the Sistema Nacional de Investigadores (SNI-ANII, Uruguay). C.R. and M.L.C. are Programa de Desarrollo de Ciencias Básicas (PEDECIBA, Uruguay) researchers.

Author information

Authors and Affiliations

Laboratorio de Interacciones Hospedero–Patógeno—UBM, Institut Pasteur de Montevideo, Montevideo, Uruguay
Florencia Díaz-Viraqué, María Laura Chiribao, María Gabriela Libisch & Carlos Robello
Departamento de Bioquímica, Facultad de Medicina, Universidad de la República, Montevideo, Uruguay
María Laura Chiribao & Carlos Robello

Authors

Florencia Díaz-Viraqué
View author publications
You can also search for this author in PubMed Google Scholar
María Laura Chiribao
View author publications
You can also search for this author in PubMed Google Scholar
María Gabriela Libisch
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Robello
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.R. conceived and supervised the project. F.D.-V., M.L.C., G.L. and C.R. designed the transcriptomics experiments. F.D.-V., M.L.C. and G.L. performed the transcriptomics experiments. F.D.-V. performed the computational analysis of Hi-C, Nanopore sequencing, RNA-seq, DNA methylation and nucleosome data integration. F.D.-V. and C.R. wrote the paper and the other authors revised it. C.R. and F.D.-V. acquired funding.

Corresponding author

Correspondence to Carlos Robello.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Microbiology thanks Paul Denny and Fotini Papavasiliou for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Normalized Hi-C interaction frequencies of chromosomes 3, 9 and 10.

Results are displayed as a two-dimensional heatmap at 10 kb resolution. Chromatin Folding Domains (CFDs) identified using HiCExplorer and TADtools (triangles on Hi-C map) are indicated in yellow. The local minimum insulation score calculated using FAN-C (representing the region between two self-interacting domains) is indicated as a black dash. The genomic position of core and disruptive genome compartments are represented in grey and yellow, respectively. Hi-C coverage is plotted.

Extended Data Fig. 2 RNA expression.

a, Mean scaffold expression of scaffold composed by core or disruptive compartment in amastigotes (A), epimastigotes (E) and trypomastigotes (T). Scaffolds were classified as core or disruptive when one of the genome compartments spans 80–100% length of the scaffold. Scaffolds containing the core compartment present similar mean expressions in the different stages while the mean scaffold expression increase in T in the disruptive scaffolds. Values are p value from Welch Two Sample t-test (two-sided). b, Representative RNA-seq sequencing coverage plots in entire scaffolds normalized using counts per million (CPM). Bin size 10 bp. For each condition two biological replicates are plotted.

Extended Data Fig. 3 Data integration.

Normalized Hi-C interaction frequencies of chromosome 10 of T. cruzi displayed as a two-dimensional heatmap at 10 kb resolution. Chromatin Folding Domains (CFDs) identified using TADtools are indicated as triangles on the Hi-C map. Insulation score values were calculated using FAN-C and the black horizontal dashes represent the local minima of the insulation score, indicatives of the domain boundaries. RNA expression across the chromosome is depicted as coverage plots using a bin size of 10 bp. For each condition, two biological replicates are plotted. A: amastigotes; E: epimastigote; and T: trypomastigote forms. Vertical gray bars indicate the position of all well-positioned nucleosomes. Polycistronic Transcription Units (PTUs). This is a core chromosome as the core compartment comprises the entire chromosome. Some disruptive genes present in this chromosome are indicated as yellow arrows. Zero coverage regions (\(\ge\) 500 bp length) are shown as black vertical lines. Low coverage regions were determined considering all transcriptomes and are represented as blue blocks.

Supplementary information

Supplementary Information

Supplementary Figs. 1–3 and Tables 1–5.

Reporting Summary

Source data

Source Data

Statistical source data and agarose gel.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Díaz-Viraqué, F., Chiribao, M.L., Libisch, M.G. et al. Genome-wide chromatin interaction map for Trypanosoma cruzi. Nat Microbiol 8, 2103–2114 (2023). https://doi.org/10.1038/s41564-023-01483-y

Download citation

Received: 14 March 2023
Accepted: 25 August 2023
Published: 12 October 2023
Issue Date: November 2023
DOI: https://doi.org/10.1038/s41564-023-01483-y