Nanoscale organization of rotavirus replication machineries

Version of Record

Accepted for publication after peer review and revision.

Download
Cite
Share
CommentOpen annotations (there are currently 0 annotations on this page).

Version of Record updated: September 17, 2019 (This version)
Version of Record updated: August 22, 2019 (Go to version)
Version of Record published: August 13, 2019 (Go to version)
Accepted Manuscript published: July 25, 2019 (Go to version)
Accepted: July 22, 2019
Received: October 17, 2018

1. Of interest
Staphylococcus aureus FtsZ and PBP4 bind to the conformationally dynamic N-terminal domain of GpsB

Michael D Sacco, Lauren R Hammond ... Yu Chen

Research Article Apr 19, 2024
Further reading

Abstract
eLife digest
Introduction
Results
Discussion
Materials and methods
Appendix 1
Data availability
References
Article and author information
Metrics

Abstract

Rotavirus genome replication and assembly take place in cytoplasmic electron dense inclusions termed viroplasms (VPs). Previous conventional optical microscopy studies observing the intracellular distribution of rotavirus proteins and their organization in VPs have lacked molecular-scale spatial resolution, due to inherent spatial resolution constraints. In this work we employed super-resolution microscopy to reveal the nanometric-scale organization of VPs formed during rotavirus infection, and quantitatively describe the structural organization of seven viral proteins within and around the VPs. The observed viral components are spatially organized as five concentric layers, in which NSP5 localizes at the center of the VPs, surrounded by a layer of NSP2 and NSP4 proteins, followed by an intermediate zone comprised of the VP1, VP2, VP6. In the outermost zone, we observed a ring of VP4 and finally a layer of VP7. These findings show that rotavirus VPs are highly organized organelles.

https://doi.org/10.7554/eLife.42906.001

eLife digest

Rotaviruses are small viruses that can infect cells in the intestine. They are responsible for most cases of severe infectious diarrhea, the most common cause of death among young children in developing countries. Controlling the spread of rotavirus infections is difficult, even with high levels of hygiene, so effective treatments are essential to curtail the virus’ infections. Understanding how new rotaviral particles are made in infected cells is one of the first steps toward developing new therapies.

Once rotaviruses enter the cells, proteins from the virus and the cell aggregate into compact spheres called viroplasms to make new viral particles. Studying these viroplasms used to be difficult because they are too small to see with the resolution of standard microscopes. In recent years, advances in microscopy and mathematical methods have focused on breaking the existing resolution limits, leading to the development of super-resolution microscopy. This new technique has made it possible to study objects with sizes in the order of a billionth of a meter, known as nanoscopic structures, including viroplasms.

Garcés et al. use super-resolution microscopy to determine how viral proteins are arranged in the viroplasm and gain a better understanding of how the viruses are assembled. The images revealed that, in infected monkey kidney cells, rotavirus proteins inside the viroplasm form highly organized concentric layers. This arrangement is reliably repeated in viroplasms of different sizes, indicating that the organization of the proteins is likely set up when the viroplasm starts to form.

These findings make use of new microscopy, image analysis and statistical tools to study rotaviruses, providing a new framework to understand many aspects of rotaviral biology. Additionally, the result showing that proteins organize consistently in viroplasms is a first step towards understanding how the machinery that makes new rotaviruses works, which could lead to future treatments for severe infectious diarrhea.

https://doi.org/10.7554/eLife.42906.002

Introduction

Rotavirus is a non-enveloped virus composed of three concentric layers of proteins that enclose a genome constituted by eleven segments of double stranded RNA (dsRNA) that encode six structural proteins (VP1 to VP4, VP6 and VP7) and six non-structural proteins (NSP1 to NSP6). The inner layer is formed by dimers of VP2 that enclose the viral genome and small numbers of molecules of the viral RNA-dependent RNA polymerase (RdRp), VP1, and the capping enzyme, VP3. This nucleoprotein complex constitutes the core of the virus, which is surrounded by an intermediate protein layer of trimers of VP6, to form double-layered particles (DLPs). The surface of the virion is occupied by two polypeptides, VP7, a glycoprotein, and VP4, which forms spikes that protrude from the VP7 shell (Estes and Greenberg, 2013). Replication of the rotavirus genome and assembly of DLPs take place in cytoplasmic electron dense inclusions termed viroplasms (VPs) (Estes and Greenberg, 2013). Once the double-shelled particles are assembled, they bud from the cytoplasmic VPs into the adjacent endoplasmic reticulum (ER). During this process, which is mediated by the interaction of DLPs with the ER transmembrane viral protein NSP4, the particles acquire a temporary lipid bilayer, modified by VP7 and NSP4, which after being removed in the lumen of the ER by an unknown mechanism, yields the mature triple-layered virions (Estes and Greenberg, 2013). It has been reported that VP4 is located between the VP and the ER membrane and it is incorporated into triple-layered particles (TLPs) during the budding process and maturation of the virus particle inside the ER (Estes and Greenberg, 2013; Navarro et al., 2016).

The viral non-structural proteins NSP2 and NSP5 serve a nucleation role that is essential for the biogenesis of VPs (Fabbretti et al., 1999; Silvestri et al., 2004; Vascotto et al., 2004; Campagna et al., 2005). In addition to viral proteins and genomic dsRNA, cellular proteins such as ER chaperones (Maruri-Avidal et al., 2008), proteins associated with lipid droplets (Cheung et al., 2010), and ribonuclear proteins (Dhillon et al., 2018), have been shown to colocalize with VPs. Several studies have characterized the intracellular distribution of the rotavirus proteins (González et al., 2000; Petrie et al., 1982; Petrie et al., 1984; Richardson et al., 1986). Immunofluorescence studies, based upon epifluorescence or confocal microscopy, have described the viral proteins that conform the VPs, however the images are inherently diffraction-limited to a spatial resolution in the range of hundreds of nanometer, precluding the identification of the nanoscopic molecular scale organization of VPs (González et al., 1998; González et al., 2000; Eichwald et al., 2004; López et al., 2005b; Criglar et al., 2014; Martin et al., 2011; Contin et al., 2010). On the other hand, transmission electron microscopy (TEM) studies often provide images with nanometric resolution, nevertheless, immunoelectron microscopy is challenging when looking for the localization of more than a single protein (Altenburg et al., 1980; Petrie et al., 1982; Petrie et al., 1984). Over the past 15 years, a variety of super-resolution microscopy (SRM) techniques have been developed to observe subcellular structures beneath the diffraction limit of optical microscopes, with resolutions in the tens of nanometers (Schnitzbauer et al., 2017; Deschout et al., 2014; Cox et al., 2011). In this work, we determined the organization of rotaviral proteins within and around VPs through the ‘Bayesian Blinking and Bleaching’ (3B) SRM technique. We developed a segmentation algorithm to automatically analyze and quantify the relative distribution of seven viral proteins, and propose a model that describes their relative spatial distribution. Also, we present a dependency model that explains the relationship between the viral proteins. This work establishes a structural framework for VP organization that future mechanistic and functional studies must take into account, and establishes key methodologies for future investigations on this subject.

Results

Qualitative analysis of VP morphology and structure through SRM

Rotavirus VPs are complex signaling hubs composed of viral and cellular proteins, packed together with viral RNAs. By TEM, they roughly resemble circular electrodense structures whose internal components lack an obvious degree of spatial organization (Altenburg et al., 1980; Eichwald et al., 2012). In this work, we determined the relative spatial distribution of VPs components by immunofluorescence and SRM in MA104 cells infected with the rhesus rotavirus strain RRV at 6 hr post-infection (hpi), using protein-specific antibodies. Due to their important role as nucleating factors during VP biogenesis, we selected either NSP2 or NSP5 as spatial relative reference for the distribution of the VP1, VP2 and VP6 proteins. VPs were optically sectioned through total internal reflection fluorescence microscopy (TIRF), with an excitation depth of field restricted to 200 $n m$ from the coverslip. This approach avoids excitation of fluorophores marking structural components located away from this plane, that is towards the inner cellular milieu. Additionally, NSP2 was also co-immunostained with the viral outer layer protein VP4 as well as with the ER resident proteins NSP4 and VP7, all of which have been reported to form separate ring-like structures that closely associate with VPs (González et al., 2000). In order to gain more insight into the morphogenesis of rotavirus, we analyzed the distribution of both VP7 monomers (VP7-Mon) and trimers (VP7-Tri) since this protein is assembled into virus particles in the latter form (Kabcenell et al., 1988). The nanoscale distribution of VPs was then analyzed through 3B-SRM, with improvements in the technique, developed in the present work, to solve nanoscopic structures (‘Stochastic model fitted for 3B super resolution microscopy’Appendix 1). By different methods of analysis VPs exhibit roughly a circular shape (Figure 1A–E). However, unlike the diffraction-limited image (Figure 1B), in super-resolution microscopy structural details of VP are appreciated, like the different layer distributions of viral components with respect to NSP2 (Figure 1C–E). In addition to VPs, by diffraction-limited TIRF microscopy we detected in the cytoplasm several small and dispersed puncta of fluorescence (Figure 1B), and in these images it is also sometimes possible to differentiate the distribution of NSP2 from that of VP4, a closely viroplasm-associated viral protein (see also González et al., 2000); in this case, VP4 is detected as a ring-like structure that surrounds the VP. Nevertheless, the small size of the VPs effectively precludes measurement of component distribution for the majority of its structural elements, as their separation is below the spatial resolution of typical optical microscopes. In contrast, images obtained by 3B-SRM do allow the study of the relative distribution of the VP components (Figure 1C–E). In the case of SRM images of VP4 (Figure 1C), we observed that this protein forms a ring-like structure that does not colocalize with NSP2, and also ribbon-like projections that extend towards the cytoplasm, details that were not apparent in images captured with conventional fluorescence microscopy (Figure 1B). Additionally, we observed that the small puncta of proteins detected in the cytoplasm were in fact ribbon-like structures composed of various viral proteins that may represent different organization forms of the viroplasmic proteins (Figure 1C). In this regard, it is interesting to note that both NSP2 and VP4 have been reported to have at least two different intracellular distributions (González et al., 2000; Nejmeddine et al., 2000; Criglar et al., 2014). An examination of 3B-SRM images of VPs (Figure 1C–E) revealed that the viral components form ring like structures within the VPs and are arrayed as rather discrete concentric layers. As seen in Figure 1C–E, we find that although the structural proteins VP1, VP2 and VP6 partially overlap in position with NSP2, the bulk of the proteins form separate and distinct layers. Also, the monomeric as well as the trimeric forms of VP7 are clearly distinguished from NSP2, forming an outer ring. Of interest, the spatial distribution of NSP4 colocalized with that of NSP2, an unexpected result since, as mentioned, NSP4 is an ER integral membrane protein (see the Discussion section), and as such it was expected to colocalize with VP7 rather than with an internal viroplasmic protein (Petrie et al., 1984). With regard to NSP5, it was observed distributed inside the ring formed by NSP2 (Figure 1E).

Figure 1

Download asset Open asset

Relative distribution of viral components in rotavirus-VPs.

RRV-infected MA104 cells (6hpi) were fixed and processed for transmission electron microscopy or immunofluorescence microscopy. (A) Transmission electron microscopy of a VP (identified by the dotted white ellipse). (B) Diffraction-limited image of VPs (white arrows). (C) 3B-SRM image reconstructed from B. (**D–E**) 3B-SRM images of individual VPs labeled with different antibodies (see Methods).

https://doi.org/10.7554/eLife.42906.003

Quantitative characterization of VPs structure by a novel segmentation algorithm

A qualitative analysis of the distribution of the VP components through 3B-SRM suggested that these are arranged as concentric spherical shells; thus, we set out to quantitatively validate the circularity of the VP shape. For this, we developed a segmentation algorithm based on a least squares approach, which we called ‘Viroplasm Direct Least Squares Fitting Circumference’ (VP-DLSFC) (see ‘Segmentation Algorithm’ in Appendix 1), to measure the spatial distribution of the components within individual VPs by adjusting concentric circumferences. This method is automatic, deterministic, easy to implement, and has a linear computational complexity. The performance of VP-DLSFC was tested on approximately 40,000 ‘ground truth’ (GT) synthetic images, showing a high robustness to noise and partial occlusion scenarios. Additionally, we compared our method with two other alternative methods (Gander et al., 1994), and our approach displayed an improved performance (see ‘Algorithm Validation’ in Appendix 1). Based on this new algorithm, we find that the mean radius of the NSP5 distribution was smaller than that of NSP2, suggesting that NSP5 is located in the innermost section, as a component of the core of VPs (Figure 2A). On the other hand, the distribution of the structural proteins VP1, VP2 and VP6 exhibit slightly larger mean radii than that of NSP2, and are thus primarily localized in a zone surrounding NSP2. Continuing further towards the outer regions of the VP, we observed a region occupied by the spike protein VP4. Finally, the two different forms of VP7 (VP7-Mon and VP7-Tri) were located together, close to the most external region of the VPs (Figure 2A). The distribution of the glycoprotein NSP4 showed a similar mean radius to that of NSP2 (around $0.4 μ m$ ) suggesting, as described above, that these two proteins are located in the same structural layer of the VP (Figure 2A).

Figure 2

Download asset Open asset

Exploratory analysis of the results obtained by the algorithm VPs-DLSFC.

(A) Boxplot for the radius of the fitting circumferences. In each experimental condition we plot two boxes, the red box is for the radius of NSP2 (reference protein), and the blue box represents the radius of the accompanying VP components (names in x-axis). (B) Boxplot and results of the Mann-Whitney hypothesis test for the distance between each viral element and NSP2. Each combination of the Mann-Whitney test is linked by a line, and the result of the test it is above the line. Note that this test reports significant differences between the distribution of the distance to NSP2 of two different VP components. (C) Distance of VP1 and VP2 to NSP2 and result of the Mann-Whitney test. Because the distributions of NSP2 in combination with VP1 and VP2 are statistically different to the other NSP2 distributions (see Appendix 1—figure 7B), we show these two cases independently in this exploratory analysis. (D) Hierarchically clustered heatmap for the standard deviation of the distance to NSP2, the mean distance to NSP2, the mean radius of NSP2, and the mean radius of the accompanying protein layers, NSP5, NSP4, VP6, VP4, and VP7.

https://doi.org/10.7554/eLife.42906.004

In order to confirm our preliminary observations and clarify the nanoscopic organization of the VPs, we evaluated the relative separation between NSP2 and each accompanying protein. Again, the results show a remarkable degree of organization in the structure of the VP (Figure 2B). As predicted from Figure 2A, we found that NSP5 is located in the internal part of the VP, in close proximity ( $\approx 0.05 μ m$ ) to the area occupied by proteins NSP2 and NSP4, which themselves show the closest association. After the NSP2-NSP4 region, VP6 occupies a middle region at $\approx 0.05 μ m$ from NSP2, followed by the VP4 protein, which were located at a distance of $\approx 0.18 μ m$ . Finally, the VP7-Mon and VP7-Tri were situated at $\approx 0.38 μ m$ from NSP2 (Figure 2B). A Mann-Whitney test showed that the distances of the various viral components in relation to NSP2 were significantly different (Figure 2B), suggesting that they are situated in specific areas of the VPs. The two forms of VP7 were located at the same distance to NSP2, suggesting that the formation of trimers of VP7 takes place at the ER membrane, where the VP7 monomers should also be located. Note that in Figure 2B the relative distance of VP1 and VP2 to NSP2 was not included, since the radii obtained for NSP2 in these two combinations were significantly smaller than those found when it was determined in combination with the other VP components (see ‘Supplementary Exploratory Analysis’). In addition to this, we found no significant differences between the distance of both VP1 and VP2 to NSP2 (Figure 2C). Nonetheless, based on the inferential analysis, we could place these two proteins in the same layer as VP6 (see below).

Next, through a hierarchical cluster analysis, we studied the relationship between the components of the VP, taking into account multiple variables at the same time, like the mean distance to NSP2 [‘Mean(Dist)”], the standard deviation of the distance to NSP2 [‘Std(Dist)”], the mean radius of NSP2 [‘Mean(NSP2)”], and the radii of the other proteins [‘Mean(Other)”] (Figure 2D). Note that the proteins within a cluster should be as similar as possible and proteins in one cluster should be as dissimilar as possible from proteins in another. Because our variables are related with the distance to NSP2 and the radii of the proteins, this is a no-parametric analysis that should provide evidence about the spatial distribution/order of the viral proteins into the VP. As we are considering the distance to NSP2, VP1 and VP2 were not included in this analysis. The first level of the hierarchical agglomerative cluster (Figure 2D, left) partitioned the VPs and the surrounding proteins in five clusters, composed by NSP4, NSP5, VP6, VP4 and {VP7-Mon, VP7-Tri}, which suggest that these five proteins compose different layers of the VP. The second agglomerative level merged into the same group the proteins NSP4 and NSP5, meanwhile VP6 and VP4 continue as independent clusters, which indicate that NSP5 and NSP4 are closer to each other than to VP6 and VP4 in the VP. In the third level, VP6 and VP4 are clustered in the same group, and as consequence are more related between them than with the others viral proteins. The subsequent groups in the clustering analysis indicate that VP7 remains as an independent layer with respect to the other proteins. Based on this analysis, the viral proteins seem to be highly organized, with VP7 conforming the most external layer, while NSP5, NSP4, VP6 and VP4 are distributed very close but as independent layers. The clusters between NSP5-NSP4 and VP6-VP4 suggest that these two pairs of proteins (in each cluster) conform continuous layers in the VP.

The relative spatial organization of VPs is maintained regardless their size

The scatterplot between the radius of the spatial distribution of NSP2 (independent variable, x-axis) and the radius of the distribution of other viral components (response variable, y-axis) showed a strong linear relationship (Figure 3A). The distribution of NSP5 grows $0.87 μ m$ for each $1 μ m$ increase in the radius of NSP2 (slope interpretation), whereas the radius of the distribution of NSP4 increases $0.99 μ m$ (Figure 3B). These findings indicate that NSP5 is distributed in a proportionally smaller region than NSP2 regardless of the absolute size of the VP, supporting our observation that NSP5 is a constituent of the core of the VP. Moreover, the fact that the increase in the radius of the fitted distribution of NSP4 is directly proportional to the same parameter measured for NSP2 supports the idea that these proteins are both constituents of a putative second layer. VP1, VP2 and VP6 exhibit similar slopes which diverge between 0.03 and 0.05 μm (Figure 3B and Appendix 1—table 6); thus, these results confirm that VP1, VP2 and VP6 are components of the same layer in the VPs which, from the data in Figure 3, is located just after the layer of NSP2 and NSP4. Finally, as noted in our quantitative analysis, VP4 and VP7 form consecutive external layers with a slope of 1.39 and 1.94 μm, respectively (Figure 3B and Appendix 1—table 6). These findings indicate that the spatial distribution of the viral components in the VPs and in the surrounding areas is conserved regardless of their absolute size, and also form the basis of a predictive model, where, for a given radius of distribution of NSP2, it is possible to predict the radii of the remaining VP components (NSP5, NSP4, VP1, VP2, VP6) and of VP4 and VP7 proteins. This predictive model is available as a web app at https://yasel.shinyapps.io/Nanoscale_organization_of_rotavirus_replication_machineries/. The mathematical details and the residual analysis that validate these linear models are available in the Appendix 1, section ‘Linear dependency between the viral components’, Appendix 1—table 6 and Appendix 1—figure 9.

Figure 3

Download asset Open asset

The organization of VPs scales with its size.

(A) Simple linear regression analyses for each component combination (eight subpanels). In all subpanels, the x-axis represents the radius of the distributions of NSP2, and the y-axis the radius of the distribution of the accompanying VP component. The 95% confidence interval, marked in grey, is imperceptible due to goodness of fit of the linear regression (solid line). (B) Slope and confidence interval for each linear regression model (dependent variables in x-axis). The slopes values were shown under each confidence interval.

https://doi.org/10.7554/eLife.42906.005

The structural organization of VPs is independent of the reference protein chosen for pairwise comparison

In order to confirm the observed structural organization of VPs, we analyzed two more experimental conditions in which we chose a different reference protein for pairwise comparisons. The first was based on the distribution of NSP5 and its comparison with the relative localizations of VP6 and VP4, and the second considered NSP4 as the reference protein to compare with the distribution of VP6. We found that both analyses produced an identical structural organization for the VPs, with a comparative localization error of approximately $0.05 μ m$ between models (close to the effective resolution limit of the 3B algorithm; see ‘NSP5 and NSP4 as reference proteins’ in Appendix 1). An extensive quantitative validation regarding the congruence between the NSP2, NSP5 and NSP4 models is available in the Appendix 1.

Based on our extensive quantitative, descriptive and inferential statistical analyses, we propose that the VP and the surrounding viral proteins form an ordered biological structure composed of at least five concentric layers organized as depicted in Figure 4. In this structure, NSP5 constitutes the innermost layer, followed by a {NSP2-NSP4} layer. Then, there is a layer composed by {VP1-VP2-VP6} and two consecutive external layers formed by VP4 and VP7. The different layers of proteins are most likely highly porous to allow the entry of positive-sense single-stranded viral RNA (+RNA) during genome replication and also of the antibodies used for VP staining.

Figure 4

Download asset Open asset

Relative structural distribution of VP components.

The radii of the circumferences maintain the relative values determined for the different VP layers.

https://doi.org/10.7554/eLife.42906.006

Discussion

VPs have been previously studied using electron and fluorescence microscopy, however, due to the limited resolution of classic fluorescence microscopy techniques, and the difficulty of analysis of immunoelectron microscopy, the existence of any complex structural organization of the viral elements inside the VPs has not been reported. In recent years, the development of SRM has facilitated research into the nanoscale organization of a diverse range of cellular structures (Grant et al., 2018; Reznikov et al., 2018), however, until now SRM had not been applied to study the replication cycle of rotavirus. In this work, thanks to the use of the 3B SRM algorithm, we visualized and determined quantitatively the location of several viroplasmic proteins, leading us to propose a detailed model of the VP that should be of great value for understanding virus morphogenesis.

Other SRM algorithms had been used to study the organization of viral and cellular structures showing concentric arrangements, as those proposed by Laine et al. (2015) and Manetsberger et al. (2015). The main similarities between those studies and our approach is the use of conics, such as circles or ellipses, to fitting structures showing concentric organization. The method provided by Manetsberger et al. (2015) could actually be implemented to analyze our data set, which as outcome will produce similar results. This method could also provide information about the degree of asymmetry within the VP, which may be valuable to establish functional relationships between the protein distribution belts that shape these intriguing structures. The selection of the 3B SRM algorithm over other super-resolution approaches was based on the fact that this method allows to deal with samples with high density of labeling, obtaining data with a reasonable resolution, although at the cost of higher computational effort.

The quantification of the viral protein distribution within the VPs was possible thanks to a novel segmentation algorithm (VPs-DLSFC) that was proven to be robust and efficient in noisy and partial occlusion scenarios. The manual pre-segmentation step of this algorithm was necessary in our case because we did not want to introduce any bias in the isolation of the VPs through an automatic approach. Setting aside the manual pre-segmentation step, the VPs-DLSFC algorithm is automatic, deterministic, non-iterative and has a linear computational complexity.

Previous reports have suggested that VPs have a spherical-like structure (Eichwald et al., 2004; Cabral-Romero and Padilla-Noriega, 2006; Campagna et al., 2013); in this study we confirmed this suggestion by comparing the VPs-DLSFC approach with a similar approach based on an ellipse adjustment (Garcés et al., 2016). The results showed no significant statistical differences between these two models, and as consequence we can confidently model the structure of the VPs as a circumference. We also ratified that the center displacement of the circumferences that adjust two paired proteins are not statistically different.

Our study indicates that the viral components in the VPs, as well as VP4 and VP7, are arranged as largely discrete concentric layers (note that we are describing the structure of viroplasms, not of virus particles). This organization, however, does not preclude the interaction among the VP components proposed in this model as being located in separate layers since, for instance, NSP5 has been shown by different biochemical methods to interact with NSP2 (Eichwald et al., 2004; Poncet et al., 1997; Afrikanova et al., 1998; Jiang et al., 2006), VP1 (Afrikanova et al., 1998) and VP2 (Berois et al., 2003). In this regard, based on the super resolution microscopy images, it seems clear that there is some overlapping between different protein layers, as is the case for NSP2 and NSP5 in Figure 1E, but also between NSP2 and VP1, VP2, VP6 and NSP4 in Figure 1. These general overlapping zones between different proteins most likely are relevant for coordinating the genome replication and virion assembly, as suggested. Of interest, we observed the presence of protein projections (‘spike-like’) from different viral shells that could also contribute to the interaction of proteins mapped to different layers (Appendix 1—figure 8). Although our present analysis is limited to a general characterization of the spatial distribution of the viral proteins within VPs, and not to understand specific details about the interactions between proteins in different layers, it could be used as departure point to analyze these interactions. Taking as initial solution the result of the algorithm VPs-DLSFC and the SRM image, it is possible to employ other segmentation approaches, like deformable/active contours (snakes) (Kass et al., 1988), level-set (Osher and Fedkiw, 2003), or region growing methods (Mehnert and Jackway, 1997; Synthuja et al., 2012), to evolve the circular contour and fit precisely the spatial distribution of the viral proteins. Then, establishing a polar coordinate system in the VP, and considering the results of both segmentation algorithms, it would be possible to quantify the radial angle in which a spike from the central distribution of a viral protein that interacts with a different protein exists. It would be also possible to determine how strong these interactions are (intersection between two segmentation curves), and to study whether the spikes are randomly distributed between layers or a specific pattern in the connection between different protein layers exists. In the latter case, this could allow us to explore whether these patterns influence the assembly of the virus-like particles or only provide a skeleton that maintain the structure of the VP. The results obtained could also be used to study topological changes that the VP might experience at different times post infection, and associate these changes with maturation of the subviral particles. In this regard, in preliminary experiments carried out at three hpi, the viral proteins in he VP have been found to have a similar ‘layered’ organization as shown for the mature VP at six hpi (data not shown). This observation indicates that this organization is already present when the formation of viral particles has not yet taken place, suggesting that it might be important for the assembly of DLPs within VPs. In an additional application, SRM could also be used to observe the assembly of the virus particles and the interactions that may occur of these particles in different layers of the VP. Nevertheless, to develop this idea it would be important to establish an experimental protocol to observe the viral particles during the early stages of the assembly process, to distinguish simultaneously the layers of the viroplasm and the viral particles, and to collect the SRM images with a very short acquisition rate and a very high resolution (25–30 nm), which makes this experimental plan a challenge.

Previous studies based on conventional microscopy techniques have reported that NSP5 and NSP2 colocalize (González et al., 2000; Eichwald et al., 2004; Fabbretti et al., 1999); in contrast, we found that although NSP5 and NSP2 are located in close proximity, their positions in the VP were separable. This difference is attributable to the increased spatial resolution in the final image created by the super-resolution techniques employed in our study. Here, NSP5 was found to represent the innermost layer of the VPs, suggesting that this protein might serve as the core scaffold upon which the subsequent viroplasmic proteins are assembled to form the VPs. This finding contrasts with a report by Eichwald et al. (2004), who described that NSP5 locates to a region external to NSP2. In addition to the superior spatial resolution obtainable through 3B-SRM, compared to the traditional confocal microscopy employed in the previous report, the difference might be due to the fact that in our study we characterized the endogenous structures produced during virus replication, while Eichwald et al. characterized VP-like structures formed by transiently expressed proteins fused to GFP.

Immediately outside the NSP5 core, we observed a layer composed of NSP2 and NSP4 proteins. The finding that NSP4 is located in the inner part of VPs was unexpected, since it is known that NSP4 is an integral membrane protein of the ER and since it has been reported that functions as a receptor for the new DLPs located at the periphery of the VPs, during their budding towards the lumen of the ER (Chasey, 1980; Petrie et al., 1982; Petrie et al., 1984; Au et al., 1989). Furthermore, it has been shown that NSP4 associates with VP4 and VP7 to form a hetero-oligomeric complex that could be involved in the last steps of rotavirus morphogenesis (Maass and Atkinson, 1990). Based on these findings, NSP4 was expected to locate close to VP4 and VP7, in the surroundings of the VP. On the other hand, and in line with our observations, previous confocal microscopy studies have shown that a portion of NSP4 also shows a limited colocalization with NSP2 (González et al., 2000).

The dual location of NSP4 as an integral glycoprotein of the ER membrane and as internal to VPs, as our results indicate, is not easy to reconcile; however, in a previous work it was suggested that there are three pools of intracellular NSP4 molecules. The first pool is represented by NSP4 localized in the ER, a second minor pool localized in the ERGIC compartment, and the third pool distributed in cytoplasmic vesicular structures associated with the autosomal marker LC3 (Berkova et al., 2006). Furthermore, in that work the authors suggested that NSP4 and autophagic marker LC3-positive vesicles may serve as a lipid membrane scaffold for the formation of large VPs by recruiting early VPs or VP-like structures formed by NSP2 and NSP5 (Berkova et al., 2006). This observation is in line with our model that NSP4 lies in an internal protein shell within VPs.

An additional, and very interesting possibility to explain the internal location of NSP4 in VPs is the hypothesis that VP morphogenesis occurs on the surface of lipid droplets (LDs) (Cheung et al., 2010). In that work, it was proposed that LDs serve as a platform to which NSP2 and NSP5 proteins attach to form VP-like structures; NSP2 octamers, in turn, associate with the viral polymerase VP1 and rotavirus +RNAs. The assorted RNA complex containing NSP2, VP1, the capping enzyme VP3 and viral +ssRNA is predicted to nucleate VP2 core assembly. In this model, core assembly results in the displacement of +RNA-bound NSP2 octamers, while VP1 within new formed cores direct dsRNA synthesis, using +RNAs as templates (Cheung et al., 2010; Borodavka et al., 2017; Borodavka et al., 2018). These events are followed by incorporation of the middle virus capsid protein VP6 to form DLPs. At some stage, these assemblies become VPs containing cores and DLPs and may lose some or all of their lipids (Cheung et al., 2010). In this regard, it is important to have in mind that the currently accepted model for the LD biogenesis is that neutral lipids are synthesized between the leaflets of the ER membrane, and the mature LD is then thought to bud from the ER membrane to form an independent organelle that is contained within a limiting monolayer of phospholipids and associated LD proteins (Walther and Farese, 2012). Thus, during budding of the LDs from the ER membrane they could take along rotavirus NSP4 (topologically oriented towards the cell's cytoplasm) which could help as a scaffold on the surface of LDs for the assembly of other rotavirus viroplasmic proteins, localizing then to the interior of VPs.

Further support for our model of localization of at least one pool of NSP4 molecules inside of the VPs is the observation that knocking-down the expression of NSP4 by RNA interference significantly reduces the number and size of VPs present in the cell, as well as the production of DLPs (López et al., 2005a). That study also showed that during RNAi inhibition of NSP4 expression the NSP2 and NSP5 proteins maintained an intracellular distribution restricted to VPs, while the VP2, VP4, VP6 and VP7 proteins failed to locate to VPs. Based on these observations, it is tempting to suggest that, in addition to the role NSP4 has on the budding of DLPs into the ER lumen, it may also play an important role as a regulator of VP assembly.

After the NSP2/NSP4 layer, we observed a middle zone composed of the structural proteins VP1, VP2 and VP6. Their location in the same zone is expected given their close association in the assembled DLPs (Estes and Greenberg, 2013). Also, the fact that VP1, VP2 and VP6 form a complex with NSP2 that has replicase activity (Aponte et al., 1996), suggests that the production of new DLPs could take place in this zone of the VP.

Finally, we found that VP4 and VP7 conform independent layers just external to the viroplasmic proteins. The position of these two proteins agrees with the proposed model of rotavirus morphogenesis in which VP4 is assembled first on DLPs, and subsequently VP7 binds the particles and locks VP4 in place (Trask and Dormitzer, 2006). Furthermore, the fact that VP7-Mon and VP7-Tri occupied the same layer in our model indicates that in the ER sites into which the DLPs bud, VP7 is already organized as trimers, which are subsequently assembled into the virus particles. Of interest, VP4 has been reported to exist in two different forms in infected cells. One of them is associated with microtubules (Nejmeddine et al., 2000), while the other one has been reported to be found between the VP and the ER membrane (González et al., 2000). In this regard, based on our findings, we suggest that the latter form of VP4 can be actually considered as an integral component of the VP. Since several studies have found the presence of different cellular proteins and lipids in association to VPs (Maruri-Avidal et al., 2008; Cheung et al., 2010; Dhillon et al., 2018), it will be interesting to study the relative localization of this components using the methodologies described here.

Materials and methods

Key resources table

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Virus strain (Rhesus rotavirus)	RRV	Harry B. Greenberg, Stanford University.
Cell line (Cercopithecus aethiops)	MA014 cells	American Type Culture Collection	ATCC:CRL-2378.1; RRID:CVCL_3846
Antibody	Mouse monoclonal antibody 3A8	Harry B. Greenberg, Stanford University.		IF (1:1000)
Antibody	Mouse monoclonal antibody 2G4	Harry B. Greenberg, Stanford University. PMID: 2431540		IF (1:1000)
Antibody	Mouse monoclonal antibody 255/60	Harry B. Greenberg, Stanford University. PMID: 6185436		IF (1:1000)
Antibody	Mouse monoclonal antibody M60	Harry B. Greenberg, Stanford University. PMID: 2431540		IF (1:2000)
Antibody	Mouse monoclonal antibody 159	Harry B. Greenberg, Stanford University. PMID: 2431540		IF (1:2000)
Antibody	Mouse polyclonal antibody VP1	Our Laboratory.	RRID:AB_2802095	IF (1:500)
Antibody	Mouse polyclonal antibody NSP2	Our Laboratory. PMID: 9645203	RRID:AB_2802096	IF (1:100)
Antibody	Rabbit polyclonal antibody NSP2	Our Laboratory. PMID: 9645203	RRID:AB_2802097	IF (1:2000)
Antibody	Rabbit polyclonal antibody NSP4	Our Laboratory. PMID: 18385250	RRID:AB_2802094	IF (1:1000)
Antibody	Rabbit polyclonal antibody NSP5	Our Laboratory. PMID: 9645203	RRID:AB_2802098	IF (1:2000)
Software, algorithm	R	R Development Core Team, 2017. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.r-project.org/	RRID:SCR_001905	Version 3.4.4 (2018-03-15)
Software, algorithm	Matlab	MATLAB and Statistics Toolbox Release 2018b, The MathWorks, Inc, Natick, Massachusetts, United States.	RRID:SCR_001622
Software, algorithm	Fiji	PMID:22743772	RRID:SCR_002285
Software, algorithm	VP-DLSFC	This paper		See ‘Segmentation Algorithm’ in Appendix 1.

Cell and virus

Request a detailed protocol

The rhesus monkey kidney epithelial cell line MA104 (ATCC) was grown in Dulbecco’s Modified Eagle Medium-Reduced Serum (DMEM-RS) (Thermo-Scientific HyClone, Logan, UT) supplemented with 5% heat-inactivated fetal bovine serum (FBS) (Biowest, Kansas City, MO) at 37°C in a 5% CO₂ atmosphere. The cells were confirmed to be free of mycoplasm by testing with the INTRON Mycoplasma PCR Detection Kit (#25234). Rhesus rotavirus (RRV) was obtained from H. B. Greenberg (Stanford University, Stanford, Calif.) and propagated in MA104 cells as described previously (Pando et al., 2002). Prior to infection, RRV was activated with trypsin (10 μg/ml; Gibco, Life Technologies, Carlsbad, CA) for 30 min at 37°C.

Antibodies

Monoclonal antibodies (MAbs) to VP2(3A8), VP4 (2G4), VP6 (255/60), VP7 (60) and VP7 (159) were kindly provided by H. B. Greenberg (Stanford University, Stanford, CA) (Shaw et al., 1986; Greenberg et al., 1983). The rabbit polyclonal sera to NSP2, NSP4 and NSP5, and the mouse polyclonal serum to NSP2 were produced in our laboratory (González et al., 1998). The hyperimmune serum to NSP4 (C-239) was generated in our laboratory by immunizing New Zealand white rabbits with a recombinant protein expressed in E. coli with a histidine-tail, representing the carboxy-terminal end (amino acids 120 to 175) of the rhesus rotavirus RRV NSP4 protein; see also Maruri-Avidal et al. (2008), in which this serum was used. The hyperimmune serum to VP1 was also generated in our laboratory by immunizing BALB/c mice with a recombinant protein expressed in E. coli with a histidine-tail, representing amino acids 227 to 539 of the rhesus rotavirus RRV VP1 protein. Goat anti-mouse Alexa-488- and Goat anti-rabbit Alexa-568-conjugated secondary antibodies were purchased from Molecular Probes (Eugene, Oreg.).

Immunofluorescence

Request a detailed protocol

MA104 cells grown on glass coverslips were infected with rotavirus RRV at a multiplicity of infection (MOI) of 1. Six hours post infection, the cells were fixed with and processed for immunofluorescence as described (Silva-Ayala et al., 2013). Finally, the coverslips were mounted onto the center of glass slides with storm solution (1.5% glucose oxidase $+ 100$ mM $β$ -mercaptoethanol) to induce the blinking of the fluorophores (Dempsey et al., 2011; Heilemann et al., 2009).

Transmission electron microscopy

Request a detailed protocol

Cells grown in 75- $c m^{2}$ flasks were infected with rotavirus RRV at an MOI of 3 as described above. Six hours postinfection the cells were fixed in 2.5% glutaraldehyde-0.1 M cacodylate (pH 7.2), postfixed with 1% osmium tetroxide, and embedded in Epon 812 resin. The ultrathin sections obtained were stained with 2% uranyl acetate-1% lead citrate (Reynolds mix). The grids were examined with a Zeiss EM-900 electron microscope at 80 kV.

Set up of the optical microscope

Request a detailed protocol

All super-resolution imaging measurements were performed on an Olympus IX-81 inverted microscope configured for total internal reflection fluorescence (TIRF) excitation (Olympus, cellTIRFTM Illuminator). The critical angle was set up such that the evanescence field had a penetration depth of ~200 nm (Xcellence software v1.2, Olympus soft imaging solution GMBH). The samples were continuously illuminated using excitation sources depending on the fluorophore in use. Alexa Fluor 488 and Alexa Fluor 568 dyes were excited with a 488 nm or 568 nm diode-pumped solid-state laser, respectively. Beam selection and modulation of laser intensities were controlled via Xcellence software v.1.2. A full multiband laser cube set was used to discriminate the selected light sources (LF 405/488/561/635 A-OMF, Bright Line; Semrock). Fluorescence was collected using an Olympus UApo N $100 x / 1.49$ numerical aperture, oil-immersion objective lens, with an extra 1.6x intermediate magnification lens. All movies were recorded onto a 128 × 128-pixel region of an electron-multiplying charge coupled device (EMCCD) camera (iXon 897, Model No: DU-897E-CS0-#BV; Andor) at 100 nm per pixel, and within a 50 ms interval (300 images per fluorescent excitation).

Bayesian analysis of the blinking and bleaching

Request a detailed protocol

Sub-diffraction images were derived from the Bayesian analysis of the stochastic Blinking and Bleaching of Alexa Fluor 488 dye (Cox et al., 2011). For each super-resolution reconstruction, 300 images were acquired at 20 frames per second with an exposure time of 50 ms at full laser power, spreading the bleaching of the sample over the length of the entire acquisition time. The maximum laser power coming out of the optical fiber measured at the back focal plane of the objective lens, for the 488 nm laser line, was 23.1 mW. The image sequences were analyzed with the 3B algorithm considering a pixel size of 100 nm and a full width half maximum of the point spread function of 270 nm (for Alexa Fluor 488), measured experimentally with 0.17 μm fluorescent beads (PS-SpeckTM Microscope Point Source Kit, Molecular Probes, Inc). All other parameters were set up using default values. The 3B analysis was run over 200 iterations, as recommended by the authors in Cox et al. (2011), and the final super-resolution reconstruction was created at a pixel size of 10 nm with the ImageJ plugin for 3B analysis (Rosten et al., 2013), using parallel computing as described in Hernández et al. (2016). The resolution increase observed in our imaging set up by 3B analysis was up to five times below the Abbe’s limit (~50 nm). The resolution provided by 3B was improved by computing the photo-physical properties of Alexa Fluor 488, and Alexa Fluor 568 dyes, which were provided to 3B algorithm, as an input parameter which encompass the probability transition matrix between fluorophore’s states. The method was validated with 40 nm gattapaint nanorules (PAINT 40RG, gattaquant, Inc) labeled with ATTO 655/ATTO 542 dyes (see ‘3B Algorithm’ in Appendix 1).

Code and statistical analysis

Request a detailed protocol

The segmentation algorithm (VPs-DLSFC) was developed in Matlab R2018a (9.4.0.813654) software. A detailed explanation of each the developed methods is available in Appendix 1. Statistical analysis were performed using R version 3.4.4 (2018-03-15) software. All the codes are available at https://github.com/Yasel88/Nanoscale_organization_of_rotavirus_replication_machineries (Garcés Suárez, 2019; copy archived at https://github.com/elifesciences-publications/Nanoscale_organization_of_rotavirus_replication_machineries).

Appendix 1

This appendix is divided in seven sections. In –Segmentation algorithm–, we discuss the mathematical details of the algorithm 'Viroplasm Direct Least Square Fitting Circumference' (VPs-DLSFC). In Section –Algorithm validation–, we compare the algorithm VPs-DLSFC with other two classic methods for the adjustment of circumferences; this analysis was done thanks to the use of approximately 40000 synthetic 'ground truth' images. In Section – Model Considerations–, we prove that the distribution of the studied viral elements follow a concentric circumference spatial distribution. Sections –Exploratory Analysis– and – Linear Regression Model– contains a supplementary exploratory analysis about the spatial distribution of the viral elements, and details about the results and the residual error analysis of the linear regression models, respectively. In Section –NSP5 and NSP4 as reference proteins– we developed a similar study that was made with NSP2, but considering NSP5 and NSP4 as reference proteins. Also, we shown that there is no statistically significative differences between the distributions of the viral elements when we changed the reference protein. Finally, in Section –3B Algorithm– we explain the fitting model for the 3B algorithm where the transition matrix is modeled by ordinary differential equations (ODE).

Segmentation algorithm

The use of primitive models for the segmentation of the SRM images has many benefits, like computational efficiency, simple programmation, and understandable information about the objects that were segmented. In this regard, we developed a simple method for fitting a circumference to scattered data, which we called ‘Direct Least Squares Fitting Circumference’ (DLSFC). This approach considers basic mathematical analysis tools for the computation of the extreme value of a continuous function with N variables.

Direct least squares fitting circumference (DLSFC)

The spatial distribution of a viral proteins into the VP can be seen as a set of points $P = {(x_{i}, y_{i}) | i = \bar{1 : N}} \in R^{2}$ (scattered data in the plane). Taking into account the implicit equation of a circumference ${(x - x_{c})}^{2} + {(y - y_{c})}^{2} = r^{2}$ , where $(x_{c}, y_{c})$ is the center, and $r$ is the radius. The problem of adjust a circumference to $P$ , is a optimization problem given by:

\min_{x_{c}, y_{c}, r} \sum_{i = 1}^{N} {({(x_{i} - x_{c})}^{2} + {(y_{i} - y_{c})}^{2} - r^{2})}^{2} .

The center of mass of $P$ is given by:

(\bar{x}, \bar{y}) = (\frac{1}{N} \sum_{i = 1}^{N} x_{i}, \frac{1}{N} \sum_{i = 1}^{N} y_{i}) .

Then, considerering Equation (2), the problem (Equation (1)) can be rewritten as:

\min_{u_{c}, v_{c}, α} \sum_{i = 1}^{N} {({(u_{i} - u_{c})}^{2} + {(v_{i} - v_{c})}^{2} - α)}^{2}

where $u_{i} = x_{i} - \bar{x}$ , $v_{i} = y_{i} - \bar{y} (for all i = 1, 2, \dots, N)$ , $α = r^{2}$ , and $(u_{c}, v_{c})$ is the center of the circumference after the change of variable.

For simplicity, we will define as $f_{u_{c}, v_{c}, α} (u_{i}, v_{i}) := {(u_{i} - u_{c})}^{2} + {(v_{i} - v_{c})}^{2} - α$ the distance function, and by $F (u_{c}, v_{c}, α) := \sum_{i = 1}^{N} f_{u_{c}, v_{c}, α} {(u_{i}, v_{i})}^{2}$ the objective function of our minimization problem.

Note 1 Note that, with the new convention of notation, the problem (Equation (3)) is equivalent to:

\min_{u_{c}, v_{c}, α} F (u_{c}, v_{c}, α)

A simple alternative to resolve Equation (4) is to consider the extreme values of $F (u_{c}, v_{c}, α)$ , this is:

\nabla F (u_{c}, v_{c}, α) = (\frac{\partial F}{\partial u_{c}}, \frac{\partial F}{\partial v_{c}}, \frac{\partial F}{\partial α}) = (0, 0, 0) .

The partial derivates of the function $F (u_{c}, v_{c}, α)$ are:

1. For $α$ :

\frac{\partial F}{\partial α} = - 2 \sum_{i = 1}^{N} f_{u_{c}, v_{c}, α} (u_{i}, v_{i}),

Taking into account the condition given in Equation (5) for $\frac{\partial F}{\partial α}$ , we obtain that:

\sum_{i = 1}^{N} f_{u_{c}, v_{c}, α} (u_{i}, v_{i}) = 0 .

2. For $u_{c}$ :

\begin{aligned} \frac{\partial F}{\partial u_{c}} & = - 4 \sum_{i = 1}^{N} [f_{u_{c}, v_{c}, α} (u_{i}, v_{i}) (u_{i} - u_{c})] \\ = - 4 [\sum_{i = 1}^{N} u_{i} f_{u_{c}, v_{c}, α} (u_{i}, v_{i}) - u_{c} \sum_{i = 1}^{N} f_{u_{c}, v_{c}, α} (u_{i}, v_{i})] \\ = - 4 \sum_{i = 1}^{N} u_{i} f_{u_{c}, v_{c}, α} (u_{i}, v_{i}) \end{aligned}

3. For $v_{c}$ : It is the same process than $u_{c}$ , the final result is,

\frac{\partial F}{\partial v_{c}} = - 4 \sum_{i = 1}^{N} v_{i} f_{u_{c}, v_{c}, α} (u_{i}, v_{i}) .

Expanding the Equation (7) we obtain:

\begin{array}{ll} \frac{\partial F}{\partial u_{c}} & = - 4 \sum_{i = 1}^{N} u_{i} f_{u_{c}, v_{c}, α} (u_{i}, v_{i}) = - 4 \sum_{i = 1}^{N} u_{i} [(u_{i} - u_{c})^{2} + (v_{i} - v_{c})^{2} - α] \\ = - 4 \sum_{i = 1}^{N} u_{i} [(u_{i}^{2} + u_{c}^{2} - 2 u_{i} u_{c}) + (v_{i}^{2} + v_{c}^{2} - 2 v_{i} v_{c}) - α] \\ = - 4 [\sum_{i = 1}^{N} u_{i}^{3} + u_{c}^{2} \sum_{i = 1}^{N} u_{i} - 2 u_{c} \sum_{i = 1}^{N} u_{i}^{2} + \sum_{i = 1}^{N} u_{i} v_{i}^{2} + v_{c}^{2} \sum_{i = 1}^{N} u_{i} - 2 v_{c} \sum_{i = 1}^{N} u_{i} v_{i} - α \sum_{i = 1}^{N} u_{i}], \end{array}

and taking into account that $u_{i} = x_{i} - \bar{x} \Rightarrow \sum_{i = 1}^{N} u_{i} = \sum_{i = 1}^{N} x_{i} - N \bar{x} = \sum_{i = 1}^{N} x_{i} - N \frac{\sum_{i = 1}^{N} x_{i}}{N} = 0$ , we obtain that:

\frac{\partial F}{\partial u_{c}} = - 4 [\sum_{i = 1}^{N} u_{i}^{3} - 2 u_{c} \sum_{i = 1}^{N} u_{i}^{2} + \sum_{i = 1}^{N} u_{i} v_{i}^{2} - 2 v_{c} \sum_{i = 1}^{N} u_{i} v_{i}] .

Let $C_{(u^{k}, v^{t})} := \sum_{i = 1}^{N} u_{i}^{k} v_{i}^{t}$ , then the equation Equation (9) can be written as:

\frac{\partial F}{\partial u_{c}} = - 4 [C_{(u^{3}, v^{0})} - 2 u_{c} C_{(u^{2}, v^{0})} + C_{(u^{1}, v^{2})} - 2 v_{c} C_{(u^{1}, v^{1})}],

and considering the extreme condition $\frac{\partial F}{\partial u_{c}} = 0$ (see Equation (5)) we obtain:

u_{c} C_{(u^{2}, v^{0})} + v_{c} C_{(u^{1}, v^{1})} = \frac{1}{2} (C_{(u^{3}, v^{0})} + C_{(u^{1}, v^{2})}) .

Following the same process, but now considering the Equation (8), we obtain:

u_{c} C_{(u^{1}, v^{1})} + v_{c} C_{(u^{0}, v^{2})} = \frac{1}{2} (C_{(u^{0}, v^{3})} + C_{(u^{2}, v^{1})}) .

The extreme values $\hat{u_{c}}$ and $\hat{v_{c}}$ of the function $F (u_{c}, v_{c}, α)$ are obtained as solution of the linear system formed by Equation (10) and Equation (11).

Note 2 The coordinate system translation to the mass center of the original data, allow the simplification of the equation (Equation (9)).

The radius of the circumference is obtained developing the Equation (6):

\begin{array}{ll} 0 & = \sum_{i = 1}^{N} [u_{i}^{2} + u_{c}^{2} - 2 u_{i} u_{c} + v_{i}^{2} + v_{c}^{2} - 2 v_{i} v_{c} - α] \\ = \sum_{i = 1}^{N} u_{i}^{2} + N u_{c}^{2} + \sum_{i = 1}^{N} v_{i}^{2} + N v_{c}^{2} - N α, \end{array}

and writing the equation in function of $α$ , we obtain that:

\hat{α} = u_{c}^{2} + v_{c}^{2} + \frac{C_{(u^{2}, v^{0})} + C_{(u^{0}, v^{2})}}{N} .

The point $(\hat{u_{c}}, \hat{v_{c}}, \hat{α})$ is a critical point of the function $F (u_{c}, v_{c}, α)$ . Finally, we need to compute the Hessian matrix of $F (u_{c}, v_{c}, α)$ to test if $(\hat{u_{c}}, \hat{v_{c}}, \hat{α})$ is a maximum, a minimum, or an inflection point:

H = (\begin{matrix} \frac{\partial^{2} F}{\partial α^{2}} & \frac{\partial^{2} F}{\partial α \partial u_{c}} & \frac{\partial^{2} F}{\partial α \partial v_{c}} \\ \frac{\partial^{2} F}{\partial u_{c} \partial α} & \frac{\partial^{2} F}{\partial u_{c}^{2}} & \frac{\partial^{2} F}{\partial u_{c} \partial v_{c}} \\ \frac{\partial^{2} F}{\partial v_{c} \partial α} & \frac{\partial^{2} F}{\partial v_{c} \partial u_{c}} & \frac{\partial^{2} F}{\partial v_{c}^{2}} \end{matrix}) = (\begin{matrix} 2 N & - 4 N u_{c} & - 4 N v_{c} \\ 0 & 8 \sum_{i = 1}^{N} u_{i}^{2} & 8 \sum_{i = 1}^{N} u_{i} v_{i} \\ 0 & 8 \sum_{i = 1}^{N} u_{i} v_{i} & 8 \sum_{i = 1}^{N} v_{i}^{2} \end{matrix})

The principal minors of $H$ are:

\begin{aligned} H_{1} & = 2 N > 0, \\ H_{2} & = | \begin{matrix} 2 N & - 4 N u_{c} \\ 0 & 8 \sum_{i = 1}^{N} u_{i}^{2} \end{matrix} | = 16 N \sum_{i = 1}^{N} u_{i}^{2} > 0, \\ H_{3} & = | \begin{matrix} 2 N & - 4 N u_{c} & - 4 N v_{c} \\ 0 & 8 \sum_{i = 1}^{N} u_{i}^{2} & 8 \sum_{i = 1}^{N} u_{i} v_{i} \\ 0 & 8 \sum_{i = 1}^{N} u_{i} v_{i} & 8 \sum_{i = 1}^{N} v_{i}^{2} \end{matrix} | = 128 N \underset{Cauchy- Schwarz inequality}{\underset{⏟}{(\sum_{i = 1}^{N} u_{i}^{2} \sum_{i = 1}^{N} v_{i}^{2} - {(\sum_{i = 1}^{N} u_{i} v_{i})}^{2}]}} > 0. \end{aligned}

Note 3 The Cauchy-Schwarz inequality is equal to zero if and only if exist a real value $r$ such that $u_{i} r + v_{i} = 0$ , for all $i = 1, 2, \dots, N$ , that is, all the points are in a straight line. In our problem, the proteins do not follow a straight line distribution in the space, then, we can guarantee the strict inequality. Note that $H_{2} = 0$ if and only if $u_{i} = 0, for all i = 1, 2, \dots, N$ , which is a particular case of a straight line.

The principal minors of the matrix $H$ are greater than zero, then, the point $(\hat{u_{c}}, \hat{v_{c}}, \hat{α})$ is a local minimum, but this point is the only critical point of the function $F (u_{c}, v_{c}, α)$ , and as consequence it is a global minimum.

The center of the fitted circumference in the original coordinate system is $(x_{c}, y_{c}) = (\hat{u_{c}}, \hat{v_{c}}) + (\bar{x}, \bar{y})$ , and the radius is $R = \sqrt{\hat{α}}$ .

Note 4 This method is based on a simple and very popular mathematical techniques (specifically Differential Calculus). Even when we do not found a published paper with this approach, we want to take cares and not adjudicate this technique to our work.

Note 5 Note that, by definition, the distance function $f_{u_{c}, v_{c}, α} (u_{i}, v_{i})$ is not a geometric distance to the circle, which is probably more intuitive to use in this kind of problems. Let consider for a second the geometric distance of a set of points $P$ to a circle, which is given by:

G (x_{c}, y_{c}, r) := \sum_{i = 1}^{N} {(r - \sqrt{{(x_{i} - x_{c})}^{2} + {(y_{i} - y_{c})}^{2}})}^{2},

where $(x_{c}, y_{c})$ is the center of the circumference and $r$ is the radius. Considering the extreme condition for the function $G (x_{c}, y_{c}, r)$ , it is obtained that:

\begin{aligned} \frac{\partial G}{\partial r} & = - 2 \sum_{i = 1}^{N} \sqrt{(x_{i} - x_{c})^{2} + (y_{i} - y_{c})^{2}} + 2 N r \\ \frac{\partial G}{\partial x_{c}} & = 2 r \sum_{i = 1}^{N} \frac{x_{i} - x_{c}}{\sqrt{(x_{i} - x_{c})^{2} + (y_{i} - y_{c})^{2}}} - 2 N \bar{x} + 2 N x_{c} \\ \frac{\partial G}{\partial y_{c}} & = 2 r \sum_{i = 1}^{N} \frac{y_{i} - y_{c}}{\sqrt{(x_{i} - x_{c})^{2} + (y_{i} - y_{c})^{2}}} - 2 N \bar{y} + 2 N y_{c}, \end{aligned}

where $\bar{x}$ and $\bar{y}$ are the mean values of the variables $x$ and $y$ respectively. Simultaneously equating these partials to zero does not produce closed form solutions for $x_{c}, y_{c},$ and $r$ . With numerical methods it is possible to carry out the optimization of these parameters, but these alternatives are iterative, no-deterministic process, which also are highly sensitives to the initial values, while our proposal not have these kind of unwished characteristics. In any case, in Appendix 1 Section –Algorithm validation– we compared our approach with an algorithm based on geometric distance and explained in details the benefits of each one.

The final segmentation algorithm is composed by three steps (Appendix 1—figure 1). The first is a manual pre-segmentation that guarantees the existence of one and only one viroplasm per image (Appendix 1—figure 1A). In the second step, we adjust a circumference to the reference protein (remember that these are paired experiments, where NSP2 is present in all the combinations as reference protein) through the algorithm DLSFC (Appendix 1—figure 1B), and finally the radius of the accompanying protein is adjusted using the Equation (12). Note that, the center of the accompanying protein is the same that the center of the reference protein (concentric model). Out of the manual pre-segmentation step, this algorithm is automatic, deterministic, not iterative and with a linear computational complexity.

Appendix 1—figure 1

Download asset Open asset

Scheme of the ‘Viroplasm Direct Least Square Fitting Circumference’ algorithm (VPs-DLSFC).

SRM) Complete SRM image; A) Manual pre-segmentation step, an expert selects and isolates each viroplasm as a single image; B) Fit a circumference to the reference protein through the algorithm DLSFC; C) The center of the reference protein is taken as the center of the accompanying protein, and then the radius of the adjust circumference for this second protein is computed.

https://doi.org/10.7554/eLife.42906.009

Note 6 The hypothesis that the viral proteins in the viroplasm are distributed like as concentric circumferences was proved in Appendix 1 Section – Model Considerations– (‘Model Considerations’).

Algorithm validation

The algorithm DLSFC was compared with other two approaches proposed by Gander et al. (1994). For a more comprehensive and clear comparison we named these others two methods as ‘Algebraic Least Square Fitting Circle’ (ALSFC) and ‘Geometric Least Square Fitting Circle’ (GLSFC).

The algorithm ALSFC considers the algebraic representation of a circle in the plane:

F (p) = A p^{t} p + B^{t} p + C;

where $p = (x, y)$ is a point in $ℝ^{2}$ , $B = (B_{1}, B_{2}) \in ℝ^{2}$ , $A \neq 0$ , and $C$ is the independent term. Then, let $p_{i} = (x_{i}, y_{i}) \in ℝ^{2}, i = 1, 2, \dots, N$ the set of points that we want to adjust. The objective function for this minimization problem is $\sum_{i = 1}^{n} F (p_{i})$ , which can be represented as matrix form:

Ψ u = \underset{ψ}{\underset{⏟}{⟮ \begin{matrix} x_{1}^{2} + y_{1}^{2} & x_{1} & y_{1} & 1 \\ x_{2}^{2} + y_{2}^{2} & x_{2} & y_{2} & 1 \\ ⋮ & ⋮ & ⋮ & ⋮ \\ x_{n}^{2} + y_{n}^{2} & x_{n} & x_{n} & 1 \end{matrix} ⟯}} \underset{u}{\underset{⏟}{⟮ \begin{matrix} A \\ B_{1} \\ B_{2} \\ C \end{matrix} ⟯}}

The final optimization problem is given by:

{\begin{matrix} \min_{u} ∥ Ψ u ∥ \\ subjet to: ∥ u ∥ = 1 \end{matrix},

where $∥ u ∥ = 1$ is a contrain to avoid the trivial solution $u = (0, 0, 0, 0)$ . The notation $‖ v ‖ = \sqrt{v_{1}^{2} + v_{2}^{2} + \dots + v_{m}^{2}}$ represent the Euclidean norm. Finally, the problem (Equation (16)) is solved considering the right singular eigenvector associated with the smallest singular eigenvalue of $Ψ$ (Gander et al., 1994).

On the other hand, the algorithm GLSFC deals with the minimization of the geometric distance:

\min_{x_{c}, y_{c}, r} \sum_{i = 1}^{n} {(∥ (x_{c}, y_{c}) - p_{i} ∥ - r)}^{2} .

Note that ${(∥ (x_{c}, y_{c}) - p_{i} ∥ - r)}^{2}$ is the geometric distance of the point $p_{i}$ to the circle ${(x - x_{c})}^{2} + {(y - y_{c})}^{2} = r^{2}$ . This problem is nonlinear, and in Gander et al. (1994), the authors consider to resolve it trought a Gauss-Newton algorithm. The Gauss-Newton method is an iterative algorithm that depends of an initial approximation, and in this case, Gander and collaborators consider the solution of the problem (Equation (16)) as the initial value for the iterative process. More details about the algorithms ALSFC and GLSFC can be consulted in Gander et al. (1994). In summary, the method ALSFC is very simple and computational efficient, but the algebraic distance might not reflect with accuracy the real distance between the points and the circumference and, as consequence, the parameters of the model might be biased. On the contrary, the algorithm GLSFC is highly precise, but it is an iterative process (demand more computation time), and depends of good initial values to achieve the convergence. The Matlab codes for the algorithms GLSFC and ALSFC were obtained from Brown (2007).

For the validation, we simulated the spatial distribution of viral elements as circumferences, for which we know their parametric form (‘ground truth’ dataset) (Appendix 1—figure 2A). The use of ‘ground truth’ allows us to quantify the error in the adjustment of the algorithms DLSFC, ALSFC and GLSFC at different noise levels (Appendix 1—figure 2B) and partial occlusion conditions (Appendix 1—figure 2C). We generated over 40 000 images (size $512 \times 512$ pixels) taking into account different levels of additive white Gaussian noise (AWGN) in order to consider auto-fluorescence and the error in the localization of the fluorophores by the algorithm 3B-ODE (see Appendix 1 Section –3B Algorithm–), and partial occlusion angles. Several radii and position of the synthetic viroplasms were generated randomly through a uniform distribution function. The codes for this simulation are availables at https://github.com/Yasel88/Nanoscale_organization_of_rotavirus_replication_machineries (Garcés Suárez, 2019; copy archived at https://github.com/elifesciences-publications/Nanoscale_organization_of_rotavirus_replication_machineries).

Appendix 1—figure 2

Download asset Open asset

Simulation of the viral proteins.

(A) ‘Ground truth’ (GT) circumference; (B) Addition of gaussian noise to the GT circumference (see Appendix 1 subsection –noise generation–); (C) Generation of partial occlusion (see Appendix 1 subsection –Partial Occlusion Generation).

https://doi.org/10.7554/eLife.42906.010

Noise generation

Let $ℐ$ a synthetic image and $p_{i} \in ℝ^{2}, i = 1, 2, \dots, N$ points of the ‘ground truth’ circunference $𝒞 \in ℐ$ , the AWGN over $p_{i}, \forall i = \bar{1, N}$ was generated as:

p_{i}^{c} = p_{i} + σ X_{i},

where $p_{i}^{c}$ is the corrupted point after the contamination with AWGN, $σ$ is a scalar that represents the level of noise, and $X_{i}$ is a trial of a random variable $X \sim N (0, 1)$ (standard gaussian distribution). In our experimental design we included 20 different levels of noise $σ = 0 : 0.5 : 10$ (increase by 0.5 from 0 to 10). Note that the noise was added to the points of the circumferences and not to the images. This is because we are working with SRM images, and as consequence does not exist background noise.

Partial occlusion generation

The partial occlusion can be seen as the existence of incomplete information of the objects of interest, for example, an image where we have only information about the half of the VPs. In our perspective, it is important that a segmentation algorithm have a good performance even in these situations.

Definition 1 We called an angle $θ$ as a partial oclussion angle for a set of points $p_{i} = (x_{i}, y_{i}) \in R^{2}, i = 1, 2, \dots, N$ , if not exist $j \in {1, 2, \dots, N}, r \in R$ and $β \in [α, α + θ]$ such as:

p_{j} = (\bar{x} + r \cos β, \bar{y} + r \sin β),

where $α$ is an angle in $[0, 2 π]$ , and $(\bar{x}, \bar{y}) = (\frac{1}{N} \sum_{i = 1}^{N} x_{i}, \frac{1}{N} \sum_{i = 1}^{N} y_{i})$ is the center of mass of the points $p_{i}$ .

Note 7 Note that, in the Definition 1 we considered the interval $[α, α + θ]$ as a partial oclusion angle $θ$ , but it is possible to obtain the same oclussion angle with $[α, α - θ]$ . For our validation we took $α = 0$ for simplicity, but this has not consequences in our experiments, because we generate the points as a circumference.

The relationship between cartesian and polar coordinates is given by:

{\begin{matrix} x & = a + r \cos β \\ y & = b + r \sin β \end{matrix},

where, (a, b) is the circumference center, $r$ is the radius and $β$ is the angular coordinate. Then, for example, to provoke a partial occlusion of $θ$ , it is just needed to avoid the generation of points in the angle $[0, θ]$ or $[2 π, 2 π - θ]$ through the Equation (19). Appendix 1—figure 3 shows this strategy, in this example, a half of the information is removed from the ‘ground truth’ circumference.

Appendix 1—figure 3

Download asset Open asset

Generation of partial occlusion in the angle $θ = π$ (red line).

The circle just conserves the information relative to the angle $[0, π]$ .

https://doi.org/10.7554/eLife.42906.011

In general, the average of the points that will be conserved after the generation of partial occlusion is given by:

P_{%} = 100 (1 - \frac{θ}{2 π}) .

In the validation of the algorithm DLSFC, we considered four different partial occlusion conditions: 1- Not partial occlusion: Conserves all the points of the ‘ground truth’ circumference; 2– 25% of partial occlusion: Partial occlusion angle $θ = π / 2$ ; 3– 50% of partial occlusion: Partial occlusion angle $θ = π$ ; 4– 75% of partial occlusion: Partial occlusion angle $θ = 3 π / 2$ .

In the experiments, we represent the noise through the mean and the standard deviation of the distance of the points $p_{i}^{c}, i = 1, 2 \dots, n$ to the ‘ground truth’ circumference $𝒞$ , this is:

\underset{Mean Value}{\underset{⏟}{\bar{d_{𝒞}} = \frac{\sum_{i = 1}^{n} d_{i 𝒞}}{n}}}, \underset{Standard Deviation}{\underset{⏟}{σ_{d_{𝒞}} = \sqrt{\frac{\sum_{i = 1}^{n} {(d_{i 𝒞} - \bar{d_{𝒞}})}^{2}}{N - 1}}}}

where $d_{i 𝒞} = | ∥ μ - p_{i}^{c} ∥ - r |$ is the geometric distance between the point $p_{i}^{c}$ and the circumference $𝒞$ , $| \cdot |$ represent the absolute value, $μ = (x_{c}, y_{c})$ is the center of the circumference and $r$ is the radius. The mean and standard deviation are an useful alternative to understand the amount of dispersion generated by a noise $σ$ in a circumference $𝒞$ .

The Appendix 1—figure 4 shows the adjustment through the algorithms DLSFC, GLSFC and ALSFC of one ‘ground truth’ circumference (solid black line) that was corrupted by different levels of AWGN and partial occlusion angles. As was expected, the performance of the algorithms seems to get worse when increase the level of partial occlusion, but the three algorithms shows good results for high AWGN levels. This example suggests that the algorithm GLSFC (solid red line) has a better performance than DLSFC (solid blue line) and ALSFC (solid green line) when the partial occlusion is less than $π$ (columns A-C), but when $θ = 3 π / 2$ , this method breaks down, and this observation is more relevant when increase the level of noise. The algorithm ALSFC shows relative good results for all the experimental conditions, but it is interesting to note that, in most cases, the adjustment is worse than in the DLSFC and GLSFC alternatives, probably as consequence of the algebraic distance (see Appendix 1 Section –Algorithm validation–). This example is just for a first visual analysis, but a complete study of the results obtained by the three algorithms over the 40 000 synthetic images will be presented below.

Appendix 1—figure 4

Download asset Open asset

Data adjustment through the algorithms DLSFC (solid blue line), ALSFC (solid green line) and GLSFC (solid red line).

The data was generated corrupting the points of a ‘ground truth’ circumference (solid black line) with different noise levels (rows) and four differents occlusion angles conditions (columns). Column 1 (A.1-A.4) Not Occlusion angle, Column 2 (B.1-B.4) $θ = π / 2$ , Column 3 (C.1-C.4) $θ = π$ , Column 4 (D.1-D.4) $θ = 3 π / 2$ . The noise increase by rows: Row 1: $\bar{d_{𝒞}} \in [1.5 μ m, 1.54 μ m]$ , $σ (d_{𝒞}) \in [1.29, 1.74]$ , $σ_{AWGN} = 2.1053$ ; Row 2: $\bar{d_{𝒞}} \in [4.19 μ m, 5.28 μ m]$ , $σ (d_{𝒞}) \in [3.64, 5.71]$ , $σ_{AWGN} = 4.7368$ ; Row 3: $\bar{d_{𝒞}} \in [7.5 μ m, 9.23 μ m]$ , $σ (d_{𝒞}) \in [7.49, 10.38]$ , $σ_{AWGN} = 7.3684$ ; Row 4: $\bar{d_{𝒞}} \in [12.58 μ m, 17.78 μ m]$ , $σ (d_{𝒞}) \in [11.12, 17.96]$ , $σ_{AWGN} = 10$ .

https://doi.org/10.7554/eLife.42906.012

A circumference can be represented as a vector $𝒞 = (r, x_{c}, y_{c}) \in ℝ^{3}$ , where $r$ is the radius, and $(x_{c}, y_{c})$ the center. Let $\hat{𝒞} = (\hat{r}, \hat{x_{c}}, \hat{y_{c}})$ and $𝒞 = (r, x_{c}, y_{c})$ the ‘ground truth’ and the adjusted circumference, respectively. The fit error committed by the algorithms can be quantified as:

ε = ‖ \hat{𝒞} - 𝒞 ‖

Now, thanks to the use of ‘ground truth’ circumferences, it is possible to compare the obtained outcome with the ideal result. We consider that an adjustment is good enough if the estimation error of each circumference’s component $𝒞$ is less than 0.1 μm. This value is approximately two-fold smaller than the theoretical Abbe’s limit ( $\approx 250 n m$ ), and taking into account that the synthetic images resolution vary between 4.6 and 25.8 μm in mean (see Appendix 1—figure 5), we are demanding that the algorithm has a nanoscopic precision (less than 100 nm) even when the images have a very bad resolution. Combining the threshold explained above with Equation (21), we obtain the final condition based on the euclidean norm between these two vectors:

ε = ‖ \hat{𝒞} - 𝒞 ‖ \leq ‖ (0.1, 0.1, 0.1) ‖ = 0.1732 μ m .

Appendix 1—figure 5

Download asset Open asset

Error in the adjustment of the algorithms DLSFC, GLSFC and ALSFC.

The error was quantified through Equation (21). In all panels, the horizontal black line represents the value $y = 0.1732 μ m$ (see Equation (21)). (A) Boxplot of the error distribution for each algorithm taking into account the partial occlusion angles (four sub-panels). The x-axis specifies the name of the algorithm and the y-axis the error in microns. The blue shadow in the sub-panel $\frac{3 π}{2}$ represents the occlusion angle in which the mean value of the errors are greater than $0.1732 μ m$ . (B) Mean error of the adjustment by the algorithms DLSFC, GLSFC and ALSFC. The bottom x-axis is the Mean distance of the corrupted points to the ‘ground truth’ circumference (see Equation (20)), and the up x-axis is the Standard Deviation. The figure is split out in four sub-panels in accordance with the occlusion angle. The black boxes show examples in which the algorithm GLSFC does not reach the convergence (extremely high error). The arrows mark out some examples where the algorithm ALSFC does not have have a good adjustment. (C) Zoom of the performance of the algorithms DLSFC and ALSFC. (D) Results of the algorithm DLSFC. The graphics in the panel (C)) and (D)) also shows the confidence interval around the mean (see green and blue shadows), it was computed as $\bar{d_{𝒞}} \pm σ (d_{𝒞})$ .

https://doi.org/10.7554/eLife.42906.013

The Appendix 1—figure 5(a) shows the descriptive analysis (through boxplots) of the errors generated by the algorithms DLSFC, GLSFC and ALSFC over the 40000 synthetic images. In some experiments we obtained extremely good adjusts ( $ε \approx 10^{- 15} μ m$ ), but in general the 75% of the errors are in the interval $[10^{- 3}, 1] μ m$ . For all occlusion angles, the algorithm GLSFC had a bad behaviour in a wide range of experiments with an error of $\approx 10^{10} μ m$ . This kind of situation was expected for big partial occlusion angles (for example $θ = 3 π / 2$ ), but in conditions where does not exist partial occlusion ( $θ = 0$ ), the bad adjustment of GSFC is evidence that this algorithm is not a good alternative in this kind of problems. Note that a similar situation happens with the algorithm ALSFC, but in this case the maximum error is $\approx 10^{3} μ m$ , which is high too, but significantly less than that obtained by GLSFC. The algorithm DLSFC does not show the previous problems; observe that the maximun error is under $1 μ m$ for all experimental conditions, even when $θ = 3 π / 2$ . Now, if we forget the outliers and take into account only the 75% of the experiments (boxes), a direct relationship between the error of each algorithm and the partial occlusion angles (increase occlusion angle imply a major error) is observed. However, the raise of the mean error is small in relationship with the occlusion angle, which suggests that these three methods are robust to partial occlusion. The blue shadow band highlights the occlusion angle $(θ = 3 π / 2)$ for which the algorithms have a median error greater than $0.1732 μ m$ . Therefore, it is necessary at least the 50% of the data to obtain a good adjustment (with an error under $0.1732 μ m$ ) in the 75% of the experiments.

The Appendix 1—figure 5(b) displays the mean error obtained by each algorithm in relationship with the resolution of the images (mean and the standard deviation of the points distance to the ‘ground truth’ circumference, Equation (20)). This graph shows again that the method GLSFC did not reach the convergence in some experiments (remember that GLSFC is based on an iterative optimization algorithm), even for small partial occlusion angles, but it is interesting to point out that in all these cases (at least for the occlusion angles $θ = {0, \frac{π}{2}, π}$ ), these high errors ( $\approx 10^{10} μ m$ ) seem to be related with bad initial values. Note that in these experiments the algorithm ALSFC do not has a good performance (remember that the result of ALSFC is the initial value for GLSFC), which is an evidence of the sensibility of the GLSFC to the initial values. For the occlusion angle $θ = \frac{3 π}{2}$ , a lot of bad fits were obtained through the algorithm GLSFC. In our opinion, even when the algorithm GLSFC can reach an extremely good fit, its sensitivity to initial values, the computational cost (iterative process), and the possibility of not convergence, are strongs reasons to consider that this alternative is not viable to use in this kind of problems.

In the Appendix 1—figure 5(c), the mean error of the algorithms DLSFC and ALSFC was splitted to see more clearly the performance of each one. For small partial occlusion angles $(θ = {0, \frac{π}{2}})$ the algorithm DLSFC has errors $\leq 0.1732 μ m$ , while ALSFC, in high noise scenes, get a poor adjustment (black arrows). When $θ = π$ , the method ALSFC has errors $\geq 0.1732 μ m$ in many cases, that increase when the level of noise grows. Both algorithms generate an error $\geq 0.1732 μ m$ when $θ = \frac{3 π}{2}$ , but in the case of DLSFC, in some experiments had errors $< 0.1732 μ m$ , even for high noise levels. The Appendix 1—figure 5(d) show only the performance of the algorithm DLSFC. As noted, the method DLSFC shows a behaviour extremely robust and stable to noise and partial acclusion; note that for $θ = {0, \frac{π}{2}, π}$ does not exist an increment in the mean error when the noise level is incremented. In extreme experimental conditions, when only the 25% of the data remain in the image $(θ = \frac{3 π}{2})$ , DLSFC has a mean error between $[0.3, 0.5] μ m$ with a 95% confidence interval equal to $[0.1, 0.9] μ m$ .

So far we have shown the behaviour of the three algorithms considering several noise levels and partial occlusion angles conditions. However, it is interesting to know through a hypothesis test (inferential statistics) the performance of DLSFC and ALSFC taking into account the experiments we developed (samples). Note that we are not going to study the algorithm GLSFC, because, as we mentioned above, its sensitivity to initial values and its computational cost (iterative process) are strong reasons to consider that this algorithm is not viable to use in this kind of problems. Taking into account the threshold in Equation (22), for each experiment we have two possible outcomes:

{\begin{matrix} success, & if ε \leq 0.1732 μ m \\ failure, & if ε > 0.1732 μ m \end{matrix} .

From this point of view, each experiment follows a Bernoulli’s distribution, and then, all the experiments for the same experimental condition (partial occlusion angle) follows a Binomial distribution. The generated samples are independent (random generation), the decision rule (Equation (23)) is dichotomous, and our experimental design is a fair representation of the population (we considered many experimental conditions in more than 40000 images); then, it is possible to develop a ‘Binomial Test’ (Prybutok, 1989; Howell, 1982) to evaluate the population proportion of success, which we will denote by $p$ . The null (H0) and alternative (H1) hypothesis for this test are:

{\begin{matrix} H 0 : & p = 0.7 \\ H 1 : & p > 0.7 \end{matrix} .

The null hypothesis is that the population proportion with an error less that 0.1732 is 0.7, and the alternative hypothesis is that the proportion is more that 0.7. The significance level for these tests is fixed at $α = 0.05$ . For more information about this test and its implementation review (R Development Core Team, 2017; Prybutok, 1989; Howell, 1982) Online documentation is available in url https://stat.ethz.ch/R-manual/R-devel/library/stats/html/00Index.html.

The Appendix 1—table 1 shows the results of the ‘Binomial Test’ for the algorithms DLSFC and ALSFC. For small occlusion angles $θ = {0, \frac{π}{2}}$ , the results of both methods are very similar and indicate a high performance. Note that in these cases the null hypothesis was rejected with a probability error of type I of $\approx 10^{- 16} \leq α$ , and the 95% confidence intervals are equal to $[0.99, 1]$ , which means that, at a population level, we hope that at least the 99% of the adjustments had an error $\leq 0.1732 μ m$ . For $θ = π$ , the algorithm DLSFC has a probability of success greater than 0.7 with a p-value of $\approx 10^{- 16} \leq α$ . On the contrary, for the same oclussion angle, the test revealed that the null hypothesis can’t be rejected in the case of the algorithm ALSFC. Finally, as was analyzed above, the performance of the algorithms decrease significally for $θ = \frac{3 π}{2}$ , and we can’t reject the null hypothesis, in others words, we don’t have enough evidence to assume that in this case the probability of success is greater than 0.7.

Appendix 1—table 1

Results of the Binomial Test.

Column 1: Names of the algorithms. Column 2: ‘P. Success’ is the Probability of Success; ‘P. Value’ denote the probability of the error type 1, that is, reject the null hypothesis when it is true; ‘C. Interval’ is the 95% confidence interval. Column 3–6: Partial Occlusion Angles. The blue shadowed area in the table cell indicates the conditions in which it is not possible to reject the null hypothesis.

https://doi.org/10.7554/eLife.42906.014

Algorithm	Statistics		Partial occlusion angles
		0	$π / 2$	$π$	$3 π / 2$
	P. Success	1	1	0.804	0.384
DLSFC	P. Value	2.2 × 10⁻¹⁶	2.2 × 10⁻¹⁶	2.2 × 10⁻¹⁶	1
	C. Interval	[0.999, 1]	[0.999, 1]	[0.797, 1]	[0.376, 1]
	P. Success	0.993	0.99	0.643	0.273
ALSFC	P-value	2.2 × 10⁻¹⁶	2.2 × 10⁻¹⁶	1	1
	C. Interval	[0.99, 1]	[0.99, 1]	[0.635, 1]	[0.265, 1]

The study carried up in this section shows that the algorithm GLSFC has a good performance and is capable to obtain a low error in the adjustment, but this alternative is sensitive to the initial values (Appendix 1—figure 5 (a,b)), and as any iterative process, consumes more computational resources. The method ALSFC proved to have a good behavior in many experimental conditions, although in diverse occasions a bad fit was obtained. Also, the behaviour of this algorithm was always inferior than that of DLSFC (Appendix 1—figure 5(C) and Appendix 1—table 1). Finally, the algorithm DLSFC had a great performance and stability in all the experimental conditions, even in extreme partial occlusion angles (Appendix 1—figure 5 and Appendix 1—table 1). This method proved to be robust in high noise conditions and it is a deterministic and not an iterative algorithm. For all these reasons we consider that the algorithm DLSFC is the best choice in comparison with GLSFC and ALSFC, to fitting the spatial distribution of the viral elements.

Model considerations

To proof the hypothesis that the viral elements of the VPs can be approximated through circumferences, we carried up a series of experiments based on the comparison between the circumference obtained by the algorithm DLSFC, and a least squared ellipse resulting of the ‘Direct Least Square Fitting Ellipse’ (DLSFE) algorithm (Fitzgibbon et al., 1999). The algorithm DLSFE has been used previously to adjust the viral replication centers of adenoviruses in fluorescence images (limited by diffraction) (Garcés et al., 2016), and has also been shown to be robust to noise, computationally efficient, and easy to implement (Fitzgibbon et al., 1999).

Let us denote by $(x_{i}, y_{i}), i = 1, 2, \dots, N$ the set of N points relative to the protein $𝒫$ , and by $C_{𝒫}$ and $E_{𝒫}$ the adjusted implicit form of the conics obtained by the algorithms DLSFC and DLSFE respectively. The variable $C_{𝒫} = (r, x_{c}, y_{c})$ is a vector where $r$ is the radius, and $(x_{c}, y_{c})$ is the center of the circumference. On the other hand, the implicit form of the ellipse can be represented as $E_{𝒫} = (a, b, x_{c}^{E}, y_{c}^{E}, θ)$ , where ${a, b}$ are the semi-major and semi-minor axis respectively, $(x_{c}^{E}, y_{c}^{E})$ is the center of the ellipse, and $θ$ is the rotation angle. Under the assumption that the protein $𝒫$ can be approximate by a circumference, we expect that does not exist significative statistical differences between the radius of the circumference $(r)$ and each one of the ellipse semi-axis $(a, b)$ .

Note 8 Note that this approach is independent of which protein we choose to test the circularity hypothesis.

For each protein combination, we accomplish a Shapiro-Wilk hypothesis test to study if any of the three distributions functions (circumference radius ( $r$ ), semi-major axis ( $a$ ) and semi-minor axis ( $b$ )) came from a normally distributed population (Shapiro and Wilk, 1965). The results revealed that, our data does not seem to follow a Gaussian distribution, and as consequence, it is impossible to use a parametric test (for example t-student). Based on our data conditions, we considered the Mann-Whitney test (Mann and Whitney, 1947; Hollander et al., 2013) to evaluate the differences between the radii of the circumferences and each of the ellipses semi-axis. This is a nonparametric test that can be applied when the observations are independent (our variables are independent because the radius and the semi-axes were obtained by different algorithms), the variables are ordinal (also our variables meet this requirement because they are numeric), and finally, we want to test if the radius of the circumferences and each one of the ellipses semi-axis, are equal (null hypothesis) or are not (alternative hypothesis).

The statistical analysis reveal that does not exist a significative statistical differences between $R_{C}$ and $a$ (see Appendix 1—table 2) with a significance level of $α = 0.05$ . The same results were obtained for $R_{C}$ and $b$ (see Appendix 1—table 3), and as consequence not matter if we use circumferences or ellipses to fitt the spatial distribution of the viral proteins.

Appendix 1—table 2

Two-sample Mann-Whitney hypothesis test between $a$ (major semi-axis of the ellipse ) and $r$ (circumference radius).

H0(H1): The radii of the circumferences and the values of the major semi-axis of the ellipses came from the same distribution function (different). Column 1: W represent the distribution value of the statistical test; Difference is the estimation of the location parameter (median difference between $a$ and $r$ ); P. Value is the p-value of the test; C. Interval is the 95% confidence interval. For each one of the nine viral elements combinations, we carry up the hyphotesis test taking into account NSP2 independenly in each combination, just for the semi-major axis this table shows the results of 18 Mann-Whitney hypothesis tests. More information about this test is available in Mann and Whitney (1947) and Hollander et al. (2013).

https://doi.org/10.7554/eLife.42906.015

	Proteins Combinations
	NSP2/NSP4		NSP2/NSP5		NSP2/VP1		NSP2/VP2
Statistics	NSP2	NSP4	NSP2	NSP5	NSP2	VP1	NSP2	VP2
W	900	902	1426	1415	559	539	174	184
Difference	0.0352	0.03369	0.0358	0.0357	0.0213	0.026	0.0529	0.0512
P. Value	0.1233	0.127	0.091	0.08025	0.1776	0.117	0.1137	0.1789
95% CI	[−0.082, 0.011]	[−0.08,0.013]	[−0.086, 0.007]	[−0.08,0.006]	[−0.054,0.012]	[−0.057, 0.006]	[−0.1168,0.0204]	[-0.15, 0.028]
	NSP2/VP4		NSP2/VP6		NSP2/VP7-Tri		NSP2/VP7-Mon
Statistics	NSP2	VP4	NSP2	VP6	NSP2	VP7-Tri	NSP2	VP7-Mon
W	771	760	119	134	1425	1421	551	528
Difference	0.0264	0.0221	0.0393	0.031	0.0301	0.032	0.0487	0.055
P. Value	0.1012	0.0833	0.0747	0.1814	0.1567	0.1503	0.1513	0.092
95% CI	[−0.0549, 0.0044]	[−0.051,0.003]	[−0.0909,0.0067]	[−0.103,0.034]	[−0.0675,0.0105]	[−0.073, 0.014]	[−0.1165,0.0221]	[−0.119,0.01]

Appendix 1—table 3

Two-sample Mann-Whitney hypothesis test between $b$ (minor semi-axis of the ellipse ) and $r$ (circumference radius).

H0(H1): The radii of the circumferences and the values of the minor semi-axis of the ellipses came from the same distribution function (different). This table is equivalent to Appendix 1—table 2. We considered to split the results for each semi-axis of the ellipse for better analysis and visualization.

https://doi.org/10.7554/eLife.42906.016

	Proteins Combinations
	NSP2/NSP4		NSP2/NSP5		NSP2/VP1		NSP2/VP2
Statistics	NSP2	NSP4	NSP2	NSP5	NSP2	VP1	NSP2	VP2
W	1263	1261	1966	1960	804	824	1135	1134
Difference	0.0325	0.041	0.0282	0.03	0.0308	0.0278	0.0185	0.0216
P. Value	0.1499	0.1936	0.2334	0.2394	0.2259	0.2385	0.1995	0.1335
95% CI	[−0.027,0.092]	[−0.026,0.108]	[−0.019,0.077]	[−0.0213,0.08]	[−0.018,0.085]	[−0.021,0.082]	[−0.015,0.05]	[−0.012,0.057]
	NSP2/VP4		NSP2/VP6		NSP2/VP7-Tri		NSP2/VP7-Mon
Statistics	NSP2	VP4	NSP2	VP6	NSP2	VP7-Tri	NSP2	VP7-Mon
W	227	213	1931	1895	798	814	291	279
Difference	0.038	0.029	0.0259	0.0288	0.0427	0.046	0.0435	0.038
P. Value	0.1814	0.3543	0.17	0.2407	0.2233	0.1641	0.2578	0.3953
95% CI	[−0.023,0.097]	[−0.031,0.106]	[−0.013,0.066]	[-0.02, 0.074]	[−0.022,0.107]	[−0.019,0.113]	[−0.0317,0.107]	[−0.058,0.135]

On the other hand, we carried up a study to validate the viral elements concentricity hypothesis based on the distance between the centers of the adjusted circumferences of both proteins. For all the proteins, the median value is very close or smaller than $0.05 μ m$ (Appendix 1—figure 6 and Appendix 1—table 4 (Column 2)), which is in accordance with the resolution limit of the algorithm 3B-ODE ( $0.04 - 0.05 μ m$ ) (Cox et al., 2011). Also, the 95% confidence interval (Appendix 1—table 4, Column 3) shows that the distance between the centers in each case is, 95% of the times in the order of $10^{- 2} μ m$ , which we consider a small error taking into account the limitations of the algorithm 3B-ODE.

Appendix 1—figure 6

Download asset Open asset

Boxplot of the differences in location between the center of the circumference that adjust NSP2 and the centers of the others nine viral elements.

The red line $y = 0.052 μ m$ represents approximately the maximun median value of distance between NSP2 and each of the others viral elements, while the second red line ( $y = ∥ (0.1, 0.1) ∥ = 0.141 μ m$ ) is the maximun error that we considered in the algorithm validation section (see Appendix 1 Section –Algorithm validation–). Each box contains the 50% of the observations, the bottom and the top of the box are the first and third quartiles (25% and 75% of the observations respectively). The line inside the box is the median (second quartiles (50% of the observations)). The upper whisker extends from the hinge to the largest value no further than $1.5 \times I Q R$ from the hinge (where IQR is the interquartile range, or distance between the first and third quartiles). The lower whisker extends from the hinge to the smallest value at most $1.5 \times I Q R$ of the hinge. The notches around the median extend to $\pm (1.57 \times I Q R / n)$ . This gives a roughly 95% confidence interval for comparing medians (McGill et al., 1978).

https://doi.org/10.7554/eLife.42906.017

Appendix 1—table 4

Difference in location between the centers of NSP2 and {NSP5, NSP4, VP1, VP2, VP6, VP4, VP7-Tri, VP7-Mon}.

The 95% Confidence Interval (Column 3) was computed in accordance with Chambers et al. (1983) as: $\hat{x} \pm (1.57 \times I Q R / \sqrt{n})$ , where, $\hat{x}$ is the median of $x$ , IQR is the interquartile range, and $n$ is the length of $x$ .

https://doi.org/10.7554/eLife.42906.018

Combination	Difference in location	95% CI
NSP2/NSP5	0.0367391	[0.03302618, 0.04045203]
NSP2/NSP4	0.02126683	[0.01708335, 0.02545032]
NSP2/VP2	0.03795674	[0.03043561, 0.04547787]
NSP2/VP1	0.04006871	[0.03508308, 0.04505433]
NSP2/VP6	0.03787043	[0.03153263, 0.04420824]
NSP2/VP4	0.0432802	[0.03410000, 0.05246041]
NSP2/VP7-Tri	0.05184999	[0.04138659, 0.06231340]
NSP2/VP7-Mon	0.05184999	[0.03922319, 0.06447680]

Note 9 The Mann-Whitney test was used to study the differences in population medians (difference in location) (Mann and Whitney, 1947).

Supplementary exploratory analysis

The algorithm VPs-DLSFC allows the quantification of the radius of two proteins in the same VP, and also the relative distance between them. At a first stage, we analyzed approximately 590 images (see histogram in Appendix 1—figure 7A) that were obtained by the pre-segmentation step of the algorithm VPs-DLSFC (Appendix 1—figure 1). In agreement with previous reports (Eichwald et al., 2012; Eichwald et al., 2004; Carreño-Torres et al., 2010), we found a wide heterogeneity in the size of VPs measured by NSP2, which exhibited radii ranging from 0.2 to 0.75 μm. In order to compare the distribution of each protein taking NSP2 as reference, we selected a subset of the individual VPs, in a way that there were no significant statistical difference between the radii of NSP2 from each condition.

Appendix 1—figure 7

Download asset Open asset

Supplementary exploratory analysis of the results obtained by the algorithm VPs-DLSFC.

(A) Histogram with the numbers of VPs that were obtained as part of the pre-segmentation step of the algorithm described in Appendix 1—figure 1A. Note that because each image contains one and only one VP, this histogram represents also the numbers of pre-segmented images that we had in our study. (B) Graphical representation of the Mann-Whitney hypothesis test in two-by-two comparison between the radii distributions of NSP2 in each experiment (Mann and Whitney, 1947). To avoid confusions, for each box, under the NSP2 name in x-axis, we included the name of the accompanying protein that specifies in which experiment we obtain that distribution of NSP2. Each combination is linked by a line, and the result of the test is up of the line. The red dashed square highlights the only two combinations (VP1 and VP2) in which the distribution of NSP2 had a significant statistical difference with the other NSP2 distributions. (C) Two-sample Mann-Whitney hypothesis test, considering as variables the radius of NSP2 in contrast to the radius of the others seven proteins. In each combination, we show the difference in location between NSP2 and the other protein (x-axis) and the confidence interval at a level of 95%. The numbers over the intervals confidence are the p-value of the statistical test.

https://doi.org/10.7554/eLife.42906.019

In accordance with this criterion, we were able to adjust the size of NSP2, such that the mean in the distribution of the radius was similar when this protein was paired with NSP5, NSP4, VP6, VP4 and VP7 (Appendix 1—figure 7B). Unfortunately, in the case of NSP2~VP1 and NSP2~VP2, even though we could selected a group of VPs in which the mean radius of NSP2 (0.35 μm) was closer to the size of the NSP2 from the other conditions ( $0.4 μ m$ ), the analysis of the distributions in the radius of NSP2 by the Mann-Whitney hypothesis test (Appendix 1—figure 7B), revealed that the VPs of VP1 and VP2 were significative smaller than the mean size of viroplasms of the others proteins. As consequence, was not possible to compare directly the position of VP1 and VP2 with the others viral elements through an exploratory analysis. However, this problem did not limit the inferential study that we developed.

The use of super-resolution microscopy and the algorithm VPs-DLSFC allows the detection of small differences (resolution of nanometers) in the relative position of the viral elements. To validate that the different viral elements were in fact located in a different region from NSP2, we developed a Mann-Whitney test to study the existence of significative differences between the radii of NSP2 and the radii of each of the other viral elements. According with our previous results (see main text), this test revealed that only NSP4 has a similar spatial distribution that of NSP2 (Appendix 1—figure 7C and Appendix 1—table 5), while the others viral elements have different radii. Moreover, the small negative differences in location of NSP5 turned out to be significative different with respect to NSP2 (the p-values of the test are available in the (Appendix 1—figure 7C and Appendix 1—table 5). These results support that NSP5 is located in the innermost part of the viroplasm and that NSP4 shares the same layer with NSP2. The structural proteins VP1, VP2 and VP6 seem to be in the same layer of the viroplasm, with similar values, but with a significative statistical difference in location with NSP2 at a level $α = 0.01$ . Also, this test suggests that VP4 and VP7 form independent layers in the most external zone of the VPs (Appendix 1—figure 7C and Appendix 1—table 5), which is in accordance with our proposed model.

Appendix 1—table 5

Two-sample Mann-Whitney hypothesis test, considering as variables the radii of NSP2 in contrast with the radii of the others seven viral proteins.

Under the null hypothesis, both samples come from the same distribution (H0: true location is equal to zero), while the alternative hypothesis (H1: true location is not equal to zero) establishes that exist a difference between the median of the distributions. Protein: Name of the viral element compared with NSP2; W: value of the Mann-Whitney statistical hypothesis test; Difference: Estimation of the location parameter (median difference between the radii of NSP2 and the radii of the other viral element); C. Interval: 95% confidence interval; P. Value: p-value of the test.

https://doi.org/10.7554/eLife.42906.020

Protein	W	Difference in location	95% CI	p-value
NSP5	1327	−0.048972049	[−0.086040620,–0.01563190]	$2.968772 \times 10^{- 3}$
NSP4	4711	−0.003348142	[−0.032696239, 0.02632726]	$8.197094 \times 10^{- 1}$
VP1	1699	0.044239569	[0.012214263, 0.07804365]	$7.728702 \times 10^{- 3}$
VP2	3557	0.040956503	[0.010802771, 0.06990018]	$5.166044 \times 10^{- 3}$
VP6	4316	0.063282322	[0.032020596, 0.09793399]	$1.407904 \times 10^{- 4}$
VP4	2637	0.150597251	[0.115162890, 0.18499337]	$1.824306 \times 10^{- 11}$
VP7-Mon	4280	0.392334895	[0.350192949, 0.43852366]	$1.130338 \times 10^{- 21}$
VP7-Tri	1814	0.400145108	[0.324450625, 0.46457274]	$1.611397 \times 10^{- 14}$

Appendix 1—figure 8

Download asset Open asset

Examples of small zones of colocalization between differents viral proteins.

The interations between the viral elements could be explained throught the spikes that come from the central distribution of the viral elements which colocalize with other proteins. A more specific study it is necessary in order to explain these interactions.

https://doi.org/10.7554/eLife.42906.022

Linear dependency between the viral components

The spatial distribution of the viral elements in the VPs is conserved regardless of their size; that is, the radius of each protein shows a linear dependency with the radius of NSP2.

Let $P_{i} = (x_{i}, y_{i}) \in ℝ^{2}, i = 1, 2, \dots, N$ a set of points, where $x_{i}$ and $y_{i}$ are the radii of NSP2 and the acompayning viral element Y, respectively. The adjustment of the linear model can be solved through the following optimization problem:

\min_{β} \sum_{i = 1}^{N} {(y_{i} - β x_{i})}^{2} .

The model (Equation (25)) does not include an offset term because we don’t have any quantification about the radius of the accompanying viral element when NSP2 have been silenced.

The solution of this problem is given by Kiefer (1987):

\begin{aligned} \hat{β} & = c o r (x, y) \times \frac{σ (y)}{σ (x)} \\ = \frac{c o v (x, y)}{σ (x) σ (y)} \times \frac{σ (y)}{σ (x)} \\ = \frac{\frac{1}{n} \sum (x_{i} - \bar{x}) (y_{i} - \bar{y})}{V a r (x)}, \end{aligned}

where $c o r (x, y)$ is the linear correlation between x and y, $c o v (x, y)$ is the covariance between x and y, Var(x) is the variance of x, and $\bar{x}$ is the mean value of the variable $X$ .

The ‘Residual Standard Error’ (RSE or $(σ^{2})$ ) is a measure of fit of the linear regression model.

σ^{2} = \frac{1}{N - 2} \sum_{i = 1}^{N} ϵ_{i}^{2},

where $ϵ_{i} = y_{i} - \hat{β} x_{i}$ . The ‘Standard Error’ $(σ_{\hat{β}}^{2})$ in the estimation of the slope $\hat{β}$ is:

σ_{\hat{β}}^{2} = \frac{σ^{2}}{\sum {(x_{i} - \bar{x})}^{2}} .

Consider the statistic $t_{β} = \frac{\hat{β} - β}{σ_{\hat{β}}}$ and the null (H0) and alternative hypothesis (H1):

{\begin{matrix} H 0 & : β = 0 \\ H 1 & : β \neq 0 \end{matrix}

Under the null hypothesis, $t_{0} = \frac{\hat{β}}{σ_{\hat{β}}}$ follow a t-Student distribution with N-1 degrees of freedom. The p-value of this tests is computed as:

p . v a l u e = 2 \times (1 - P (T \leq t_{0})),

where $P (T \leq t_{0})$ is the probability that a t-Student variable with N-1 degrees of freedom be less than $t_{0}$ . Note that through this hypothesis test it is possible to evaluate if the slope $β$ is significative to explain the linear relationship between the variables.

The variance of the dependent variable $y$ is:

\underset{Full Variance}{\underset{⏟}{\sum_{i = 1}^{N} {(y_{i} - \bar{y})}^{2}}} = \underset{Residual Variance}{\underset{⏟}{\sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}} + \underset{Regression Variance}{\underset{⏟}{\sum_{i = 1}^{N} {({\hat{y}}_{i} - \bar{y})}^{2}}},

where, ${\hat{y}}_{i} = \hat{β} x_{i}$ . In accordance with the Equation (31), the variance rate that could be explained by the regression model is the regression variance divided by the full variance, this is:

R^{2} = \frac{\sum {({\hat{y}}_{i} - \bar{y})}^{2}}{\sum {(y_{i} - \bar{y})}^{2}} \in [0, 1] .

Let $Ψ$ be the lineal regression model that adjusts the data $(x_{i}, y_{i})$ , and be $R^{2}$ the associated r-square value, then, $100 \times R^{2}$ is the percent of the data variance that it was explained by the model. For strong linear dependencies, the expected $R^{2}$ value should be close to 1. For more information about linear regression models consult (Kiefer, 1987).

The results of the regression models are available in Appendix 1—table 6. In all regression models, the standard errors are in the order of $10^{- 2} μ m$ , with a $p - v a l u e < 10^{- 33}$ which is a strong evidence of the linear relationship that exist between NSP2 with the other seven viral proteins. Also, the RSE have values in the order of $10^{- 2} μ m$ , except for VP7 which have a RSE of $0.15 μ m$ , which could suggest that exist a greater dispersion of VP7 in comparison with the others viral proteins, something expected since, according to our model, VP7 is the most external protein.

Appendix 1—table 6

Results and validation of the linear regression models.

NSP2 was considered as the independent variable and the other viral elements as the dependent variable (see Equation (25)). The slope and standard error for each regression model (Column 2 and 3) were computed in accordance with the Equation (26) and (28) respectively. The t-value of the t-Student distribution function with N-1 degrees of freedom are in Column 4 (see Appendix 1 Section – Linear Regression Model– for details). The p-value for each linear regression model are shows in Column five and were measure as was described in (Equation (29)) and (Equation (30)). Finally, the RSE and the $R^{2}$ summarize the adjustment of the data through the proposed lineal models. As was advised, the RSE (Column 6) is a fit measure of the linear model to the data (see Equation (27)). The ‘% Error’ is the average error in the prediction and it is computed as: $100 \times R S E / β$ , while the $100 \times R^{2}$ is the percent of the data variance that it is explained by the model (see Equation (32)).

https://doi.org/10.7554/eLife.42906.021

Model	Slope (β)	Std.error $(σ_{\hat{β}}^{2})$	t-value	p-value	RSE	% Error	R²
$N S P 5 = β_{N S P 5} \times N S P 2$	0.8667494	0.017947738	48.29296	2.550154 × 10⁻⁵⁰	0.07	8%	0.974
$N S P 4 = β_{N S P 4} \times N S P 2$	0.9911517	0.006973701	142.12708	2.271571 × 10⁻¹¹⁴	0.035	3.5%	0.995
$V P 2 = β_{V P 2} \times N S P 2$	1.1243931	0.014817985	75.88030	6.242800 × 10⁻⁷²	0.04	3.5%	0.987
$V P 1 = β_{V P 1} \times N S P 2$	1.1506138	0.019069730	60.33718	2.238867 × 10⁻⁴⁸	0.044	3.8%	0.986
$V P 6 = β_{V P 6} \times N S P 2$	1.1806335	0.011218918	105.23595	1.087989 × 10⁻⁸⁶	0.038	3.2%	0.992
$V P 4 = β_{V P 4} \times N S P 2$	1.3887083	0.024319983	57.10153	5.757257 × 10⁻⁵⁰	0.07	5%	0.983
$VP7-Mon = β_{VP7-Mon} \times N S P 2$	1.9407204	0.040354225	48.09212	1.560719 × 10⁻⁵²	0.142	7.3%	0.972
$VP7-Tri = β_{VP7-Tri} \times N S P 2$	1.9416542	0.054656327	35.52478	5.778910 × 10⁻³³	0.153	7.8%	0.967

The average errors are under 8% in all models (Appendix 1—table 6), which confirm the capacity of our linear models to predict with high accuracy the radius of the described viral elements. On the other hand, the percentage variance rate that could be explained by the regression models ( $100 \times R^{2}$ , see Equation (32)), was greater than the 96% in all cases (Appendix 1—table 6), demonstrating again the high linear dependency between NSP2 and the others viral elements into the viroplasm.

Appendix 1—figure 9

Download asset Open asset

Residuals errors for each linear regression model.

Gray line represent the regression model, the points over the line are the predicted values $\hat{y_{i}}$ (see Equation (31)) by the models, and the dots filled with a color gradient are the real values $y_{i}$ . The errors between the predicted and real values, $ϵ = | \hat{y_{i}} - y_{i} |$ , are represented as a gradient of colors as follows (from lowest/coldest to highest/warmest). For each model, the RSE and $R^{2}$ (R-squared) were included, note that this statistics are the same that were presented in Appendix 1—table 6. (A) $N S P 5 \approx 0.8667 \times N S P 2$ , (B) $N S P 4 \approx 0.9912 \times N S P 2$ , (C) $V P 1 = 1.1506 \times N S P 2$ , (D) $V P 2 \approx 1.1244 \times N S P 2$ , (E) $V P 6 \approx 1.1806 \times N S P 2$ , (F) $V P 4 \approx 1.3887 \times N S P 2$ , (G) $VP7-Tri \approx 1.9417 \times N S P 2$ , (H) $VP7-Mon \approx 1.9407 \times N S P 2$ .

https://doi.org/10.7554/eLife.42906.023

NSP5 and NSP4 as reference proteins

As was mentioned in the main text, we carried out a similar study using NSP5 and NSP4 as reference proteins. Since these experiments were developed to validate our previous results with NSP2, they were not as deep and extensive as in the case of the NSP2 model.

The number of VPs studied was 38 and 23 for NSP5~VP6 and NSP5~VP4 respectively (Appendix 1—figure 10A.1). The distribution of NSP5 relative to VP6 and VP4 did not show significative statistical differences according to the Mann-Whitney test, and the radius of VP6 was smaller than VP4, which is in accordance with our model of the viroplasm (Appendix 1—figure 10A). Furthermore, the distance of VP6 and VP4 towards NSP5, reflects that VP6 is closer to NSP5 than VP4, with a significance level $\leq 0.001$ (Appendix 1—figure 10B). On the other hand, we found that exists a linear relationship between the radii of NSP5 and {VP6, VP4} (Appendix 1—figure 10 C-E). The linear models of both, VP6 and VP4, explain approximately 97% of the data variability and exhibit a RSE in the order of $10^{- 2} μ m$ (see Appendix 1—figure 10 (D,E) and Appendix 1—table 7). Additionally, we performed a Mann-Whitney test to study the differences of the radii of NSP5 and each of the accompanying proteins (VP6 and VP4). This test demonstrated that the differences in location between NSP5 and VP6 is $\approx 0.1 μ m$ , while in the case of VP4 is $\approx 0.23 μ m$ , with a p-value in the order of $10^{- 7}$ and $10^{- 8}$ respectively (see Appendix 1—table 8), indicating that although these protein are near to each other, they occupy different layers on the VPs.

Appendix 1—figure 10

Download asset Open asset

VP6 and VP4 spatial distribution taking NSP5 as reference protein.

(A) Boxplot for the radii of the fitting circumferences for NSP5 and {VP6,VP4}. Mann-Whitney hypothesis test for the radius distribution of NSP5 shows that does not exist a significative statistically differences between the reference protein (NSP5) in the two experiments. A.1) Histogram of the numbers of viroplasm by combination. (B) Boxplot for the distance of the viral proteins VP6 and VP4 to NSP5, and result of the Mann-Whitney test between these two distributions. (C) Linear regression model taking the radius of NSP5 as independent variable and the radius of {VP6, VP4} as dependent variable. The gray shadow represent the 95% confidence interval for the regression adjustment. (**D and E**) Residual error for the VP6 and VP4 linear models, respectively. The details about this kind of representation can be consulted in Appendix 1—figure 9.

https://doi.org/10.7554/eLife.42906.024

Appendix 1—table 7

Linear regression results with NSP5 used as independent variable.

For more information about the variables involved (columns) consult Appendix 1—table 6 or Appendix 1 Section – Linear Regression Model.

https://doi.org/10.7554/eLife.42906.025

Model	Slope (β )	Std.error $(σ_{\hat{β}}^{2})$	t-value	p-value	RSE	R-squared
$V P 6 = β_{V P 6} \times N S P 5$	1.273246	0.03278467	38.83663	1.364503 × 10⁻³¹	0.071	0.976
$V P 4 = β_{V P 4} \times N S P 5$	1.623226	0.05248750	30.92596	1.262448 × 10⁻¹⁹	0.093	0.977

In the case of the model based on NSP4, we studied 60 viroplasms through the algorithm VPs-DLSFC. The results show that the radius of VP6 were bigger than the radius of NSP4 in approximately 0.04 μm (Appendix 1—figure 11 A-B). This difference is statistically significative ( $p - v a l u e \leq 0.05$ ) according with the Mann-Whitney test (Appendix 1—figure 11A). Like the models based on NSP2 and NSP5, we found a linear relationship between NSP4 and VP6 (Appendix 1—figure 11C). This linear model was validated through a residual analysis (Appendix 1—figure 11D and Appendix 1—table 9), obtaining a $R S E = 0.079 μ m$ , and a $R^{2} = 0.97$ .

Appendix 1—figure 11

Download asset Open asset

VP6 spatial distribution taking NSP4 as reference protein.

(A) Boxplot for the radii of the fitting circumference of NSP4 and VP6. The inside panel shows the results of the Mann-Whitney hypothesis test (see the Appendix 1—table 5 for details). (B) Boxplot for the distance between NSP4 and VP6. (C) Linear regression fitting taking the radius of NSP4 as independent variable and the radius of VP6 as dependent variable. The gray shadow represents the confidence interval at a level of 95%. (D) Residual error analysis of the model. This graph is analogous to Appendix 1—figure 9; the details about this kind of representation can be consulted in that figure.

https://doi.org/10.7554/eLife.42906.027

Consistency between models

As was described, our study involves three differents reference proteins (NSP2, NSP5 and NSP4). Since the three models should respond to an unique spatial organization of the viral elements into the VP, it is expected to obtain similar results independenly of the selected model.

The equivalence of the differents models (models based on NSP2 and NSP5) can be proven through a comparison of the distances between NSP5 and VP6. If the two models describe the same distribution of the VPs, we expect that the distance of NSP5 to VP6 computed by the NSP5 model be similar to the the distance between NSP5 and NSP2, plus the distance of NSP2 to VP6 that were computed by the model based on NSP2. Hence:

d_{N S P 2} (N S P 5, N S P 2) + d_{N S P 2} (N S P 2, V P 6) \approx d_{N S P 5} (N S P 5, V P 6)

The subindex (‘name’) in $d_{n a m e} (\cdot)$ specifies the model employed to compute the distance. Therefore, replacing the corresponding values (available in Appendix 1—table 5 and Appendix 1—table 8) in the above equation, we obtain that:

\begin{aligned} d_{N S P 2} (N S P 5, N S P 2) & + d_{N S P 2} (N S P 2, V P 6) \\ = 0.048972049 + 0.063282322 = 0.1122544 \\ \Rightarrow ε & = | d_{N S P 5} (N S P 5, V P 6) - 0.1122544 | \\ = | 0.096555 - 0.1122544 | \\ = 0.0156994 μ m . \end{aligned}

Appendix 1—table 8

Two-sample Mann-Whitney hypotheses test, considering the radius of NSP5 in contrast with the radius of VP6 and VP4.

For more information about the parameters (columns) see the Appendix 1—table 5.

https://doi.org/10.7554/eLife.42906.026

Protein	W	Difference in location	95%	p-value
VP6	1174	0.096555	[0.05986,0.13473]	9.359731 × 10⁻⁰⁷
VP4	502	0.227230	[0.17693,0.28280]	3.574209 × 10⁻⁰⁹

Since the level of precision in our radii estimation of the viral components are limited by the resolution of the algorithm 3B ( $0.04 - 0.05 μ m$ ), we set this parameter as a threshold for the errors when we compared the results obtained for each model. For example, the error between the models based on NSP2 and NSP5 for VP6 is $\approx 0.016 μ m$ (see Equation (34)), which is under the resolution limit that can be reached by the algorithm 3B, and as consequence the error is asociated to the limitations of the optic and not to the reference protein that we selected for the estimation of the VP6’s radius.

This same approach was applied for VP4. In this case, and considering again the results in Appendix 1—table 5 and Appendix 1—table 8 we obtain:

\begin{aligned} d_{N S P 2} (N S P 5, N S P 2) & + d_{N S P 2} (N S P 2, V P 4) \\ = 0.048972049 + 0.150597251 = 0.1995693 \\ \Rightarrow ε & = | d_{N S P 5} (N S P 5, V P 4) - 0.1995693 | \\ = | 0.227230 - 0.1995693 | \\ = 0.0276607 μ m, \end{aligned}

which again is an error under the resolution limit of the alorithm 3B.

On the other hand, taking into account the models based on NSP2 and NSP4, and the results in Appendix 1—table 5 and Appendix 1—figure 11A, we obtain that:

\begin{aligned} d_{N S P 2} (N S P 4, N S P 2) & + d_{N S P 2} (N S P 2, V P 6) \\ = 0.003348142 + 0.063282322 = 0.06663046 \\ \Rightarrow ε & = | d_{N S P 4} (N S P 4, V P 6) - 0.06663046 | \\ = | 0.04070526 - 0.06663046 | \\ = 0.0259252 μ m . \end{aligned}

As before, the error is under the limit of resolution of the algorithm 3B. Altogether, the comparison of the three models shows up that different experimental approaches describe the same spatial distributions of the viral elements inside the viroplasm.

The previous analysis only considers the consistency of the models based on the difference in the radius of the viral elements, but besides this, we developed a similar study using the lineal regression models.

Again, consider the model NSP5 and NSP2 as predictors of VP6:

V P 6 = β_{NSP2}^{VP6} \times N S P 2

V P 6 = β_{NSP5}^{VP6} \times N S P 5,

where $β_{NSP2}^{VP6}$ and $β_{NSP5}^{VP6}$ are the slope associated with VP6 in the model based on NSP2 and NSP5 respectively. Also, the radius of the NSP5 can be estimated with the model based on NSP2, this is:

N S P 5 = β_{NSP2}^{NSP5} \times N S P 2.

Substituting Equation (39) into Equation (38) we obtain:

V P 6 = (β_{NSP5}^{VP6} \times β_{NSP2}^{NSP5}) \times N S P 2,

and taking into acount Equation (37), we can assert that:

β_{NSP2}^{VP6} \approx (β_{NSP5}^{VP6} \times β_{NSP2}^{NSP5})

\Rightarrow ϵ = | β_{NSP2}^{VP6} - (β_{NSP5}^{VP6} \times β_{NSP2}^{NSP5}) | (absolute error)

\Rightarrow δ ϵ = \frac{| β_{NSP2}^{VP6} - (β_{NSP5}^{VP6} \times β_{NSP2}^{NSP5}) |}{β_{NSP2}^{VP6}} (relative error) .

Note 10 The relative error was computed under the assumption that the slope of the model based on NSP2 is the ideal value. This consideration was based in the fact that we have more evidence of a linear relationship in the NSP2 model (see the $R^{2}$ and RSE in Appendix 1—table 6) that in the models based on NSP5 and NSP4 (Appendix 1—table 7 and Appendix 1—table 9 respectively).

Appendix 1—table 9

Linear regression results with NSP4 as independent variable.

For more information about the variables involved (columns) consult Appendix 1—table 6 or Appendix 1 Section – Linear Regression Model.

https://doi.org/10.7554/eLife.42906.028

Model	Slope (β)	Std.error $(σ_{\hat{β}}^{2})$	t-value	p-value	RSE	R-squared
$V P 6 = β_{V P 6} \times N S P 4$	1.103683	0.0212345	51.97592	5.626321×10⁻⁵¹	0.079	0.978

For the prediction of VP6 through the models NSP2 and NSP5, the relative error is (Equation (43)):

δ ϵ = \frac{| 1.1806335 - (1.273246 \times 0.8667494) |}{1.1806335} = 0.06526013.

The relative error asociated with the prediction of the radius of VP4 through the models NSP2 and NSP5 is:

δ ϵ = \frac{| β_{NSP2}^{VP4} - (β_{NSP5}^{VP4} \times β_{NSP2}^{NSP5}) |}{β_{NSP2}^{VP4}} = \frac{| 1.3887083 - (1.623226 \times 0.8667494) |}{1.3887083} = 0.01312145.

Finally, in the case of VP6 considering the models NSP2 and NSP4 the relative error is:

δ ϵ = \frac{| β_{NSP2}^{VP6} - (β_{NSP4}^{VP6} \times β_{NSP2}^{NSP4}) |}{β_{NSP2}^{VP6}} = \frac{1.1806335 - (1.103683 \times 0.9911517)}{1.1806335} = 0.07344889.

Since the relative errors represent the average error of the slopes computed with the models NSP5 and NSP4 in relation with the NSP2 model. These results indicate that the model based on NSP5 has an error of 6.5% for the prediction of the radius of VP6, while for VP4 the error is around 1.3% compared to the prediction of the model based on NSP2. On the other hand, if we use the model based on NSP4, we get approximately a 7.3% of differences in the radius of VP6. As in the previous analysis, these errors are small too, and could be associated with experimental variations and with the resolution limit of the algorithm 3B.

Even when the relative error provide important information about the differences in the prediction between differents models, we are not taking into account the standard error associated with the slopes of each model. In the case of the prediction of VP6 through the models NSP2 and NSP5 (all others cases have the same deduction), if we consider $β_{NSP5}^{VP6}$ and $β_{NSP2}^{NSP5}$ as random variables with a standard error $σ (β_{NSP5}^{VP6})$ and $σ (β_{NSP2}^{NSP5})$ respectively, the variance of the variable $(β_{NSP5}^{VP6} \times β_{NSP2}^{NSP5})$ could be computed through the ‘Delta Method’ (Oehlert, 1992; Ver Hoef, 2012). The ‘Delta Method’ makes possible to obtain an approximation of the variance of a function $f (X_{1}, X_{2}, \dots, X_{n})$ as:

V a r (f) = \sum_{i} {(\frac{\partial f}{\partial X_{i}})}^{2} V a r (X_{i}) + \sum_{i} \sum_{j \neq i} (\frac{\partial f}{\partial X_{i}}) (\frac{\partial f}{\partial X_{j}}) C o v (X_{i}, X_{j}),

were $C o v (X_{i}, X_{j})$ denote the covariance between $X_{i}$ and $X_{j}$ (Lawrence, 1953).

Note 11 The standard error is defined as:

Standard Error = Standard Deviation / \sqrt{n},

but, in our case $n = 1$ because we only have the final slope estimation and the associated standard error, hence the standard error is equal to the standard deviation (Altman and Bland, 2005).

Because $β_{NSP5}^{VP6}$ and $β_{NSP2}^{NSP5}$ are independent (obtained by the linear regression analysis of the segmentation results in differents experiments and images) the covariance $C o v (β_{NSP5}^{VP6}, β_{NSP2}^{NSP5}) = 0$ . This point can be proved easily because $C o v (X, Y) = 𝐄 (X Y) - 𝐄 (X) 𝐄 (Y)$ , were $𝐄 (\cdot)$ is the expected value. If X and Y are independents $𝐄 (X Y) = 𝐄 (X) 𝐄 (Y) \Rightarrow C o v (X, Y) = 0$ . Therefore, Equation (47) can be simplified to:

V a r (f) = \sum_{i} {(\frac{\partial f}{\partial X_{i}})}^{2} V a r (X_{i}) .

Now, considering the function $f (p_{1}, p_{2}) = p_{1} p_{2}$ , we can rewrite Equation (48) as:

V a r (f) = p_{2}^{2} V a r (p_{1}) + p_{1}^{2} V a r (p_{2}),

and substituting the variance for the standard deviation (same that the standard error in this case, see Note 11), we have:

V a r (f) = p_{2}^{2} σ^{2} (p_{1}) + p_{1}^{2} σ^{2} (p_{2}) .

The 99% confidence interval for $p_{1} p_{2}$ can be computed as:

C I = p_{1} p_{2} \pm 2.576 \sqrt{V a r (p_{1} p_{2})} .

Set $p_{1} = β_{NSP5}^{VP6}$ and $p_{2} = β_{NSP2}^{NSP5}$ . We can compute the confidence interval of $β_{NSP5}^{VP6} \times β_{NSP2}^{NSP5}$ through Equation (49) and Equation (50).

\begin{array}{ll} V a r (β_{NSP5}^{VP6} \times β_{NSP2}^{NSP5}) & = {(β_{NSP2}^{NSP5})}^{2} σ^{2} (β_{NSP5}^{VP6}) + {(β_{NSP5}^{VP6})}^{2} σ^{2} (β_{NSP2}^{NSP5}) \\ = (0.8667494)^{2} (0.03278467)^{2} + (1.273246)^{2} (0.017947738)^{2} \\ = 0.001329683 \end{array}

\Rightarrow σ (β_{NSP 5}^{VP6} \times β_{NSP2}^{NSP5}) = \sqrt{0.001329683} = 0.03646482.

Now, considering Equation (50) we obtain:

\begin{array}{ll} C I_{β_{NSP5}^{VP6} \times β_{NSP2}^{NSP5}} & = (β_{NSP5}^{VP6} \times β_{NSP2}^{NSP5}) \pm 2.576 (σ (β_{NSP5}^{VP6} \times β_{NSP2}^{NSP5})) \\ = (1.273246 \times 0.8667494) \pm (2.576 \times 0.03646482) \\ = [1.009652, 1.197519] . \end{array}

The slope of VP6 in the NSP2 model ( $β_{NSP2}^{VP6}$ , see Appendix 1—table 6) is $1.1806335 \in [1.009652, 1.197519]$ , which proves that, in reference to VP6, the models based on NSP2 and NSP5 are consistent.

The rest of combinations runs as before. In the case of VP4 and the models based on NSP2 and NSP5, we obtained:

\begin{array}{ll} V a r (β_{NSP5}^{VP4} \times β_{NSP2}^{NSP5}) & = {(β_{NSP2}^{NSP5})}^{2} σ^{2} (β_{NSP5}^{VP4}) + {(β_{NSP5}^{VP4})}^{2} σ^{2} (β_{NSP2}^{NSP5}) \\ = (0.8667494)^{2} (0.05248750)^{2} + (1.623226)^{2} (0.017947738)^{2} \\ = 0.002918405. \end{array}

\begin{array}{ll} \Rightarrow σ (β_{NSP 5}^{VP4} \times β_{NSP2}^{NSP5}) & = \sqrt{0.002918405} = 0.05402226. \\ \Rightarrow C I_{β_{NSP5}^{VP4} \times β_{NSP2}^{NSP5}} & = (β_{NSP5}^{VP4} \times β_{NSP2}^{NSP5}) \pm 2.576 (σ (β_{NSP5}^{VP4} \times β_{NSP2}^{NSP5})) \\ = (1.623226 \times 0.8667494) \pm (2.576 \times 0.05402226) \\ = [1.267769, 1.546092] . \end{array}

Again, the slope of VP4 in the NSP2 model is $1.3887083 \in [1.267769, 1.546092]$ (see Appendix 1—table 6), and as consequence the models are consistent between them.

Finally, the stability between the models NSP4 and NSP2 based on the protein VP6 was evaluated.

\begin{array}{ll} V a r (β_{NSP4}^{VP6} \times β_{NSP2}^{NSP4}) & = {(β_{NSP2}^{NSP4})}^{2} σ^{2} (β_{NSP4}^{VP6}) + {(β_{NSP4}^{VP6})}^{2} σ^{2} (β_{NSP2}^{NSP4}) \\ = (0.9911517)^{2} (0.0212345)^{2} + (1.103683)^{2} (0.006973701)^{2} \\ = 0.0005021999 \end{array}

\begin{array}{ll} \Rightarrow σ (β_{NSP4}^{VP6}) & = \sqrt{0.0005021999} = 0.02240982 \\ \Rightarrow C I_{β_{NSP4}^{VP6}} \times_{β_{NSP2}^{NSP4}} & = (β_{NSP4}^{VP6} \times β_{NSP2}^{NSP4}) \pm 2.576 (σ (β_{NSP4}^{VP6} \times β_{NSP2}^{NSP4})) \\ = (1.103683 \times 0.9911517) \pm (2.576 \times 0.02240982) \\ = [1.036190, 1.151645] . \end{array}

In this case the slope $β_{NSP2}^{VP6} = 1.1806335 \notin [1.036190, 1.151645]$ , which is surprising taking into account that the relative error of NSP4 as predictor of VP6 instead of the model based on NSP2 is around the 7.3% (see Equation (46)). Even when this percent doesn’t represent a great difference, it could be enough to generate small variations that cause that the slope $β_{NSP2}^{VP6}$ be out of the confidence interval given in Equation (56). Moreover, note that the difference between the upper endpoint of the confidence interval of the slope between NSP2 and VP6 is approximately 0.03 μm, which is under the resolution limit of the 3B algorithm. A deeper study with different antibodies for NSP4 and the contrast of NSP4 with others viral elements would help to clarify our results.

Stochastic model fitted for 3B super resolution microscopy.

The algorithm 3B (Cox et al., 2011) creates a super resolution image from experimental data (hundreds of images acquired by the microscope). The 3B algorithm generates a probability map of fluorophore location through Markov Chains and Bayesian inference. The implementation of the algorithm requires a priori information on the dynamics of the photophysical properties of the fluorophores under study; however, the software does not provide a method to calculate it. In this section we propose a method to obtain this information from the experimental data.

3B microscopy generates a probability map of fluorophore localization by deriving a weighted average over all possible models; the set of models includes varying numbers of emitters, emitter localizations and temporal dynamics of the states of the fluorophores. The information stored in the image sequence is iteratively compared with the models using Bayesian inference, producing a SRM image. This technique allows the reduction of the number of necessary images, so it reduces the photo damage in the sample. However, the necessary computation time is significant (Hu et al., 2013); therefore, we employed 3B parallelization techniques for a cluster and PC from Hernández et al. (2016) in order to decrease the time to enhance the SR images.

The dynamics of the photophysical properties of a fluorophore is modeled in 3B-algorithm as a Markov chain, where, for each time, the fluorophore can be in three posible states: the emitting state $S_{0}$ , the non-emitting state $S_{1}$ , and the bleached state $S_{2}$ (Appendix 1—figure 12). Hence, in the complete sequence of images a fluorophore can be transiting between these states with a probability $P_{i}$ .

Appendix 1—figure 12

Download asset Open asset

Diagram of probabilities for the transitions between the states of a fluorophore for the 3B-algorithm.

From left to right are represented the emitting state, non-emitting state and bleached state for a fluorophore, with their respective transition probabilities among the states. As an example, a fluorophore can transit from the emission state to the non-emission state or remain in this state with a probability $P_{2}$ and $P_{1}$ , respectively.

https://doi.org/10.7554/eLife.42906.029

In this way, the temporary stack of images acquired by the microscope contains information on the dynamics of an unknown number of fluorophores transiting between these three states, with a probability $P_{i}$ . This information is represented as a transition matrix:

P := (\begin{matrix} P_{1} & P_{2} & 0 \\ P_{3} & P_{4} & P_{5} \\ 0 & 0 & 1 \end{matrix}),

where the entry in the $i$ th row and $j$ th column is the probability that a fluorophore in the state $S_{i}$ moves to the state $S_{j}$ .

The initial probability distribution

{\bar{ϕ}}_{0} := (ϕ_{0} (0), ϕ_{0} (1), ϕ_{0} (2)),

describes the probability of a fluorophore to be in each energy state in the beginning of the process: $ϕ_{0} (i)$ corresponds to the state $S_{i}$ .

The 3B algorithm considers the Markov chain determined by the transition matrix (Equation (57)) and the initial probability distribution (Equation (58)). Then,

\begin{array}{ll} {\bar{ϕ}}_{1} & = {\bar{ϕ}}_{0} P \\ {\bar{ϕ}}_{2} & = {\bar{ϕ}}_{1} P \\ ⋮ \\ {\bar{ϕ}}_{n} & = {\bar{ϕ}}_{n - 1} P, \end{array}

are the probabilities that a fluorophore is in the states $S_{0}, S_{1}$ and $S_{2}$ in time $1, \dots, n$ ; that is, for all $i = 1, \dots, n$ , ${\bar{ϕ}}_{i}$ is the 3-vector whose entries are the probabilities that the fluorophore is in each energy state at time $i$ .

The 3B algorithm has the transition matrix (Equation (57)) as an input, together with the initial probability distribution (Equation (58)). However, the software does not provide a method to calculate the transition matrix from the experimental data.

We deal with this issue by proposing a method to determine $P$ from the data. Namely, we propose to adjust the experimental data (the stack of images) with an ordinary differential equation (ODE) model that allows us to calculate the probabilities of the system.

Following the Jablonski diagram for a given fluorophore, we propose a model with three states ( $S_{0}, S_{1}$ and $S_{2}$ ) and three transition probabilities ( $k_{i s c}, k_{T}$ and $k_{b}$ ) between them (Appendix 1—figure 13).

Appendix 1—figure 13

Download asset Open asset

Left panel: The reduced Jablonski diagram for the fluorophore model (Appendix 1—figure 12).

Right panel: Diagram of probabilities for the transitions between the states of a fluorophore for our fluorophore’s model. In our model, the basal ( $E_{1}$ ) and the excited ( $E_{2}$ ) states, from the reduced Jablonski diagram, are collapsed in a new excited state $S_{0}$ (green). The justification for this lies in the fact that the fluorescence phenomenon ( $k_{e x}, k_{e m}$ ) occurs on the scale of nanoseconds or less; however, an image collected with an EM-CCD camera regularly has millisecond exposure times, therefore, it involves the integration of $105 - 106$ cycles of photon emission; as a consequence, the $E_{1}$ state is never detected. The entrance into the triplet excited state ( $k_{i s c}, k_{T}$ ) happens on the scale of seconds; if the emission process occurs, it releases photons of lower energy that are not detected and, therefore, the triplet state is considered as a dark state $S_{1}$ (violet). Finally, the photobleaching state $S_{2}$ (gray) is a irreversible process ( $k_{b})$ culminating with the destruction of the coordinating center of resonant electrons (orbitals $π$ ) which is responsible for absorbing photons.

https://doi.org/10.7554/eLife.42906.030

For $i = 0, 1, 2$ , we define the dynamics of the energy state $S_{i}$ as the time evolution of the concentration $[E_{i}]$ of molecules (fluorophores) emitting photons:

\begin{aligned} \frac{d [S_{0}]}{d t} & = k_{T} [S_{1}] - k_{i s c} [S_{0}], \\ \frac{d [S_{1}]}{d t} & = k_{i s c} [S_{0}] - (k_{T} + k_{b}) [S_{1}], \\ \frac{d [S_{2}]}{d t} & = k_{b} [S_{1}] . \end{aligned}

The sequence of images taken through the time keeps track of the state the fluorophore is in. If in the $i$ th image the fluorophore has intensity (the value of the pixel approaches the maximum value), then the fluorophore is in its emitting state; in any other case, the fluorophore is off, and it can be in two states: the non-emitting state or the bleached state. We normalize the data and compute the mean intensity at time t of the fluorophores in the emitting state: by definition, the initial mean value is one; after that, it is going to decay depending on the acquiring protocol.

Following the classical regression methods, we fit the experimental data to the proposed differential equations system. The estimate of the rate constant $k_{i s c}$ , $k_{T}$ and $k_{b}$ from the stack of images is obtained following the example taken from Rawlings and Ekerdt (2013).

Let us consider the matrix $P$ from Equation (57) as the transition matrix of our model. Then, by the definition of our model, $P_{2} = k_{i s c}$ , $P_{3} = k_{T}$ and $P_{5} = k_{b}$ . Also, by the general properties of the probability function, the sum of the entries of each row of this matrix is one; so we obtain the transition matrix as follows:

P = (\begin{matrix} P_{1} & P_{2} & 0 \\ P_{3} & P_{4} & P_{5} \\ 0 & 0 & 1 \end{matrix}) = (\begin{matrix} ? & k_{i s c} & 0 \\ k_{T} & ? & k_{b} \\ 0 & 0 & 1 \end{matrix}) = (\begin{matrix} 1 - k_{i s c} & k_{i s c} & 0 \\ k_{T} & 1 - (k_{T} + k_{i s c}) & k_{b} \\ 0 & 0 & 1 \end{matrix})

As we mentioned earlier, the estimate of the transition matrix depends on the acquisition protocol. There are image sequences for which the fitting does not converge; this happens when the average intensity decays exponentially. However, when the average intensity decreases linearly, the adjustment of the data to the converging model results in a transition matrix that fits the data.

After the fitting to the experimental data we simulate a Markov chain for the calculated transition matrix (Appendix 1—figure 14). We also observe that greater number of fluorophores leads to better concordance between the simulation and the experimental data.

Appendix 1—figure 14

Download asset Open asset

Markov Chain simulations with the new transition matrix.

(**A–C**) shows the simulation of the dynamics for 1, 10 and 1000 fluorophores respectively. A.1, B.1 and C.1 indicates the state of each fluorophore in the time; A.2, B.2 and C.2 point out the amount of fluorophores that are in the state $S_{0}$ in the time.

https://doi.org/10.7554/eLife.42906.031

To corroborate that the proposed fitting is adjusted properly we use the experimental data provided by the authors of the 3B SRM microscopy algorithm. The podosomes data is available at the ThreeB plugin for ImageJ (Rosten et al., 2013), is a time serie of 300 images. Taking this stack as initial experimental data, we simulate Markov chains for 1000 fluorophores with the original transition matrix and the transition matrix fitted by our model. Although we found that our model fits better to the experimental data (Appendix 1—figure 15); the comparision between the SR images of the podosomes are not conclusive about the resolution enhancement with our model (Appendix 1—figure 16).

Appendix 1—figure 15

Download asset Open asset

Simulation estimation chart of 1000 fluorophores.

The $y$ -axis is the normalization of fluorophores in the $S_{0}$ state (emission) in the time $i$ ( $x$ -axis). The normalized data extracted from the images are shown in black. The purple and the brown dots represent the simulation of 1000 fluorophores as a Markov chain with the transition matrix that we propose and with the original 3B matrix, respectively. The transition matrix that we propose is shown in the bottom of the figure.

https://doi.org/10.7554/eLife.42906.032

Appendix 1—figure 16

Download asset Open asset

Super-resolution images generated with different transition matrices.

(A) SR image using the transition matrix constructed by our model. (B) SR image with the original transition matrix. (C) Composite image comparing the two SR images in green (A) and in red (B).

https://doi.org/10.7554/eLife.42906.033

Furthermore, in order to validate our method, we use Gatta-paint nanorulers, DNA molecules label with 488, 550 and 655 fluorophores at a distance of $40 n m$ (GATTA-PAINT 40RG/40B, immobilized in buffer on glass-slide) to resolve the resolution enhacement of our method.

As in the previous analysis, we find that our model fits better to the experimental data obtained from the different ATTOS. Because our model involves a greater number of fluorophores, there is more information of the fluorophores localization. For example, in the Appendix 1—figure 17A.1, the information in the SR image with the original matrix, resolves only one fluorophore; however, with our method, three fluorophores in the same area can be resolved. In this way, we observed that in all cases the distance between the fluorophores (each peak) of the grahps shown in panels A.1-A.3 from Appendix 1—figure 17 is approximately 40 nm; this coincides with the information provided by the manufacturers of the samples. All together, these results demonstrate that the new proposed method improves the resolution capacity of 3B algorithm by reflecting better the photophysical behavior of the fluorophores.

Appendix 1—figure 17

Download asset Open asset

Comparison between the original model of 3B against the obtained with the transition matrix that we propose.

Columns A-C represent the nanorulers GATTA-PAINT labelled with ATTO 488, 550 and 655 respectively. Panels A-C show the normalized experimental data (black dots) and the simulation for 1000 fluorophores with the matrix that we propose (purple dots) and with the original 3B matrix (burgundy dots); each point represents the mean value of the fluorescence. The time is indicated in frames, acquisition time between images is 100 ms. A complete experiment consists of 300 images collected in a CellTIRF microscope with a 160× magnification. Panels A.1- C.1 show two ROI’s from the complete SR images: 3B) canonical reconstruction and 3B-ODE) reconstruction with the ODE’s model. The graph shows a line profile from the two reconstructions, in green the results from original 3B and in black with the transition matrix that we propose, the x-label is the distance in nm and the y-label is the intensity pixel value. The peaks from the graph denote the localization of a fluorophore.

https://doi.org/10.7554/eLife.42906.034

Once the transition matrix for the nanorulers was calculated and taking into account that the data consists of hundreds of images, we parallelized the data on a cluster and on a personal computer (Hernández et al., 2016) (the cluster has 314 nodes with RedHat Enterprise Linux Version 6.2, 2 Intel Xeon Sandy Bridge E5-2670 processors at 2.6 GHz with 8 cores each 64 GB of RAM) to reduce the computation time.

https://doi.org/10.7554/eLife.42906.008

Data availability

All data generated or analysed during this study are included in the manuscript and supporting files.

References

(1998) Rotavirus NSP5 phosphorylation is up-regulated by interaction with NSP2
Journal of General Virology 79 Pt 11:2679–2686.
https://doi.org/10.1099/0022-1317-79-11-2679
- Google Scholar
(1980) Ultrastructural study of Rotavirus replication in cultured cells
Journal of General Virology 46:75–85.
https://doi.org/10.1099/0022-1317-46-1-75
- PubMed
- Google Scholar
1. Altman DG
2. Bland JM
(2005) Standard deviations and standard errors
BMJ 331:903.
https://doi.org/10.1136/bmj.331.7521.903
- PubMed
- Google Scholar
(1996)
Recovery and characterization of a replicase complex in rotavirus-infected cells by using a monoclonal antibody against NSP2

Journal of Virology 70:985–991.
- PubMed
- Google Scholar
1. Au KS
2. Chan WK
3. Burns JW
4. Estes MK
(1989)
Receptor activity of Rotavirus nonstructural glycoprotein NS28

Journal of Virology 63:4553–4562.
- PubMed
- Google Scholar
(2006) Rotavirus NSP4 induces a novel vesicular compartment regulated by calcium and associated with viroplasms
Journal of Virology 80:6061–6071.
https://doi.org/10.1128/JVI.02167-05
- PubMed
- Google Scholar
1. Berois M
2. Sapin C
3. Erk I
4. Poncet D
5. Cohen J
(2003) Rotavirus nonstructural protein NSP5 interacts with major core protein VP2
Journal of Virology 77:1757–1763.
https://doi.org/10.1128/JVI.77.3.1757-1763.2003
- PubMed
- Google Scholar
(2017) Protein-mediated RNA folding governs sequence-specific interactions between Rotavirus genome segments
eLife 6:e27453.
https://doi.org/10.7554/eLife.27453
- PubMed
- Google Scholar
(2018) Genome packaging in multi-segmented dsRNA viruses: distinct mechanisms with similar outcomes
Current Opinion in Virology 33:106–112.
https://doi.org/10.1016/j.coviro.2018.08.001
- PubMed
- Google Scholar
Website
1. Brown R
(2007) Fits circles to 2D data using nonlinear least squares to minimise geometric error
Accessed January 20, 2017.

https://la.mathworks.com/matlabcentral/fileexchange/15060-fitcircle-m
1. Cabral-Romero C
2. Padilla-Noriega L
(2006) Association of Rotavirus viroplasms with microtubules through NSP2 and NSP5
Memórias Do Instituto Oswaldo Cruz 101:603–611.
https://doi.org/10.1590/S0074-02762006000600006
- Google Scholar
(2005) RNA interference of Rotavirus segment 11 mRNA reveals the essential role of NSP5 in the virus replicative cycle
Journal of General Virology 86:1481–1487.
https://doi.org/10.1099/vir.0.80598-0
- PubMed
- Google Scholar
(2013) Rotavirus viroplasm proteins interact with the cellular SUMOylation system: implications for viroplasm-like structure formation
Journal of Virology 87:807–817.
https://doi.org/10.1128/JVI.01578-12
- PubMed
- Google Scholar
(2010) Characterization of viroplasm formation during the early stages of Rotavirus infection
Virology Journal 7:350.
https://doi.org/10.1186/1743-422X-7-350
- PubMed
- Google Scholar
Book
(1983)
Graphical Methods for Data Analysis (Statistics)

Chapman and Hall.
- Google Scholar
1. Chasey D
(1980) Investigation of immunoperoxidase-labelled Rotavirus in tissue culture by light and electron microscopy
Journal of General Virology 50:195–200.
https://doi.org/10.1099/0022-1317-50-1-195
- PubMed
- Google Scholar
1. Cheung W
2. Gill M
3. Esposito A
4. Kaminski CF
5. Courousse N
6. Chwetzoff S
7. Trugnan G
8. Keshavan N
9. Lever A
10. Desselberger U
(2010) Rotaviruses associate with cellular lipid droplet components to replicate in Viroplasms, and compounds disrupting or blocking lipid droplets inhibit viroplasm formation and viral replication
Journal of Virology 84:6782–6798.
https://doi.org/10.1128/JVI.01757-09
- PubMed
- Google Scholar
(2010) Rotavirus NSP5 orchestrates recruitment of viroplasmic proteins
Journal of General Virology 91:1782–1793.
https://doi.org/10.1099/vir.0.019133-0
- PubMed
- Google Scholar
(2011) Bayesian localization microscopy reveals nanoscale podosome dynamics
Nature Methods 9:195–200.
https://doi.org/10.1038/nmeth.1812
- PubMed
- Google Scholar
1. Criglar JM
2. Hu L
3. Crawford SE
4. Hyser JM
5. Broughman JR
6. Prasad BV
7. Estes MK
(2014) A novel form of Rotavirus NSP2 and phosphorylation-dependent NSP2-NSP5 interactions are associated with viroplasm assembly
Journal of Virology 88:786–798.
https://doi.org/10.1128/JVI.03022-13
- PubMed
- Google Scholar
1. Dempsey GT
2. Vaughan JC
3. Chen KH
4. Bates M
5. Zhuang X
(2011) Evaluation of fluorophores for optimal performance in localization-based super-resolution imaging
Nature Methods 8:1027–1036.
https://doi.org/10.1038/nmeth.1768
- PubMed
- Google Scholar
(2014) Precisely and accurately localizing single emitters in fluorescence microscopy
Nature Methods 11:253–266.
https://doi.org/10.1038/nmeth.2843
- PubMed
- Google Scholar
1. Dhillon P
2. Tandra VN
3. Chorghade SG
4. Namsa ND
5. Sahoo L
6. Rao CD
(2018) Cytoplasmic relocalization and colocalization with viroplasms of host cell proteins, and their role in Rotavirus infection
Journal of Virology 92:e00612-18.
https://doi.org/10.1128/JVI.00612-18
- PubMed
- Google Scholar
(2004) Characterization of Rotavirus NSP2/NSP5 interactions and the dynamics of viroplasm formation
Journal of General Virology 85:625–634.
https://doi.org/10.1099/vir.0.19611-0
- PubMed
- Google Scholar
1. Eichwald C
2. Arnoldi F
3. Laimbacher AS
4. Schraner EM
5. Fraefel C
6. Wild P
7. Burrone OR
8. Ackermann M
(2012) Rotavirus viroplasm fusion and perinuclear localization are dynamic processes requiring stabilized microtubules
PLOS ONE 7:e47947.
https://doi.org/10.1371/journal.pone.0047947
- PubMed
- Google Scholar
Book
1. Estes M
2. Greenberg H
(2013)
Rotaviruses

In: Knipe D, Howley P, Cohen J, Griffin D, Lamb R, Martin M, Racaniello V, Roizman B, editors. Fields Virology (Sixth Edition). Philadelphia: Wolters Kluwer Health/Lippincott Williams & Wilkins. pp. 1347–1401.
- Google Scholar
(1999) Two non-structural Rotavirus proteins, NSP2 and NSP5, form viroplasm-like structures in vivo
Journal of General Virology 80 Pt 2:333–339.
https://doi.org/10.1099/0022-1317-80-2-333
- Google Scholar
(1999) Direct least square fitting of ellipses
IEEE Transactions on Pattern Analysis and Machine Intelligence 21:476–480.
https://doi.org/10.1109/34.765658
- Google Scholar
(1994) Least-squares fitting of circles and ellipses
Bit 34:558–578.
https://doi.org/10.1007/BF01934268
- Google Scholar
(2016) Automatic detection and measurement of viral replication compartments by ellipse adjustment
Scientific Reports 6:36505.
https://doi.org/10.1038/srep36505
- PubMed
- Google Scholar
Software
1. Garcés Suárez Y
(2019) Nanoscale organization of rotavirus replication machineries, version 959a366
GitHub.

https://github.com/Yasel88/Nanoscale_organization_of_rotavirus_replication_machineries
(1998) In vivo interactions among Rotavirus nonstructural proteins
Archives of Virology 143:981–996.
https://doi.org/10.1007/s007050050347
- PubMed
- Google Scholar
(2000) Relative localization of viroplasmic and endoplasmic reticulum-resident Rotavirus proteins in infected cells
Archives of Virology 145:1963–1973.
https://doi.org/10.1007/s007050070069
- PubMed
- Google Scholar
1. Grant MJ
2. Loftus MS
3. Stoja AP
4. Kedes DH
5. Smith MM
(2018) Superresolution microscopy reveals structural mechanisms driving the nanoarchitecture of a viral chromatin tether
PNAS 115:4992–4997.
https://doi.org/10.1073/pnas.1721638115
- PubMed
- Google Scholar
1. Greenberg H
2. McAuliffe V
3. Valdesuso J
4. Wyatt R
5. Flores J
6. Kalica A
7. Hoshino Y
8. Singh N
(1983)
Serological analysis of the subgroup protein of Rotavirus, using monoclonal antibodies

Infection and Immunity 39:91–99.
- PubMed
- Google Scholar
(2009) Super-resolution imaging with small organic fluorophores
Angewandte Chemie International Edition 48:6903–6908.
https://doi.org/10.1002/anie.200902073
- PubMed
- Google Scholar
Book
(2016) Parallelizing the Bayesian Analysis of Blinking and Bleaching for Super-Resolution Microscopy
In: Gitler I, Klapp J, editors. High Performance Computer Applications, 595. Springer. pp. 356–366.
https://doi.org/10.1007/978-3-319-32243-8_25
- Google Scholar
Book
(2013)
Nonparametric Statistical Methods (Third Edition)

Wiley.
- Google Scholar
Book
1. Howell D
(1982)
Statistical Methods for Psychology (Eigth Edition)

Wadsworth Publishing.
- Google Scholar
(2013) Accelerating 3B single-molecule super-resolution microscopy with cloud computing
Nature Methods 10:96–97.
https://doi.org/10.1038/nmeth.2335
- PubMed
- Google Scholar
1. Jiang X
2. Jayaram H
3. Kumar M
4. Ludtke SJ
5. Estes MK
6. Prasad BV
(2006) Cryoelectron microscopy structures of Rotavirus NSP2-NSP5 and NSP2-RNA complexes: implications for genome replication
Journal of Virology 80:10829–10835.
https://doi.org/10.1128/JVI.01347-06
- PubMed
- Google Scholar
(1988)
Two forms of VP7 are involved in assembly of SA11 Rotavirus in endoplasmic reticulum

Journal of Virology 62:2929–2941.
- PubMed
- Google Scholar
(1988) Snakes: active contour models
International Journal of Computer Vision 1:321–331.
https://doi.org/10.1007/BF00133570
- Google Scholar
Book
1. Kiefer J
(1987)
Introduction to Statistical Inference (First Edition)

Springer Texts in Statistics.
- Google Scholar
(2015) Structural analysis of herpes simplex virus by optical super-resolution imaging
Nature Communications 6:2058–2066.
https://doi.org/10.1038/ncomms6980
- Google Scholar
Book
1. Lawrence R
(1953)
A Textbook of Econometrics

Lawrence R, editors. Peterson & Co.
- Google Scholar
1. López T
2. Camacho M
3. Zayas M
4. Nájera R
5. Sánchez R
6. Arias CF
7. López S
(2005a) Silencing the morphogenesis of Rotavirus
Journal of Virology 79:184–192.
https://doi.org/10.1128/JVI.79.1.184-192.2005
- PubMed
- Google Scholar
(2005b) Reduced expression of the Rotavirus NSP5 gene has a pleiotropic effect on virus replication
Journal of General Virology 86:1609–1617.
https://doi.org/10.1099/vir.0.80827-0
- PubMed
- Google Scholar
1. Maass DR
2. Atkinson PH
(1990)
Rotavirus proteins VP7, NS28, and VP4 form oligomeric structures

Journal of Virology 64:2632–2641.
- PubMed
- Google Scholar
1. Manetsberger J
2. Manton JD
3. Erdelyi MJ
4. Lin H
5. Rees D
6. Christie G
7. Rees EJ
(2015) Ellipsoid localization microscopy infers the size and order of protein layers in Bacillus spore coats
Biophysical Journal 109:2058–2066.
https://doi.org/10.1016/j.bpj.2015.09.023
- Google Scholar
1. Mann HB
2. Whitney DR
(1947) On a test of whether one of two random variables is stochastically larger than the other
The Annals of Mathematical Statistics 18:50–60.
https://doi.org/10.1214/aoms/1177730491
- Google Scholar
(2011) Structural organisation of the Rotavirus nonstructural protein NSP5
Journal of Molecular Biology 413:209–221.
https://doi.org/10.1016/j.jmb.2011.08.008
- PubMed
- Google Scholar
(2008) Endoplasmic reticulum chaperones are involved in the morphogenesis of Rotavirus infectious particles
Journal of Virology 82:5368–5380.
https://doi.org/10.1128/JVI.02751-07
- PubMed
- Google Scholar
(1978) Variations of box plots
The American Statistician 32:12–16.
https://doi.org/10.1080/00031305.1978.10479236
- Google Scholar
1. Mehnert A
2. Jackway P
(1997) An improved seeded region growing algorithm
Pattern Recognition Letters 18:1065–1071.
https://doi.org/10.1016/S0167-8655(97)00131-1
- Google Scholar
(2016) Rotavirus replication and reverse genetics
Viral Gastroenteritis pp. 121–143.
https://doi.org/10.1016/B978-0-12-802241-2.00007-9
- Google Scholar
1. Nejmeddine M
2. Trugnan G
3. Sapin C
4. Kohli E
5. Svensson L
6. Lopez S
7. Cohen J
(2000) Rotavirus spike protein VP4 is present at the plasma membrane and is associated with microtubules in infected cells
Journal of Virology 74:3313–3320.
https://doi.org/10.1128/JVI.74.7.3313-3320.2000
- PubMed
- Google Scholar
1. Oehlert G
(1992) A note on the Delta method
The American Statistician 46:27–29.
https://doi.org/10.2307/2684406
- Google Scholar
1. Osher S
2. Fedkiw R
(2003)
Level Set Methods and Dynamic Implicit Surfaces. Applied Mathematical Sciences

79–86, Particle Level Set Method, Level Set Methods and Dynamic Implicit Surfaces. Applied Mathematical Sciences, Springer.
- Google Scholar
1. Pando V
2. Isa P
3. Arias CF
4. López S
(2002) Influence of calcium on the early steps of Rotavirus infection
Virology 295:190–200.
https://doi.org/10.1006/viro.2001.1337
- PubMed
- Google Scholar
(1982) Localization of Rotavirus antigens in infected cells by ultrastructural immunocytochemistry
Journal of General Virology 63:457–467.
https://doi.org/10.1099/0022-1317-63-2-457
- PubMed
- Google Scholar
(1984) Ultrastructural localization of Rotavirus antigens using colloidal gold
Virus Research 1:133–152.
https://doi.org/10.1016/0168-1702(84)90069-8
- PubMed
- Google Scholar
(1997)
In vivo and in vitro phosphorylation of Rotavirus NSP5 correlates with its localization in viroplasms

Journal of Virology 71:34–41.
- PubMed
- Google Scholar
1. Prybutok VR
(1989) An introduction to statistical methods and data analysis
Technometrics 31:389–390.
https://doi.org/10.1080/00401706.1989.10488572
- Google Scholar
Software
1. R Development Core Team
(2017) R: A Language and Environment for Statistical Computing
R Foundation for Statistical Computing, Vienna, Austria.

https://www.R-project.org
Book
1. Rawlings J
2. Ekerdt J
(2013)
Chemical Reactor Analysis and Design Fundamentals (Second Edition)

Nob Hill Pub.
- Google Scholar
1. Reznikov N
2. Bilton M
3. Lari L
4. Stevens MM
5. Kröger R
(2018) Fractal-like hierarchical organization of bone begins at the nanoscale
Science 360:eaao2189.
https://doi.org/10.1126/science.aao2189
- PubMed
- Google Scholar
(1986) Intracellular localization of rotaviral proteins
Archives of Virology 88:251–264.
https://doi.org/10.1007/BF01310879
- PubMed
- Google Scholar
1. Rosten E
2. Jones GE
3. Cox S
(2013) ImageJ plug-in for bayesian analysis of blinking and bleaching
Nature Methods 10:97–98.
https://doi.org/10.1038/nmeth.2342
- PubMed
- Google Scholar
(2017) Super-resolution microscopy with DNA-PAINT
Nature Protocols 12:1198–1228.
https://doi.org/10.1038/nprot.2017.024
- PubMed
- Google Scholar
1. Shapiro SS
2. Wilk MB
(1965) An analysis of variance test for normality (complete samples)
Biometrika 52:591–611.
https://doi.org/10.1093/biomet/52.3-4.591
- Google Scholar
1. Shaw RD
2. Vo PT
3. Offit PA
4. Coulson BS
5. Greenberg HB
(1986) Antigenic mapping of the surface proteins of rhesus Rotavirus
Virology 155:434–451.
https://doi.org/10.1016/0042-6822(86)90205-9
- PubMed
- Google Scholar
(2013) Genome-wide RNAi screen reveals a role for the ESCRT complex in Rotavirus cell entry
PNAS 110:10270–10275.
https://doi.org/10.1073/pnas.1304932110
- PubMed
- Google Scholar
(2004) Rotavirus replication: plus-sense templates for double-stranded RNA synthesis are made in viroplasms
Journal of Virology 78:7763–7774.
https://doi.org/10.1128/JVI.78.14.7763-7774.2004
- PubMed
- Google Scholar
(2012)
Image segmentation using seeded region growing

International Conference on Computing, Electronics and Electrical Technologies 2012:576–583.
- Google Scholar
1. Trask SD
2. Dormitzer PR
(2006) Assembly of highly infectious Rotavirus particles recoated with recombinant outer capsid proteins
Journal of Virology 80:11293–11304.
https://doi.org/10.1128/JVI.01346-06
- PubMed
- Google Scholar
(2004) Effects of intrabodies specific for Rotavirus NSP5 during the virus replicative cycle
Journal of General Virology 85:3285–3290.
https://doi.org/10.1099/vir.0.80075-0
- PubMed
- Google Scholar
1. Ver Hoef JM
(2012) Who invented the Delta method?
The American Statistician 66:124–127.
https://doi.org/10.1080/00031305.2012.687494
- Google Scholar
1. Walther TC
2. Farese RV
(2012) Lipid droplets and cellular lipid metabolism
Annual Review of Biochemistry 81:687–714.
https://doi.org/10.1146/annurev-biochem-061009-102430
- PubMed
- Google Scholar

Article and author information

Author details

Yasel Garcés Suárez

Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico

Contribution
Data curation, Software, Formal analysis, Supervision, Validation, Investigation, Visualization, Methodology, Writing—original draft, Project administration, Writing—review and editing

Contributed equally with
Jose L Martínez

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0001-8401-8352
Jose L Martínez

Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico

Contribution
Conceptualization, Supervision, Investigation, Visualization, Methodology, Writing—original draft, Project administration, Writing—review and editing

Contributed equally with
Yasel Garcés Suárez

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-0529-7558
David Torres Hernández
1. Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico
2. Laboratorio Nacional de Microscopía Avanzada, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico
Contribution
Investigation, Visualization, Methodology

Competing interests
No competing interests declared
Haydee Olinca Hernández
1. Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico
2. Laboratorio Nacional de Microscopía Avanzada, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico
Contribution
Software, Validation, Investigation, Visualization, Methodology

Competing interests
No competing interests declared
Arianna Pérez-Delgado

Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico

Contribution
Investigation, Methodology

Competing interests
No competing interests declared
Mayra Méndez

Centro de Investigación en Ciencias, Instituto de Investigación en Ciencias Básicas y Aplicadas, Universidad Autónoma del Estado de Morelos, Cuernavaca, Mexico

Contribution
Software, Validation, Investigation, Writing—review and editing

Competing interests
No competing interests declared
Christopher D Wood
1. Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico
2. Laboratorio Nacional de Microscopía Avanzada, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico
Contribution
Investigation, Methodology, Writing—review and editing

Competing interests
No competing interests declared
Juan Manuel Rendon-Mancha

Centro de Investigación en Ciencias, Instituto de Investigación en Ciencias Básicas y Aplicadas, Universidad Autónoma del Estado de Morelos, Cuernavaca, Mexico

Contribution
Supervision, Investigation

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-9629-7050
Daniela Silva-Ayala

Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico

Present address
Harvard Medical School, Boston, United States

Contribution
Investigation, Methodology

Competing interests
No competing interests declared
Susana López

Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico

Contribution
Conceptualization, Supervision, Funding acquisition, Investigation, Methodology, Project administration, Writing—review and editing

Competing interests
No competing interests declared
Adán Guerrero
1. Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico
2. Laboratorio Nacional de Microscopía Avanzada, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico
Contribution
Conceptualization, Software, Supervision, Validation, Investigation, Methodology, Writing—review and editing

For correspondence
adanog@ibt.unam.mx

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-4389-5516
Carlos F Arias

Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico City, Mexico

Contribution
Conceptualization, Supervision, Funding acquisition, Investigation, Methodology, Writing—original draft, Project administration, Writing—review and editing

For correspondence
arias@ibt.unam.mx

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-3130-4501

Funding

Dirección General de Asuntos del Personal Académico, Universidad Nacional Autónoma de México (IG200317)

Susana López
Carlos F Arias

Dirección General de Asuntos del Personal Académico, Universidad Nacional Autónoma de México (IA202417)

Adán Guerrero

Universidad Nacional Autónoma de México (SC15-1-IR-89)

Adán Guerrero

Consejo Nacional de Ciencia y Tecnología (252213)

Adán Guerrero

Dirección General de Asuntos del Personal Académico, Universidad Nacional Autónoma de México (IN202312)

Haydee Olinca Hernández

Universidad Nacional Autónoma de México (SC16-1-IR-102)

Adán Guerrero

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

YG received a postdoctoral fellowship from DGAPA-UNAM at the Institute of Biotechnology (IBt-UNAM). AG thanks DGTIC-UNAM for generous computing time on the Miztli supercomputer (Grant numbers: SC15-1-IR-89; SC16-1-IR-102). JLM and DTH are recipients of scholarships from CONACyT. HOH received a grant from the Programa de Apoyo a Proyectos de Investigación e Innovación Tecnológica (PAPIIT-UNAM), IN202312. Microscopy equipment was provided and maintained through CONACYT grants 123007, 232708, 260541, 280487, 293624. AG thanks CONACyT (No. 252213) and DGAPA-PAPIIT (No. IA202417), SL and CFA thank DGAPA-PAPIIT grant IG200317 for funding. We are thankful to IBt-UNAM for providing access to the computer cluster and to Jerome Verleyen for his support while using it. We are also thankful to Arturo Pimentel, Andrés Saralegui and Xochitl Alvarado from LNMA-UNAM for their helpful discussions. The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Version history

Received: October 17, 2018
Accepted: July 22, 2019
Accepted Manuscript published: July 25, 2019 (version 1)
Version of Record published: August 13, 2019 (version 2)
Version of Record updated: August 22, 2019 (version 3)
Version of Record updated: September 17, 2019 (version 4)

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

2,524

views
399

downloads
18

citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Article PDF

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Yasel Garcés Suárez
Jose L Martínez
David Torres Hernández
Haydee Olinca Hernández
Arianna Pérez-Delgado
Mayra Méndez
Christopher D Wood
Juan Manuel Rendon-Mancha
Daniela Silva-Ayala
Susana López
Adán Guerrero
Carlos F Arias

(2019)

Nanoscale organization of rotavirus replication machineries

eLife 8:e42906.

https://doi.org/10.7554/eLife.42906

Categories and tags

Research organism

Virus

Share this article

Cite this article

Relative distribution of viral components in rotavirus-VPs.

Exploratory analysis of the results obtained by the algorithm VPs-DLSFC.

The organization of VPs scales with its size.

Relative structural distribution of VP components.

Scheme of the ‘Viroplasm Direct Least Square Fitting Circumference’ algorithm (VPs-DLSFC).

Simulation of the viral proteins.

Generation of partial occlusion in the angle θ=π (red line).

Data adjustment through the algorithms DLSFC (solid blue line), ALSFC (solid green line) and GLSFC (solid red line).

Error in the adjustment of the algorithms DLSFC, GLSFC and ALSFC.

Results of the Binomial Test.

Two-sample Mann-Whitney hypothesis test between a (major semi-axis of the ellipse ) and r (circumference radius).

Two-sample Mann-Whitney hypothesis test between b (minor semi-axis of the ellipse ) and r (circumference radius).

Boxplot of the differences in location between the center of the circumference that adjust NSP2 and the centers of the others nine viral elements.

Difference in location between the centers of NSP2 and {NSP5, NSP4, VP1, VP2, VP6, VP4, VP7-Tri, VP7-Mon}.

Supplementary exploratory analysis of the results obtained by the algorithm VPs-DLSFC.

Two-sample Mann-Whitney hypothesis test, considering as variables the radii of NSP2 in contrast with the radii of the others seven viral proteins.

Examples of small zones of colocalization between differents viral proteins.

Results and validation of the linear regression models.

Residuals errors for each linear regression model.

VP6 and VP4 spatial distribution taking NSP5 as reference protein.

Linear regression results with NSP5 used as independent variable.

VP6 spatial distribution taking NSP4 as reference protein.

Two-sample Mann-Whitney hypotheses test, considering the radius of NSP5 in contrast with the radius of VP6 and VP4.

Linear regression results with NSP4 as independent variable.

Diagram of probabilities for the transitions between the states of a fluorophore for the 3B-algorithm.

Left panel: The reduced Jablonski diagram for the fluorophore model (Appendix 1—figure 12).

Markov Chain simulations with the new transition matrix.

Simulation estimation chart of 1000 fluorophores.

Super-resolution images generated with different transition matrices.

Comparison between the original model of 3B against the obtained with the transition matrix that we propose.

Author details

Yasel Garcés Suárez

Contribution

Contributed equally with

Competing interests

Jose L Martínez

Contribution

Contributed equally with

Competing interests

David Torres Hernández

Contribution

Competing interests

Haydee Olinca Hernández

Contribution

Competing interests

Arianna Pérez-Delgado

Contribution

Competing interests

Mayra Méndez

Contribution

Competing interests

Christopher D Wood

Contribution

Competing interests

Juan Manuel Rendon-Mancha

Contribution

Competing interests

Daniela Silva-Ayala

Present address

Contribution

Competing interests

Susana López

Contribution

Competing interests

Adán Guerrero

Contribution

For correspondence

Competing interests

Carlos F Arias

Contribution

For correspondence

Competing interests

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

Further reading

Generation of partial occlusion in the angle $θ = π$ (red line).

Two-sample Mann-Whitney hypothesis test between $a$ (major semi-axis of the ellipse ) and $r$ (circumference radius).

Two-sample Mann-Whitney hypothesis test between $b$ (minor semi-axis of the ellipse ) and $r$ (circumference radius).