Nanoscale organization of rotavirus replication machineries

Rotavirus genome replication and assembly take place in cytoplasmic electron dense inclusions termed viroplasms (VPs). Previous conventional optical microscopy studies observing the intracellular distribution of rotavirus proteins and their organization in VPs have lacked molecular-scale spatial resolution, due to inherent spatial resolution constraints. In this work we employed super-resolution microscopy to reveal the nanometric-scale organization of VPs formed during rotavirus infection, and quantitatively describe the structural organization of seven viral proteins within and around the VPs. The observed viral components are spatially organized as five concentric layers, in which NSP5 localizes at the center of the VPs, surrounded by a layer of NSP2 and NSP4 proteins, followed by an intermediate zone comprised of the VP1, VP2, VP6. In the outermost zone, we observed a ring of VP4 and finally a layer of VP7. These findings show that rotavirus VPs are highly organized organelles.


Introduction
Rotavirus is a non-enveloped virus composed of three concentric layers of proteins that enclose a genome constituted by eleven segments of double stranded RNA (dsRNA) that encode six structural proteins (VP1 to VP4, VP6 and VP7) and six non-structural proteins (NSP1 to NSP6). The inner layer is formed by dimers of VP2 that enclose the viral genome and small numbers of molecules of the viral RNA-dependent RNA polymerase (RdRp), VP1, and the capping enzyme, VP3. This nucleoprotein complex constitutes the core of the virus, which is surrounded by an intermediate protein layer of trimers of VP6, to form double-layered particles (DLPs). The surface of the virion is occupied by two polypeptides, VP7, a glycoprotein, and VP4, which forms spikes that protrude from the VP7 shell (Estes and Greenberg, 2013). Replication of the rotavirus genome and assembly of DLPs take place in cytoplasmic electron dense inclusions termed viroplasms (VPs) (Estes and Greenberg, 2013). Once the double-shelled particles are assembled, they bud from the cytoplasmic VPs into the adjacent endoplasmic reticulum (ER). During this process, which is mediated by the interaction of DLPs with the ER transmembrane viral protein NSP4, the particles acquire a temporary lipid bilayer, modified by VP7 and NSP4, which after being removed in the lumen of the ER by an unknown mechanism, yields the mature triple-layered virions (Estes and Greenberg, 2013). It has been reported that VP4 is located between the VP and the ER membrane and it is incorporated into triple-layered particles (TLPs) during the budding process and maturation of the virus particle inside the ER (Estes and Greenberg, 2013;Navarro et al., 2016).
The viral non-structural proteins NSP2 and NSP5 serve a nucleation role that is essential for the biogenesis of VPs (Fabbretti et al., 1999;Silvestri et al., 2004;Vascotto et al., 2004;Campagna et al., 2005). In addition to viral proteins and genomic dsRNA, cellular proteins such as ER chaperones (Maruri-Avidal et al., 2008), proteins associated with lipid droplets (Cheung et al., 2010), and ribonuclear proteins (Dhillon et al., 2018), have been shown to colocalize with VPs. Several studies have characterized the intracellular distribution of the rotavirus proteins (González et al., 2000;Petrie et al., 1982;Petrie et al., 1984;Richardson et al., 1986). Immunofluorescence studies, based upon epifluorescence or confocal microscopy, have described the viral proteins that conform the VPs, however the images are inherently diffraction-limited to a spatial resolution in the range of hundreds of nanometer, precluding the identification of the nanoscopic molecular scale organization of VPs (González et al., 1998;González et al., 2000;Eichwald et al., 2004;Ló pez et al., 2005b;Criglar et al., 2014;Martin et al., 2011;Contin et al., 2010). On the other hand, transmission electron microscopy (TEM) studies often provide images with nanometric resolution, nevertheless, immunoelectron microscopy is challenging when looking for the localization of more than a single protein (Altenburg et al., 1980;Petrie et al., 1982;Petrie et al., 1984). Over the past 15 years, a variety of super-resolution microscopy (SRM) techniques have been developed to observe subcellular structures beneath the diffraction limit of optical microscopes, with resolutions in the tens of nanometers (Schnitzbauer et al., 2017;Deschout et al., 2014;Cox et al., 2011). In this work, we determined the organization of rotaviral proteins within and around VPs through the 'Bayesian Blinking and Bleaching' (3B) SRM technique. We developed a segmentation algorithm to automatically analyze and quantify the relative distribution of seven viral proteins, and propose a model that describes their relative spatial distribution. Also, we present a dependency model that explains the relationship between the viral proteins. This work establishes a structural framework for VP organization that future mechanistic and functional studies must take into account, and establishes key methodologies for future investigations on this subject. eLife digest Rotaviruses are small viruses that can infect cells in the intestine. They are responsible for most cases of severe infectious diarrhea, the most common cause of death among young children in developing countries. Controlling the spread of rotavirus infections is difficult, even with high levels of hygiene, so effective treatments are essential to curtail the virus' infections. Understanding how new rotaviral particles are made in infected cells is one of the first steps toward developing new therapies.
Once rotaviruses enter the cells, proteins from the virus and the cell aggregate into compact spheres called viroplasms to make new viral particles. Studying these viroplasms used to be difficult because they are too small to see with the resolution of standard microscopes. In recent years, advances in microscopy and mathematical methods have focused on breaking the existing resolution limits, leading to the development of super-resolution microscopy. This new technique has made it possible to study objects with sizes in the order of a billionth of a meter, known as nanoscopic structures, including viroplasms.
Garcé s et al. use super-resolution microscopy to determine how viral proteins are arranged in the viroplasm and gain a better understanding of how the viruses are assembled. The images revealed that, in infected monkey kidney cells, rotavirus proteins inside the viroplasm form highly organized concentric layers. This arrangement is reliably repeated in viroplasms of different sizes, indicating that the organization of the proteins is likely set up when the viroplasm starts to form.
These findings make use of new microscopy, image analysis and statistical tools to study rotaviruses, providing a new framework to understand many aspects of rotaviral biology. Additionally, the result showing that proteins organize consistently in viroplasms is a first step towards understanding how the machinery that makes new rotaviruses works, which could lead to future treatments for severe infectious diarrhea.

Results
Qualitative analysis of VP morphology and structure through SRM Rotavirus VPs are complex signaling hubs composed of viral and cellular proteins, packed together with viral RNAs. By TEM, they roughly resemble circular electrodense structures whose internal components lack an obvious degree of spatial organization (Altenburg et al., 1980;Eichwald et al., 2012). In this work, we determined the relative spatial distribution of VPs components by immunofluorescence and SRM in MA104 cells infected with the rhesus rotavirus strain RRV at 6 hr post-infection (hpi), using protein-specific antibodies. Due to their important role as nucleating factors during VP biogenesis, we selected either NSP2 or NSP5 as spatial relative reference for the distribution of the VP1, VP2 and VP6 proteins. VPs were optically sectioned through total internal reflection fluorescence microscopy (TIRF), with an excitation depth of field restricted to 200 nm from the coverslip. This approach avoids excitation of fluorophores marking structural components located away from this plane, that is towards the inner cellular milieu. Additionally, NSP2 was also co-immunostained with the viral outer layer protein VP4 as well as with the ER resident proteins NSP4 and VP7, all of which have been reported to form separate ring-like structures that closely associate with VPs (González et al., 2000). In order to gain more insight into the morphogenesis of rotavirus, we analyzed the distribution of both VP7 monomers (VP7-Mon) and trimers (VP7-Tri) since this protein is assembled into virus particles in the latter form (Kabcenell et al., 1988). The nanoscale distribution of VPs was then analyzed through 3B-SRM, with improvements in the technique, developed in the present work, to solve nanoscopic structures ('Stochastic model fitted for 3B super resolution microscopy'Appendix 1). By different methods of analysis VPs exhibit roughly a circular shape ( Figure 1A-E). However, unlike the diffraction-limited image ( Figure 1B), in super-resolution microscopy structural details of VP are appreciated, like the different layer distributions of viral components with respect to NSP2 ( Figure 1C-E). In addition to VPs, by diffraction-limited TIRF microscopy we detected in the cytoplasm several small and dispersed puncta of fluorescence ( Figure 1B), and in these images it is also sometimes possible to differentiate the distribution of NSP2 from that of VP4, a closely viroplasm-associated viral protein (see also González et al., 2000); in this case, VP4 is detected as a ring-like structure that surrounds the VP. Nevertheless, the small size of the VPs effectively precludes measurement of component distribution for the majority of its structural elements, as their separation is below the spatial resolution of typical optical microscopes. In contrast, images obtained by 3B-SRM do allow the study of the relative distribution of the VP components ( Figure 1C-E). In the case of SRM images of VP4 ( Figure 1C), we observed that this protein forms a ring-like structure that does not colocalize with NSP2, and also ribbon-like projections that extend towards the cytoplasm, details that were not apparent in images captured with conventional fluorescence microscopy ( Figure 1B). Additionally, we observed that the small puncta of proteins detected in the cytoplasm were in fact ribbon-like structures composed of various viral proteins that may represent different organization forms of the viroplasmic proteins ( Figure 1C). In this regard, it is interesting to note that both NSP2 and VP4 have been reported to have at least two different intracellular distributions (González et al., 2000;Nejmeddine et al., 2000;Criglar et al., 2014). An examination of 3B-SRM images of VPs ( Figure 1C-E) revealed that the viral components form ring like structures within the VPs and are arrayed as rather discrete concentric layers. As seen in Figure 1C-E, we find that although the structural proteins VP1, VP2 and VP6 partially overlap in position with NSP2, the bulk of the proteins form separate and distinct layers. Also, the monomeric as well as the trimeric forms of VP7 are clearly distinguished from NSP2, forming an outer ring. Of interest, the spatial distribution of NSP4 colocalized with that of NSP2, an unexpected result since, as mentioned, NSP4 is an ER integral membrane protein (see the Discussion section), and as such it was expected to colocalize with VP7 rather than with an internal viroplasmic protein (Petrie et al., 1984). With regard to NSP5, it was observed distributed inside the ring formed by NSP2 ( Figure 1E).

Quantitative characterization of VPs structure by a novel segmentation algorithm
A qualitative analysis of the distribution of the VP components through 3B-SRM suggested that these are arranged as concentric spherical shells; thus, we set out to quantitatively validate the circularity of the VP shape. For this, we developed a segmentation algorithm based on a least squares approach, which we called 'Viroplasm Direct Least Squares Fitting Circumference' (VP-DLSFC) (see 'Segmentation Algorithm' in Appendix 1), to measure the spatial distribution of the components within individual VPs by adjusting concentric circumferences. This method is automatic, deterministic, easy to implement, and has a linear computational complexity. The performance of VP-DLSFC was tested on approximately 40,000 'ground truth' (GT) synthetic images, showing a high robustness to noise and partial occlusion scenarios. Additionally, we compared our method with two other alternative methods (Gander et al., 1994), and our approach displayed an improved performance (see 'Algorithm Validation' in Appendix 1). Based on this new algorithm, we find that the mean radius of the NSP5 distribution was smaller than that of NSP2, suggesting that NSP5 is located in the innermost section, as a component of the core of VPs (Figure 2A). On the other hand, the distribution of the structural proteins VP1, VP2 and VP6 exhibit slightly larger mean radii than that of NSP2, and are thus primarily localized in a zone surrounding NSP2. Continuing further towards the outer regions of the VP, we observed a region occupied by the spike protein VP4. Finally, the two different forms of VP7 (VP7-Mon and VP7-Tri) were located together, close to the most external region of the VPs (Figure 2A). The distribution of the glycoprotein NSP4 showed a similar mean radius to that of NSP2 (around 0:4m) suggesting, as described above, that these two proteins are located in the same structural layer of the VP (Figure 2A). In order to confirm our preliminary observations and clarify the nanoscopic organization of the VPs, we evaluated the relative separation between NSP2 and each accompanying protein. Again, the results show a remarkable degree of organization in the structure of the VP ( Figure 2B). As predicted from Figure 2A, we found that NSP5 is located in the internal part of the VP, in close proximity ( » 0:05m) to the area occupied by proteins NSP2 and NSP4, which themselves show the closest association. After the NSP2-NSP4 region, VP6 occupies a middle region at » 0:05m from NSP2, followed by the VP4 protein, which were located at a distance of » 0:18m. Finally, the VP7-Mon and VP7-Tri were situated at » 0:38m from NSP2 ( Figure 2B). A Mann-Whitney test showed that the distances of the various viral components in relation to NSP2 were significantly different ( Figure 2B), suggesting that they are situated in specific areas of the VPs. The two forms of VP7 were located at the same distance to NSP2, suggesting that the formation of trimers of VP7 takes place at the ER membrane, where the VP7 monomers should also be located. Note that in Figure 2B  (µm) *** *** *** *** *** *** *** *** *** *** *** *** *** NS.
NS.= Not Significant "*" = p-value < 0.05 "**" = p-value < 0.01 "***" = p-value < 0.001 distance of VP1 and VP2 to NSP2 was not included, since the radii obtained for NSP2 in these two combinations were significantly smaller than those found when it was determined in combination with the other VP components (see 'Supplementary Exploratory Analysis'). In addition to this, we found no significant differences between the distance of both VP1 and VP2 to NSP2 ( Figure 2C). Nonetheless, based on the inferential analysis, we could place these two proteins in the same layer as VP6 (see below). Next, through a hierarchical cluster analysis, we studied the relationship between the components of the VP, taking into account multiple variables at the same time, like the mean distance to NSP2 ['Mean(Dist)"], the standard deviation of the distance to NSP2 ['Std(Dist)"], the mean radius of NSP2 ['Mean(NSP2)"], and the radii of the other proteins ['Mean(Other)"] ( Figure 2D). Note that the proteins within a cluster should be as similar as possible and proteins in one cluster should be as dissimilar as possible from proteins in another. Because our variables are related with the distance to NSP2 and the radii of the proteins, this is a no-parametric analysis that should provide evidence about the spatial distribution/order of the viral proteins into the VP. As we are considering the distance to NSP2, VP1 and VP2 were not included in this analysis. The first level of the hierarchical agglomerative cluster ( Figure 2D, left) partitioned the VPs and the surrounding proteins in five clusters, composed by NSP4, NSP5, VP6, VP4 and {VP7-Mon, VP7-Tri}, which suggest that these five proteins compose different layers of the VP. The second agglomerative level merged into the same group the proteins NSP4 and NSP5, meanwhile VP6 and VP4 continue as independent clusters, which indicate that NSP5 and NSP4 are closer to each other than to VP6 and VP4 in the VP. In the third level, VP6 and VP4 are clustered in the same group, and as consequence are more related between them than with the others viral proteins. The subsequent groups in the clustering analysis indicate that VP7 remains as an independent layer with respect to the other proteins. Based on this analysis, the viral proteins seem to be highly organized, with VP7 conforming the most external layer, while NSP5, NSP4, VP6 and VP4 are distributed very close but as independent layers. The clusters between NSP5-NSP4 and VP6-VP4 suggest that these two pairs of proteins (in each cluster) conform continuous layers in the VP.

The relative spatial organization of VPs is maintained regardless their size
The scatterplot between the radius of the spatial distribution of NSP2 (independent variable, x-axis) and the radius of the distribution of other viral components (response variable, y-axis) showed a strong linear relationship ( Figure 3A). The distribution of NSP5 grows 0:87m for each 1m increase in the radius of NSP2 (slope interpretation), whereas the radius of the distribution of NSP4 increases 0:99m ( Figure 3B). These findings indicate that NSP5 is distributed in a proportionally smaller region than NSP2 regardless of the absolute size of the VP, supporting our observation that NSP5 is a constituent of the core of the VP. Moreover, the fact that the increase in the radius of the fitted distribution of NSP4 is directly proportional to the same parameter measured for NSP2 supports the idea that these proteins are both constituents of a putative second layer. VP1, VP2 and VP6 exhibit similar slopes which diverge between 0.03 and 0.05 mm ( Figure 3B and Appendix 1-table 6); thus, these results confirm that VP1, VP2 and VP6 are components of the same layer in the VPs which, from the data in Figure 3, is located just after the layer of NSP2 and NSP4. Finally, as noted in our quantitative analysis, VP4 and VP7 form consecutive external layers with a slope of 1.39 and 1.94 mm, respectively ( Figure 3B and Appendix 1-table 6). These findings indicate that the spatial distribution of the viral components in the VPs and in the surrounding areas is conserved regardless of their absolute size, and also form the basis of a predictive model, where, for a given radius of distribution of NSP2, it is possible to predict the radii of the remaining VP components (NSP5, NSP4, VP1, VP2, VP6) and of VP4 and VP7 proteins. This predictive model is available as a web app at https://yasel.shinyapps.io/Nanoscale_organization_of_rotavirus_replication_machineries/. The mathematical details and the residual analysis that validate these linear models are available in the Appendix 1, section 'Linear dependency between the viral components', Appendix 1-table 6 and Appendix 1-figure 9.
The structural organization of VPs is independent of the reference protein chosen for pairwise comparison In order to confirm the observed structural organization of VPs, we analyzed two more experimental conditions in which we chose a different reference protein for pairwise comparisons. The first was based on the distribution of NSP5 and its comparison with the relative localizations of VP6 and VP4, and the second considered NSP4 as the reference protein to compare with the distribution of VP6. We found that both analyses produced an identical structural organization for the VPs, with a comparative localization error of approximately 0:05m between models (close to the effective resolution limit of the 3B algorithm; see 'NSP5 and NSP4 as reference proteins' in Appendix 1). An extensive quantitative validation regarding the congruence between the NSP2, NSP5 and NSP4 models is available in the Appendix 1.
Based on our extensive quantitative, descriptive and inferential statistical analyses, we propose that the VP and the surrounding viral proteins form an ordered biological structure composed of at least five concentric layers organized as depicted in Figure 4. In this structure, NSP5 constitutes the innermost layer, followed by a {NSP2-NSP4} layer. Then, there is a layer composed by {VP1-VP2-VP6} and two consecutive external layers formed by VP4 and VP7. The different layers of proteins are most likely highly porous to allow the entry of positive-sense single-stranded viral RNA (+RNA) during genome replication and also of the antibodies used for VP staining.

Discussion
VPs have been previously studied using electron and fluorescence microscopy, however, due to the limited resolution of classic fluorescence microscopy techniques, and the difficulty of analysis of immunoelectron microscopy, the existence of any complex structural organization of the viral elements inside the VPs has not been reported. In recent years, the development of SRM has facilitated research into the nanoscale organization of a diverse range of cellular structures (Grant et al., 2018; Reznikov et al., 2018), however, until now SRM had not been applied to study the replication cycle of rotavirus. In this work, thanks to the use of the 3B SRM algorithm, we visualized and determined quantitatively the location of several viroplasmic proteins, leading us to propose a detailed model of the VP that should be of great value for understanding virus morphogenesis. Other SRM algorithms had been used to study the organization of viral and cellular structures showing concentric arrangements, as those proposed by Laine et al. (2015) and Manetsberger et al. (2015). The main similarities between those studies and our approach is the use of conics, such as circles or ellipses, to fitting structures showing concentric organization. The method provided by Manetsberger et al. (2015) could actually be implemented to analyze our data set, which as outcome will produce similar results. This method could also provide information about the degree of asymmetry within the VP, which may be valuable to establish functional relationships between the protein distribution belts that shape these intriguing structures. The selection of the 3B SRM algorithm over other super-resolution approaches was based on the fact that this method allows to deal with samples with high density of labeling, obtaining data with a reasonable resolution, although at the cost of higher computational effort.
The quantification of the viral protein distribution within the VPs was possible thanks to a novel segmentation algorithm (VPs-DLSFC) that was proven to be robust and efficient in noisy and partial  occlusion scenarios. The manual pre-segmentation step of this algorithm was necessary in our case because we did not want to introduce any bias in the isolation of the VPs through an automatic approach. Setting aside the manual pre-segmentation step, the VPs-DLSFC algorithm is automatic, deterministic, non-iterative and has a linear computational complexity.
Previous reports have suggested that VPs have a spherical-like structure (Eichwald et al., 2004;Cabral-Romero and Padilla-Noriega, 2006;Campagna et al., 2013); in this study we confirmed this suggestion by comparing the VPs-DLSFC approach with a similar approach based on an ellipse adjustment (Garcés et al., 2016). The results showed no significant statistical differences between these two models, and as consequence we can confidently model the structure of the VPs as a circumference. We also ratified that the center displacement of the circumferences that adjust two paired proteins are not statistically different.
Our study indicates that the viral components in the VPs, as well as VP4 and VP7, are arranged as largely discrete concentric layers (note that we are describing the structure of viroplasms, not of virus particles). This organization, however, does not preclude the interaction among the VP components proposed in this model as being located in separate layers since, for instance, NSP5 has been shown by different biochemical methods to interact with NSP2 (Eichwald et al., 2004;Poncet et al., 1997;Afrikanova et al., 1998;Jiang et al., 2006), VP1 (Afrikanova et al., 1998) andVP2 (Berois et al., 2003). In this regard, based on the super resolution microscopy images, it seems clear that there is some overlapping between different protein layers, as is the case for NSP2 and NSP5 in Figure 1E, but also between NSP2 and VP1, VP2, VP6 and NSP4 in Figure 1. These general overlapping zones between different proteins most likely are relevant for coordinating the genome replication and virion assembly, as suggested. Of interest, we observed the presence of protein projections ('spikelike') from different viral shells that could also contribute to the interaction of proteins mapped to different layers (Appendix 1-figure 8). Although our present analysis is limited to a general characterization of the spatial distribution of the viral proteins within VPs, and not to understand specific details about the interactions between proteins in different layers, it could be used as departure point to analyze these interactions. Taking as initial solution the result of the algorithm VPs-DLSFC and the SRM image, it is possible to employ other segmentation approaches, like deformable/active contours (snakes) (Kass et al., 1988), level-set (Osher and Fedkiw, 2003), or region growing methods (Mehnert and Jackway, 1997;Synthuja et al., 2012), to evolve the circular contour and fit precisely the spatial distribution of the viral proteins. Then, establishing a polar coordinate system in the VP, and considering the results of both segmentation algorithms, it would be possible to quantify the radial angle in which a spike from the central distribution of a viral protein that interacts with a different protein exists. It would be also possible to determine how strong these interactions are (intersection between two segmentation curves), and to study whether the spikes are randomly distributed between layers or a specific pattern in the connection between different protein layers exists. In the latter case, this could allow us to explore whether these patterns influence the assembly of the virus-like particles or only provide a skeleton that maintain the structure of the VP. The results obtained could also be used to study topological changes that the VP might experience at different times post infection, and associate these changes with maturation of the subviral particles. In this regard, in preliminary experiments carried out at three hpi, the viral proteins in he VP have been found to have a similar 'layered' organization as shown for the mature VP at six hpi (data not shown). This observation indicates that this organization is already present when the formation of viral particles has not yet taken place, suggesting that it might be important for the assembly of DLPs within VPs. In an additional application, SRM could also be used to observe the assembly of the virus particles and the interactions that may occur of these particles in different layers of the VP. Nevertheless, to develop this idea it would be important to establish an experimental protocol to observe the viral particles during the early stages of the assembly process, to distinguish simultaneously the layers of the viroplasm and the viral particles, and to collect the SRM images with a very short acquisition rate and a very high resolution (25-30 nm), which makes this experimental plan a challenge.
Previous studies based on conventional microscopy techniques have reported that NSP5 and NSP2 colocalize (González et al., 2000;Eichwald et al., 2004;Fabbretti et al., 1999); in contrast, we found that although NSP5 and NSP2 are located in close proximity, their positions in the VP were separable. This difference is attributable to the increased spatial resolution in the final image created by the super-resolution techniques employed in our study. Here, NSP5 was found to represent the innermost layer of the VPs, suggesting that this protein might serve as the core scaffold upon which the subsequent viroplasmic proteins are assembled to form the VPs. This finding contrasts with a report by Eichwald et al. (2004), who described that NSP5 locates to a region external to NSP2. In addition to the superior spatial resolution obtainable through 3B-SRM, compared to the traditional confocal microscopy employed in the previous report, the difference might be due to the fact that in our study we characterized the endogenous structures produced during virus replication, while Eichwald et al. characterized VP-like structures formed by transiently expressed proteins fused to GFP.
Immediately outside the NSP5 core, we observed a layer composed of NSP2 and NSP4 proteins. The finding that NSP4 is located in the inner part of VPs was unexpected, since it is known that NSP4 is an integral membrane protein of the ER and since it has been reported that functions as a receptor for the new DLPs located at the periphery of the VPs, during their budding towards the lumen of the ER (Chasey, 1980;Petrie et al., 1982;Petrie et al., 1984;Au et al., 1989). Furthermore, it has been shown that NSP4 associates with VP4 and VP7 to form a hetero-oligomeric complex that could be involved in the last steps of rotavirus morphogenesis (Maass and Atkinson, 1990). Based on these findings, NSP4 was expected to locate close to VP4 and VP7, in the surroundings of the VP. On the other hand, and in line with our observations, previous confocal microscopy studies have shown that a portion of NSP4 also shows a limited colocalization with NSP2 (González et al., 2000).
The dual location of NSP4 as an integral glycoprotein of the ER membrane and as internal to VPs, as our results indicate, is not easy to reconcile; however, in a previous work it was suggested that there are three pools of intracellular NSP4 molecules. The first pool is represented by NSP4 localized in the ER, a second minor pool localized in the ERGIC compartment, and the third pool distributed in cytoplasmic vesicular structures associated with the autosomal marker LC3 (Berkova et al., 2006). Furthermore, in that work the authors suggested that NSP4 and autophagic marker LC3-positive vesicles may serve as a lipid membrane scaffold for the formation of large VPs by recruiting early VPs or VP-like structures formed by NSP2 and NSP5 (Berkova et al., 2006). This observation is in line with our model that NSP4 lies in an internal protein shell within VPs.
An additional, and very interesting possibility to explain the internal location of NSP4 in VPs is the hypothesis that VP morphogenesis occurs on the surface of lipid droplets (LDs) (Cheung et al., 2010). In that work, it was proposed that LDs serve as a platform to which NSP2 and NSP5 proteins attach to form VP-like structures; NSP2 octamers, in turn, associate with the viral polymerase VP1 and rotavirus +RNAs. The assorted RNA complex containing NSP2, VP1, the capping enzyme VP3 and viral +ssRNA is predicted to nucleate VP2 core assembly. In this model, core assembly results in the displacement of +RNA-bound NSP2 octamers, while VP1 within new formed cores direct dsRNA synthesis, using +RNAs as templates (Cheung et al., 2010;Borodavka et al., 2017;Borodavka et al., 2018). These events are followed by incorporation of the middle virus capsid protein VP6 to form DLPs. At some stage, these assemblies become VPs containing cores and DLPs and may lose some or all of their lipids (Cheung et al., 2010). In this regard, it is important to have in mind that the currently accepted model for the LD biogenesis is that neutral lipids are synthesized between the leaflets of the ER membrane, and the mature LD is then thought to bud from the ER membrane to form an independent organelle that is contained within a limiting monolayer of phospholipids and associated LD proteins (Walther and Farese, 2012). Thus, during budding of the LDs from the ER membrane they could take along rotavirus NSP4 (topologically oriented towards the cell's cytoplasm) which could help as a scaffold on the surface of LDs for the assembly of other rotavirus viroplasmic proteins, localizing then to the interior of VPs.
Further support for our model of localization of at least one pool of NSP4 molecules inside of the VPs is the observation that knocking-down the expression of NSP4 by RNA interference significantly reduces the number and size of VPs present in the cell, as well as the production of DLPs (Ló pez et al., 2005a). That study also showed that during RNAi inhibition of NSP4 expression the NSP2 and NSP5 proteins maintained an intracellular distribution restricted to VPs, while the VP2, VP4, VP6 and VP7 proteins failed to locate to VPs. Based on these observations, it is tempting to suggest that, in addition to the role NSP4 has on the budding of DLPs into the ER lumen, it may also play an important role as a regulator of VP assembly.
After the NSP2/NSP4 layer, we observed a middle zone composed of the structural proteins VP1, VP2 and VP6. Their location in the same zone is expected given their close association in the assembled DLPs (Estes and Greenberg, 2013). Also, the fact that VP1, VP2 and VP6 form a complex with NSP2 that has replicase activity (Aponte et al., 1996), suggests that the production of new DLPs could take place in this zone of the VP.
Finally, we found that VP4 and VP7 conform independent layers just external to the viroplasmic proteins. The position of these two proteins agrees with the proposed model of rotavirus morphogenesis in which VP4 is assembled first on DLPs, and subsequently VP7 binds the particles and locks VP4 in place (Trask and Dormitzer, 2006). Furthermore, the fact that VP7-Mon and VP7-Tri occupied the same layer in our model indicates that in the ER sites into which the DLPs bud, VP7 is already organized as trimers, which are subsequently assembled into the virus particles. Of interest, VP4 has been reported to exist in two different forms in infected cells. One of them is associated with microtubules (Nejmeddine et al., 2000), while the other one has been reported to be found between the VP and the ER membrane (González et al., 2000). In this regard, based on our findings, we suggest that the latter form of VP4 can be actually considered as an integral component of the VP. Since several studies have found the presence of different cellular proteins and lipids in association to VPs (Maruri-Avidal et al., 2008;Cheung et al., 2010;Dhillon et al., 2018), it will be interesting to study the relative localization of this components using the methodologies described here.

Cell and virus
The rhesus monkey kidney epithelial cell line MA104 (ATCC) was grown in Dulbecco's Modified Eagle Medium-Reduced Serum (DMEM-RS) (Thermo-Scientific HyClone, Logan, UT) supplemented with 5% heat-inactivated fetal bovine serum (FBS) (Biowest, Kansas City, MO) at 37˚C in a 5% CO 2 atmosphere. The cells were confirmed to be free of mycoplasm by testing with the INTRON Mycoplasma PCR Detection Kit (#25234). Rhesus rotavirus (RRV) was obtained from H. B. Greenberg (Stanford University, Stanford, Calif.) and propagated in MA104 cells as described previously (Pando et al., 2002). Prior to infection, RRV was activated with trypsin (10 mg/ml; Gibco, Life Technologies, Carlsbad, CA) for 30 min at 37˚C.

Antibodies
Monoclonal antibodies (MAbs) to VP2(3A8), VP4 (2G4), VP6 (255/60), VP7 (60) and VP7 (159) were kindly provided by H. B. Greenberg (Stanford University, Stanford, CA) (Shaw et al., 1986;Greenberg et al., 1983). The rabbit polyclonal sera to NSP2, NSP4 and NSP5, and the mouse polyclonal serum to NSP2 were produced in our laboratory (González et al., 1998). The hyperimmune serum to NSP4 (C-239) was generated in our laboratory by immunizing New Zealand white rabbits with a recombinant protein expressed in E. coli with a histidine-tail, representing the carboxy-terminal end (amino acids 120 to 175) of the rhesus rotavirus RRV NSP4 protein; see also Maruri-Avidal et al. (2008), in which this serum was used. The hyperimmune serum to VP1 was also generated in our laboratory by immunizing BALB/c mice with a recombinant protein expressed in E. coli with a histidine-tail, representing amino acids 227 to 539 of the rhesus rotavirus RRV VP1 protein.

Immunofluorescence
MA104 cells grown on glass coverslips were infected with rotavirus RRV at a multiplicity of infection (MOI) of 1. Six hours post infection, the cells were fixed with and processed for immunofluorescence as described (Silva-Ayala et al., 2013). Finally, the coverslips were mounted onto the center of glass slides with storm solution (1.5% glucose oxidase þ100 mM b-mercaptoethanol) to induce the blinking of the fluorophores (Dempsey et al., 2011;Heilemann et al., 2009).

Transmission electron microscopy
Cells grown in 75-cm 2 flasks were infected with rotavirus RRV at an MOI of 3 as described above. Six hours postinfection the cells were fixed in 2.5% glutaraldehyde-0.1 M cacodylate (pH 7.2), postfixed with 1% osmium tetroxide, and embedded in Epon 812 resin. The ultrathin sections obtained were stained with 2% uranyl acetate-1% lead citrate (Reynolds mix). The grids were examined with a Zeiss EM-900 electron microscope at 80 kV.

Set up of the optical microscope
All super-resolution imaging measurements were performed on an Olympus IX-81 inverted microscope configured for total internal reflection fluorescence (TIRF) excitation (Olympus, cellTIRFTM Illuminator). The critical angle was set up such that the evanescence field had a penetration depth of 200 nm (Xcellence software v1.2, Olympus soft imaging solution GMBH). The samples were continuously illuminated using excitation sources depending on the fluorophore in use. Alexa Fluor 488 and Alexa Fluor 568 dyes were excited with a 488 nm or 568 nm diode-pumped solid-state laser, respectively. Beam selection and modulation of laser intensities were controlled via Xcellence software v.1.2. A full multiband laser cube set was used to discriminate the selected light sources (LF 405/488/561/635 A-OMF, Bright Line; Semrock). Fluorescence was collected using an Olympus UApo N 100x=1:49 numerical aperture, oil-immersion objective lens, with an extra 1.6x intermediate magnification lens. All movies were recorded onto a 128 Â 128-pixel region of an electron-multiplying charge coupled device (EMCCD) camera (iXon 897, Model No: DU-897E-CS0-#BV; Andor) at 100 nm per pixel, and within a 50 ms interval (300 images per fluorescent excitation).

Bayesian analysis of the blinking and bleaching
Sub-diffraction images were derived from the Bayesian analysis of the stochastic Blinking and Bleaching of Alexa Fluor 488 dye (Cox et al., 2011). For each super-resolution reconstruction, 300 images were acquired at 20 frames per second with an exposure time of 50 ms at full laser power, spreading the bleaching of the sample over the length of the entire acquisition time. The maximum laser power coming out of the optical fiber measured at the back focal plane of the objective lens, for the 488 nm laser line, was 23.1 mW. The image sequences were analyzed with the 3B algorithm considering a pixel size of 100 nm and a full width half maximum of the point spread function of 270 nm (for Alexa Fluor 488), measured experimentally with 0.17 mm fluorescent beads (PS-SpeckTM Microscope Point Source Kit, Molecular Probes, Inc). All other parameters were set up using default values. The 3B analysis was run over 200 iterations, as recommended by the authors in Cox et al. (2011), and the final super-resolution reconstruction was created at a pixel size of 10 nm with the ImageJ plugin for 3B analysis (Rosten et al., 2013), using parallel computing as described in Hernández et al. (2016). The resolution increase observed in our imaging set up by 3B analysis was up to five times below the Abbe's limit (~50 nm). The resolution provided by 3B was improved by computing the photo-physical properties of Alexa Fluor 488, and Alexa Fluor 568 dyes, which were provided to 3B algorithm, as an input parameter which encompass the probability transition matrix between fluorophore's states. The method was validated with 40 nm gattapaint nanorules (PAINT 40RG, gattaquant, Inc) labeled with ATTO 655/ATTO 542 dyes (see '3B Algorithm' in Appendix 1).
Verleyen for his support while using it. We are also thankful to Arturo Pimentel, André s Saralegui and Xochitl Alvarado from LNMA-UNAM for their helpful discussions. The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication. The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication. This appendix is divided in seven sections. In -Segmentation algorithm-, we discuss the mathematical details of the algorithm 'Viroplasm Direct Least Square Fitting Circumference' (VPs-DLSFC). In Section -Algorithm validation-, we compare the algorithm VPs-DLSFC with other two classic methods for the adjustment of circumferences; this analysis was done thanks to the use of approximately 40000 synthetic 'ground truth' images. In Section -Model Considerations-, we prove that the distribution of the studied viral elements follow a concentric circumference spatial distribution. Sections -Exploratory Analysis-and -Linear Regression Model-contains a supplementary exploratory analysis about the spatial distribution of the viral elements, and details about the results and the residual error analysis of the linear regression models, respectively. In Section -NSP5 and NSP4 as reference proteins-we developed a similar study that was made with NSP2, but considering NSP5 and NSP4 as reference proteins. Also, we shown that there is no statistically significative differences between the distributions of the viral elements when we changed the reference protein. Finally, in Section -3B Algorithm-we explain the fitting model for the 3B algorithm where the transition matrix is modeled by ordinary differential equations (ODE).

Segmentation algorithm
The use of primitive models for the segmentation of the SRM images has many benefits, like computational efficiency, simple programmation, and understandable information about the objects that were segmented. In this regard, we developed a simple method for fitting a circumference to scattered data, which we called 'Direct Least Squares Fitting Circumference' (DLSFC). This approach considers basic mathematical analysis tools for the computation of the extreme value of a continuous function with N variables.

Direct least squares fitting circumference (DLSFC)
The spatial distribution of a viral proteins into the VP can be seen as a set of points P ¼ ðx i ; y i Þji ¼ 1 : N È É 2 R 2 (scattered data in the plane). Taking into account the implicit equation of a circumference ðx À x c Þ 2 þ ðy À y c Þ 2 ¼ r 2 , where ðx c ; y c Þ is the center, and r is the radius. The problem of adjust a circumference to P, is a optimization problem given by: The center of mass of P is given by: Then, considerering Equation (2), the problem (Equation (1)) can be rewritten as: . . . ; NÞ, a ¼ r 2 , and ðu c ; v c Þ is the center of the circumference after the change of variable.
For simplicity, we will define as f uc;vc;a ðu i ; v i Þ :¼ ðu i À u c Þ 2 þ ðv i À v c Þ 2 À a the distance function, and by Fðu c ; v c ; aÞ :¼ P N i¼1 f uc;vc;a ðu i ; v i Þ 2 the objective function of our minimization problem.
Note 1 Note that, with the new convention of notation, the problem (Equation (3)) is equivalent to: min uc;vc;a Fðu c ; v c ; aÞ (4) A simple alternative to resolve Equation (4) is to consider the extreme values of Fðu c ; v c ; aÞ, this is: The partial derivates of the function Fðu c ; v c ; aÞ are: 1. For a: Taking into account the condition given in Equation (5) for qF qa , we obtain that: 2. For u c : For v c : It is the same process than u c , the final result is, Expanding the Equation (7) we obtain: and taking into account that (9) can be written as: and considering the extreme condition qF quc ¼ 0 (see Equation (5)) we obtain: Following the same process, but now considering the Equation (8), we obtain: The extreme values b u c and b v c of the function Fðu c ; v c ; aÞ are obtained as solution of the linear system formed by Equation (10) and Equation (11).
Note 2 The coordinate system translation to the mass center of the original data, allow the simplification of the equation (Equation (9)).
The radius of the circumference is obtained developing the Equation (6): and writing the equation in function of a, we obtain that: Þ is a critical point of the function Fðu c ; v c ; aÞ. Finally, we need to compute the Hessian matrix of Þ is a maximum, a minimum, or an inflection point: The principal minors of H are:

>0:
Note 3 The Cauchy-Schwarz inequality is equal to zero if and only if exist a real value r such that u i r þ v i ¼ 0, for all i ¼ 1; 2; . . . ; N, that is, all the points are in a straight line. In our problem, the proteins do not follow a straight line distribution in the space, then, we can guarantee the strict inequality. Note that H 2 ¼ 0 if and only if u i ¼ 0; for all i ¼ 1; 2; . . . ; N, which is a particular case of a straight line.
The principal minors of the matrix H are greater than zero, then, the point ð b aÞ is a local minimum, but this point is the only critical point of the function Fðu c ; v c ; aÞ, and as consequence it is a global minimum.
The center of the fitted circumference in the original coordinate system is Note 4 This method is based on a simple and very popular mathematical techniques (specifically Differential Calculus). Even when we do not found a published paper with this approach, we want to take cares and not adjudicate this technique to our work.
Note 5 Note that, by definition, the distance function f uc;vc;a ðu i ; v i Þ is not a geometric distance to the circle, which is probably more intuitive to use in this kind of problems. Let consider for a second the geometric distance of a set of points P to a circle, which is given by: where ðx c ; y c Þ is the center of the circumference and r is the radius. Considering the extreme condition for the function Gðx c ; y c ; rÞ, it is obtained that: where x and y are the mean values of the variables x and y respectively. Simultaneously equating these partials to zero does not produce closed form solutions for x c ; y c ; and r. With numerical methods it is possible to carry out the optimization of these parameters, but these alternatives are iterative, no-deterministic process, which also are highly sensitives to the initial values, while our proposal not have these kind of unwished characteristics. In any case, in Appendix 1 Section -Algorithm validation-we compared our approach with an algorithm based on geometric distance and explained in details the benefits of each one. The final segmentation algorithm is composed by three steps (Appendix 1-figure 1). The first is a manual pre-segmentation that guarantees the existence of one and only one viroplasm per image (Appendix 1- figure 1A). In the second step, we adjust a circumference to the reference protein (remember that these are paired experiments, where NSP2 is present in all the combinations as reference protein) through the algorithm DLSFC (Appendix 1figure 1B), and finally the radius of the accompanying protein is adjusted using the Equation (12). Note that, the center of the accompanying protein is the same that the center of the reference protein (concentric model). Out of the manual pre-segmentation step, this algorithm is automatic, deterministic, not iterative and with a linear computational complexity.
Step 2 Step 3 * * A DLSFC Pre-Segmentation Step Note 6 The hypothesis that the viral proteins in the viroplasm are distributed like as concentric circumferences was proved in Appendix 1 Section -Model Considerations-('Model Considerations').

Algorithm validation
The algorithm DLSFC was compared with other two approaches proposed by Gander et al. (1994). For a more comprehensive and clear comparison we named these others two methods as 'Algebraic Least Square Fitting Circle' (ALSFC) and 'Geometric Least Square Fitting Circle' (GLSFC).
The algorithm ALSFC considers the algebraic representation of a circle in the plane: . . . ; N the set of points that we want to adjust. The objective function for this minimization problem is P n i¼1 Fðp i Þ, which can be represented as matrix form: The final optimization problem is given by: where kuk ¼ 1 is a contrain to avoid the trivial solution u ¼ ð0; 0; 0; 0Þ. The represent the Euclidean norm. Finally, the problem (Equation (16)) is solved considering the right singular eigenvector associated with the smallest singular eigenvalue of É (Gander et al., 1994).
On the other hand, the algorithm GLSFC deals with the minimization of the geometric distance: Note that kðx c ; y c Þ À p i k À r ð Þ 2 is the geometric distance of the point p i to the circle ðx À x c Þ 2 þ ðy À y c Þ 2 ¼ r 2 . This problem is nonlinear, and in Gander et al. (1994), the authors consider to resolve it trought a Gauss-Newton algorithm. The Gauss-Newton method is an iterative algorithm that depends of an initial approximation, and in this case, Gander and collaborators consider the solution of the problem (Equation (16)) as the initial value for the iterative process. More details about the algorithms ALSFC and GLSFC can be consulted in Gander et al. (1994). In summary, the method ALSFC is very simple and computational efficient, but the algebraic distance might not reflect with accuracy the real distance between the points and the circumference and, as consequence, the parameters of the model might be biased. On the contrary, the algorithm GLSFC is highly precise, but it is an iterative process (demand more computation time), and depends of good initial values to achieve the convergence. The Matlab codes for the algorithms GLSFC and ALSFC were obtained from Brown (2007).
For the validation, we simulated the spatial distribution of viral elements as circumferences, for which we know their parametric form ('ground truth' dataset) (Appendix 1- figure 2A). The use of 'ground truth' allows us to quantify the error in the adjustment of the algorithms DLSFC, ALSFC and GLSFC at different noise levels (Appendix 1-figure 2B) and partial occlusion conditions (Appendix 1-figure 2C). We generated over 40 000 images (size 512 Â 512 pixels) taking into account different levels of additive white Gaussian noise (AWGN) in order to consider auto-fluorescence and the error in the localization of the fluorophores by the algorithm 3B-ODE (see Appendix 1 Section -3B Algorithm-), and partial occlusion angles. Several radii and position of the synthetic viroplasms were generated randomly through a uniform distribution function. The codes for this simulation are availables at https://github. com/Yasel88/Nanoscale_organization_of_rotavirus_replication_machineries (Garcés Suárez, 2019; copy archived at https://github.com/elifesciences-publications/Nanoscale_organization_ of_rotavirus_replication_machineries).

Noise generation
Let I a synthetic image and p i 2 R 2 ; i ¼ 1; 2; . . . ; N points of the 'ground truth' circunference C 2 I , the AWGN over p i ; 8i ¼ 1; N was generated as: where p c i is the corrupted point after the contamination with AWGN, s is a scalar that represents the level of noise, and X i is a trial of a random variable X~Nð0; 1Þ (standard gaussian distribution). In our experimental design we included 20 different levels of noise s ¼ 0 : 0:5 : 10 (increase by 0.5 from 0 to 10). Note that the noise was added to the points of the circumferences and not to the images. This is because we are working with SRM images, and as consequence does not exist background noise.

Partial occlusion generation
The partial occlusion can be seen as the existence of incomplete information of the objects of interest, for example, an image where we have only information about the half of the VPs. In our perspective, it is important that a segmentation algorithm have a good performance even in these situations.
Definition 1 We called an angle as a partial oclussion angle for a set of points if not exist j 2 f1; 2; . . . ; Ng; r 2 R and b 2 ½a; a þ such as: where a is an angle in ½0; 2p, and ðx; yÞ ¼ 1 y i is the center of mass of the points p i . Note 7 Note that, in the Definition 1 we considered the interval ½a; a þ as a partial oclusion angle , but it is possible to obtain the same oclussion angle with ½a; a À . For our validation we took a ¼ 0 for simplicity, but this has not consequences in our experiments, because we generate the points as a circumference.
The relationship between cartesian and polar coordinates is given by: where, (a, b) is the circumference center, r is the radius and b is the angular coordinate. Then, for example, to provoke a partial occlusion of , it is just needed to avoid the generation of points in the angle ½0; or ½2p; 2p À through the Equation (19). Appendix 1- figure 3 shows this strategy, in this example, a half of the information is removed from the 'ground truth' circumference. In general, the average of the points that will be conserved after the generation of partial occlusion is given by: In the validation of the algorithm DLSFC, we considered four different partial occlusion conditions: 1-Not partial occlusion: Conserves all the points of the 'ground truth' circumference; 2-25% of partial occlusion: Partial occlusion angle ¼ p=2; 3-50% of partial occlusion: Partial occlusion angle ¼ p; 4-75% of partial occlusion: Partial occlusion angle ¼ 3p=2.
In the experiments, we represent the noise through the mean and the standard deviation of the distance of the points p c i ; i ¼ 1; 2 . . . ; n to the 'ground truth' circumference C, this is: where d iC ¼ k À p c i k À r is the geometric distance between the point p c i and the circumference C, Á j j represent the absolute value, ¼ ðx c ; y c Þ is the center of the circumference and r is the radius. The mean and standard deviation are an useful alternative to understand the amount of dispersion generated by a noise s in a circumference C.
The Appendix 1- figure 4 shows the adjustment through the algorithms DLSFC, GLSFC and ALSFC of one 'ground truth' circumference (solid black line) that was corrupted by different levels of AWGN and partial occlusion angles. As was expected, the performance of the algorithms seems to get worse when increase the level of partial occlusion, but the three algorithms shows good results for high AWGN levels. This example suggests that the algorithm GLSFC (solid red line) has a better performance than DLSFC (solid blue line) and ALSFC (solid green line) when the partial occlusion is less than p (columns A-C), but when ¼ 3p=2, this method breaks down, and this observation is more relevant when increase the level of noise. The algorithm ALSFC shows relative good results for all the experimental conditions, but it is interesting to note that, in most cases, the adjustment is worse than in the DLSFC and GLSFC alternatives, probably as consequence of the algebraic distance (see Appendix 1 Section -Algorithm validation-). This example is just for a first visual analysis, but a complete study of the results obtained by the three algorithms over the 40 000 synthetic images will be presented below. A circumference can be represented as a vector C ¼ ðr; x c ; y c Þ 2 R 3 , where r is the radius, and ðx c ; y c Þ the center. LetĈ ¼ ðr;x c ;ŷ c Þ and C ¼ ðr; x c ; y c Þ the 'ground truth' and the adjusted circumference, respectively. The fit error committed by the algorithms can be quantified as: " ¼Ĉ À C Now, thanks to the use of 'ground truth' circumferences, it is possible to compare the obtained outcome with the ideal result. We consider that an adjustment is good enough if the estimation error of each circumference's component C is less than 0.1 m. This value is approximately two-fold smaller than the theoretical Abbe's limit ( » 250nm), and taking into account that the synthetic images resolution vary between 4.6 and 25.8 m in mean (see Appendix 1- figure 5), we are demanding that the algorithm has a nanoscopic precision (less than 100 nm) even when the images have a very bad resolution. Combining the threshold explained above with Equation (21), we obtain the final condition based on the euclidean norm between these two vectors: " ¼Ĉ À C ð0:1; 0:1; 0:1Þ k k ¼ 0:1732m: Appendix 1- figure 5. Error in the adjustment of the algorithms DLSFC, GLSFC and ALSFC. The error was quantified through Equation (21). In all panels, the horizontal black line represents the value y ¼ 0:1732m (see Equation (21) The bottom x-axis is the Mean distance of the corrupted points to the 'ground truth' circumference (see Equation (20) The Appendix 1- figure 5(a) shows the descriptive analysis (through boxplots) of the errors generated by the algorithms DLSFC, GLSFC and ALSFC over the 40000 synthetic images. In some experiments we obtained extremely good adjusts (" » 10 À15 m), but in general the 75% of the errors are in the interval 10 À3 ; 1 ½ m. For all occlusion angles, the algorithm GLSFC had a bad behaviour in a wide range of experiments with an error of » 10 10 m. This kind of situation was expected for big partial occlusion angles (for example ¼ 3p=2), but in conditions where does not exist partial occlusion ( ¼ 0), the bad adjustment of GSFC is evidence that this algorithm is not a good alternative in this kind of problems. Note that a similar situation happens with the algorithm ALSFC, but in this case the maximum error is » 10 3 m, which is high too, but significantly less than that obtained by GLSFC. The algorithm DLSFC does not show the previous problems; observe that the maximun error is under 1m for all experimental conditions, even when ¼ 3p=2. Now, if we forget the outliers and take into account only the 75% of the experiments (boxes), a direct relationship between the error of each algorithm and the partial occlusion angles (increase occlusion angle imply a major error) is observed. However, the raise of the mean error is small in relationship with the occlusion angle, which suggests that these three methods are robust to partial occlusion. The blue shadow band highlights the occlusion angle ¼ 3p=2 ð Þ for which the algorithms have a median error greater than 0:1732m. Therefore, it is necessary at least the 50% of the data to obtain a good adjustment (with an error under 0:1732m) in the 75% of the experiments.
The Appendix 1-figure 5(b) displays the mean error obtained by each algorithm in relationship with the resolution of the images (mean and the standard deviation of the points distance to the 'ground truth' circumference, Equation (20)). This graph shows again that the method GLSFC did not reach the convergence in some experiments (remember that GLSFC is based on an iterative optimization algorithm), even for small partial occlusion angles, but it is interesting to point out that in all these cases (at least for the occlusion angles ¼ 0; p 2 ; p È É ), these high errors ( » 10 10 m) seem to be related with bad initial values. Note that in these experiments the algorithm ALSFC do not has a good performance (remember that the result of ALSFC is the initial value for GLSFC), which is an evidence of the sensibility of the GLSFC to the initial values. For the occlusion angle ¼ 3p 2 , a lot of bad fits were obtained through the algorithm GLSFC. In our opinion, even when the algorithm GLSFC can reach an extremely good fit, its sensitivity to initial values, the computational cost (iterative process), and the possibility of not convergence, are strongs reasons to consider that this alternative is not viable to use in this kind of problems.
In the Appendix 1-figure 5(c), the mean error of the algorithms DLSFC and ALSFC was splitted to see more clearly the performance of each one. For small partial occlusion angles ¼ 0; p 2 È É À Á the algorithm DLSFC has errors 0:1732m, while ALSFC, in high noise scenes, get a poor adjustment (black arrows). When ¼ p, the method ALSFC has errors ! 0:1732m in many cases, that increase when the level of noise grows. Both algorithms generate an error ! 0:1732m when ¼ 3p 2 , but in the case of DLSFC, in some experiments had errors <0:1732m, even for high noise levels. The Appendix 1- figure 5(d) show only the performance of the algorithm DLSFC. As noted, the method DLSFC shows a behaviour extremely robust and stable to noise and partial acclusion; note that for ¼ 0; p 2 ; p È É does not exist an increment in the mean error when the noise level is incremented. In extreme experimental conditions, when only the 25% of the data remain in the image ¼ 3p 2 À Á , DLSFC has a mean error between ½0:3; 0:5m with a 95% confidence interval equal to ½0:1; 0:9m.
So far we have shown the behaviour of the three algorithms considering several noise levels and partial occlusion angles conditions. However, it is interesting to know through a hypothesis test (inferential statistics) the performance of DLSFC and ALSFC taking into account the experiments we developed (samples). Note that we are not going to study the algorithm GLSFC, because, as we mentioned above, its sensitivity to initial values and its computational cost (iterative process) are strong reasons to consider that this algorithm is not viable to use in this kind of problems. Taking into account the threshold in Equation (22), for each experiment we have two possible outcomes: From this point of view, each experiment follows a Bernoulli's distribution, and then, all the experiments for the same experimental condition (partial occlusion angle) follows a Binomial distribution. The generated samples are independent (random generation), the decision rule (Equation (23)) is dichotomous, and our experimental design is a fair representation of the population (we considered many experimental conditions in more than 40000 images); then, it is possible to develop a 'Binomial Test' (Prybutok, 1989;Howell, 1982) to evaluate the population proportion of success, which we will denote by p. The null (H0) and alternative (H1) hypothesis for this test are: The null hypothesis is that the population proportion with an error less that 0.1732 is 0.7, and the alternative hypothesis is that the proportion is more that 0.7. The significance level for these tests is fixed at a ¼ 0:05. For more information about this test and its implementation review (R Development Core Team, 2017; Prybutok, 1989;Howell, 1982) Online documentation is available in url https://stat.ethz.ch/R-manual/R-devel/library/stats/html/ 00Index.html.
The Appendix 1-table 1 shows the results of the 'Binomial Test' for the algorithms DLSFC and ALSFC. For small occlusion angles ¼ 0; p 2 È É , the results of both methods are very similar and indicate a high performance. Note that in these cases the null hypothesis was rejected with a probability error of type I of » 10 À16 a, and the 95% confidence intervals are equal to ½0:99; 1, which means that, at a population level, we hope that at least the 99% of the adjustments had an error 0:1732m. For ¼ p, the algorithm DLSFC has a probability of success greater than 0.7 with a p-value of » 10 À16 a. On the contrary, for the same oclussion angle, the test revealed that the null hypothesis can't be rejected in the case of the algorithm ALSFC. Finally, as was analyzed above, the performance of the algorithms decrease significally for ¼ 3p 2 , and we can't reject the null hypothesis, in others words, we don't have enough evidence to assume that in this case the probability of success is greater than 0.7.
Appendix 1-table 1. Results of the Binomial Test. Column 1: Names of the algorithms. Column 2: 'P. Success' is the Probability of Success; 'P. Value' denote the probability of the error type 1, that is, reject the null hypothesis when it is true; 'C. Interval' is the 95% confidence interval. Column 3-6: Partial Occlusion Angles. The blue shadowed area in the table cell indicates the conditions in which it is not possible to reject the null hypothesis. The study carried up in this section shows that the algorithm GLSFC has a good performance and is capable to obtain a low error in the adjustment, but this alternative is sensitive to the initial values (Appendix 1- figure 5 (a,b)), and as any iterative process, consumes more computational resources. The method ALSFC proved to have a good behavior in many experimental conditions, although in diverse occasions a bad fit was obtained. Also, the behaviour of this algorithm was always inferior than that of DLSFC (Appendix 1-figure 5(C) and Appendix 1-table 1). Finally, the algorithm DLSFC had a great performance and stability in all the experimental conditions, even in extreme partial occlusion angles (Appendix 1- figure 5 and Appendix 1-table 1). This method proved to be robust in high noise conditions and it is a deterministic and not an iterative algorithm. For all these reasons we consider that the algorithm DLSFC is the best choice in comparison with GLSFC and ALSFC, to fitting the spatial distribution of the viral elements.

Model considerations
To proof the hypothesis that the viral elements of the VPs can be approximated through circumferences, we carried up a series of experiments based on the comparison between the circumference obtained by the algorithm DLSFC, and a least squared ellipse resulting of the 'Direct Least Square Fitting Ellipse' (DLSFE) algorithm (Fitzgibbon et al., 1999). The algorithm DLSFE has been used previously to adjust the viral replication centers of adenoviruses in fluorescence images (limited by diffraction) (Garcés et al., 2016), and has also been shown to be robust to noise, computationally efficient, and easy to implement (Fitzgibbon et al., 1999).
Let us denote by ðx i ; y i Þ; i ¼ 1; 2; . . . ; N the set of N points relative to the protein P, and by C P and E P the adjusted implicit form of the conics obtained by the algorithms DLSFC and DLSFE respectively. The variable C P ¼ ðr; x c ; y c Þ is a vector where r is the radius, and ðx c ; y c Þ is the center of the circumference. On the other hand, the implicit form of the ellipse can be represented as E P ¼ a; b; x E c ; y E c ; À Á , where fa; bg are the semi-major and semi-minor axis respectively, x E c ; y E c À Á is the center of the ellipse, and is the rotation angle. Under the assumption that the protein P can be approximate by a circumference, we expect that does not exist significative statistical differences between the radius of the circumference ðrÞ and each one of the ellipse semi-axis ða; bÞ.
Note 8 Note that this approach is independent of which protein we choose to test the circularity hypothesis.
For each protein combination, we accomplish a Shapiro-Wilk hypothesis test to study if any of the three distributions functions (circumference radius (r), semi-major axis (a) and semi-minor axis (b)) came from a normally distributed population (Shapiro and Wilk, 1965). The results revealed that, our data does not seem to follow a Gaussian distribution, and as consequence, it is impossible to use a parametric test (for example t-student). Based on our data conditions, we considered the Mann-Whitney test (Mann and Whitney, 1947;Hollander et al., 2013) to evaluate the differences between the radii of the circumferences and each of the ellipses semi-axis. This is a nonparametric test that can be applied when the observations are independent (our variables are independent because the radius and the semi-axes were obtained by different algorithms), the variables are ordinal (also our variables meet this requirement because they are numeric), and finally, we want to test if the radius of the circumferences and each one of the ellipses semi-axis, are equal (null hypothesis) or are not (alternative hypothesis).
The statistical analysis reveal that does not exist a significative statistical differences between R C and a (see Appendix 1-table 2) with a significance level of a ¼ 0:05. The same results were obtained for R C and b (see Appendix 1-table 3), and as consequence not matter if we use circumferences or ellipses to fitt the spatial distribution of the viral proteins.
On the other hand, we carried up a study to validate the viral elements concentricity hypothesis based on the distance between the centers of the adjusted circumferences of both proteins. For all the proteins, the median value is very close or smaller than 0:05m (Appendix 1- figure 6 and Appendix 1-table 4 (Column 2)), which is in accordance with the resolution limit of the algorithm 3B-ODE (0:04 À 0:05m) (Cox et al., 2011). Also, the 95% confidence interval (Appendix 1-table 4, Column 3) shows that the distance between the centers in each case is, 95% of the times in the order of 10 À2 m, which we consider a small error taking into account the limitations of the algorithm 3B-ODE.

(µm)
Appendix 1- figure 6. Boxplot of the differences in location between the center of the circumference that adjust NSP2 and the centers of the others nine viral elements. The red line y ¼ 0:052m represents approximately the maximun median value of distance between NSP2 and each of the others viral elements, while the second red line (y ¼ 0:1; 0:1Þ ð ¼ 0:141m) is the maximun error that we considered in the algorithm validation section (see Appendix 1 Section -Algorithm validation-). Each box contains the 50% of the observations, the bottom and the top of the box are the first and third quartiles (25% and 75% of the observations respectively). The line inside the box is the median (second quartiles (50% of the observations)). The upper whisker extends from the hinge to the largest value no further than 1:5 Â IQR from the hinge (where IQR is the interquartile range, or distance between the first and third quartiles). The lower whisker extends from the hinge to the smallest value at most 1:5 Â IQR of the hinge. The notches around the median extend to AEð1:57 Â IQR=nÞ. This gives a roughly 95% confidence interval for comparing medians (McGill et al., 1978). Note 9 The Mann-Whitney test was used to study the differences in population medians (difference in location) (Mann and Whitney, 1947).

Supplementary exploratory analysis
The algorithm VPs-DLSFC allows the quantification of the radius of two proteins in the same VP, and also the relative distance between them. At a first stage, we analyzed approximately 590 images (see histogram in Appendix 1-figure 7A) that were obtained by the presegmentation step of the algorithm VPs-DLSFC (Appendix 1-figure 1). In agreement with previous reports (Eichwald et al., 2012;Eichwald et al., 2004;Carreño-Torres et al., 2010), we found a wide heterogeneity in the size of VPs measured by NSP2, which exhibited radii ranging from 0.2 to 0.75 mm. In order to compare the distribution of each protein taking NSP2 as reference, we selected a subset of the individual VPs, in a way that there were no significant statistical difference between the radii of NSP2 from each condition. Histogram with the numbers of VPs that were obtained as part of the pre-segmentation step of the algorithm described in Appendix 1- figure 1A. Note that because each image contains one and only one VP, this histogram represents also the numbers of pre-segmented images that we had in our study. (B) Graphical representation of the Mann-Whitney hypothesis test in two-by-two comparison between the radii distributions of NSP2 in each experiment (Mann and Whitney, 1947). To avoid confusions, for each box, under the NSP2 name in x-axis, we included the name of the accompanying protein that specifies in which experiment we obtain that distribution of NSP2. Each combination is linked by a line, and the result of the test is up of the line. The red dashed square highlights the only two combinations (VP1 and VP2) in which the distribution of NSP2 had a significant statistical difference with the other NSP2 distributions. (C) Two-sample Mann-Whitney hypothesis test, considering as variables the radius of NSP2 in contrast to the radius of the others seven proteins. In each combination, we show the difference in location between NSP2 and the other protein (x-axis) and the confidence interval at a level of 95%. The numbers over the intervals confidence are the p-value of the statistical test. In accordance with this criterion, we were able to adjust the size of NSP2, such that the mean in the distribution of the radius was similar when this protein was paired with NSP5, NSP4, VP6, VP4 and VP7 (Appendix 1-figure 7B). Unfortunately, in the case of NSP2~VP1 and NSP2~VP2, even though we could selected a group of VPs in which the mean radius of NSP2 (0.35 mm) was closer to the size of the NSP2 from the other conditions (0:4m), the analysis of the distributions in the radius of NSP2 by the Mann-Whitney hypothesis test (Appendix 1-figure 7B), revealed that the VPs of VP1 and VP2 were significative smaller than the mean size of viroplasms of the others proteins. As consequence, was not possible to compare directly the position of VP1 and VP2 with the others viral elements through an exploratory analysis. However, this problem did not limit the inferential study that we developed.
The use of super-resolution microscopy and the algorithm VPs-DLSFC allows the detection of small differences (resolution of nanometers) in the relative position of the viral elements. To validate that the different viral elements were in fact located in a different region from NSP2, we developed a Mann-Whitney test to study the existence of significative differences between the radii of NSP2 and the radii of each of the other viral elements. According with our previous results (see main text), this test revealed that only NSP4 has a similar spatial distribution that of NSP2 (Appendix 1- figure 7C and Appendix 1-table 5), while the others viral elements have different radii. Moreover, the small negative differences in location of NSP5 turned out to be significative different with respect to NSP2 (the p-values of the test are available in the (Appendix 1- figure 7C and Appendix 1-table 5). These results support that NSP5 is located in the innermost part of the viroplasm and that NSP4 shares the same layer with NSP2. The structural proteins VP1, VP2 and VP6 seem to be in the same layer of the viroplasm, with similar values, but with a significative statistical difference in location with NSP2 at a level a ¼ 0:01. Also, this test suggests that VP4 and VP7 form independent layers in the most external zone of the VPs (Appendix 1- figure 7C and Appendix 1-table 5), which is in accordance with our proposed model. Appendix 1-table 5. Two-sample Mann-Whitney hypothesis test, considering as variables the radii of NSP2 in contrast with the radii of the others seven viral proteins. Under the null hypothesis, both samples come from the same distribution (H0: true location is equal to zero), while the alternative hypothesis (H1: true location is not equal to zero) establishes that exist a difference between the median of the distributions. Protein: Name of the viral element compared with NSP2; W: value of the Mann-Whitney statistical hypothesis test; Difference: Estimation of the location parameter (median difference between the radii of NSP2 and the radii of the other viral element); C. Interval: 95% confidence interval; P. Value: p-value of the test. NSP2-VP7-Mon NSP2-VP7-Tri Appendix 1- figure 8. Examples of small zones of colocalization between differents viral proteins. The interations between the viral elements could be explained throught the spikes that come from the central distribution of the viral elements which colocalize with other proteins. A more specific study it is necessary in order to explain these interactions.

Linear dependency between the viral components
The spatial distribution of the viral elements in the VPs is conserved regardless of their size; that is, the radius of each protein shows a linear dependency with the radius of NSP2.
Let P i ¼ ðx i ; y i Þ 2 R 2 ; i ¼ 1; 2; . . . ; N a set of points, where x i and y i are the radii of NSP2 and the acompayning viral element Y, respectively. The adjustment of the linear model can be solved through the following optimization problem: The model (Equation (25)) does not include an offset term because we don't have any quantification about the radius of the accompanying viral element when NSP2 have been silenced.
The solution of this problem is given by Kiefer (1987): where corðx; yÞ is the linear correlation between x and y, covðx; yÞ is the covariance between x and y, Var(x) is the variance of x, and x is the mean value of the variable X.
The 'Residual Standard Error' (RSE or s 2 ð Þ) is a measure of fit of the linear regression model.
in the estimation of the slope b b is: Consider the statistic t b ¼ b bÀb s b b and the null (H0) and alternative hypothesis (H1): Under the null hypothesis, follow a t-Student distribution with N-1 degrees of freedom. The p-value of this tests is computed as: where PðT t 0 Þ is the probability that a t-Student variable with N-1 degrees of freedom be less than t 0 . Note that through this hypothesis test it is possible to evaluate if the slope b is significative to explain the linear relationship between the variables. The variance of the dependent variable y is: In accordance with the Equation (31), the variance rate that could be explained by the regression model is the regression variance divided by the full variance, this is: Let É be the lineal regression model that adjusts the data x i ; y i ð Þ, and be R 2 the associated r-square value, then, 100 Â R 2 is the percent of the data variance that it was explained by the model. For strong linear dependencies, the expected R 2 value should be close to 1. For more information about linear regression models consult (Kiefer, 1987).
The results of the regression models are available in Appendix 1-table 6. In all regression models, the standard errors are in the order of 10 À2 m, with a p À value<10 À33 which is a strong evidence of the linear relationship that exist between NSP2 with the other seven viral proteins. Also, the RSE have values in the order of 10 À2 m, except for VP7 which have a RSE of 0:15m, which could suggest that exist a greater dispersion of VP7 in comparison with the others viral proteins, something expected since, according to our model, VP7 is the most external protein.
Appendix 1-table 6. Results and validation of the linear regression models. NSP2 was considered as the independent variable and the other viral elements as the dependent variable (see Equation (25)). The slope and standard error for each regression model (Column 2 and 3) were computed in accordance with the Equation (26) and (28) respectively. The t-value of the t-Student distribution function with N-1 degrees of freedom are in Column 4 (see Appendix 1 Section -Linear Regression Model-for details). The p-value for each linear regression model are shows in Column five and were measure as was described in (Equation (29)) and (Equation (30)). Finally, the RSE and the R 2 summarize the adjustment of the data through the proposed lineal models. As was advised, the RSE (Column 6) is a fit measure of the linear model to the data (see Equation (27)). The '% Error' is the average error in the prediction and it is computed as: 100 Â RSE=b, while the 100 Â R 2 is the percent of the data variance that it is explained by the model (see Equation (32)).

Model
Slope ( The average errors are under 8% in all models (Appendix 1-table 6), which confirm the capacity of our linear models to predict with high accuracy the radius of the described viral elements. On the other hand, the percentage variance rate that could be explained by the regression models (100 Â R 2 , see Equation (32)), was greater than the 96% in all cases (Appendix 1-table 6), demonstrating again the high linear dependency between NSP2 and the others viral elements into the viroplasm.   (31)) by the models, and the dots filled with a color gradient are the real values y i . The errors between the predicted and real values, ¼ jb y i À y i j, are represented as a gradient of colors as follows (from lowest/coldest to highest/warmest). For each model, the RSE and R 2 (Rsquared) were included, note that this statistics are the same that were presented in

NSP5 and NSP4 as reference proteins
As was mentioned in the main text, we carried out a similar study using NSP5 and NSP4 as reference proteins. Since these experiments were developed to validate our previous results with NSP2, they were not as deep and extensive as in the case of the NSP2 model.
The number of VPs studied was 38 and 23 for NSP5~VP6 and NSP5~VP4 respectively (Appendix 1- figure 10A.1). The distribution of NSP5 relative to VP6 and VP4 did not show significative statistical differences according to the Mann-Whitney test, and the radius of VP6 was smaller than VP4, which is in accordance with our model of the viroplasm (Appendix 1- figure 10A). Furthermore, the distance of VP6 and VP4 towards NSP5, reflects that VP6 is closer to NSP5 than VP4, with a significance level 0:001 (Appendix 1- figure 10B). On the other hand, we found that exists a linear relationship between the radii of NSP5 and {VP6, VP4} (Appendix 1-figure 10 C-E). The linear models of both, VP6 and VP4, explain approximately 97% of the data variability and exhibit a RSE in the order of 10 À2 m (see Appendix 1-figure 10 (D,E) and Appendix 1-table 7). Additionally, we performed a Mann-Whitney test to study the differences of the radii of NSP5 and each of the accompanying proteins (VP6 and VP4). This test demonstrated that the differences in location between NSP5 and VP6 is » 0:1m, while in the case of VP4 is » 0:23m, with a p-value in the order of 10 À7 and 10 À8 respectively (see Appendix 1-table 8), indicating that although these protein are near to each other, they occupy different layers on the VPs.  In the case of the model based on NSP4, we studied 60 viroplasms through the algorithm VPs-DLSFC. The results show that the radius of VP6 were bigger than the radius of NSP4 in approximately 0.04 mm (Appendix 1-figure 11 A-B). This difference is statistically significative (p À value 0:05) according with the Mann-Whitney test (Appendix 1- figure  11A). Like the models based on NSP2 and NSP5, we found a linear relationship between NSP4 and VP6 (Appendix 1- figure 11C). This linear model was validated through a residual analysis (Appendix 1- figure 11D and Appendix 1-table 9), obtaining a RSE ¼ 0:079m, and a R 2 ¼ 0:97.  figure 9; the details about this kind of representation can be consulted in that figure.

Consistency between models
As was described, our study involves three differents reference proteins (NSP2, NSP5 and NSP4). Since the three models should respond to an unique spatial organization of the viral elements into the VP, it is expected to obtain similar results independenly of the selected model. The equivalence of the differents models (models based on NSP2 and NSP5) can be proven through a comparison of the distances between NSP5 and VP6. If the two models describe the same distribution of the VPs, we expect that the distance of NSP5 to VP6 computed by the NSP5 model be similar to the the distance between NSP5 and NSP2, plus the distance of NSP2 to VP6 that were computed by the model based on NSP2. Hence: The subindex ('name') in d name ðÁÞ specifies the model employed to compute the distance. Therefore, replacing the corresponding values (available in Appendix 1- Since the level of precision in our radii estimation of the viral components are limited by the resolution of the algorithm 3B (0:04 À 0:05m), we set this parameter as a threshold for the errors when we compared the results obtained for each model. For example, the error between the models based on NSP2 and NSP5 for VP6 is » 0:016m (see Equation (34)), which is under the resolution limit that can be reached by the algorithm 3B, and as consequence the error is asociated to the limitations of the optic and not to the reference protein that we selected for the estimation of the VP6's radius.
This same approach was applied for VP4. In this case, and considering again the results in Appendix 1- which again is an error under the resolution limit of the alorithm 3B.
On the other hand, taking into account the models based on NSP2 and NSP4, and the results in Appendix 1-table 5 and Appendix 1- figure 11A, we obtain that: ¼ 0:003348142 þ 0:063282322 ¼ 0:06663046 ) " ¼ d NSP4 ðNSP4; VP6Þ À 0:06663046 j j ¼ 0:04070526 À 0:06663046 j j ¼ 0:0259252m: As before, the error is under the limit of resolution of the algorithm 3B. Altogether, the comparison of the three models shows up that different experimental approaches describe the same spatial distributions of the viral elements inside the viroplasm.
The previous analysis only considers the consistency of the models based on the difference in the radius of the viral elements, but besides this, we developed a similar study using the lineal regression models.
Again, consider the model NSP5 and NSP2 as predictors of VP6: where b VP6 NSP2 and b VP6 NSP5 are the slope associated with VP6 in the model based on NSP2 and NSP5 respectively. Also, the radius of the NSP5 can be estimated with the model based on NSP2, this is: Substituting Equation (39) into Equation (38) we obtain: and taking into acount Equation (37), we can assert that: Note 10 The relative error was computed under the assumption that the slope of the model based on NSP2 is the ideal value. This consideration was based in the fact that we have more evidence of a linear relationship in the NSP2 model (see the R 2 and RSE in Appendix 1-table 6) that in the models based on NSP5 and NSP4 (Appendix 1-table 7 and Appendix 1-table 9 respectively).
Appendix 1-table 9. Linear regression results with NSP4 as independent variable. For more information about the variables involved (columns) consult Appendix 1-table 6 or Appendix 1 Section -Linear Regression Model.

Model
Slope For the prediction of VP6 through the models NSP2 and NSP5, the relative error is (Equation (43) The relative error asociated with the prediction of the radius of VP4 through the models NSP2 and NSP5 is: Finally, in the case of VP6 considering the models NSP2 and NSP4 the relative error is: Since the relative errors represent the average error of the slopes computed with the models NSP5 and NSP4 in relation with the NSP2 model. These results indicate that the model based on NSP5 has an error of 6.5% for the prediction of the radius of VP6, while for VP4 the error is around 1.3% compared to the prediction of the model based on NSP2. On the other hand, if we use the model based on NSP4, we get approximately a 7.3% of differences in the radius of VP6. As in the previous analysis, these errors are small too, and could be associated with experimental variations and with the resolution limit of the algorithm 3B.
Even when the relative error provide important information about the differences in the prediction between differents models, we are not taking into account the standard error associated with the slopes of each model. In the case of the prediction of VP6 through the models NSP2 and NSP5 (all others cases have the same deduction), if we consider b VP6 NSP5 and b NSP5 NSP2 as random variables with a standard error s b VP6 NSP5 and s b NSP5 NSP2 respectively, the variance of the variable b VP6 NSP5 Â b NSP5 NSP2 could be computed through the 'Delta Method' (Oehlert, 1992;Ver Hoef, 2012). The 'Delta Method' makes possible to obtain an approximation of the variance of a function f ðX 1 ; X 2 ; . . . ; X n Þ as: were CovðX i ; X j Þ denote the covariance between X i and X j (Lawrence, 1953).

Note 11
The standard error is defined as: but, in our case n ¼ 1 because we only have the final slope estimation and the associated standard error, hence the standard error is equal to the standard deviation (Altman and Bland, 2005).
In this case the slope b VP6 NSP2 ¼ 1:1806335 6 2 ½1:036190; 1:151645, which is surprising taking into account that the relative error of NSP4 as predictor of VP6 instead of the model based on NSP2 is around the 7.3% (see Equation (46)). Even when this percent doesn't represent a great difference, it could be enough to generate small variations that cause that the slope b VP6 NSP2 be out of the confidence interval given in Equation (56). Moreover, note that the difference between the upper endpoint of the confidence interval of the slope between NSP2 and VP6 is approximately 0.03 m, which is under the resolution limit of the 3B algorithm. A deeper study with different antibodies for NSP4 and the contrast of NSP4 with others viral elements would help to clarify our results.
Stochastic model fitted for 3B super resolution microscopy.
The algorithm 3B (Cox et al., 2011) creates a super resolution image from experimental data (hundreds of images acquired by the microscope). The 3B algorithm generates a probability map of fluorophore location through Markov Chains and Bayesian inference. The implementation of the algorithm requires a priori information on the dynamics of the photophysical properties of the fluorophores under study; however, the software does not provide a method to calculate it. In this section we propose a method to obtain this information from the experimental data.
3B microscopy generates a probability map of fluorophore localization by deriving a weighted average over all possible models; the set of models includes varying numbers of emitters, emitter localizations and temporal dynamics of the states of the fluorophores. The information stored in the image sequence is iteratively compared with the models using Bayesian inference, producing a SRM image. This technique allows the reduction of the number of necessary images, so it reduces the photo damage in the sample. However, the necessary computation time is significant (Hu et al., 2013); therefore, we employed 3B parallelization techniques for a cluster and PC from Hernández et al. (2016) in order to decrease the time to enhance the SR images.
The dynamics of the photophysical properties of a fluorophore is modeled in 3B-algorithm as a Markov chain, where, for each time, the fluorophore can be in three posible states: the emitting state S 0 , the non-emitting state S 1 , and the bleached state S 2 (Appendix 1- figure  12). Hence, in the complete sequence of images a fluorophore can be transiting between these states with a probability P i . fluorophore for the 3B-algorithm. From left to right are represented the emitting state, nonemitting state and bleached state for a fluorophore, with their respective transition probabilities among the states. As an example, a fluorophore can transit from the emission state to the non-emission state or remain in this state with a probability P 2 and P 1 , respectively. DOI: https://doi.org/10. 7554/eLife.42906.029 In this way, the temporary stack of images acquired by the microscope contains information on the dynamics of an unknown number of fluorophores transiting between these three states, with a probability P i . This information is represented as a transition matrix: where the entry in the ith row and jth column is the probability that a fluorophore in the state S i moves to the state S j .
The initial probability distribution describes the probability of a fluorophore to be in each energy state in the beginning of the process: f 0 ðiÞ corresponds to the state S i .
The 3B algorithm considers the Markov chain determined by the transition matrix (Equation (57)) and the initial probability distribution (Equation (58)). Then, (59) are the probabilities that a fluorophore is in the states S 0 ; S 1 and S 2 in time 1; . . . ; n; that is, for all i ¼ 1; . . . ; n, f i is the 3-vector whose entries are the probabilities that the fluorophore is in each energy state at time i.
The 3B algorithm has the transition matrix (Equation (57)) as an input, together with the initial probability distribution (Equation (58)). However, the software does not provide a method to calculate the transition matrix from the experimental data.
We deal with this issue by proposing a method to determine P from the data. Namely, we propose to adjust the experimental data (the stack of images) with an ordinary differential equation (ODE) model that allows us to calculate the probabilities of the system.
Following the Jablonski diagram for a given fluorophore, we propose a model with three states (S 0 ; S 1 and S 2 ) and three transition probabilities (k isc ; k T and k b ) between them (Appendix 1- figure 13). (Appendix 1-figure 12). Right panel: Diagram of probabilities for the transitions between the states of a fluorophore for our fluorophore's model. In our model, the basal (E 1 ) and the excited (E 2 ) states, from the reduced Jablonski diagram, are collapsed in a new excited state S 0 (green). The justification for this lies in the fact that the fluorescence phenomenon (k ex ; k em ) occurs on the scale of nanoseconds or less; however, an image collected with an EM-CCD camera regularly has millisecond exposure times, therefore, it involves the integration of 105 À 106 cycles of photon emission; as a consequence, the E 1 state is never detected. The entrance into the triplet excited state (k isc ; k T ) happens on the scale of seconds; if the emission process occurs, it releases photons of lower energy that are not detected and, therefore, the triplet state is considered as a dark state S 1 (violet). Finally, the photobleaching state S 2 (gray) is a irreversible process (k b Þ culminating with the destruction of the coordinating center of resonant electrons (orbitals p) which is responsible for absorbing photons. For i ¼ 0; 1; 2, we define the dynamics of the energy state S i as the time evolution of the concentration ½E i of molecules (fluorophores) emitting photons: The sequence of images taken through the time keeps track of the state the fluorophore is in. If in the ith image the fluorophore has intensity (the value of the pixel approaches the maximum value), then the fluorophore is in its emitting state; in any other case, the fluorophore is off, and it can be in two states: the non-emitting state or the bleached state. We normalize the data and compute the mean intensity at time t of the fluorophores in the emitting state: by definition, the initial mean value is one; after that, it is going to decay depending on the acquiring protocol.
Following the classical regression methods, we fit the experimental data to the proposed differential equations system. The estimate of the rate constant k isc , k T and k b from the stack of images is obtained following the example taken from Rawlings and Ekerdt (2013).
Let us consider the matrix P from Equation (57) as the transition matrix of our model. Then, by the definition of our model, P 2 ¼ k isc , P 3 ¼ k T and P 5 ¼ k b . Also, by the general properties of the probability function, the sum of the entries of each row of this matrix is one; so we obtain the transition matrix as follows: As we mentioned earlier, the estimate of the transition matrix depends on the acquisition protocol. There are image sequences for which the fitting does not converge; this happens when the average intensity decays exponentially. However, when the average intensity decreases linearly, the adjustment of the data to the converging model results in a transition matrix that fits the data.
After the fitting to the experimental data we simulate a Markov chain for the calculated transition matrix (Appendix 1-figure 14). We also observe that greater number of fluorophores leads to better concordance between the simulation and the experimental data. To corroborate that the proposed fitting is adjusted properly we use the experimental data provided by the authors of the 3B SRM microscopy algorithm. The podosomes data is available at the ThreeB plugin for ImageJ (Rosten et al., 2013), is a time serie of 300 images. Taking this stack as initial experimental data, we simulate Markov chains for 1000 fluorophores with the original transition matrix and the transition matrix fitted by our model. Although we found that our model fits better to the experimental data (Appendix 1- figure  15); the comparision between the SR images of the podosomes are not conclusive about the resolution enhancement with our model (Appendix 1-figure 16). Furthermore, in order to validate our method, we use Gatta-paint nanorulers, DNA molecules label with 488, 550 and 655 fluorophores at a distance of 40nm (GATTA-PAINT 40RG/40B, immobilized in buffer on glass-slide) to resolve the resolution enhacement of our method.
As in the previous analysis, we find that our model fits better to the experimental data obtained from the different ATTOS. Because our model involves a greater number of fluorophores, there is more information of the fluorophores localization. For example, in the Appendix 1- figure 17A.1, the information in the SR image with the original matrix, resolves only one fluorophore; however, with our method, three fluorophores in the same area can be resolved. In this way, we observed that in all cases the distance between the fluorophores (each peak) of the grahps shown in panels A.1-A.3 from Appendix 1- figure  17 is approximately 40 nm; this coincides with the information provided by the manufacturers of the samples. All together, these results demonstrate that the new proposed method improves the resolution capacity of 3B algorithm by reflecting better the photophysical behavior of the fluorophores. Appendix 1- figure 17. Comparison between the original model of 3B against the obtained with the transition matrix that we propose. Columns A-C represent the nanorulers GATTA-PAINT labelled with ATTO 488, 550 and 655 respectively. Panels A-C show the normalized experimental data (black dots) and the simulation for 1000 fluorophores with the matrix that we propose (purple dots) and with the original 3B matrix (burgundy dots); each point represents the mean value of the fluorescence. The time is indicated in frames, acquisition time between images is 100 ms. A complete experiment consists of 300 images collected in a CellTIRF microscope with a 160Â magnification. Panels A.1-C.1 show two ROI's from the complete SR images: 3B) canonical reconstruction and 3B-ODE) reconstruction with the ODE's model. The graph shows a line profile from the two reconstructions, in green the results from original 3B and in black with the transition matrix that we propose, the x-label is the distance in nm and the y-label is the intensity pixel value. The peaks from the graph denote the localization of a fluorophore. Once the transition matrix for the nanorulers was calculated and taking into account that the data consists of hundreds of images, we parallelized the data on a cluster and on a personal computer (Hernández et al., 2016) (the cluster has 314 nodes with RedHat Enterprise Linux Version 6.2, 2 Intel Xeon Sandy Bridge E5-2670 processors at 2.6 GHz with 8 cores each 64 GB of RAM) to reduce the computation time.