A swiftly increasing amount of available genomic information allures scientists to systematize the data and embed it into evolutionary scenarios. The recent analysis of viral RNA-dependent RNA polymerase (RdRp), superfamily 3 helicase and chymotrypsin-like protease sequences by Koonin and co-workers (Big Bang of picorna-like virus evolution antedates the radiation of eukaryotic supergroups. Nature Rev. Microbiol. 6, 925–939 (2008))1 clusters a number of RNA viruses into the picorna-like superfamily and led to the proposal that this group has a monophyletic origin. This superfamily, which includes viruses in the order Picornavirales, is now proposed to include other viruses from ten viral families as well as several as-yet-unclassified virus species. The characteristic feature of classical icosahedral picorna-like viruses is the jelly-roll capsid protein fold2. The new proposal by Koonin et al. combines both icosahedral (single-stranded RNA (ssRNA) and double-stranded RNA (dsRNA)) and helical viruses that have no similarities in virion architecture. The common nominator for all these viruses is the presence of a gene that encodes a related RdRp.

But how do we define a virus? It is widely accepted that a replicon is a virus only when it can produce a virion. Consequently, virion architecture, as well as the structures of the major components of the virion, are crucial when considering the origins and evolution of viruses3. In other words, a replicon transforms to a virus only after acquisition of genetic determinants for the virion. It follows that when we analyse relationships between viruses, and not only replicons (for example, plasmids and transposons), a virion-centred approach must dominate. It is evident that replicases can be horizontally exchanged between unrelated viruses. For example, tailed phi29-like bacteriophages (which have an HK97 capsid protein fold4,5 (Fig. 1a); Podoviridae family) and tailless membrane-containing bacteriophages of the Tectiviridae family (which have a double β-barrel capsid protein fold6 (Fig. 1b)) use homologous protein-primed type B polymerases for genome replication7. However, these two viruses have no similarities in virion structure and assembly8, a strictly regulated process that requires a concerted interplay between a number of different proteins. Similarly, different corticoviruses seem to use four non-homologous replication proteins9. Therefore, relationships between viral replication proteins do not necessarily reflect the relationships between viruses that use these proteins (Fig. 2). The extent of horizontal gene exchange between unrelated viruses and between viruses and their hosts10 emphasizes the choices to use a range of replication strategies that viruses have at their disposition.

Figure 1: Comparison of the main capsid protein X-ray structures of bacteriophages.
figure 1

The structures of HK97 (a; protein data bank code 1OHG) and PRD1 (b; protein data bank code 1HX6) are shown. These two viruses assemble unrelated virion structures in distinct and different ways.

Figure 2: Evolution of polymerases does not necessarily reflect the evolution of viruses.
figure 2

a | Three independent instances in which virogenesis gave rise to three distinct viruses that, nevertheless, replicate their genomes with the aid of related proteins. b | Horizontal exchange of the replicase genes between virion types 1 and 2 does not transform virion type 1 into virion type 2. HGT, horizontal gene transfer.

We note that the jelly-roll fold is common to most of the icosahedral ssRNA eukaryotic viruses and is likely to reflect a monophyletic origin for these viruses2. However, the jelly-roll fold, as well as the virion assembly principles of picornaviruses, is unrelated to the capsid proteins of dsRNA viruses, such as Saccharomyces cerevisiae L-A virus (Totiviridae family), which in turn was found to share architectural principles with other dsRNA viruses that infect eukaryotes (Reoviridae family) and bacteria (Cystoviridae family)8.

The phylogenetic analysis presented by Koonin et al.1 reflects the relatedness of viral RdRps. However, owing to reasons presented above, the obtained trees do not necessarily represent relationships between viruses.