The contrasting phylodynamics of human influenza B viruses

A complex interplay of viral, host, and ecological factors shapes the spatio-temporal incidence and evolution of human influenza viruses. Although considerable attention has been paid to influenza A viruses, a lack of equivalent data means that an integrated evolutionary and epidemiological framework has until now not been available for influenza B viruses, despite their significant disease burden. Through the analysis of over 900 full genomes from an epidemiological collection of more than 26,000 strains from Australia and New Zealand, we reveal fundamental differences in the phylodynamics of the two co-circulating lineages of influenza B virus (Victoria and Yamagata), showing that their individual dynamics are determined by a complex relationship between virus transmission, age of infection, and receptor binding preference. In sum, this work identifies new factors that are important determinants of influenza B evolution and epidemiology. DOI: http://dx.doi.org/10.7554/eLife.05055.001


Introduction
In addition to two subtypes of influenza A virus (H1N1 and H3N2), two lineages of influenza B viruses co-circulate in humans and cause seasonal influenza epidemics (Klimov et al., 2012). Influenza B causes a significant proportion of influenza-associated morbidity and mortality, and in some years is responsible for the major disease burden (Burnham et al., 2013;Paul Glezen et al., 2013). Although type A and B influenza viruses are closely related and have similarities in genome organization and protein structure , they exhibit important differences in their ecology and evolution (Chen and Holmes, 2008;Tan et al., 2013). While new influenza A viruses periodically emerge from animal reservoirs to become endemic in humans (Neumann et al., 2009;Smith et al., 2009), influenza B viruses, first recognized in 1940, have circulated continuously in humans alongside influenza A viruses and are presumably derived from a single, as yet unknown, source (Francis, 1940;Chen and Holmes, 2008). Unlike influenza A viruses that can infect a wide range of species, influenza B infections are almost exclusively restricted to humans with only sporadic infections reported in wildlife (Osterhaus et al., 2000;Bodewes et al., 2013). While the evolutionary and epidemiological dynamics of human influenza A H1N1 and H3N2 viruses have been well documented at the global scale (Rambaut et al., 2008;Russell et al., 2008;Bedford et al., 2010;Bahl et al., 2011), the equivalent dynamics of the two influenza B virus lineages-B/Yamagata/16/88-like and B/Victoria/2/87-like, henceforth termed the Yamagata and Victoria viruses-are poorly understood. eLife digest To develop new therapies against infections caused by a virus, it is important to understand the virus's history-where, when, and why it has caused disease and how it has changed over time. For example, new human strains of the influenza type A virus originate from strains that infect animals and rapidly can become common in human populations. In contrast, influenza type B virus strains almost exclusively infect humans and are continuously present in human populations. Both types have a detrimental impact on global health, but the type B viruses are less well understood, partly because outbreaks have not been as extensively documented. Vijaykrishna et al. have now investigated the history of the two strains of the influenza type B virus-called Victoria and Yamagata-that currently circulate in humans. To do this, they inspected the genetic sequences of 908 viruses taken from samples of confirmed type B infections collected across Australia and New Zealand over 13 years.
Individual virus particles of the same strain have genetic sequences that are very similar, but not completely identical. Vijaykrishna et al. showed that the diversity of the genetic sequences from the Victoria strain fluctuated between seasons, and particular genetic variants of Victoria only persisted in the population for 1-3 years. This indicates that Victoria viruses are under a lot of pressure to evolve, which results in so-called 'bottlenecks' whereby only the viruses carrying particular varieties of genetic sequence survive. This fluctuating pattern resembles that of the better-understood type A seasonal flu strain H3N2.
On the other hand, there was little change in the genetic diversity of the Yamagata strains sampled over the same 13-year period. The Yamagata viruses have diversified to a greater extent and several different 'varieties' of the virus tend to circulate together for long periods of time. For example, the three varieties of Yamagata virus circulating in 2013 evolved from a common parent virus that was circulating around 10 years ago. Vijaykrishna et al. found that between disease outbreaks, there was greater variation in the ability of Victoria viruses to be transmitted in humans, but that they were generally more easily transmitted than the Yamagata viruses. Victoria viruses tend to infect younger patients than Yamagata viruses, which is thought to be due to differences in the molecules that help the viruses enter the cells of the respiratory tract.
These findings suggests that it might be possible to eradicate the more slowly evolving influenza B Yamagata virus through mass vaccination programs using existing vaccines. This would then allow researchers to focus on developing effective vaccines to target the other strains of influenza virus.
Human influenza A H3N2 viruses exhibit limited genetic diversity at individual time-points due to periodic bottlenecks caused by strong selection-known as 'antigenic drift'-in the hemagglutinin (HA) and neuraminidase (NA) genes. This results in an HA phylogenetic tree with a characteristic slender 'trunk' (Fitch et al., 1997) appearance ( Figure 1A). H3N2 viruses also exhibit strong seasonal fluctuations in genetic diversity in temperate climate regions (such as Australia and New Zealand) (Rambaut et al., 2008), mainly due to the local extinction of viral lineages at the end of each influenza season (Rambaut et al., 2008). A similar but weaker evolutionary pattern is observed in the seasonal H1N1 viruses that have circulated in humans for the majority of the previous century (1918-1957 and 1977-2009), with short-term co-circulation of diverging virus populations (Nelson et al., 2008b) ( Figure 1B). The pandemic H1N1 (H1N1pdm09) viruses have to date also only exhibited limited antigenic evolution since they emerged in 2009 ( Figure 1C). In contrast, influenza B viruses are currently composed of two distinct lineages (Victoria and Yamagata) (Kanegae et al., 1990;Rota et al., 1990) ( Figure 1D) that diverged approximately 40 years ago and which have since co-circulated on a global scale, despite frequent reassortment among them (Chen and Holmes, 2008). Although the HA genes of influenza B viruses are thought to exhibit lower rates of evolutionary change (nucleotide substitution) than both influenza A viruses (Ferguson et al., 2003;Chen and Holmes, 2008;Bedford et al., 2014), their antigenic characteristics are not well understood.
The advent of global influenza surveillance and full genome sequencing over the past decade has shown that seasonal epidemic outbreaks of each influenza type are caused by the stochastic introduction of multiple virus lineages (Nelson et al., 2008a) and that the patterns of seasonal oscillation vary between temperate and tropical regions (Rambaut et al., 2008). Population genetic analysis (Rambaut et al., 2008), consistent with epidemiological data (Goldstein et al., 2011), suggests that the H3N2 and H1N1 subtypes of influenza A virus compete with each other resulting in the epidemic dominance of a single subtype. However, it is unclear whether the same dynamic patterns can be extended to influenza B viruses, or why the Victoria and Yamagata lineages have co-circulated for such an extended time period.
To understand the evolutionary and epidemiological dynamics of influenza B virus, we generated the full genomes of 908 influenza B viruses selected from over 26,000 laboratory confirmed influenza B cases in children and adults aged from birth to 102 years sampled during 2002-2013 in eastern  H1N1 virus, 1998-2009 (B), H1N1pdm09 virus, 2009-2013 (C), and influenza B Yamagata (red) and Victoria (black) lineage viruses, 2002-2013 (D). All phylogenetic trees were generated using approximately 1200 randomly selected full-length gene sequences sampled during 12 years. DOI: 10.7554/eLife.05055.003 Australia (Queensland,n = 275;New South Wales,n = 210;and Victoria,n = 207) and New Zealand (n = 216) ( Figure 2). These regions were selected because influenza surveillance was well established and continuous during the sampling period and included the co-circulation and periodic dominance of influenza A and both influenza B virus lineages. Of note is that the influenza B virus strain used for vaccination in the region did not match the dominant circulating strain during 7 of the 13 years studied ( Figure 2B). Our overall aim was to integrate, for the first time, data obtained from genetic, epidemiological, and immunological sources to better understand the evolution and interaction of these two lineages of influenza B virus.

Population dynamics of influenza B virus
We used the HA segment of both lineages to contrast their phylodynamics. First, to assess the changing patterns of genetic diversity of the two influenza B virus lineages in relation to their evolutionary histories, we used a flexible coalescent-based demographic model , which revealed stark differences in the epidemiological dynamics of the Victoria and Yamagata lineages ( Figure 3A,B). Whereas the Victoria lineage experienced strong seasonal fluctuations in Percentage prevalence of influenza B viruses isolated from the three eastern Australian states and New Zealand (A). Coloured lines represent the proportion of influenza viruses typed as influenza B in each country (blue) and each of the eastern Australian states; Queensland (yellow), New South Wales (orange), and Victoria (pink). Bars represent the percentage prevalence of Victoria (black) and Yamagata (red). Data based on National Notifiable Diseases relative genetic diversity, little change was observed over the same time period for the Yamagata lineage, and these observations were not heavily affected by differences in sampling density (Figure 3-figure supplement 1). While the almost invariant relative genetic diversity of the Yamagata lineage resembled that of seasonal H1N1 viruses ( Figure 3D), the stark and almost annual changes of diversity in the Victoria lineage were similar to those observed for H3N2 virus ( Figure 3C); although H3N2 viruses exhibited a greater frequency of oscillations than those estimated for Victoria lineage viruses. The strong seasonal fluctuations in diversity observed for Victoria lineage suggest that this lineage experiences strong bottlenecks between seasons similar to those described for H3N2 viruses (Bedford et al., 2011;Zinder et al., 2013), whereas the almost invariant relative genetic diversity for Yamagata suggests the continuous co-circulation of multiple lineages.
Marked differences between the Victoria and Yamagata lineages were apparent in phylogenetic trees of the HA (Figure 4). The phylogenetic analysis of the HA genes showed that the Victoria lineage was characterized by a single prominent tree 'trunk', with side branches that circulated for short periods of time (1-3 years) ( Figure 4). This evolutionary pattern parallels that observed for seasonal H3N2 viruses and is indicative of frequent selective bottlenecks due to the serial replacement of circulating strains, as would be expected under continual antigenic drift (Grenfell et al., 2004). In contrast, greater diversification was observed for the Yamagata lineage, with multiple clades co-circulating for extensive periods of time ( Figure 4). For example, the three clades of Yamagata viruses circulating in 2013 diverged approximately 10 years ago, again paralleling the evolutionary pattern seen in seasonal H1N1 viruses. These patterns are clearly identifiable in the genealogical diversity skyline (Figure 4) in which the average time to common ancestor between contemporaneous samples fluctuated from 0 to <5 years for Victoria lineage, except during 2010 and 2011 where the  genealogical diversity marginally increased to 7 years. In contrast, the genealogical diversity of Yamagata was consistently greater and gradually increased during the sampling period. The maintenance of genetic diversity through epidemic peaks and troughs as described for Yamagata ( Figure 3B) is expected to result in the gradual increase of divergence times of contemporaneous samples.

Transmission dynamics of influenza B virus
As each seasonal influenza epidemic provides important information on the epidemiological characteristics of both influenza B virus lineages, we utilized a birth-death susceptible-infectedremoved (BDSIR) (Kühnert et al., 2014) phylodynamic model that simultaneously co-estimates seasonal phylogenies along with the basic reproductive number, R 0 , for each lineage. However, because the infected population contains both susceptible and non-susceptible hosts we report the effective reproductive number, R e . This analysis showed a greater variation in R e (median values, 1.1-1.3) between epidemics caused by the Victoria lineage, whereas the R e of Yamagata epidemics, were generally lower, varied only slightly, around 1.1 (1.08-1.14) ( Figure 5A  cumulative number of all influenza B positive cases through time for each season ( Figure 5B) reveals significant differences in the exponential growth phase between the lineages, where Victoria lineage exhibited significantly higher initial growth rate resulting in a faster epidemic with larger number of infections. These results also show that in 2008 the Victoria lineage exhibited a significantly faster growth rate, in correlation with the high R e , coinciding with the year in which a new antigenic variant of the Victoria lineage was first detected (B/Brisbane/60/2008-like viruses) in Australia and New Zealand. This antigenic variant emerged as the globally dominant influenza B strain in the following years and has been continuously recommended (2009)(2010)(2011)(2012)(2013)(2014)(2015) as the influenza B vaccine component since that period in both the Northern and Southern Hemispheres (Klimov et al., 2012).
The BDSIR model assumes a closed epidemic, but the large-scale phylogenies generated using all available global data indicated that each of the annual epidemics were caused by the introduction of multiple viral lineages that went extinct locally by the end of the seasonal epidemic (data not shown). We therefore investigated the effect of virus migration on the estimates of R e . First, we identified lineages that conformed to the assumption of a closed epidemic (i.e., lineages resulting from a single introduction into Australia and New Zealand) and with a sufficiently large local transmission for analysis (i.e., Victoria lineage viruses in 2005, 2006 and 2008). An independent estimation of R e for each of these lineages produced a minor but non-significant variation to those observed for the entire epidemic ( Figure 5-figure supplement 1B), indicating that, on average, the R e estimates for lineages resulting from multiple introductions were similar. Next, we used a continuous-time Markov chain (CTMC) phylogeographic process  to estimate the number of migration events into and from Australia and New Zealand during the same period ( Figure 6). This indicated that the number of introductions per year was greater for the Yamagata lineage (15-22, mean state transition count in all years) than for Victoria (3-8, except 16 and 14 during 2010 and 2011, respectively) ( Figure 6), further suggesting an inverse relationship between R e ( Figure 5) and the number of introduction events. Indeed, our results show that introductions of viruses with greater transmission efficiency (i.e., high R e ), such as Victoria/2008, resulted in the epidemic dominance of such single strains, whereas epidemics of the Yamagata lineage with lower R e values likely resulted in slower and shorter transmission chains with reduced competition, in turn allowing the co-circulation (and detection) of multiple introduced lineages. Additionally, we identified that, combined, Australia and New Zealand were net importers of influenza viruses, except during 2002 and 2008 when the net export of the Victoria lineage was similar to the import observed during the same years ( Figure 6). The higher transmission rate for Victoria/2008 viruses (i.e., B/Brisbane/60/2008like viruses) may have also caused the successful seeding of these viruses globally (as described above). Taken together, our results support the concept of a global metapopulation seeding subsequent epidemics elsewhere (Bedford et al., 2010;Bahl et al., 2011), provided the virus is transmitted efficiently as observed during 2008 in this study.

Genome-wide evolutionary dynamics of influenza B viruses
To understand the genome-wide evolutionary dynamics of the two influenza B virus lineages, we inferred temporal changes in genetic diversity for all remaining gene segments (Figure 7). These analyses showed that the patterns observed for the NA and internal gene segments were similar to those observed for the HA genes described above. The single exception was the NP genes of both lineages where substantial differences occurred throughout their history. During 2002-2007, the peaks of relative genetic diversity of the Victoria NP was higher than all remaining gene segments following which this lineage was not identified in our surveillance, whereas the Yamagata NP showed additional peaks during 2010 and 2011 that corresponded to the NP peaks observed for the Victoria genes.
As genomic reassortment impacts levels of genetic diversity, we conducted phylogenetic analyses of all eight genome segments of the 908 viruses. Comparison of these phylogenies revealed frequent reassortment within the two lineages of influenza B virus (data not shown) and a few instances of reassortment between them (Figure 8). During the sampling period, the Victoria lineage HA gene repeatedly acquired internal gene segments from Yamagata lineage viruses to form novel reassortants. In particular, during 2004 a subpopulation (approximately 15%) of Victoria-like viruses acquired all internal gene segments (PB2, PB1, PA, NP, MP, and NS) from the Yamagata lineage viruses. Interestingly, all remaining inter-lineage reassortment events of the Victoria HA lineages involved the acquisition of the Yamagata NP gene during 2007 and 2008 ( Figure 8E), which resulted in the extinction of the previously circulating Victoria lineage NP gene. These patterns were consistent with the reconstruction of the population genetic history for the NP gene where we observed additional peaks in genetic diversity following 2007/2008 when the Yamagata NP was acquired by Victoria viruses (Figure 7), indicating a major genome-level transition for Victoria lineage viruses. In contrast, the only inter-lineage reassortment events for the virus carrying the Yamagata HA occurred during 2002 and 2004 (red arrows in Figure 8A,F), when the NA and MP genes were derived from the Victoria lineage viruses, but these viruses went extinct within the same influenza season. In sum, these results show that the HA gene of Victoria viruses is placed in different genetic backgrounds at a higher rate and this is likely to have important fitness consequences. Phylogenies also suggest that the PB2 and PB1 gene trees ( Figure 8B,C) exhibit deep divergence, similar to the HA gene where co-circulating viruses contain distinct Victoria and Yamagata genes. In contrast, the other gene segments exhibit relatively recent divergence indicating that the prevailing diversity of these genes originates from a single lineage. These results are consistent with a detailed investigation of long term reassortment patterns of influenza B virus lineages that revealed genetic linkage between the PB2, PB1 and HA protein genes (Dudas et al., 2015). Specifically, we observe that the PB2, PB1 and HA genes were consistently derived from a single lineage, except for the short-lived subpopulation in 2004.

Differential selection pressure between lineages
Despite the marked differences in their epidemiological and evolutionary dynamics, the HA genes of the two influenza B lineages both evolved at a rate of approximately 2.0 × 10 −3 subs/site/year (Table 1), comparable to those previously estimated for a smaller (n = 102) global sample of influenza B viruses collected during 1989-2006 (Chen and Holmes, 2008) (mean nucleotide substitution rate of 2.15 × 10 −3 subs/site/year). These rates were considerably lower than those estimated for influenza A H3N2 and H1N1 viruses (5.5 × 10 −3 subs/site/year, 4.0 × 10 −3 subs/site/year, respectively) (Rambaut et al., 2008). In contrast, analysis of the ratio of the number of nonsynonymous and synonymous substitutions per site (d N /d S ) revealed significant differences between the influenza B virus lineages, with the Victoria lineage viruses having accumulated more nonsynonymous substitutions (d N /d S = 0.19) than the Yamagata lineage (d N /d S = 0.13) (p-value, <0.05). In addition, two amino acid residues in the Victoria HA (positions 212 and 214) were revealed to have experienced positive selection (p < 0.05), whereas no positively selected sites were observed in the Yamagata lineage over the time period studied. Similarly, the Victoria lineage exhibited a greater d N /d S (ratio = 1.37) on internal vs external branches of the HA phylogeny compared to the Yamagata lineage (ratio = 0.98), indicating that amino acid changes have been fixed more frequently in Victoria than Yamagata lineage viruses ( Table 1). Taken together, these results indicate that the Victoria lineage is under greater positive selection pressure, and hence likely to experience greater antigenic drift, than the more conserved Yamagata lineage.

Antigenic evolution
We constructed antigenic maps (Smith et al., 2004) using hemagglutination inhibition (HI) assay measurements for 87 Victoria and Yamagata viruses isolated during 2002-2013 and using 20 reference antigens and antisera ( Figure 9A). These revealed that Victoria lineage viruses exhibited antigenic variation that generally clustered according to the year of isolation and phylogenetic distance, indicative of ongoing antigenic drift, and similar to that previously reported for H3N2 viruses (Smith et al., 2004;Bedford et al., 2014). In contrast, the antigenic distances for the Yamagata viruses had no correlation with time or phylogenetic distance and showed greater levels of antigenic cross-reactivity between antisera raised to both earlier and more recent viruses. Structural modeling showed that the degree of antigenic distance between strains of Victoria viruses was often linked to the proximity of single amino acid substitutions to the receptor binding pocket (RBP) of the HA ( Figure 9B; see structural differences section below), in agreement with recent work on H3N2 (Koel et al., 2013). Importantly, the closer the amino acid change between two strains was to the RBP, the greater the antigenic difference between them.

Heterogeneous age distributions of the lineages
In addition to genetic, antigenic, and evolutionary differences, we found a notable difference in the age distribution of infected cases for the two influenza B virus lineages ( Figure 10) that was generally consistent throughout our sampling period ( Figure 10-figure supplement 1). On average, Victoria viruses infected a younger population (mean 16.8 years, median 11 years) compared to Yamagata viruses (mean 26.6 years, median 18 years). Although the proportion of cases under 6 years were similar in both lineages (28.8% of Victoria and 26.8% of Yamagata), there were 1.7 times more cases aged 6-17 years infected with a Victoria lineage virus (39.0% Victoria vs 22.7% Yamagata), while this ratio was almost reversed for those aged 18 years and over (32.2% Victoria vs 50.0% Yamagata; χ 2 , p < 0.0001) ( Table 2). Thus, nearly 70% of Victoria lineage viruses were identified in children <18 years, whereas the Yamagata lineage exhibited a bimodal age distribution with a significant shift toward infections in individuals aged >25 years ( Figure 10). These differences in age distribution are significant and unlikely to be explained by systematic bias because the same pattern was observed in both countries, and are consistent with data from Guangdong, China (Tan et al., 2013), and Slovenia (Sočan et al., 2014) during the 2009-2010 and 2010-2013 epidemic seasons, respectively.
A direct consequence of antigenic drift is the possibility for previously infected individuals to become reinfected. Subsequently, higher rates of antigenic drift in the Victoria lineage should lead to a more even age distribution of cases, whereas lower rates of antigenic drift should lead to an age distribution of cases that are skewed towards younger individuals. Although viruses of the Victoria lineage were consistently reported at a higher frequency during our surveillance period, the observed skew towards children runs counter to this expectation ( Figure 10). One possible explanation is that the higher R e of the Victoria viruses reduces the mean age of infection, as expected in the case of a disease like influenza that imparts some immunity following infection (Anderson and May, 1992). Alternatively, the inability of Victoria viruses to infect an equivalent proportion of other age groups may mean that the relatively older population is better protected against this virus because of a broader immune response. The former scenario is supported by an increase in the mean age of infection from 15 years (median, 12) in 2008 to 20.5 years (median, 14) in 2011 for the B/Brisbane/60/ 2008-like antigenic variant of the Victoria lineage, which coincided with a gradual drop in R e from its peak in 2008 ( Figure 5A).

Structural differences among influenza B viruses
Finally, we sought to determine whether differences in the evolutionary and epidemiological dynamics between the two influenza B lineages resulted from variation in HA structure and binding preferences. First, we compared amino acid substitutions per site within and between influenza virus lineages from 2002 to 2012 and mapped these onto structural models of representative influenza B virus strains ( Figure 11). The higher rates of amino acid change observed in the Victoria HA ( Figure 11A) were consistent with the stronger selective pressures on this viral lineage. Importantly, these changes occurred in three major clusters situated around 21, 29, and 37Å to the RBP of the HA domain that Figure 9. Antigenicity of influenza B viruses. Antigenic map showing relative antigenic differences of Victoria and Yamagata lineage viruses (circles) measured using the hemagglutinin inhibition (HI) assay for each strain and coloured by year of isolation (A). Residues contributing to HI titer changes (B). Among the nine amino acid changes that we detected between antigenically different Victoria viruses, three changes produced strong HI titer change (>100) (red), 3 medium (≈50) (orange) and 3 low (<20) (yellow). Changes that produced the strongest HI titer change were the closest to the receptor binding pocket (blue arrow), highlighting the significance of their proximity to HI titer change. Amino acids were mapped on previously resolved influenza B virus structure (PDB:4FQM). Detailed HI titer values and reference antigens used are provided in the Dryad source data (Vijaykrishna et al., 2015). DOI: 10.7554/eLife.05055.014  Table 2 for comparison by age categories and Dryad source data for mean and median age for each year. DOI: 10.7554/eLife.05055.015 The following figure supplement is available for figure 10: also comprises potential antigenic sites. Notably, all changes in the closest cluster (21Å) were comprised exclusively of Victoria lineage amino acid changes, while the few changes observed in Yamagata lineage viruses were distant to the RBP ( Figure 11C). Overall, however, amino acid changes in both influenza B virus lineages were less frequent than those in influenza A viruses sampled over a similar time period, with the H3N2 viruses showing more extensive structural change (Figure 11-figure supplement 1).
Notably, we also observed fundamental structural differences between the lineages ( Figure 11B). Crystal structures showed extensive backbone differences around amino acid sites 165 and 180 that lie near the RBP as well as residue differences in the helix close to where α-2,3 and α-2,6 ligands differ structurally, thereby potentially influencing receptor binding ( Figure 11D). Previous experiments suggest that Yamagata viruses bind predominantly to α-2,6-linked sialic acid host receptors while Victoria viruses have both α-2,3 and α-2,6 binding capacities (Wang et al., 2012;Velkov, 2013). Binding differences may also originate in part from differences in N-glycosylation patterns between the lineages ( Figure 11E, 12). While both lineages share a possible glycan at Asn 160, only Victoria has a functional N-glycosylation site at Asn 248, although its distance from the receptor may account for only a limited role in binding differences. On the other hand, N-glycosylation at Asn 212 occurs in both lineages but has a lower overall frequency in Victoria strains. In light of the positive selection acting on codon sites 212 and 214 in the Victoria lineage, it is interesting to note that amino acid changes in either site would abolish the N-glycosylation at 212, thereby highlighting a possible functional consequence of gain or loss of a glycan at this site. Furthermore, position 212 is located at the exit of the RBP which is used by α-2,3-linked sialic acid host receptors, and loss of N-glycosylation at 212 consequently adds capacity to bind α-2,3 and not just α-2,6-linked sialic acid host receptors ( Figure 11E). Importantly, all our sequenced viruses have been passaged in MDCK cells to avoid egg adaptation artifacts in this context (Gambaryan et al., 1999). Interestingly, we observed that loss of N-glycosylation at site 212 was associated with an increased proportion in the younger (0-5 years) age group (Figure 12). We therefore hypothesize that subtle differences in the prevalence of α-2,3and α-2,6-linked glycans on the cells of the respiratory tract of young children compared to adults (Nicholls et al., 2007;Walther et al., 2013), combined with partial changes in glycosylation patterns, could account for the observed differential age distribution of the two influenza B lineages.

Conclusions
The genomic and epidemiological data analyzed here provide important insights into the phylodynamics of the two lineages of influenza B virus currently circulating in humans. In particular, we find significant differences in the evolutionary and epidemiological dynamics between the Victoria and Yamagata lineages ( Table 3). Central to this is the observation that the phylodynamic pattern of the Victoria lineage HA gene is indicative of a virus population under greater selection pressure that escapes host immunity by accruing beneficial amino acid substitutions in the HA gene. Indeed, theory predicts that the highest rate of viral adaptation occurs at intermediate levels of immune pressure (Grenfell et al., 2004) which may characterize the Victoria lineage. Such an evolutionary pattern ensures that there is a constant supply of susceptible individuals for Victoria lineage viruses-both naïve and reinfected individuals which in turn increases R e -which then exhibit a pattern of genomic diversity and lineage turnover that is significantly faster and more periodic than Yamagata lineage viruses.
In contrast, the phylodynamic patterns exhibited by Yamagata viruses are indicative of a virus population that exhibits slower and less periodic dynamics, reflected in a lower and more consistent R e , in turn suggesting that these viruses are under weaker immune selection pressure and accordingly experience weaker antigenic drift. Interestingly, clinical trials of influenza B virus vaccination in children (Skowronski et al., 2011) and experimental infection of mice (Skowronski et al., 2012) showed that the Yamagata antigens produced a stronger immune response than the Victoria antigens. If natural infection with influenza B virus was similar, this would imply that Yamagata viruses are less able to evolve through antigenic drift and therefore escape the immune response (Grenfell et al., 2004). We propose that these fundamental differences in evolutionary and epidemiological dynamics are driven by differences in hemagglutinin binding preferences. Specifically, Victoria viruses appear to have both α-2,3and α-2,6-linked sialic acid binding capacities (Wang et al., 2012;Velkov, 2013), Figure 11. Structural view of the HA showing mutational accumulation and lineage differences. Amino acid changes observed within and between influenza B virus lineages (A). Arrow colours in (A) correspond to inter-(B) or intra-(C) lineage amino acid changes, based on previously resolved crystal structure (PDB:4FQM). Amino acids in red represent differences between the two lineages that were retained over all sampling years; yellow represents differences that are newly observed in 2012 compared to 2002; and magenta represents changes lost in 2012 compared to 2002. Amino acids in blue and green represent changes that occurred in Victoria and Yamagata viruses between 2002 and 2012, respectively; whereas cyan represents difference between 2002 and 2012 shared between both lineages. These amino acid changes occur in regions that cluster around 21, 29, and 37Å distant from the RBP (C). Structural differences in RBP among recent Victoria (B/Brisbane/60/2008) and Yamagata (B/Florida/4/2006) strains with a human-like α-2,6 host receptor analogue (magenta) modeled within the viral RBP (D). D was based on crystal structures PDB:4FQM and PDB: 4FQJ with side-chains minimized after addition of ligand from PDB:2RFU through superposition. Regions differing in backbone conformation are shown in orange for Victoria and cyan for Yamagata, while conserved regions are shown in gray. Residues with conserved backbone structure but different amino acid side-chains are shown in red for Victoria and blue for Yamagata. Side-chains are shown only for residues within 5Å of the receptor ligand and differing between the lineages. Structural view of receptor binding pocket with α-2,6-(green) and α-2,3-linked (red) host receptor and glycans (blue) (E). E was based on crystal structure PDB:4FQM, with the addition of ligands from PDB:2RFU and PDB:2RFT through superposition and no minimization. The presence of a glycan on site 212 allows binding only to 2,6-linked receptors, while loss of the glycan allows binding to both α-2,3and α-2,6-linked receptors. Brown arrows (B and C) indicate relative position of receptor binding pocket (RBP), whereas black arrow heads (C and D) point to site of known antigenic cluster transition (Koel et al., 2013). DOI: 10.7554/eLife.05055.018 The following figure supplement is available for figure 11: while Yamagata viruses predominantly bind to α-2,6-linked glycans on cells in the human respiratory tract. Experimental studies in children (aged up to 7) (Nicholls et al., 2007) and adults have shown that the respiratory tissue of children mainly have α-2,3-linked receptors with a lower level of α-2,6linked receptors than adults, and these differences among the different age groups may in part account for the different age distribution of the two B lineages. In turn, the greater propensity to infect children will increase R e , initiating the epidemiological and evolutionary pattern that characterizes the Victoria lineage. It remains to be determined whether the broadly equivalent phylodynamic differences between the H3N2 and seasonal H1N1 types of influenza A virus are similarly due to basic differences in the structure of their respective HA proteins. Furthermore, to better understand the bimodal age distribution in Yamagata, where a significant reduction of infection was observed among the older children-young adult group (<25 years), additional experimental studies of the receptor distribution in all age groups are necessary. Figure 12. Glycosylation at Asn 212 and correlation with age groups for Victoria viruses. Yamagata viruses showed five instances of glycosylation loss at 212, compared to 71 instances in Victoria, hence Victoria lineage strains have been analyzed in detail here. Temporal distribution of age groups and glycosylation at 212 for all Victoria strains (A). Summary of odds ratio (OR) for association of glycosylation loss at 212 with the different age groups (B). OR values >1 indicate that it is more likely to find a 212 loss in the respective age group, whereas values <1 indicate that 212 losses are less likely to be found in the respective groups. The following guideline helps judging significance of OR: strong positive association >3; moderate positive association 1.5-3; moderate negative association 0.33-0.66; strong negative association <0.33. DOI: 10.7554/eLife.05055.020 These observations have implications for the future control of influenza B virus in the human population. While the co-circulation of divergent Yamagata viruses reported here has and can confound the accurate selection of vaccine strains, our analyses also indicate that the Yamagata viruses are under weaker positive selection and antigenic drift, and, on average, infect an older group of people who are more likely to have a higher level of cross-reactive antibodies to the B lineage viruses compared to children. As a consequence, there is a greater chance that, given sufficient coverage, Yamagata viruses might experience a major drop in prevalence over time through targeted control methods, such as the extensive use of quadrivalent influenza vaccines containing both B lineages, in contrast to the more adaptable Victoria viruses.

Materials and methods
Surveillance Influenza B positive samples collected between 2002 and 2013 from subjects in eastern Australia (Victoria, New South Wales and Queensland) and from New Zealand and associated metadata, including date of isolation and age of host, were sent to the WHO Collaborating Centre for Reference and Research on Influenza, Melbourne, from National Influenza Centres and other laboratories as part of the World Health Organization Global Influenza Surveillance and Response System (WHO GISRS). Data deposited in Dryad data repository under DOI: 10.5061/dryad.n940b (Vijaykrishna et al., 2015).

Virus isolation
Influenza B viruses were isolated or re-isolated in MDCK cells (ATCC-CCL 34) from original clinical samples or virus isolates and typed as B/Yamagata or B/Victoria using HI analysis or by molecular assay (Deng et al., 2013). Viruses were stored at −80˚C until sequenced.

Sequencing of viral RNA genome
We sequenced the complete genomes of 908 laboratxory confirmed influenza B virus MDCK or MDCK-SIAT cell propagated isolates passaged 1-4 times from eastern Australia and New Zealand using a novel methodology (Zhou et al., 2014). Influenza B virus genomes were amplified using the universal influenza B genomic amplification strategy that enables amplification of the complete genome of any influenza B virus in a one-step single tube/well reaction. Specifically, RNA was isolated from 130 μl of culture supernatant using ZR-96 Viral RNA Kit (Zymo Research, Irvine, CA) and eluted in 30 μl of RNase-free water. 3 μl of the RNA was mixed with FluB Universal Primer Cocktail (Zhou et al., 2014) and converted to cDNA and amplified with the SuperScript III One-Step RT-PCR System (Life Technologies, Grand Island, NY). The amplicons were fragmented, flanked by sequencing adaptors, clonally amplified onto IonSphere particles, and sequenced on the Ion Torrent PGM platform following manufacturer's instruction. The sequence reads were sorted by bar code to separate different viruses and used to assemble viral genomes (sequence accession numbers are available in the Dryad data repository under DOI: 10.5061/dryad.n940b).

Phylogenetic analysis
Sequences were curated, and maximum likelihood (ML) phylogenetic trees were inferred for each gene segment independently from the samples described above. ML trees were estimated using iqtree v0.9.5 (Minh et al., 2013) using the best-fit nucleotide substitution model, chosen by the Bayesian Information Criterion (BIC). The data were further divided into separate lineages (i.e., Victoria and Yamagata) and time-scaled phylogenies and rates of nucleotide substitution for each were inferred using a relaxed molecular clock model in a Bayesian Markov Chain Monte Carlo (MCMC) framework with the program BEASTv1.8 (Drummond et al., 2012) that incorporates virus sampling dates to concurrently estimate phylogenetic trees, rates of nucleotide substitution, and the dynamics of population genetic diversity using a coalescent based approach. The analysis was conducted with a General Time Reversible (GTR) model with a gamma (Γ) distribution of among-site rate variation and a time-aware linear Bayesian skyride coalescent tree prior . We performed at least two independent analyses per data set for 100 million generations sampled every 10,000 runs. After the appropriate removal of burn-in (10-20% of samples in most cases), a summary Maximum Clade Credibility (MCC) tree was inferred and visualized with Figtree v1.4 (Rambaut, 2014). Support for individual nodes is reflected in posterior probability values, and statistical uncertainty is given by 95% Highest Posterior Density (HPD) intervals.
The MCC trees were also used to estimate the genealogical pairwise diversity by averaging the time distance between contemporaneous sample pairs with a 1 month window (Zinder et al., 2013).
The past population dynamics of each linage were compared using a Bayesian skyride analysis in BEAST, which utilizes a Gaussian Markov Random Field (GMRF) smoothing prior to estimate the changes in relative genetic diversity in successive coalescent intervals . In the absence of natural selection (i.e., under a strictly neutral evolutionary process), the genetic diversity measure obtained reflects the change in effective number of infections over time (N et , where t is the average generation time). However, because natural selection can play a major role in the evolution of the influenza HA, these are interpreted as 'relative genetic diversity', and which is consistent with previous studies of influenza A virus (Rambaut et al., 2008). Sequence alignments with input parameters are available under Dryad data repository under DOI: 10.5061/dryad.n940b.

Phylogeography and migration rate estimates
We used a continuous-time Markov chain (CTMC) phylogeographic process Lemey et al., 2009) to estimate counts of migration to and from Australia and New Zealand, similar to previous studies (Nunes et al., 2012;Bahl et al., 2013). Briefly, global influenza B virus HA sequences and their associated spatial locations and isolation dates were downloaded from GenBank for the years for which we estimated an effective reproductive number in the phylodynamic analysis (see below). Spatial locations of the isolates were transformed to represent two discrete states: the region of interest (Australia and New Zealand) and the rest of the world. Phylogeographic events were estimated independently for each of the identified years using an asymmetric CTMC process , with the estimated state transition counts (import and export) between the two discrete states estimated using a Markov Jump count approach. This phylogeographic inference was implemented in BEAST 1.8 (Drummond et al., 2012) similar to the temporal phylogenies described above. The resulting log files were used in extracting the net migration counts and mean non-zero transition rates.

Phylodynamic analysis
To estimate epidemiological parameters (specifically the effective reproductive number, R e ) for each epidemic of virus lineages in Australia and New Zealand, we used the birth-death susceptibleinfected-removed (BDSIR) model (Kühnert et al., 2014). The BDSIR analysis was also conducted with a GTR + Γ substitution model, with epidemiological dynamics estimated jointly with the phylogenies for each virus lineage. The model assumes a closed SIR epidemic in each season for the underlying host population. The initial number of susceptible individuals S 0 could not be estimated and was therefore initially fixed to 4,000,000 (results reported in the main text). Analysis under different S 0 values, ranging from 40,000 to 10 million, showed that the estimates of reproductive numbers (R e ) are robust to the choice of S 0 . The BDSIR analyses utilized m = 100 intervals for the approximation of the SIR dynamics. Incidence and prevalence were computed from the posterior distributions of the SIR trajectories, and the relevant plots show their median values.

Molecular adaptation
Selection pressures for each gene segment, lineage, and individual codon were estimated as the ratio of the number of nonsynonymous substitutions per nonsynonymous site (d N ) to the number of synonymous substitutions per synonymous site (d S ). Estimates were obtained using the Single Likelihood Ancestor Counting (SLAC) (Kosakovsky Pond and Frost, 2005) and Fast Unconstrained Bayesian AppRoximation (FUBAR) (Murrell et al., 2013) methods, accessed through the Datamonkey webserver of the HyPhy package (Delport et al., 2010). In addition, the d N /d S ratio for the internal and external branches of the Victoria and Yamagata HA phylogenies was estimated separately using the CODEML program (tworatio model) available in the PAML suite (Yang, 2007).

HI assay and antigenic cartography
Representative viruses from each lineage were sub-sampled and tested for antigenic reactivity by a hemagglutination inhibition (HI) assay using a panel of reference ferret antisera that were available for each influenza B lineage (raw HI titers are available in the Dryad data repository under DOI: 10.5061/dryad.n940b) and the subsequent antigenic profile was used to generate antigenic maps (Cai et al., 2010) for each lineage. HI assays were performed as described previously (WHO Global Influenza Surveillance Network, 2011) using panels of post-infection ferret sera raised against representative viruses from both B/Victoria lineage or the B/Yamagata lineage collected from 2000 to 2013. Turkey red blood cells were used to detect unbound virus and the HI titer was determined as the reciprocal of the last dilution that contained non-agglutinated RBC. Normalized titers from the HI assay were compiled for antigenic cartography analysis. The HI matrix was used in a multi-dimensional scaling (MDS) plot algorithm to chart the antigenic distances between isolates tested in a twodimensional map (Cai et al., 2010), through the AntigenMap webserver (Wan, 2010). To identify residues contributing most to HI titer changes, pairwise comparison of sequences with a single amino acid difference were conducted.

Computational structural modeling
Finally, sequence data of the HA segment from each lineage were used to construct structural models (Krieger et al., 2009;Webb and Sali, 2014). To identify those residues that contribute most to antigenic drift in Victoria viruses, we compared the HA amino acid sequences of all pairs of HI assay tested strains using the Smith-Waterman algorithm. If only a single mutation difference was found, we calculated the respective average HI titer change for occurrences of this mutation. These amino acid sites were then mapped on the crystal structure PDB:4FQM (Dreyfus et al., 2012) and visualized using YASARA (Krieger et al., 2009).
Amino acid substitutions per site between pairs of HA sequences were calculated using MEGA5 (Tamura et al., 2011) under the Jones-Taylor-Thornton (JTT) amino acid substitution model. We constructed structural models using MODELLER (Webb and Sali, 2014) (five models each with and without ligand, best model selected by DOPE quality score), structural alignments were conducted using MUSTANG (Konagurthu et al., 2006) and visualized using YASARA (Krieger et al., 2009). To identify structural changes occurring on the HA proteins of influenza A subtypes and influenza B virus lineages over a 10-year period, we selected the HA protein sequences of the following virus strains: Differences in the receptor binding pocket region of the two influenza B lineages were visualized using B/Brisbane/60/2008 (PDB:4FQM [Dreyfus et al., 2012]) and B/Florida/4/2006 (PDB:4FQJ [Dreyfus et al., 2012]) with the addition of an α-2,6-linked host receptor analogue ligand from a known complex (PDB:2RFU [Wang et al., 2007]) and targeted side-chain minimization of residues within 8Å of the ligand through short simulated annealing molecular dynamic simulations in YASARA (Krieger et al., 2009) as previously benchmarked to ensure realistic results.
We also used YASARA (Krieger et al., 2009) to visualize the role of glycosylation on Asn at position 212 for α-2,3vs α-2,6-linked host receptor ligands by schematically superimposing both ligands (PDB:2RFT [Wang et al., 2007] and PDB:2RFU [Wang et al., 2007]) into their respective positions within the receptor binding pocket of a fully glycosylated influenza B HA head (PDB:4FQM [Dreyfus et al., 2012] The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Author contributions DV, ECH, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article; UJ, Y-MD, ACH, DEW, Acquisition of data, Analysis and interpretation of data; MF, Phylogenetic analysis and interpretation; YCFS, Genetic data curation, Phylogenetic analysis and interpretation; RH, Oversaw logistical and technical aspects of the viral sequencing; RTCL, VG, Structural data analysis and interpretation; XL, Viral genome purifications and amplification; TBS, Directed viral sequence assembly and informatics; NBF, Viral genome sequence finishing and closure; BZ, Viral genome sequencing technical oversight; NS, A-MC, DED, QSH, LCJ, WR, Collected and