Comparing the Change in R 0 for the COVID-19 Pandemic in Eight Countries Using an SIR Model for Specific Periods

: The reproduction number, R 0 , is an important parameter in epidemic models. It is interpreted as the average number of new cases resulted from each infected individual during the course of infection. In this paper, the R 0 estimates since the outbreak of COVID-19 till 10 August 2020 for eight countries were computed using the package R{eSIR}. The computed values were examined and compared with the daily R 0 estimates obtained by a static SIR model by aligning the days of infection, assuming a fixed number of days for the infected person to become confirmed/recover/die. The results showed that running R{eSIR} to obtain R 0 estimates provided an easy mean of exploring epidemic data. Care must be taken in the interpretation of R 0 as a measure of severity of the spread of an epidemic. Other factors, such as imported cases, need to be considered.


Introduction
Since the outbreak of COVID-19 cases in Wuhan, China in December 2019, the disease has swept through over 50 countries in the world, affecting the lives and activities of at least half of its population.As of 5:01 p.m. CEST, 18 July 2020, the number of confirmed cases and deaths were 13,876,441 and 593,087, respectively (https://www.who.int/emergencies/diseases/novel-coronavirus-2019 (accessed on 22 June 2024)).Although the death rates were not high for the young, the infectivity of the virus was exceptionally high compared with other diseases caused by coronaviruses like SARS.Moreover, some young patients with good health history suddenly experienced severe symptoms in around 2 weeks' time.
To combat the pandemic, many countries had to lock down cities, stop transports, close schools and shops, etc. Social distancing, hand washing, and wearing masks were practised.These measures were shown to be effective in lowering the transmission rate, albeit at a great cost to the economy and the normal living of people.Up to the present moment, no effective treatments for the severe COVID-19 cases has been documented, although in some research, Remdesivir and a few other anti-inflammatory drugs, and some anti-virus drugs have been found to have beneficial effects in severe cases.The hope for the epidemic to stop lies on the successful production of vaccines and/or effective treatments.
Data on the daily reported number of confirmed, recovered, and death cases in many countries are available from the World Health Organization (https://www.who.int/(accessed on 22 June 2024)), though there are discrepancies on the exact methods or tests for diagnosis and the definition of death cases.There may also be a time lag between the occurrence and the confirmation of diagnosis.Due to the lack of resources for diagnostic tests, the number of cases may have been under-estimated or confirmed with much time delay in some regions.The death cases could possibly be over-estimated.Patients might have died of other conditions like heart diseases, although they had COVID-19.This may possibly cause an over-estimation of death cases.On the other hand, some patients might have died at home or in nursery without being diagnosed with COVID-19, and this might have caused an under-estimation of death cases for some age groups.
When will the pandemic ends?Will it cause as severe consequences as the 1918 Spanish flu pandemic?
There has been much research on modelling the epidemic data of COVID-19.The data for some countries/cities have been analysed.The reproduction number (R 0 ), average transmission rate (β 0 ), average removal rate (γ 0 ), and other parameters in infection models have been found to be varying quite a lot from regions to regions, and so have the predictions of their courses of outbreak [1]; for instance, the estimation of the asymptomatic proportion of cases on board the Diamond Princess cruise ship, Yokohama, Japan [2], report on the pandemic in Italy [3], the prediction of COVID-19 spreading profiles in South Korea, Italy, and Iran by data-driven coding [4], the assessment of the lockdown effect in India [5], an epidemiological forecast model and software assessing interventions on COVID-19 epidemic in China [6], the COVID-19 epidemic trend in Malaysia under movement control order (MCO) [7], the case of Ethiopia [8], controlling the COVID-19 pandemic in Vietnam [9], analysing South Korean recovered and death data [10], overall statistics and research on COVID-19 data [11], predicting the COVID-19 epidemic in Algeria [12], assessing the effects of interventions [13][14][15], etc.All these efforts in modelling and predicting the pandemic are valuable tools in helping governments to set up effective measures to get the COVID-19 pandemic, as well as future epidemics, under control.
How to model epidemic data?At the very early stage of the outbreak of COVID-19 in Wuhan, China in December 2019, with data only up to 31 January 3020, Liu et al. [16] built an SIR model with the compartment infected (I) going through two stages: asymptomatic and symptomatic.Further, for those cases with symptoms, some were reported (R) while some were not reported (U).Together with susceptible (S), it formed a four-compartment SIRU model.By numerical simulation using data on symptomatic infected cases, they estimated f , the fraction of asymptomatic infectious that became reported symptomatic infectious cases, 1/υ, the average time during which asymptomatic infectious were asymptomatic, and 1/η, the average time during which the symptomatic infectious had symptoms.Assuming that f = 0.8, they estimated that 1/υ = 7 days and 1/η = 7 days.These values were useful in the early-stage prevention and control of COVID-19 in Wuhan.
Again using the SIRU model, Liu et al. [17] estimated the parameters which best fit the data available from January 2022 to March or April 2022.(Note: the data available from different countries have different start and end dates).For instance, for China, the estimated parameters were 1/υ = 5 days, 1/η = 6 days, and f = 0.6; for South Korea, the estimated parameters were υ = 4 days, η = 16 days, and f = 0.9; and so on.The results were compatible with results from the WHO: the estimated latent period for the asymptomatic phase was around 5-6 days but could be up to 14 days.
Using a similar SIRU model, Griette et al. [18] estimated the instantaneous reproduction number for eight geographic areas: California, France, India, Israel, Japan, Peru, Spain, and the UK, using data till January 2021.They started by defining the epidemic model SEIRU (S: susceptible, E: exposed, I: infected, R: reported infectious, U: unreported infectious), then fit the data to Bernoulli-Verhulst curves [19], using the Levenberg-Marquardt algorithm [20].In order for their model not to just fit any value of ascertainment rate (i.e., the proportions of infectious that were reported through diagnostic testing) and transmission rate, they needed to pre-set some parameters, using estimates from other sources.These included: (1) The average length of the non-infectious incubation period: 1 day; (2) The average length of the infectious incubation period: 3 days; (3) The average length of the symptomatic period: 7 days; (4) The ascertainment rate: 0.8.
Then, the transmission rate was computed, which was then used to estimate the instantaneous reproduction number R e (t).They found that the instantaneous reproduction number was dominated by 3.5.With this result, they postulated that when the portion of vaccinated or immunized population reached p = 1 − 1/3.5 = 0.71, the instantaneous reproduction numbers would become less than one.The epidemic would then be contained.
Based on the SI model, they started by fitting the early cumulative number of confirmed cases using the exponential function.But soon, they found that they could neither estimate the duration of infectiousness nor the fraction of infected cases reported.The reason was obvious: there were always many possible combinations of parameter values which would give an exact fit to the data.
Later, they used the Bernoulli-Verhulst model as a phenomenological model and obtained τ(t) and I 0 expressed as a function of the parameter of the Bernoulli-Verhulst model.Assuming a day-by-day piecewise constant rate of transmission, they could reproduce well the number of reported cases by simulation using the following set of parameters: f = 0.5, υ = 0.1 or υ = 0.2 with the cumulative number of reported cases CR ∞ = 67,102, the number of susceptible cases at start = the total population of China S 0 = 1.4 × 10 9 , and the initial number of reported cases I 0 = 954.The daily reproduction number could be obtained by R 0 (t) = τ(t)S(t)/υ.
They highlighted in the conclusion that there were four challenges in epidemic modelling: (1) the estimation of the average transmission rate; (2) the estimation of the mean duration of the infectious period for the infected; (3) the quality of the reported cases; and (4) the estimation of the average transmission rate for each day of the infectious period.
In 2021, the SIRU model was also adopted by Griette et al. [21] to model early data in France.They used the data between February 2020 and January 2021.During the period, there were two obvious epidemic waves.They divided the epidemic waves into two phases: (1) the endemic phase, when the number of new cases fluctuated around an average value; and (2) the epidemic phase, when the number of new cases changed with time.They used a standard curve-fitting algorithm to find the parameters which would fit the data best.Their estimated duration for infectious period was (1) 1/υ = 12.5 days during the first epidemic wave and (2) 1/υ = 3.5 days during the second epidemic wave.By assuming the average length of asymptotic infectious period to 3 days, they fit the time-dependent reproduction number R 0 = τ(t)S(t)/υ.At the first epidemic wave, R 0 decreased from around three to 3 than 1, and in the first part of the second epidemic wave, R 0 was almost constant and equalled 1.11.Nevertheless, they found that R 0 obtained in their model was too sensitive to the number of active cases and was not a reliable indicator of the severity of the epidemic.
In the meantime, more mathematical models were built [22].Starting from the modelling of a single epidemic wave, modelling multiple epidemic waves became important.More compartments were introduced in the SI models.For instance, the infected (I) component could further be split into various components, such as reported (R)/unreported (U).Whatever the model, the estimation of the reproduction number and the transmission rate were always of core importance.
Most of the time, numerical methods have been used to estimate the parameters of the models.However, analytical solutions or exact solutions for the models under specific conditions are possible.Schlickeriser et al. [30] provided the detailed construction of general solutions for some of these SIR derived models.Also, an analytical solution of the SIRV-Model (where V stands for vaccinated) was provided by Kroger et al. [31].
The comparison and evaluation of methods have been prevalent.In 2021, Yang et al. [32] conducted an evaluation of various epidemic models based on COVID-19 data from China.They compared the forecasting performance based on (1) the model evaluation criteria used, the Akaike information criteria (AIC)/root mean square error (RMSE)/robustness coefficient(RC); (2) the mathematical functions used for model fitting; (3) the statistical inference methods used; (4) the underlying dynamical models used.
Concerning model evaluation methods, AIC and its various modified versions are widely used in comparing SIR and SEIR models, and researchers have found that the model with the least RMSE can be picked out based on AIC and RC.Moreover, researchers have found that among different mathematical functions, the sigmoid function is a good choice for epidemic modelling.Apart from that, it is always difficult to make long-term forecast at an early stage; while in later stages, sequential Bayesian and time-dependent reproduction number can provide useful predictions.
In this analysis, the focus was on the reproduction number.The changes in R 0 for the COVID-19 pandemic in the period from 22 January 2020 to 10 August 2020 for eight countries/cities, including Brazil, China, India, Italy, South Korea, Singapore, the United Kingdom, and the United States of America were analysed.These countries/cities were selected because there experienced outbreaks as well as changes in the control measures during the selected periods.R 0 was estimated for comparison.
An SIR model with time-dependent transmission rate was adopted by using the R{eSIR} package, assuming no prior knowledge on the different phases of quarantine measures and no prior value for the transmission rate, except that it was between zero and one.An exponential distribution for the transmission rate was not adopted.Instead, parameter estimates were obtained for four different time frames for comparison.

The SIR Model
The SIR model [23][24][25] is one of the simplest model for epidemic modelling.There are many variant forms of the SIR model, such as the SEIR and double epidemic models [33].Such models are built on the daily number or proportion of confirmed cases, deaths, and recovered, with the assumption of a constant/varying transmission rate and removal rate, where removal can be from deaths or from recovery.R 0 , which is equal to the ratio of the transmission rate to the removal rate, is an indicator of the direction of an epidemic: if R 0 > 1, the epidemic is in the outbreak stage; on the other hand, if R 0 < 1, the epidemic is under control.
How to estimate R 0 from daily figures?One simple way is to use packages like R{eSIR} [34].R 0 output figures can provide insights for the stage of outbreak for a particular country/city.As R{eSIR} is easy to use, it provides a user-friendly exploratory tool for the public.While direct plots of the figures can provide visible trends, R 0 estimates provide a snapshot of the hidden underlying trend.

SIR Modelling
Based on a Dirichlet distribution [35] and assuming a constant population of size N, at any time t (in days, without loss of generality), define

N:
Population size; S(t): Susceptible (the number of cases in the population who are without immunity against the infection); I(t): Infected (the number of cases in the population who are infected); R(t): Removed (the number of cases in the population who died or recovered from the infection, or vaccinated with immunity). where In the SIR model, R 0 = β(t)/γ(t) is the average number of new cases resulted from an infected individual throughout his/her course of infection, where β(t) is the transmission rate (the number of effective contacts per day made by an individual); γ(t) is the removal rate (on average, an infected will recover or die in 1/γ days after infection).
For instance, if β(t) is 0.2, then the number of effective contacts per day per infected (which means once infected, a individual will on average infect 0.2 others per day since the time he/she became infected and before the time he/she dies/recovers.If γ(t) is 0.1 = 1/10 (which means on average an infected will have 10 "actively infective" days to infect others before he/she dies/recovers.Then, it is easy to obtain R 0 = 0.2 * 10 = 2. Thus, on average, each infected individual will infect two others during his/her course of infection. The dynamics of S(t), I(t), and R(t) are given by the three differential equations: R 0 is an important parameter in predicting the direction of the epidemic.
Case 1: R 0 (t) > 1: the epidemic will go on with more and more infected; Case 2: R 0 (t) = 1: The epidemic will maintain its present condition, with number of infected = number of removed.However, R 0 (t) will not stay at one during the course of the epidemic.Case 3: R 0 (t) < 1: the epidemic will be contained with fewer and fewer cases.
Note that in order to control an epidemic, the aim is to reduce β(t) and to increase γ(t), so as to achieve R 0 (t) < 1.By measures like lockdown, social distancing, hand washing, wearing masks, the effective number of contacts per individual per day is greatly reduced.Effective treatments like drugs so that patients can recover earlier increase the rate of recovery.Moreover, vaccines which help people to acquire immunity without becoming infected have the effect of directly increasing the removed portion in the SIR model.

The R{eSIR} Package
The R{eSIR} was built on a state-space SIR model, with an extension of time-dependent transmission rate π(t) ∈ [0, 1].It can be a step-wise function or follow an exponential function π(t) = exp(−λ 0 t).
In this analysis, the package R{eSIR} was used.The R{eSIR} was adopted by scholars in predicting the epidemic trend of COVID-19 in Italy and Hunan, China [34].The basic reproductive numbers for the time-series data of COVID-19 data from 22 January 2020 to 16 March 2020 were found to be 4.10 (95% CI: 2.15-6.77)for Italy and 3.15 (95% CI: 1.71-5.21)for Hunan.They further predicted that with rigorous blockage measures maintained in Italy, there would be 30,086 (95% CI: 7920-81869) infected cases in total, and the epidemic would reach an endpoint by 25 April (30 March-7 August).

The tvt.eSIR Method in the R{eSIR} Package
One of the three models in the R{eSIR} package is the tvt.eSIR().This model was adopted in this analysis.It allows for a time-varying transmission rate, which can be step-wise or exponential.It is the simplest model in the package and requires only the time-series data of the number of confirmed and removed cases.As the other two models would need quarantine or vaccination data, respectively, they were not considered in this analysis.
Compared with the standard SIR model, the tvt.eSIR model introduces a function π(t) ∈ [0, 1] which reflects time-dependent changes in the transmission function β(t) due to public health policies, e.g., quarantine measures, mutation or changes in the virulence of the variants, environmental changes like changes in temperature and humidity, changes in data collection or case definition policies (e.g., how a confirmed or death case is defined), etc.
This modification function π(t) can be a step function or a continuous function.In this analysis, a step function approach was adopted.
The dynamics of S(t) , I(t), and R(t) are given by the three differential equations: When a step function is assumed, π(t) = constant value between zero and one; When a continuous function is assumed, π(t) = exp(−λ 0 t).
Note that the extra π(t) functions is providing an extra "factor" for the transmission rate to accommodate the changes in transmission rate over time due to human intervention or virus evolution.
In the demonstration example provided in the package document, a step function of π 0 = (1, 0.9, 0.5, 0.1) at change_time = ("01/23/2020", "02/04/2020", "02/08/2020") is used to model early data from China.The change_time variable reflects when there was a major change in quarantine policies in China.The decline in the initial values used reflects the tightening of quarantine measures.
When an exponential function is used, it reflects a gradual decrease in transmission when public awareness, isolation policies, and personal protection measures gradually increase in intensity.
Both π 0 and change_points can be set to NULL, reflecting no prior information about the change in quarantine practices in the region.
In this package, the compartments are defined by a time series of proportions, not by counts.Let Y I t and Y R t denote the proportions of infected and removed at time t, respectively, then the proportion of susceptible cases at time t is given by 1 − Y I t − Y R t .Assume that Y I t and Y R t follows a Beta-Dirichlet state-space model (BDSSM) [36].[Note: In the formulae below, Beta, Gamma, Dirichlet, LogN stand for the Beta, Gamma, Dirichlet and lognormal distributions, respectively.These are common probability functions used to describe the underlying distributions.]The observations are and the latent process is where θ t = (θ S t , θ I t , θ R t ) T is the vector of the underlying prevalence of susceptible, infectious, and removed populations, and τ = (β, γ, θ T 0 , λ, κ) T with λ I , λ R , and κ are parameters controlling the respective variances for the observation and latent processes, and β and γ denote the transmission and removal rates, respectively.
To solve the derivative functions of the SIR process, we use: The solution f (θ t−1 , β, γ) is obtained by the fourth-order Runge-Kutta(RK4) approximation [37].
Initially, the proportions of infected and removed is given by the corresponding first data value from the dataset, denoted by Y I 1 and Y R 1 , respectively, and the initial proportion of susceptible individuals is therefore 1 The initial parameters and prior distributions were specified according to the SARS data from Hong Kong as follows: with E(γ) = 0.0117, SD(γ) = 0.1 and JAGS [38] was then be used to run MCMC [39] chains to obtain the posterior estimates of the parameters.

Initial Values and Hyper-Parameters Used in the tvt.eSIR Function
Some default initial values and hyper-parameters used in the tvt.eSIR function are given as follows: (1) death_in_R = 0.02 (death_in_R refers to the average of cumulative deaths in the removed compartments.When it was within Hubei, 0.4 was used.When it was outside Hubei, 0.02 was used).
(5) gamma0_sd = 0.1 (gamma0_sd refers to the standard deviation for the prior distribution of the removed rate γ.This value was chosen as a relatively large variance.This allowed more flexibility at the start so as to achieve an easier fit of the data.When more prior knowledge is available, a smaller value can be used for reaching more accurate estimates of parameters).( 6) R0_sd = 1 (R0_sd refers to the standard deviation for the prior distribution of R 0 .Similarly, this value was chosen as a relatively large variance).( 7) eps = 1 × 10 −10 (This is a non-zero controller so that all the input Y and R values would be bounded above 1 × 10 −10 .(8) time_unit = 1 day (It can be set to other values, e.g., 7 days for weekly data.But as the data used in this analysis were the daily numbers, the default of time_unit = 1 day was appropriate).
As there is sufficient variability allowed for the MCMC chains, the prior distributions and initial values did not restrict the final estimated parameter values obtained.

Source of Data
The daily numbers of confirmed, deaths, and recovered cases for the eight countries were obtained from the Coronavirus Disease (COVID-19) Dashboard, WHO (https://covi d19.who.int/(accessed on 22 June 2024)).
The data from the sources were packed into three data files (see the three .csvfiles in the "Supplementary Files") and processed with R.
As the most updated figures from these three sources for each country varied, an estimate for 2019 taking into account all these three sources was adopted.Only estimated approximate values were used in this analysis, as small differences in the population size would affect the results very little.

Method
In this analysis, the R{eSIR} package was used to obtain the R 0 estimates.The tvt.eSIR method was adopted.
In the tvb.eSIR method, the time-dependent transmission rate π(t) ∈ [0, 1] can either be a step function with initial values and change_time specified, or it can follow an exponential function π(t) = exp(−λ 0 t).
As an exploratory analysis, here, π(t) was simply assumed to be a constant between 0 and 1.There was no assumption on any known change_time, too.Therefore, both were set to NULL.
In order to compare the changes in the reproduction numbers in the periods from January to July 2020, the data were extracted into 4 datasets with different time frames: A sample R code for running Brazil data for time frame 1 (i.e., d1) is included in the Appendix A. Codes for d2, d3, and d4 and those of other countries are similar (see the code samples in the "Supplementary Files").Each run took around 20 min to one hour and 20 min.
Note that the variable begin_str specified when the period begins (i.e., day 1 in this analysis).Both pi0 and change_time were not specified.This is different from the demonstrated use given in the package R{eSIR}, where the changing_time is specified as 3 dates when the quarantine policies changed, and prior values for function π(t) are specified to be 1.0, 0.9, 0.5 and 0.1, respectively, for the four periods concerned.For the method used in this analysis, on the other hand, no prior knowledge or assumption concerning the changes in government policies, quarantine measures, or virus mutation was needed.Thus, the method here can easily be applied to any epidemic data as long as the numbers of confirmed/deaths/recovered are available.

Results
The estimated R 0 values are summarized in Table 1.For the eight countries concerned, the R 0 values changed throughout the four time frames.They ranged from 0.97 to 5.28.The trend was lower values for d2, then there was a gradual increase in values for d3 and d4, showing the pandemic situation was, in general, becoming less serious in April and May 2020, but it gained momentum in May and continued to get worse in June, July and August.The R 0 estimates were calculated using the beta and gamma estimates in the MCMC chains, using the formula R 0 (t) = β(t)/γ(t).The R 0 mean values and the plot on how it changed throughout the iterations were obtained from the outputs every time the R codes was run.
Besides the R 0 estimates obtained, the automatic figures generated by the package (see Figures 1-5 and "sample_outputs" folder in the "Supplementary Files") provided more insights on the predicted trends.As an illustration, here are some plots obtained from analysing the Brazil data.From Figure 1, the mean R 0 estimates for Brazil were around 1.3, 1.2, 1.3, and 1.4 for d1, d2, d3, and d4, respectively.Similarly, the mean R 0 estimates for other countries can be obtained visually from the stepR0_p charts.
From Figure 2, the pink regions show the projected regions of possible outcomes of the proportions of population infected at future dates, as generated by the model.
From Figure 3, the pink regions show the projected regions of possible outcomes of the proportions of population removed (i.e., either dead or recovered) at future dates, as generated by the model.
Figure 4 shows the spaghetti plots of infection prevalence as generated by the model.The infection prevalence is the proportion of people infected in the population at a specific time.It is a useful indicator to predict how the epidemic may evolve and for making comparisons on the survival and impact of a transmission disease.The value of its firstorder derivative indicates the direction and momentum of change.The spaghetti plots show the possible "flows" of the first derivative of the infection prevalence.This first derivatives is important.A high positive value means the epidemic is getting worse quickly, e.g., due to the emergence of a new variant.On the other hand, a high negative value means the situation is improving quickly, e.g., due to quarantine measures or effective drugs.
Figure 5 shows the estimated and projected first-order derivative of the infection prevalence.For Brazil, a positive predicted value for most of the time from February to August 2020 meant that the percentage of population becoming infected was getting higher and higher with positive momentum.With the projected value even higher after August 2020, it meant the speed for the rate of increase would be even higher in the following months.

Discussion
What did the estimated R 0 from the eSIR_tvt step reflect?Why was the R 0 estimates in China and South Korea as high as 3.66 or 3.41?COVID-19 was well under control in these countries by 10 August, 2020.Why was the R 0 estimates in Brazil as low as 1.39?The daily numbers of confirmed cases were high in early August in Brazil.Does the R 0 estimate comply with the results from other studies?Are the R 0 estimates from the eSIR_tvt step model to be trusted?First of all, there are many relevant studies on data over similar periods, with a whole range of R 0 estimates obtained.A review from February 2020 by the WHO showed that the initial estimates of R0 ranged from 1.5 to 6.68 [41].For studies on individual country/region, an R 0 estimate of 3.1 was obtained for Brazil using data up to 31 May 2020 [42].For China, an early-stage R 0 value was estimated to be 5.7 from data in late January 2020, while studies using different models and methods for data till 7 February 2020 from China resulted in R 0 estimates ranging from 1.5 to 6.49 [43].For India, research on data from February 2020 to March 2021 by districts resulted in R 0 estimates varying from two to three (in most districts) to over seven (in eight districts) [44].In a study on data till 31 March 2020 from the United States and eight European countries, R 0 values for Italy, the United Kingdom, and the US were estimated to be 4.6, 3.9, and 5.9, respectively [45].For Singapore, an R 0 estimate from a study using local data up until 17 March 2020 was 0.7 [46].For Korea, a study from data as of 6 March 2020 resulted in a R 0 estimate of 1.5 [47].
The estimates from this study are similar in range to those obtained from other studies except for China and South Korea.Actually, the resulting R 0 estimates for China and Korea from this analysis seem to be against the "intuitive" belief that a high R 0 means an "uncontrolled" condition, while a low R 0 means a "controlled" condition.What is wrong?
In this study, R 0 estimates fluctuated with time.The fluctuation is greater when the numbers of new infected, new dead, and new recovered individuals are comparatively low.This was the case for China and South Korea.In late 2020, the situations in these two countries were more or less under control, and there were a lower number of active cases as compared with before.Examining the data revealed that for these two countries, there were often days when the number of removed cases (i.e., deaths + recovered) exceeded the number of newly confirmed cases.The "negative" or "near zero" new active cases might be due to delays, say, around 30 days for deaths and around 21 days after the infection day.Moreover, there might be the issue of undiscovered cases in the background, which affected estimates significantly when the number of confirmed cases were very low.
These "negative" or "near zero" new cases might be affecting both the transmission rate and the removal rate estimation.Both the rate of transmission and the rate of removal were subjected to high variability.High variability for the removal rate is especially troublesome.As it is the denominator in the calculation of R 0 , any slight error would affect the model and the R 0 estimates significantly.
The number of new cases in China and South Korea were comparatively very low on the days before 10 August 2020.It was of the order of around 100 for China and 50 for South Korea and remained around the same level for many days.Compared with the data in earlier days, the number of confirmed cases in China was in the thousands in February 2020, while the number of confirmed cases for South Korea was in the hundreds in April 2020.In the SIR model, the estimated R 0 value from a comparatively low and flat number of new confirmed cases may not be low.Moreover, it is highly unstable.If the number of new confirmed cases randomly drifts to a slightly larger number for a few days, then R 0 could appear even higher.
Further, in this analysis, local and "imported" cases were not analysed separately.In a real situation, "imported" cases are not "infected" from the local cases and should be excluded from the R 0 estimates or analysed separately.
On the other hand, for Brazil, although the epidemic was growing at a fast rate with the number of new cases per day around 20,000 or more, the number of new cases "infected" (i) Time frame 1 (dataset labelled as d1): from 22 January 2020 to 21 March 2020; (ii) Time frame 2 (dataset labelled as d2): from 22 January 2020 to 21 May 2020; (iii) Time frame 3 (dataset labelled as d3): from 22 January 2020 to 21 July 2020; (iv) Time frame 4 (dataset labelled as d4): from 22 January 2020 to 10 August 2020.

Figure 1 .
Figure 1.R 0 estimates (Brazil) from the eSIR_tvt step for data from 22 January to 10 August 2020.

Figure 2 .
Figure 2. step_forecast (Brazil) from the eSIR_tvt step for data from 22 January to 10 August 2020.

Figure 3 .
Figure 3. step_forecast2 (Brazil) from the eSIR_tvt step for data from 22 January to 10 August 2020.

Figure 4 .
Figure 4. step_spaghetti (Brazil) from the eSIR_tvt step for data from 22 January to 10 August 2020.

Figure 5 .
Figure 5. stepderiv (Brazil) from the eSIR_tvt step for data from 22 January to 10 August 2020.

Table 1 .
R 0 estimates obtained from the eSIR_tvt step for the 8 countries considered.