FormalPara Highlights
  • We compare the effectiveness of two types of contact tracing: one that requires contacting individuals (manual contact tracing) and one that requires identifying locations (bulletin board contact tracing).

  • Bulletin board contact tracing is less resource intensive and easier to implement than manual contact tracing while offering greater privacy options and possibly improving participation.

  • Manual contact tracing performs only slightly better than bulletin board contact tracing at reducing the reproductive number, duration, prevalence and incidence of SARS-CoV-2.

  • Hybrid contact tracing using both manual and bulletin board methods performs significantly better than manual contact tracing alone.

  • This suggests opportunities to improve implementations of manual contact tracing with location-based, bulletin board contact tracing.

1 Introduction

Contact tracing has been a large component of battling disease [7]. It works by identifying and alerting individuals who have been in contact with a contagious individual, so that they can isolate [18]. In the classical form of contact tracing, a contact tracer interviews the infected individual to help them remember who they may have contacted in the recent past. The tracer then relays a message to isolate to each of these individuals. This is the most common type of contact tracing and it is a manual process [14]. Recently, contact tracing has seen a large push from the tech community to see how they can help [2]. These more recent automated forms of contact tracing rely on technology to help the process of detecting contacts. However, they are essentially an automated form of classical contact tracing [1]. The main weakness of classical contact tracing is that it has a single point of failure: it depends entirely on the ability of an individual to accurately remember and identify people they contacted. Analogously, the technology-assisted forms require a high degree of device adoption. Second, it requires the sharing and use of personally identifying information because the intention is to identify people uniquely [35]. Third, it assumes that a sufficient number of these contacts can be, or are willing to be, contacted by the contact tracer for the spread to be slowed [12].

A different type of contact tracing is also used, although less frequently. Countries like Israel have used a bulletin board style approach for contact tracing. Bulletin board contact tracing differs from the manual approach because it uses location-identifying information rather than individual-identifying information. It consists of identifying locations that an infectious person has visited, and then disseminating that information anonymously using a public bulletin board rather than trying to uniquely identify and contact particular individuals. On a smaller scale, schools use a similar approach to notify parents of classroom infections while preserving the privacy of the individual infected.

This raises the question: how effective are the different forms of contact tracing? There have been no evaluations of the effectiveness of bulletin board contact tracing compared to classical contact tracing. Evaluating them using simulations is challenging without a natural way to conceptualize their components. To address this challenge, we divide the contact tracing problem into two components: contact set reconstruction and message routing. The former consists of figuring out who was in contact with the infected individual and the latter consists of getting a message to each of them to isolate. This enables us to implement simulations to compare manual contact tracing and bulletin board contact tracing.

We implement the simulation by developing a stochastic discrete-time model based on the Susceptible, Exposed, Infected and Recovered (SEIR) framework of disease spread. We calibrate this model against what is known about SARS-CoV-2. We then implement a prototype manual contact tracing method, a bulletin board contact tracing method and a hybrid contact tracing method that combines both. And we compare their effectiveness in terms of the overall reproductive number, the incidence and the prevalence of disease.

Our experiments show that bulletin board contact tracing performs similarly to manual contact tracing, or slightly worse in some cases. It does not alter the overall reproductive number or the duration of disease compared to manual contact tracing. Bulletin board tracing reduces prevalence initially, but then increases it. Incidence follows a similar trajectory. A hybrid approach that adds bulletin board tracing to manual tracing significantly reduces the reproductive number, incidence, duration, and prevalence of disease compared to manual tracing alone. Considered together with the efficiency gains from using a bulletin board approach to contact tracing, this suggests that bulletin board tracing can potentially improve the effectiveness of manual contact tracing.

Overall, the bulletin board approach is relatively easier to implement and requires many fewer resources than manual contact tracing. Even if the latter is automated it will likely still generate resistance for privacy reasons that will raise further accuracy issues. Our model demonstrates that bulletin board contact tracing performs as well as (or slightly worse than) manual contact tracing. Therefore, from a practical point of view, it can be more efficient than manual contact tracing.

2 Types of contact tracing

Contact tracing is a type of race against the spread of contagious disease [22]. The disease spreads when one person infects at least one other. The number of people that a person directly infects under the assumption that everyone is equally susceptible is called the basic reproduction number(R0). Contact tracing is a method of finding out who has been exposed to and potentially infected by an infected individual [31]. Once they are identified, they can be tested, treated and isolated to reduce the number of infections that would have otherwise occurred. All forms of contact tracing proceed backwards in time from the discovery of this index individual to isolate other potentially exposed individuals [6]. It is able to slow disease spread when this backward tracing is faster than the forward disease spread [19, 27]. Contact tracing is an important tool for combating pandemics [14]. It is more effective against diseases like SARS, smallpox [15] and Tuberculosis [3] than influenza [25]. Contact tracing effectiveness is significantly affected by individual behavior [12].

Recently, there have been several public proposals for implementing contact tracing using technologies such as Bluetooth Low Energy (BLE). PACT [10], PEEP-PT [38], BlueTrace [4], TCN [41], DP-3T [13] and Whisper [37] are among the most prominent. A major design goal of all of these proposals is user location-data privacy. Of these, PACT is the most developed in terms of implementation and testing. It uses Bluetooth enabled devices to determine when contact has occurred, and depends on how well the hardware permits assessing proximity. Because many factors in addition to hardware features affect the accuracy of proximity detection, all implementations find it necessary to choose a mathematical model that will make this estimation with the desired level of accuracy. Existing bluetooth hardware implementations cannot achieve greater than 60 percent accuracy in proximity detection [10], while GPS-based implementations have trouble differentiating different floors of the same building. Because of similar technological limitations and the privacy concerns with location tracking, none of these proposals use GPS. Some contact tracing systems, such as the Whisper Protocol, allow users to report symptoms, as positive COVID-19 test results. The PACT system proposes some solutions to potential issues with Bluetooth-based contact tracing having to do with infected surfaces. PACT proposes putting “echoing” devices on public surfaces that re-broadcast the chirps they collect from others.

These implementations make certain assumptions about device adoption and use. The primary assumption is that the reporting and collection of location information is automatic, that is, the human does not input the locations. Instead, proximity and location is recorded by the device and considered to be more accurate than human assessments. All of these implementations assume that there is adequate adoption and that the device and the person are always close enough for the protocol to work. They also assume that people don’t share devices, kids don’t use their parents’ phones even in the house etc. (i.e. the mapping from device to owner is not one to many). It is also not clear what happens when a person carries multiple id-generating devices, but they give conflicting assessments of proximity. Finally, the question of interoperability of these different protocols remains unaddressed. Early evaluations have shown digital contact tracing can be potentially useful for containing COVID-19 [40]. However, some policies intended to increase self-isolation can decrease cooperation with contact tracing [30]. While we focus on location-based contact tracing, we do not make any assumptions about the specific implementation aside from separating the location recall and the message passing stages. The former can be achieved with the assistance of devices or not, and the latter requires a bulletin board system that is accessible to possible contacts. This generality permits implementations in which humans can correct previous reports that they posted about their locations.

These recent proposals are variations of classical contact tracing which take location into account in order to identify proximal individuals, find a route to them and pass a message to isolate. These implementations remove humans from the loop of contact tracing i.e. they do not require that a contact tracer route a message to isolate to the contact as this is done automatically, nor do they require human intervention in deciding what information to share. Ultimately, the bottleneck for these proposals is mobile device adoption. Finally, the guarantees of trust are provided by device manufacturers for whom location tracking and information collection may be integral to their business model.

The bulletin board approach is entirely location-based, and need not be completely automated - this depends on the implementation. For example, although they can use whatever location-based technologies they want to aid their memory, the infected individual (or the contact tracer) ultimately decides whether and what information to post. No other information is collected, or shared about this individual for the purposes of contact tracing. Similarly, it is not necessary to know who was in the vicinity of this infected individual: this is determined by the concerned individuals themselves. The bulletin board approach requires only that a person who tests positive push an anonymous message to a public bulletin board that is then pulled by interested individuals. As a result, it also offers more nuanced privacy options to tested individuals - they do not need to share any identifying information, nor is any identifying information necessarily collected. And it allows for more nuanced human-in-the-loop implementations depending on context. As an added benefit, we suspect that the ability to calibrate privacy to ones preferences, in particular, is likely to improve cooperation with this approach compared to manual contact tracing. It is important to note, however, that our models suggest that bulletin board tracing may not perform as well as manual contact tracing towards the end of the disease lifetime.

2.1 Classical (manual) contact tracing

Classical contact tracing begins with a contact tracer interviewing the infected individual. The interview focuses on mining the individual’s memory of people they may have been in contact with in the recent past, a time range equal to the period of infectiousness of the disease. The interviews may cover a variety of topics, such the individual’s recent travels, in the interest of jogging their memory of individuals they may have contacted. All of these individuals could have been potentially exposed. So, the tracer then attempts to contact them directly through available means to request taking measures to prevent further spread. Classical contact tracing assumes that this set of contacts can be identified accurately enough, and that they can be contacted fast enough, to win the race against forward spread. Aided by communication technologies, this race seems more winnable than ever.

Classical contact tracing has three major weaknesses. First, it has a single point of failure because it depends entirely on the ability and willingness of an individual to accurately identify who they contacted. Second, it requires the sharing and use of personally identifying information. Indeed, the more specific and individually identifiable the information the better for the purposes of manual contact tracing. Third, it assumes that a sufficient number of contacts can be, or are willing to be, contacted that the spread can be slowed. A wide range of factors have been found to affect cooperation with manual contact tracing [33]. These concerns have led to early reports of mixed effectiveness of classical contact tracing [36] with half [20] of those contacted lying and two thirds [32] refusing to cooperate.

2.2 Bulletin board tracing

Bulletin board based contact tracing addresses some of the problems with classical contact tracing. First, instead of reconstructing the set of individuals that a person has contacted, it relies on identifying the locations that they have visited. It does not rely on memories of individuals but on memories of locations. And second, instead of relying on the infected individual to identify people who may have been exposed, it relies on these people identifying themselves. More concretely, a message is publicly posted online that an infected person, without saying who, visited a particular set of locations. The general public are instructed to regularly check the bulletin boards for locations of interest to them. There are different ways to implement this, for instance with or without a contact tracing interview. And a variety of apps could be built on this platform to automate this process of checking or even posting messages. For example, Israel has a nationwide bulletin board system (Fig. 1) but it is relatively uncommon in general. Implementations of contact tracing have focused, instead, on the manual approach. The bulletin board contact tracing described here here shares similarities with the “Narrowcasting” feature of the PACT protocol. Prominent differences include that the posting of messages and the broadcasting of messages need not be automated as in PACT [10].

Fig. 1
figure 1

Israel’s bulletin board for contact tracing. URL: https://coronaupdates.health.gov.il/corona-updates/grid/place (Accessed 9/23/20 20:20)

Bulletin board contact tracing thus relies on the location memories of the infected individuals and those who may have been exposed. Locations visited are typically easier to recall than people, many of whom are probably strangers. Next, because the message is broadcast, there is no need to route a message to a particular individual. This permits a wider range of privacy options both for the infected individual and their contacts. Individuals choose which locations to receive messages about. So, personally identifiable information beyond what might be gleaned from these choices is irrelevant to bulletin board contact tracing. A major weakness of bulletin board tracing is that it relies crucially on the voluntary cooperation of a larger number of people than manual contact tracing. Beyond the cooperation of the the infected individuals, it additionally relies on the cooperation of all those who may have been infected. With the bulletin board approach, for instance, there is no notion of knocking on physical doors of private residences to convey a message: for it to work, these people will have to either frequently check the message boards or sign up to receive relevant alerts. Conceivably, an implementation of bulletin board contact tracing could automate both the posting of messages to the bulletin board and the pulling of messages from it. But this human-out-of-the-loop implementation would require accurate records of location history and a willingness to be tracked.

3 Research questions

Our primary research question is, how does the effectiveness of manual contact tracing at curtailing the spread of the SARS-CoV-2 virus compare to bulletin board contact tracing? For each of these two models of contact tracing, we further ask: (i) how efficient it has to be in order to be effective and (ii) what variables determine its effectiveness.

4 Methodology

We answer these questions using repeated agent-based simulations that combine a location-based model of disease spread with models of contact tracing. We use the CDC’s “best estimate” scenario (Scenario 5) to calibrate this model [8]. As more is learned about real world spread, these parameters will need to be updated [17]. We define location as any physical space a person occupies for more than 15 minutes in a day so that two people in the same location would be considered in “close contact” using the CDC’s definition [9]. We conduct multiple simulations using a range of values for model parameters. We measure effectiveness in terms of (i) the time between first infection and the end of the last infection, (ii) the maximum number of people infected during this time period, and (iii) the average effective reproduction number.

5 Mathematical approach

The problem of classical contact tracing can be separated into two problems: that of reconstructing the set of contacts and that of routing the message that a person is in a contact set to them. Analogously, for bulletin board contact tracing these two problems are: the problem of reconstructing the set of locations and of relaying that a positive person had been to each location to others who were also there at that time. These problems can be generalized as a reconstruction problem and a message routing problem. Our method to compare the performance characteristics of contact tracing under various assumptions using simulations relies on treating these two problems separately. From this perspective the two general types of contact tracing can be described as follows:

  1. 1.

    Manual: The individual i gets tested, if positive, a contact tracer asks them who they have been in contact with in the day range R, say Ci(R), and contacts each of the individuals in the set. The reconstruction problem consists of reconstructing the set of contacts for a relevant time period. This is done with the help of a contact tracer. The message passing problem consists of finding a way to contact these individuals and then getting a message to them. Both of these are done by the contact tracer.

  2. 2.

    Bulletin board: The individual i gets tested, if positive anonymously posts their locations and test result to a bulletin board. All other individuals check this bulletin board to learn that they were in proximity. There are variations of this model with different privacy properties. Here the reconstruction problem consists of reconstructing the set of locations visited by the infected person over a relevant time period. This can be done with the help of a contact tracer, any physical or digital artifacts that one might have (such as receipts, photos or location histories) or independently. The message passing problem consists of posting (pushing) an anonymous message to a bulletin board, again which can be done with assistance or independently. The other part of the problem is for the relevant individuals to “pull” relevant messages from the bulletin board. Note that it is not necessary for the contact tracer to find a route to the recipient.

Two key parameters characterize the reconstruction problem and the message routing problem: recall accuracy of individuals and devices and the message routing method.

5.1 Recall accuracy (A)

Recall refers to the ability of a person to remember a past experience. Recall accuracy (A) refers to the accuracy of a person’s memory of a past experience. That is, how similar what they remember is to what was experienced. There are two types of recall relevant here: contact recall and location recall. The former refers to the ability to recall who one has come into contact with. The latter refers to the ability to recall where one has been.

In the classical model, the index agent’s recall accuracy determines who gets a message. Index agent i’s contact recall is used to populate the set Ci(R) and it determines accuracy of the set Ci(R). Location recall of the index agent does not matter. Recall accuracy of individuals in Ci(R) may or may not be relevant. One way that it might be relevant is as a way to augment the agent’s recall and make the set Ci(R) more accurate. For example, for the contact tracer to ask the contacted person whether they remember being in contact with the index person.

In the bulletin board model, recall accuracy of the index agent i as well as non-index agents determines who gets a message. Here, the memory about index agent’s location and non-index individuals location determines who gets a message. The contact recall of the index individual does not matter, but their location recall does. Contact recall and location recall of non-index individuals also matters. In human out of the loop implementations, recall accuracy of the machine determines who gets contacted since the recall of the machine is used to populate Ci(R). A machine’s recall accuracy, such as that of a mobile phone, is not guaranteed to be perfect.

Finally, the results of contact tracing can also be expressed in terms of the completeness of Ci(R). For instance, the percentage in the set that are also in the real set is an indication of contact recall accuracy (c). Or, when we know that Ci(R) is completely accurate (as in the case of flawless automation) then it is an indication of the effectiveness of message routing. We use the proportion of contacts recalled, and the proportion of recalled contacts who are reachable as parameters in our models.

5.2 Message routing method

In the manual model of contact tracing, the message is passed to the correct recipient only through a contact tracer who must use available means to route the message. In the bulletin board model, the signal is effectively broadcast to all possible recipients by posting in a public location, and the recipients need to pull the messages and read those that are relevant to them. If we assume that recall accuracy for all individuals is the same, then the two models of contact tracing are distinguished by the message passing mechanism. That is, they are distinguished by the way that the message reaches the elements of Ci(R) from index agent i.

There are two processes within the signal routing mechanism that can alter its effectiveness:

  1. 1.

    route finding (f ): How one finds the route that the message should take to reach the destination, in this case an element of Ci(R) for index agent i.

  2. 2.

    message passing (p): The mechanism by which the message is passed to the destination, in this case an element of Ci(R).

For agent-based contact tracing, route finding is dominant because these individuals have to be tracked down after they are identified, but once they have been tracked down passing the message to them is easier. For bulletin board, message passing is more dominant because the message is pulled from the bulletin board by each individual. The contact tracer does not have to find a route and instead posts to one location. In other words, the tracer only has to find a route to the bulletin board which is trivial by design. It is important to note that the processes for reconstructing the contact set and routing the message have to both be completed within a reasonable amount of time. In the case of SARS-CoV-2, an upper bound is 11 days because those to be contacted are infectious for that time. So if the contact tracing processes take longer than that to implement, it is ineffective at stopping the spread of the virus. A second constraint in manual contact tracing comes from the availability of contact tracers. We assume that there is always an adequate supply of contact tracers to be able to route the message to recalled individuals within the instantiation time.

6 Generalization

Let Tc be the time between infection and the time that the message to get tested reaches an individual. Consistent with [11] T is the number of days before isolation and after infection. So, T = Tc + TE where TE > 0 because the steps following message receipt such as testing, getting the result and then entering isolation are not instantaneous.

The models of tracing alter Tc. Hence,

$$ T_{c}=g(A, R, f, p) $$

For SARS-Cov-2, the relevant time period is 11 days (R = 11). So,

$$ T_{c}=g(A, f, p) $$
(1)

6.1 Early transmission

Under the assumptions of the standard SEIR model, a newly infected but not yet infectious person enters an exposed state for an exponentially distributed length of time τE with mean 1/μE, after which they become infectious for an exponentially distributed duration τI of mean 1/μI during which transmission again occurs at constant rate β [11].

In the standard SEIR model describing an outbreak in a population of size N, the expected number of secondary infections generated by an infected person early in an outbreak, the reproductive number R0, is given by

$$ R_{0}=N\beta/\mu_{I} $$
(2)

where β is the transmission rate per unit time. Given epidemiological estimates of the reproduction number, this enables the transmission rate to be written as [22].

$$ \beta=R_{0}\mu_{I}/N $$
(3)

In the agent based model to follow, each individual i in the population is assigned the same initial transmission rate (and basic reproduction number R0) as explained in Section 7 below. They end up with different effective reproduction numbers as a result of the simulation.

6.2 Two reconstruction problems in contact tracing

Before explaining the computational approach that we use, it is important to note that the reconstruction problem at the heart of of contact tracing can be formulated as centered on individuals or locations.

6.2.1 Individual based reconstruction

Let Ω be the set of all humans in a locality such that |Ω| = N. Let ωi be the subset ever contacted by human iΩ. Given a time range R, Ci(R) is the set of people contacted by i in that range. Then, Ci(R) ⊂ ωi. Let \({C}_{i}^{t}(R)\) denote this set at time t. Then, \(\omega _{i}=\bigcup \limits _{t=0}^{t_{L}} {C}_{i}^{t}(R)\) where tL is the time of death. Next, ∀i,j, ωiωj, if one cannot be in contact with oneself. The recall problem for every person i at time t is the problem of assigning every other element jΩ a value in {0,1} such that they are assigned a 0 if jCi(R) and a 1 if it is. Next, let Ci(R) ⊂Ω be the set of people j such that iCj(R). And, let mi(R) ⊂Ω be the set of people j such that i believes iCj(R) to be true. The reconstruction problem for each person i is the problem of keeping mi(R) as similar to Ci(R) as possible. To see why the reconstruction problem is non-trivial, consider the case where two people come into contact with each other but only one knows it (they may have their backs turned or eyes closed, or they might come in contact with a common surface in quick succession). At any given time t and fixed R, three sets define each human being i: Ci(R), \({C}_{i}^{t}(R)\) and mi(R).

6.2.2 Location based reconstruction

Another way to formulate the reconstruction problem is to assume that every location records who has visited in every time period. That is, let ml{l,t} be the set of humans who have visited location l in time period t. This might be the case, for instance, when a facial recognition device continuously records all who have visited a location. The reconstruction problem is the problem of reconstructing all ml{l,t}.

7 Computational approach

As we have implemented it, there are three components of the computational models of manual and bulletin board contact tracing. Two components are common to the models: the model of disease spread and the testing model. The third component, the model of contact tracing, is allowed to vary. This permits us to develop time-dependent agent based simulations to compute indicators of disease spread under the three contact tracing conditions without changing the models of disease spread and testing.

7.1 The disease spread model

We use a location based SEIR-type model of spread that assumes homogeneous random mixing in the population and discrete time (in days). This type of mixing is appropriate for modeling local outbreaks, and represents a worst-case scenario. Each community is assumed to consist of a population of size N and a set of L locations. We assume there is no importing of infections into the community. The basic reproduction number R0 over the population is held constant for each simulation, and every individual is thereby given a constant infection rate. We assume that each individual’s days in the exposed state \(\left ({\tau _{i}^{E}}\right )\) after infection is drawn from an exponential distribution with mean 1/μE. We assume that each individual’s days in the infectious state \(\left ({\tau _{i}^{I}}\right )\) after the exposed state is drawn from an exponential distribution with mean 1/μI.

Each individual is assigned an infection age, which represents the number of days since infection. If the infection age is 0, they have never been infected. If (infection age)\(\leq {\tau _{i}^{E}} \) then they are exposed but not infectious. If \( {\tau _{i}^{E}} < \) (age of infection)\(\leq {\tau _{i}^{E}} + {\tau _{i}^{I}}\) then they are infectious. A person i who is infectious can spread the infection to everyone in the same location who is not already infected. The number of people ni that i actually infects in the simulation depends on R0, \({{\tau }_{i}^{I}}\), and their movement between locations (and hence their total number of interactions). Each time period, the total number of infections is counted.

7.2 Population mixing

To enable a comparison of the bulletin-board approach, we model interactions using location assignment rather than stochastic pair formation and separation [26]. Each individual i is assigned an integer value from [1...L] using a discrete uniform probability distribution over the set. All individuals assigned the same non-zero value are assumed to be in contact on day t. The location assignment model has two desirable characteristics. First, the parameter determining disease spread can be specified in terms of location. And second, it incorporates the idea of movement in space. Both of these are important for modeling bulletin board contact tracing. As we show below, we calibrate the combined population mixing and disease spread model to be equivalent to what is expected from more common mass-action models.

7.3 Location dependence

In a location-based simulation, the exact mix of Susceptible \(\left ({S_{t}^{l}}\right )\), Exposed \(\left ({E_{t}^{l}}\right )\), Infected \(\left ({I_{t}^{l}}\right )\) and Recovered \(\left ({R_{t}^{l}}\right )\) people at each location is determined by the probabilistic model of population mixing. The variables that characterize this model of mixing are the number of locations they can travel to (L), and the number of people (N). Hence, the effective reproduction number for an individual i depends on these two variables.

$$ R_{0i}=f(L,N) $$
(4)

7.4 Approximating mass-action epidemic models

In common mass action models, \(|{S_{t}^{l}}| \geq {n_{l}^{i}}\) and L = 1 with homogeneous random mixing and a constant rate of infection. To generate an infection rate that is compatible with this in a simulation with L > 1 and known R0, we assign each individual a constant scaled transmission probability. To retain stochasticity, we draw the infectious and exposed durations from probability distributions.

Let the expected number of infections at time t in our model be given by

$$ \beta|S_{t}||I_{t}|=b\sum\limits_{l=1}^{L}|{S}_{t}^{l}||{I}_{t}^{l}| $$
(5)

Where

$$ |{S}_{t}^{l}|=|S_{t}|q_{l} $$
(6)

and

$$ |{I}_{t}^{l}|=|I_{t}|q_{l} $$
(7)

ql is the probability that the individual i will be at location l ∈{1,2..L}. b is the scaled transmission rate and β is the constant transmission rate in an ordinary SEIR model with a single location. Note that we are assuming that b is also constant in our model, which is what permits us to pull it out of the sum. More generally, using Eq. 3 we can rewrite the the scaled infection rate for an individual i as

$$ b_{i}=\frac{\beta}{{\sum}_{l=1}^{L}{q_{l}^{2}}} =\frac{\mu_{I}R_{0i}}{N{\sum}_{l=1}^{L}{{{q}_{l}^{2}}}} $$
(8)

In our experiments there is no heterogeneity in the underlying disease spread model. So, R0i = R0 and does not vary at the individual level. In order to give everyone the same value of bi, we use the mean value 1/μI for \({\tau _{i}^{I}}\) above.

Since we assign an equal probability of being at any location, ql = 1/L, to each individual, we have,

$$ b_{i} =\frac{\mu_{I}R_{0i}L}{N} = \frac{\mu_{I}R_{0i}}{\rho} $$
(9)

Where ρ = N/L is the density of the model and 0 ≤ bi ≤ 1. To spread the infection at location l, a susceptible person at location l is infected with probability

$$ p_{l}=1-\underset{i\in {I_{t}^{l}}}{\prod}(1-b_{i}) $$
(10)

Thus, the expected number of infections at location l is given by

$$ {n_{t}^{l}}=|{{S}_{t}^{l}}|\left( 1-\underset{i\in {I_{t}^{l}}}{\prod}(1-b_{i})\right) $$
(11)

These new infections are then assigned at random to infectious people at the location of infection. Note that Eq. 9 makes explicit the effect of density on the transmission probability in our simulation. Finally, it is now easier to see that if L = 1, we have a mass action model with a single location.

Note that this formulation requires that \(P\left \{|{S_{t}^{l}}| < {n_{t}^{l}}\right \}=0\). To achieve this condition, it must be that μIR0iρ. In our implementation, we choose simplicity over generality and fix R0i for each simulation, use μI in place of \({\tau _{i}^{I}}\) when computing the scaled transmission rate, and use a sampled \({\tau _{i}^{I}}\) to determine how many days an individual remains infectious. As we have calibrated the model below, max(μIR0i) < 0.228, and min(ρ) = 1

7.5 Testing model

Testing in this model assumes symptomatic testing because that is the type of testing that is associated with manual contact tracing. We assume that 60% show symptoms [8]. For those that show symptoms, we assume that the incubation period of the virus for each individual is drawn from a log-normal distribution with mean μP and standard deviation σP, after which they start to show symptoms. For simplicity, we assume that the individual’s symptoms, if they are symptomatic, coincides with their infectious period. During initialization, we repeatedly sample the incubation period until we get a value in the individuals infectious period, \(\left ({\tau _{i}^{E}},{\tau _{i}^{I}}\right )\). We assume that it takes every individual one day to recognize symptoms, and they do so accurately. There is a delay in receiving test results (td), and the test itself has a false negative rate of PN and a false positive rate of 0, as the false positive rate is only relevant to asymptomatic testing. Consistent with the use of a common antigen test we use PN = 0.2 [39]. We assume that individuals isolate immediately upon receiving a positive test result with probability PI. We use td = 1 as an optimistic estimate that emphasizes the effects of contact tracing parameters [27]. Similarly, we use PI = 0.7 assuming that compliance with self-isolation absent financial incentives to do so is similar to self-quarantine [5].

7.6 Implementation

We implement the disease spread model using time steps of one day as follows:

  1. 1.

    Initialization: Begin with a population of susceptible people (P) and a number of locations (L). Each individual (i) is initialized with values for exposed duration, infectious duration, whether or not they are symptomatic and incubation period from the corresponding distributions (Table 1) to be applied in the event that they become infected. We also assign each person a transmission probability bi and infection age of 0 since everybody is uninfected.

  2. 2.

    Initial infection: At time t0, we choose a randomly infected index person from the population. Initializing with more than one index person speeds up the infections, without changing the dynamics.

  3. 3.

    Movement: Every person is randomly assigned to one location with a uniform probability. In subsequent days, each location will have a mix of Susceptible, Exposed, Infectious and Recovered individuals.

  4. 4.

    Generate infections: For each location, infect each susceptible person with the probability shown in Eq. 10.

  5. 5.

    Repeat: Increase the infection age of each exposed or infectious individual by one day. Return to the Movement step until the number of exposed and infectious people in the population is zero.

Table 1 Model parameters

The testing model is combined with this model of disease spread by adding the following steps to each time period before the Repeat stage.

  1. 1.

    Testing: Test each symptomatic person in the population whose infection age is greater than their incubation period. Tests are ready after td days.

  2. 2.

    Isolation: For each person whose results are ready and positive, isolate them with probability PI. For each person who has been isolated for R days, unisolate them. R is the number of days after which an infected person is assumed to be no longer infected.

Finally, the two different types of contact tracing are added on top of this by adding the following steps to be followed if a person i tests positive. For manual contact tracing:

  1. 1.

    Contact Reconstruction: Choose Pc proportion of individuals from the set of people that i has contacted in the last RS days. S is the number of days an individual is assumed to be in the exposed but not infectious state.

  2. 2.

    Route Finding and Message Passing: Push a message to this set of contacts with some probability of reaching them (Pr).

  3. 3.

    Isolation: Isolate each person who receives the message with probability Pim.

For bulletin board contact tracing:

  1. 1.

    Location Reconstruction: Choose Pc proportion of locations from the set of locations that i has visited in the last RS days.

  2. 2.

    Message Passing: Ps proportion of those in each of these locations receive a message that an infected person visited a relevant area.

  3. 3.

    Isolation: Each person who receives the message isolates with probability Pib.

For hybrid contact tracing, the steps for both manual and bulletin board contact tracing are added on top of the testing model.

8 The five models

Five models are compared when R0 is 2.5. Refer to Table 1 for the typical values and symbols used.

  • m0: No Intervention: This is the location assignment model of spread with no testing or contact tracing given a population size (N) and number of locations (L). The infectious period for each person is assumed to end after \({\tau }_{I}^{i}\) of infection, and assumed to start \({\tau }_{E}^{i}\) days after infection. \({\tau }_{I}^{i}\) is drawn from an exponential distribution with mean 1/μI and \({\tau }_{E}^{i}\) is drawn from an exponential distribution with mean 1/μE. The main parameters are: {N, L, 1/μI, 1/μE}. The other models below build on this model.

  • m1: Testing Only: Every individual is tested symptomatically. 40% of the population is assumed to be asymptomatic [8]. For others, symptoms arise after an incubation period drawn from a log-normal distribution with mean (μp) 5.2 and standard deviation (σp) 3.9. PT proportion of people who show symptoms get tested. The result arrives td days later with a false negative rate of Pn. People isolate with probability PI upon receiving a positive test result for R days. The main parameters are: {N, L, 1/μI, 1/μE, td, R, PT, Pn, μp, σp}.

  • m2: Manual Contact Tracing: Tracing begins Tm days after an individual receives a positive test result. A contact tracer reconstructs Pc proportion of the individual’s contact set for the last RS days from the individual’s memory, with or without assistance. The tracer then sends a message to all of these people and it reaches each target individual with probability Pr. Recipients isolate with probability Pim upon receiving the message. It takes Tm days for the manual tracing steps to be implemented. Typically these are securing cooperation, interviewing, assigning volunteers and locating recipients. We assume that Pm proportion of infected individuals are willing to subject themselves to manual contact tracing. The main parameters are {N, L, 1/μI, 1/μE, td, PT, Pn, μp, σp, R, S, Pc, Pr, Pm, Tm, Pim}.

  • m3: Bulletin Board: An individual who receives a positive test posts a message to a bulletin board with probability Pb, which is the probability that they will cooperate with the requirements of bulletin board tracing. We assume that the message is posted and reaches the recipient within a day, whether or not it is automated so that instantiating bulletin board tracing takes 0 days. The message contains two pieces of information: (location, when visited). One message is sent to the bulletin board for each location they visited in their relevant location history. They do not share any personally identifying information. Cooperating individuals reconstruct Pl proportion of their location history for the last RS days either alone or with assistance. Ps proportion of people overall are signed up to get bulletin board messages concerning any given location. The message to isolate reaches an identified person with probability Pr in time Tb. We assume that the recipients are willing to isolate with probability Pib when they receive a message this way. The main parameters are {N, L, 1/μI, 1/μE, td, PT, Pn, μp, σp, R, Ps, Pl, Pb, Tb,Pib}.

  • m4: Hybrid (Manual + Bulletin Board): Both bulletin board contact tracing and manual contact tracing are performed. The main parameters of this model consist of all the main parameters of models m2 and m3.

9 Model calibration

The no intervention scenario is calibrated to be compatible with the currently known characteristics of SARS-CoV-2 as far as possible. When there is only a single location, the final sizes are compatible with what is expected from mass action SEIR models when R0 is 1.5 (0.583), 2.25 (0.854) and 2.5 (0.89). Figure 2 shows the time traces of infection for each run of the simulation in this scenario.

Fig. 2
figure 2

The sequence of infections (prevalence) of 1000 runs for a calibrated base model (m0) with 1 location

10 Results

This paper was motivated by the question of how bulletin board contact tracing compares to manual contact tracing. To test this, we developed a discrete-time SEIR model of disease spread calibrated to what is known about SARS-CoV-2 (Table 2) (Fig. 3). We then combined this with models of symptomatic testing and the three types of contact tracing. In discrete-time simulations, compared to manual contact tracing, we find that bulletin board tracing does not significantly (α = 0.05) change the duration of the disease (Table 3). It also does not significantly reduce the percentage of the population that is infected (Table 4). Instead, it is effective at reducing the overall number of outbreaks (Fig. 4). It also does not alter the overall R0 (Table 6) compared to manual contact tracing. Bulletin board tracing reduces prevalence initially (Fig. 5), but then permits a higher prevalence than manual contact tracing. Incidence follows a similar pattern (Fig. 6). It is, however, more effective at tracing a higher proportion of people who become infected (Table 5). When manual and hybrid approaches are combined, contact tracing is more effective in each of these measures (Table 6). All graphs in this section show the results of 1000 iterations of the model for each set of parameters. Each iteration consists of at most 1000 time periods. Parameter values are in Table 1.

Table 2 Calibration: The base model (m0) with a single location and no heterogeneity
Fig. 3
figure 3

Mean total infections over time for all models: Mean total infection curves for 1000 runs of the model, with model density of 100 (N = 1 × 103 and L = 1 × 101) and various parameters for other models

Table 3 Results[Duration (days)]: Results of comparing manual (m2), bulletin board (m3) and hybrid (m4) contact tracing
Table 4 Results[Infected (%)]:Results of comparing manual (m2), bulletin board (m3) and hybrid (m4) contact tracing
Fig. 4
figure 4

The cumulative fraction of runs by total number infected for all models

Fig. 5
figure 5

Prevalence of infection with manual, bulletin board and hybrid contact tracing. N = 1000. The graph shows mean values for each time period across all runs. The values are small because of the large number of runs that do not result in outbreaks but that are included in the divisor. The grey ribbon around each line shows the bootstrap 95% confidence interval

Fig. 6
figure 6

Incidence of infection with manual, bulletin board and hybrid contact tracing. N = 1000. The graph shows mean values for each time period across all runs. The values are small because of the large number of runs that do not result in outbreaks but that are included in the divisor. The grey ribbon around each line shows the bootstrap 95% confidence interval

Table 5 Results[Traced (%)]:Results of comparing manual (m2), bulletin board (m3) and hybrid (m4) contact tracing
Table 6 Results[R0]: Results of comparing manual (m2), bulletin board (m3) and hybrid (m4) contact tracing

10.1 m0: No Intervention

This is the baseline scenario in each of the comparisons. We assume that there are ten locations, one thousand people in the population (ρ = 100) and no heterogeneity.

10.2 m1: Testing only

This is the model of testing and subsequent isolation. We assume that 70% of individuals isolate when asked to, or after they get a positive result. We assume that 2 in 10 tests gives a false negative. Predictably, less of the population on average is infected in the model compared to m0.

10.3 m2: Manual contact tracing

In the manual contact tracing scenario, we keep the testing and false negative testing rates the same as model m1. We assume that individuals can recall 50% of all the people they contacted in the past 11 days, including strangers. And of these people, we assume that 50% can be contacted after identification. We assume that the contact tracing instantiation is highly efficient and that it only takes one day between the positive test result and the contacts receiving a message from the contact tracer. Finally, we assume that 50% of individuals cooperate with contact tracers. The results show a significant improvement over testing alone both at reducing duration, incidence and the effective reproduction number. Manual contact tracing leads to the contacting of slightly less than half of the people who would otherwise become infected being contacted (Table 5). The number of people contacted is more sensitive to variations in recall and message passing than cooperation with contact tracing and implementation delays, as the Supplementary Materials show (Online Resource 1).

10.4 m3: Bulletin board contact tracing

In the bulletin board contact tracing scenario, we keep the testing parameters the same as with manual contact tracing. We assume that individuals can recall 70% of all the locations they visited in the past 11 days. This is higher than the rate of recalling individual contacts because it is easier to remember locations than people. Of the people who visited these locations at the same time, we assume that 50% check the bulletin board at least once a day or have signed up to receive notifications. We assume that the bulletin board contact tracing implementation is highly efficient and that the message reaches the individuals the same day that it is posted. Finally, we assume that half of all individuals cooperate fully with the bulletin board system. The bulletin board approach leads to a shorter mean duration (Table 3), although not significantly so as well as a significantly larger number of total infected on average (Table 4). The effective reproduction number remains the same, however. Bulletin board contact tracing leads to the contacting of significantly more of the people who would otherwise become infected being contacted than manual contact tracing (Table 5). As the Supplementary Materials elaborate, bulletin board tracing does significantly worse in this metric when cooperation with contact tracing generally is low (10% of people cooperate) and does about as well with medium levels of cooperation (50% of people cooperate) (Online Resource 1). However, bulletin board contact tracing and manual contact tracing perform similarly overall in the low recall scenario (Table 7), and bulletin board significantly worse in the low cooperation scenario (Table 8).

Table 7 Results in the low recall scenario: Means with 95% confidence intervals of measures comparing manual (m2), bulletin board (m3) and hybrid (m4) contact tracing in the low recall scenario (Pc = 0.1,Pl = 0.1)
Table 8 Results in the low cooperation scenario: Means with 95% confidence intervals of measures comparing manual (m2), bulletin board (m3) and hybrid (m4) contact tracing in the low cooperation scenario (Pm = 0.1,Pb = 0.1)

10.5 m4: Hybrid contact tracing

In the hybrid contact tracing scenario, we keep the testing, manual contact tracing and bulletin board tracing parameters the same as in models m1, m2 and m3. The results show a significant improvement over either of the contact tracing approaches alone at reducing the duration and incidence, while keeping the effective reproductive number the same. Hybrid contact tracing is able to contact significantly more of the people who would otherwise become infected compared to the other approaches (Table 5). It is superior to the other two methods in both the low recall (Table 7) and low cooperation (Table 8) scenarios, as well as the other scenarios examined in the Supplementary Materials (Online Resource 1).

11 Sensitivity analysis

Our choice of values for parameters introduces some uncertainty into the results. Therefore, we report the duration, infected percentage, the effective reproduction number and the percentage traced when model parameters indicating cooperation with contact tracing (Pm, Pb) and recall ability (Pc,Pl) are much lower than the values used above (0.1). Figure 7 shows that while the values of the indicators changes in these low value scenarios, the relative relationships persist. Tables 7 and 8 reinforce this persistence more precisely. We refer the reader to the Supplementary Materials for a fuller sensitivity analysis of these results (Online Resource 1). There, we examine the effect of variations in four sets of parameters: those affecting the reconstruction problem (Pc,Pl), the message passing problem (Pr,Ps), cooperation with contact tracing (Pm,Pb) and implementation delay (Tm,Tb). The sensitivity results are presented for three scenarios - low, medium and high- for each sest of parameters.

Fig. 7
figure 7

Sensitivity: Sensitivity of models of Manual (m2), Bulletin Board (m3) and Hybrid Contact Tracing (m4) to low recall (a) and low cooperation (b) scenarios with 1000 runs. Other parameters as in Table 1. The gradients correspond to the density of observations

12 Discussion

Bulletin board contact tracing can improve manual contact tracing considering how easy it is to implement, the relative fewer resources it demands, and its nuanced privacy options. However, as our models show, under certain conditions, bulletin board contact tracing can be comparable or even slightly worse than manual contact tracing. When they are combined into a hybrid approach, contact tracing is uniformly more effective. Figure 3 shows that around the 45-day mark, the curves for manual contact tracing and bulletin board tracing cross with the former trending downwards faster. We suspect that this reflects the difference in expected numbers of infected people due to a missed location versus a missed individual. This is likely because locations typically have more than one person in it. So, missing a location that one had visited that was infected rules out a larger number of people from receiving the message to isolate. In comparison, missing one person that was contacted rules out only one person from receiving the message to isolate. Thus, it is a limitation of the model that it does not adequately scale the probabilities of recall. It also represents a significant handicap in favor of manual contact tracing in the head to head comparison of the two models of contact tracing.

We suspect this crossing is likely only when (i) the recall of locations is comparable to recall of contacts and (ii) the probability of a message reaching the recipient after they had been identified is about as good as, or slightly better than, a coin toss in both cases. This is not a realistic scenario in light of what we know from individual experiences. First, there are strong reasons to believe that our ability to remember locations is better than the ability to remember individuals (some of whom will be strangers). Second, if we visit locations with lots of people (such as concerts, or other super-spreading venues), we are even more likely to remember these locations. Third, technology to assist in identifying locations is far more accurate and ubiquitous than the technology to identify people. And finally, based on authors’ experiences, people are more willing to share anonymous location-identifying information than individually identifying information.Taken together, a contact tracing approach that also takes into account location information, and uses a bulletin board approach is likely to be more effective than manual contact tracing alone in all but the most uncommon circumstances.

We have been as generous as we could to manual contact tracing given recent reports of serious problems with the approach, and the limitations of technological fixes to the approach. On the other hand, we have been relatively severe to the bulletin board approach by assuming that the key parameters are similar to manual contact tracing when everyday experience suggests otherwise. Our results, therefore, should be understood as sketching a lower bound on the gains that can be achieved through bulletin board contact tracing. There are multiple avenues for improving its performance significantly compared to what we have assumed, without incurring similar costs as manual contact tracing. It is worth highlighting that the bulletin board approach can propagate messages much faster than the manual approach, and people are more likely to remember locations than people, particularly strangers. Compliance is also likely to be higher when there is greater privacy by default.

There are several extensions of our model that immediately suggest themselves. One is the addition of a probability of staying home. Another is the addition of a false positive rate, in order to better model different testing regimes. However, it should be noted that the false positive rate matters to the model only when individuals are tested before they exhibit symptoms (asymptomatic testing). Test sensitivity also depends on viral load, particularly for the antigen test that we assume here. This load varies between asymptomatic and symptomatic individuals [34]. Future work could use a model of disease spread and testing that better incorporates these variations in symptomatic and asymptomatic spread. One could also generate the individual basic reproductive number from a probability distribution to introduce heterogeneity into the model. We have also used a log-normal distribution to sample the infectious period from, however this could be generalized to choose any other distribution.

For compatibility with mass action models with a single location that are common in the literature we have assumed homogeneous random mixing with a constant rate of infection. These assumptions achieved a Poisson distribution for the number of secondary infections. These constraints can be relaxed in future work to achieve more accurate estimates of the relative effectiveness of manual and bulletin board contact tracing in particular localities. A key advantage of Agent Based Models is the ability to study the effects of contact heterogeneity, a major driver of epidemics [23], on contact tracing effectiveness [29, 31]. While a thorough assessment of various models of heterogeneity is beyond the scope of this paper, we briefly discuss two types here: heterogeneity across locations and across individuals.

Relaxing the assumption of homogeneous random mixing across locations by allowing for differences in individual travel behavior enables one to use more realistic spatial-temproal models. Different models would alter the mix of Susceptible (S), Exposed (E), Infected (I) and Recovered (R) individuals at each location at any given time, thereby leading to different effective reproductive numbers for each individual (R0i). Similarly, allowing for heterogeneity of behavior within locations, such as home, schools, parks etc. would also alter the transmission rate in each location (β). These modifications would allow, for instance, for modeling infection hotspots, superspreader events and overdispersion - three well known characteristics of SARS-CoV-2. Non-uniform travel behavior would also affect the performance of manual and bulletin board contact tracing by changing the impact of forgetting particular locations or individuals. For example, failing to remember coming into contact with a potential superspreader can disproportionally affect manual contact tracing, while failing to remember a superspreader event location would disproportionally affect bulletin board tracing. Given the significance of individual travel and social behavior to disease spread and contact tracing, these modifications present opportunities for improving this model.

A challenge for any model of travel and mixing heterogeneity is to parameterize the model properly. This can be challenging because, for instance, some of the interventions associated with pandemic control are travel restrictions and behavioral changes [16]. Automatically collected travel data from mobile devices, as well as other sources of place-visit data more generally, can be a promising way to approach these challenges [24, 31]. In order to use this type of data in our model, one would need to add the parameters determining travel behavior to Eq. 4. If within-location behavior data is available, these parameters can be added to Eq. 5. This will ultimately change the equation for the expected number of infections at a given location and time (\({n}_{t}^{l}\) in Eq. 11) as well. These modifications will likely be more realistic, but at the possible cost of breaking direct comparability with mass-action models that are common in epidemiology. The specific behavioral and mixing models used can fundamentally alter the performance of contact tracing and could significantly change the results presented here. A thorough investigation of the effects of behavioral heterogeneity on the effectiveness of contact tracing requires further work.

13 Conclusion

Based on these results, classical contact tracing should, in the least, incorporate location information alongside individual information. If greater privacy options or efficiency is desired, a greater reliance on location-based tracing should be considered compared to manual contact tracing as classically implemented in most circumstances. Bulletin board contact tracing provides an improved solution for the two main problems of classical contact tracing: reconstruction of the contact set for the infected individual and communicating with those who have been exposed. It improves contact set reconstruction by relying on individuals’ memory of locations not of people; and by combining the location memories of the index individual as well as the exposed individuals. It improves on the message passing by similarly requiring only that the index person pass a message to a public bulletin board. The recipients also have only to know how to get messages from this board. The contact tracer, thus, does not need to know how to individually reach every contact. However, the bulletin board approach makes these improvements at the cost of requiring greater cooperation with the contact tracing regime than classical contact tracing. This, as we have noted, may be easier to secure than classical contact tracing because individuals involved need to reveal a relatively negligible amount of personal information: they need only agree to the posting of an anonymous message associating location and time. Based on our models, we suggest that policy makers should consider bulletin board based approaches to contact tracing in addition to manual approaches.