Skip to content
ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Research Article

The relative incidence of COVID-19 in healthcare workers versus non-healthcare workers: evidence from a web-based survey of Facebook users in the United States

[version 1; peer review: 2 approved with reservations, 1 not approved]
PUBLISHED 27 Nov 2020
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Coronavirus (COVID-19) collection.

Abstract

Background: Healthcare workers are at the forefront of the COVID-19 pandemic and it is essential to monitor the relative infection rate of this group, as compared to workers in other occupations. This study aimed to produce estimates of the relative incidence ratio between healthcare workers and workers in non-healthcare occupations.
Methods: Analysis of cross-sectional data from a daily, web-based survey of 1,788,795 Facebook users from September 6, 2020 to October 18, 2020. Participants were Facebook users in the United States aged 18 and above who were tested for COVID-19 because of an employer or school requirement in the past 14 days. The exposure variable was a self-reported history of working in healthcare in the past four weeks and the main outcome was a self-reported positive test for COVID-19.
Results: On October 18, 2020, in the United States, there was a relative COVID-19 incidence ratio of 0.7 (95% UI 0.6 to 0.8) between healthcare workers and workers in non-healthcare occupations.
Conclusions: Currently in the United States, healthcare workers have a substantially and significantly lower COVID-19 incidence rate than workers in non-healthcare occupations.

Keywords

COVID-19, healthcare workers

Introduction

In August, the Peterson-KFF Health System Tracker published a collection of charts showing how healthcare utilization has declined during the COVID-19 pandemic in the United States1, showing that facility discharge volume dropped by over 25% and cancer screening volumes dropped by over 85% from levels in 2019. This decrease is consistent with evidence from other sources2,3, and could be driven by a perceived risk of interacting with workers at health facilities. It is yet to be seen how much this delayed and foregone care will reduce population health. Meanwhile, a Wall Street Journal analysis of Centers for Disease Control and Prevention (CDC) data found that at least 7,400 COVID-19 infections were transmitted in US hospitals in 20204. Access to adequate resources for infection prevention among health care workers (HCWs) remains a topic of urgent importance5.

There is currently no population-based evidence quantifying the relative COVID-19 incidence rate among HCWs as compared to workers in non-healthcare occupations (non-HCWs) in the US. We hypothesized that there is not a substantially elevated rate of COVID-19 infection among HCWs and that HCWs might even have lower incidence rate than non-HCWs, and we analyzed data from a large survey of Facebook users to investigate.

Methods

Study design

We analyzed individual participant data from a large, web-based survey of Facebook users aged 18 and above in the United States (around 300,000 respondents per week). Every day Facebook offered a random sample of US-based users a Qualtrics survey run by the Delphi lab at Carnegie Mellon University who made it rapidly available to other academic researchers6. Facebook also provided survey weights to adjust for the demographics of the active Facebook user population7,8. This sort of survey data has been used previously to perform population based analyses related to COVID-19, though never before at such large scale9,10. Our analysis relied on the responses to two lines of questions: (1) questions about recent work history, worded as, “In the past 4 weeks, did you do any kind of work for pay?” and if so, “[p]lease select the occupational group that best fits the main kind of work you were doing in the last four weeks”; and (2) questions about COVID-19 testing history, worded as, “Have you ever been tested for coronavirus (COVID-19)?”, “[h]ave you been tested for coronavirus (COVID-19) in the last 14 days?”, “[d]id this test find that you had coronavirus (COVID-19)”, and “[d]o any of the following reasons describe why you were tested for coronavirus (COVID-19) in the last 14 days? Please select all that apply.”

We analyzed the most recently available six weeks of data from September 6, 2020 to October 18, 2020, which provided more than 80% power to detect a 30% difference between COVID-19 prevalence in HCWs and non-HCWs (details below).

Variables

To quantify the relative risk of COVID-19 among healthcare workers (HCWs) versus workers in non-healthcare occupations (non-HCWs), we used the response to the occupational group question as our exposure variable (we coded respondents who selected option “Healthcare practitioners and technicians” or “Healthcare support” as HCWs, and all others, including those with a missing value, as non-HCWs). We identified individuals with COVID-19 as those who reported that they had tested positive for COVID-19 in the last 14 days.

Statistical methods

We calculated the endorsement rate of positive COVID-19 test (ER) for the HCW and non-HCW population as the survey-weighted percent of respondents in either group who reported COVID-19, and calculated the relative COVID-19 incidence ratio (RR) with the equation

   RR = (ER among HCWs) / (ER among non-HCWs).

We quantified the uncertainty in this ratio using non-parametric bootstrap resampling to obtain a 95% uncertainty interval11. To control for confounding due to differential access to COVID-19 testing, we restricted our analysis to only HCWs and non-HCWs who were tested in the last 14 days because their employer or school required it.

As sensitivity analyses, we considered also alternative inclusion criteria and more restrictive subsets of HCWs. The survey provided sample weights that adjust for non-response bias, which we used in our main analysis. As a sensitivity analysis, we repeated our calculation using the unweighted data. To investigate the possibility that workplace testing practices differ between HCW and non-HCW occupational settings, we also repeated our analysis with additional filtering based on the “why you were tested” question. In the main result we used the subset of individuals who responded that they were tested in the last 14 days because of employer/educational requirements, and this question has a “select all that apply” answer type, and also includes “I felt sick” as an option. As a sensitivity analysis, we used only those individuals who were tested because of a workplace requirement and did not feel sick.

Power calculation: To determine the sample size necessary to detect a difference of 30% between the COVID-19 prevalence of HCWs and non-HCWs, we developed a small simulation model where the fraction of HCWs in the general population and the COVID-19 prevalence in the general population both match that observed in the survey data.

Of respondents who were tested in the last 14 days because their employer or school required it, 33.9% were HCWs and 4.9% tested positive for COVID-19, so we simulated populations of size n with these fractions of HCWs and this positive rate among the non-HCW population. We made the positive rate among the HCW population 30% lower:

def sim_data(n_simulants):
    frac_hcw = .339
    frac_cli = .049
    rr_hcw = 0.7
    
    data = pd.DataFrame(index=range(n_simulants))
    data['hcw']  = np.random.uniform(size=n_simulants) < frac_hcw
    cli_pr = np.where(data.hcw, rr_hcw * frac_cli, frac_cli)
    data['cli'] = np.random.uniform(size=n_simulants) < cli_pr
    return data

Then for populations of ranging in size from n = 500 to 9,500, we repeatedly synthesized a simulated population, calculated the RR of COVID-19 between the HCWs and non-HCWs as described in the main text, and checked if the upper bound of the uncertainty interval was less than 1.0. We replicated this experiment 10,000 times for each population size n and found the n where at least 80% of the experimental replications where the uncertainty interval upper bound was less than one.

Ethical statement

These research activities used no identifiable private information and were therefore exempt from institutional board review.

Results

The survey data contained 40,552 respondents who were tested due to workplace requirements in the time period we focused on, 13,747 HCWs and 26,805 non-HCWs (see Table 1 for demographic details). There were 1,993 respondents who reported a positive test for COVID-19 in the last 14 days (527 among HCWs and 1,466 among non-HCWs).

Table 1. Characteristics of survey respondents.

Non- healthcare workersHealthcare workers
n(%)n(%)
Total 1,672,980 100.0 115,814 100.0
Tested in last 14 days 123,830 7.4 21,071 18.2
Test required by work or school 26,805 1.6 13,747 11.9
Among those with required test
Male gender 8,662 32.3 1,972 14.3
Age in years
18 to 24 3,356 12.5 761 5.5
25 to 34 4,648 17.3 2,374 17.3
35 to 44 4,784 17.8 3,058 22.2
45 to 54 4,797 17.9 3,377 24.6
55 to 64 3,983 14.9 3,141 22.8
65 to 74 1,204 4.5 920 6.7
75 and older 476 1.8 105 0.8

Among HCWs with a required test, 527 of 13,747 (3.8%) reported a positive test in the last 14 days, while among non-HCWs with a required test, 1,466 of 26,805 (5.5%) reported a positive test, for a relative COVID-19 prevalence ratio of 0.7 (95% UI 0.6 to 0.8) (Table 2).

Table 2. Relative COVID-19 incidence rate (RR) and counts of healthcare workers and non-healthcare workers and their crude prevalence counts and rates.

Healthcare workersNon-healthcare workers
TestedPositive%TestedPositive%RR95% UI
13,7475273.826,8051,4665.50.70.6 to 0.8

Our power calculation simulation results showed that 7,000 simulants provide 80% power to reject a null hypothesis that HCWs and non-HCWs have the same RR if, in truth, the RR is 0.7. Since the survey currently collects a weekly volume of around 7,000 individuals who report taking a required COVID-19 test, the simulation results imply that six weeks of data will provide more than sufficient power.

Sensitivity analyses

When we repeated our calculation using the unweighted survey responses to calculate the COVID-19 incidence ratio, we found an even smaller relative incidence ratio of 0.4 (95% UI 0.3 to 0.5).

When we repeated our analysis restricted to only specific types of HCWs, as afforded by the questionnaire, we found a range of risks, usually less than 1.0, with substantially less certainty due to small sample sizes (Table 3).

Table 3. Relative COVID-19 incidence rate (RR) and counts of healthcare workers (HCWs) and non-healthcare workers stratified by worker subtype.

Number of
non-HCWs
Number of
HCWs
Relative
risk
Lower
bound
Upper
bound
All HCWs26,80513,7470.70.60.8
Physician or surgeon40,2772752.61.83.5
Registered nurse (including nurse
practitioner)
37,5732,9790.60.60.8
Licensed practical or licensed vocational
nurse
38,5601,9920.60.50.8
Physician assistant40,4051470.70.41.3
Dentist40,518340.40.00.8
Any other treating practitioner40,1893630.50.30.9
Pharmacist40,473790.30.10.8
Any therapist39,3711,1810.50.40.7
Any health technologist or technician39,0621,4901.00.71.2
Veterinarian40,519330.30.01.1
Nursing assistant or psychiatric aide39,0451,5071.00.81.3
Home health or personal care aide39,9995530.80.51.0
Occupational or physical therapy
assistant or aide
40,477751.30.51.9
Massage therapist40,54934.60.08.1
Dental assistant40,534180.00.00.0
Medical assistant40,4151371.10.51.7
Medical transcriptionist40,526260.60.01.5
Pharmacy aide40,536160.00.00.0
Phlebotomist40,524283.40.74.8
Veterinary assistant40,54753.40.012.0
Any other healthcare support worker38,3792,1730.50.40.6

When we used only those individuals who were tested because of a workplace requirement and did not feel sick, we obtained a relative risk closer to 1.0. Using only those tested because of a workplace requirement who also did feel sick we still obtained a relative risk substantially smaller than 1.0 (Table 4). Although this finding could suggest that differences in testing patterns between healthcare and other work settings are partially responsible for the different positivity rates among HCWs and non-HCWs, it could also be driven by greater access to COVID-19 testing for confirmation of illness among HCWs experiencing symptoms. The recall period of 14 days provides ample time for an individual to receive a workplace test without symptoms, then develop symptoms, and then receive another test to determine if the symptoms are due to COVID-19, and HCWs might have more opportunity to access such a follow-up test, since they are visiting a healthcare setting for work already.

Table 4. Relative COVID-19 incidence rate (RR) and counts of healthcare workers and non-healthcare workers stratified by those who reported they felt/did not feel sick as an additional reason for getting tested.

Number of non-HCWsNumber of HCWsRelative riskLower boundUpper bound
Test required, did not feel sick 23,523 12,7891.11.01.2
Test required, felt sick 3,282 9580.80.70.9

Discussion

This study utilized a population-based approach to examine the relative risk of COVID-19 infection among HCW compared with non-HCW. Finding a relative COVID-19 incidence ratio substantially and significantly less than 1.0 is an unequivocally positive finding, indicating that infection control measures being taken by HCWs in total are effective.

Our findings are consistent with the limited other evidence available on the risk of COVID-19 in healthcare facility settings1215, and, taken together, this growing body of evidence suggests that providing and seeking healthcare at this point in the epidemic is quite safe. HCWs need not fear contracting or transmitting infections more than other workers do, and patients should not defer needed care at present over concern that they will be exposed to COVID-19 during their interactions with HCWs.

This outbreak and our understanding of it have both changed rapidly in the past, and may do so again, so we will continue to update this information.

Limitations

This work has at least three limitations. First, our results are based on self-reported data and therefore subject to both recall bias and social desirability bias, although the questions we relied on did not seem particularly at risk for either of these biases; the question “have you been tested for COVID-19 in the last 14 days?” likely included positive responses from individuals who received seroprevalence testing as well as PCR testing as well, which could also introduce a small amount of bias. Second, our approach required a large sample size to obtain a sufficiently precise estimate of RR, but this seems safer than including respondents who did not report receiving a required test, as that could introduce confounding. Third, it is possible that there was still uncontrolled confounding due to differential access to tests between HCWs and non-HCWs. Our sensitivity analysis found substantively similar results when restricted only to individuals who had workplace testing when they did not feel sick, but since we have only considered respondents with tests required by their employer or school, this might focus on non-HCW setting with better-than-average infection control policies (for example, they are doing asymptomatic testing) and therefore the relative risk for HCWs might be even lower than our method estimated.

Conclusion

As of October, 2020, in the United States the relative infection ratio of HCWs to non-HCWs is reassuringly low. Infection control remains essential and HCWs must continue to be protected as the COVID-19 pandemic continues, to ensure safety to themselves, their co-workers, and their patients.

Data availability

Underlying data

The underlying data used in this study are available to academic researchers for research purposes from Facebook at: https://www.facebook.com/research-operations/rfp/?title=covid19-symptom-survey-data-access. Conditions of access and instructions for applications can be found at https://dataforgood.fb.com/docs/covid-19-symptom-survey-request-for-data-access/.

Code availability

Reproducibility code available from: https://github.com/aflaxman/covid_hcw_rr

Archived code at time of publication: http://doi.org/10.5281/zenodo.427036816.

License: GNU General Public License v3.0

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 27 Nov 2020
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
Gates Open Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Flaxman AD, Henning DJ and Duber HC. The relative incidence of COVID-19 in healthcare workers versus non-healthcare workers: evidence from a web-based survey of Facebook users in the United States [version 1; peer review: 2 approved with reservations, 1 not approved] Gates Open Res 2020, 4:174 (https://doi.org/10.12688/gatesopenres.13202.1)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 27 Nov 2020
Views
21
Cite
Reviewer Report 01 Apr 2021
Tim Driscoll, School of Public Health, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW, Australia 
Approved with Reservations
VIEWS 21
This paper presents an analysis of data collected from United States’ respondents to a Facebook survey and focuses on a comparison of the rate of COVID-19 in health care workers compared to workers in other sectors. The main finding was ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Driscoll T. Reviewer Report For: The relative incidence of COVID-19 in healthcare workers versus non-healthcare workers: evidence from a web-based survey of Facebook users in the United States [version 1; peer review: 2 approved with reservations, 1 not approved]. Gates Open Res 2020, 4:174 (https://doi.org/10.21956/gatesopenres.14411.r30426)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 20 Jul 2021
    Abraham Flaxman, Institute for Health Metrics and Evaluation, University of Washigton, Seattle, 98195, USA
    20 Jul 2021
    Author Response
    In terms of the Methods, there are inconsistencies in the terminology and I can’t see any reason for this. Most particularly, there is mention of an “endorsement rate”, which is ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 20 Jul 2021
    Abraham Flaxman, Institute for Health Metrics and Evaluation, University of Washigton, Seattle, 98195, USA
    20 Jul 2021
    Author Response
    In terms of the Methods, there are inconsistencies in the terminology and I can’t see any reason for this. Most particularly, there is mention of an “endorsement rate”, which is ... Continue reading
Views
36
Cite
Reviewer Report 29 Mar 2021
Devan Hawkins, Department of Public Health Program, Schools of Arts and Sciences, MCPHS University, Boston, MA, USA 
Marcy Goldstein-Gelb, National Council for Occupational Safety and Health, Somerville, MA, USA 
Not Approved
VIEWS 36
Thank you for the invitation to review this paper. The paper addresses an important topic (the risk of acquiring COVID-19 among healthcare workers). The authors apply unique methods to study the problem. However, we have some concerns about how the ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Hawkins D and Goldstein-Gelb M. Reviewer Report For: The relative incidence of COVID-19 in healthcare workers versus non-healthcare workers: evidence from a web-based survey of Facebook users in the United States [version 1; peer review: 2 approved with reservations, 1 not approved]. Gates Open Res 2020, 4:174 (https://doi.org/10.21956/gatesopenres.14411.r30475)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 20 Jul 2021
    Abraham Flaxman, Institute for Health Metrics and Evaluation, University of Washigton, Seattle, 98195, USA
    20 Jul 2021
    Author Response
    Introduction:
    • The authors should provide some information about previous studies that have examined the risk for COVID-19 among healthcare workers and also justify why they hypothesized that healthcare
    ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 20 Jul 2021
    Abraham Flaxman, Institute for Health Metrics and Evaluation, University of Washigton, Seattle, 98195, USA
    20 Jul 2021
    Author Response
    Introduction:
    • The authors should provide some information about previous studies that have examined the risk for COVID-19 among healthcare workers and also justify why they hypothesized that healthcare
    ... Continue reading
Views
43
Cite
Reviewer Report 04 Dec 2020
Alex Reinhart, Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, PA, USA 
Approved with Reservations
VIEWS 43
This presents a timely and useful analysis of large-scale survey data. For an analysis like this, it's very important to clearly present the meaning of the data and the caveats in the survey design; the authors do a good job ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Reinhart A. Reviewer Report For: The relative incidence of COVID-19 in healthcare workers versus non-healthcare workers: evidence from a web-based survey of Facebook users in the United States [version 1; peer review: 2 approved with reservations, 1 not approved]. Gates Open Res 2020, 4:174 (https://doi.org/10.21956/gatesopenres.14411.r30079)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 20 Jul 2021
    Abraham Flaxman, Institute for Health Metrics and Evaluation, University of Washigton, Seattle, 98195, USA
    20 Jul 2021
    Author Response
    This presents a timely and useful analysis of large-scale survey data. For an analysis like this, it's very important to clearly present the meaning of the data and the caveats ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 20 Jul 2021
    Abraham Flaxman, Institute for Health Metrics and Evaluation, University of Washigton, Seattle, 98195, USA
    20 Jul 2021
    Author Response
    This presents a timely and useful analysis of large-scale survey data. For an analysis like this, it's very important to clearly present the meaning of the data and the caveats ... Continue reading

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 27 Nov 2020
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

Are you a Gates-funded researcher?

If you are a previous or current Gates grant holder, sign up for information about developments, publishing and publications from Gates Open Research.

You must provide your first name
You must provide your last name
You must provide a valid email address
You must provide an institution.

Thank you!

We'll keep you updated on any major new updates to Gates Open Research

Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.