Abstract
Here we provide data from an online survey of 639 people diagnosed with COVID-19 and resident in France, who were diagnosed with COVID-19 between 30th Jan 2020 and 29th August 2022. In addition to demographic information the survey includes questions about participants’ symptoms (by category), symptom onset and persistence, and the effects these symptoms had on their daily lives. Participants were able to include information related to their perceived medical, social and professional needs. These data are needed in order to create effective care policies addressing post-COVID sequelae. Information related to symptom association & dynamics is expected to be useful to clinicians and may also inform more fundamental studies.
Measurement(s) | Physical & Mental Health, Demographics, Perceptions & Needs |
Technology Type | Questionnaire |
Sample Characteristic - Organism | Adult Humans |
Sample Characteristic - Environment | Online Survey |
Sample Characteristic - Location | France |
Similar content being viewed by others
Background & Summary
This study was designed to provide data for scientists, clinicians and policy makers in order to assist them in the creation of effective policies caring for those affected by COVID-19. Specifically, we aimed to define the resources needed (in terms of financial support, social services or medical care) to meet people’s needs, and to plan for future pandemics. Despite the fact that there have been hundreds of millions of cases of COVID-19 worldwide since the start of the pandemic1 we still lack relevant data regarding the long-term consequences of this illness and the impact that chronic problems have on people’s everyday lives. Nevertheless it is becoming clear that disabling long-term symptoms arise even for people with apparently mild initial illness2,3 and it is therefore urgent to implement effective mitigation and rehabilitation strategies to minimise the impact of these symptoms on affected individuals and on society as a whole. We have collected data characterizing the wide-ranging symptoms of COVID-19, together with their effects on quality of life and the perceived needs of those affected. These data document symptom associations and dynamics and characterise the ways in which symptoms handicap individuals in their everyday lives. It is possible to analyse these data by age and/or gender, specific symptom categories or groups of symptoms, and use them to determine individuals’ perceived medical, social and professional needs. We expect the wide-ranging overview of COVID-19 symptoms to be useful to both clinicians and to researchers investigating fundamental questions, and the objective and subjective information to inform public policies.
Methods
Procedure and inclusion
The dataset4 is from a large-scale survey based on a questionnaire designed to be anonymous with no collection of IP addresses, written in French and posted online on the website https://project.crnl.fr/covid/. The title and introduction to the survey explained that the purpose of the survey was to understand better the effects of the disease on people’s daily lives and needs. It was advertised via our scientific network, and people were also able to find it via internet searching. Adults, resident in France, with an official diagnosis of COVID were invited to complete the survey, which was approved by the Institutional Review Board of INSERM (IRB00003888, IORG0003254, FWA00005831) of the French Institute of medical research and health, under number 21–805. Participants were not remunerated and were not monitored. All participants provided informed consent.
Data were collected between 15th July 2021 and 6th September 2022. Exclusion criteria were: persons under the age of 18 years, those without a diagnosis of COVID-19 (made either via an analytical test or by a doctor on the basis of symptoms alone), those who failed to complete the entire survey, or were not filling it in for the first time and those not resident in France. We have also excluded participants with diagnosis dates prior to the first cases of COVID-19 in France (3 participants), participants entering a recovery date prior to their diagnosis date (13 participants), and participants who omitted to provide a diagnosis date (5 participants). Combined these criteria excluded 415 of the 1054 participants. We did not use any criteria with respect to specific symptoms, diagnosis dates (other than the implausible dates as stated above), interval since diagnosis, duration of illness or perceived recovery.
Data collected
The questionnaire began with an explanation of the purpose of the study and consent. It continued with general compulsory questions and subsequently used a branching design. Figure 1 illustrates the structure of the questionnaire.
The first section of the questionnaire concerns demographic information: gender, age, height, weight, socio-professional category, education & profession, French department of residence and whether they lived in an urban, suburban or rural area. These questions were compulsory and there are no missing values.
General background COVID-19 and health information was then collected which included: their pregnancy status (if female), any chronic illnesses and regular medication, whether they are (or were) smokers, whether they had been vaccinated against COVID-19, and if so, with which vaccine.
There then followed a series of questions related to their illness with COVID-19. These data include how and when they were diagnosed with COVID-19, whether they were diagnosed with a specific variant, and if so which, whether they considered themselves recovered, and if so on what date, whether they were hospitalised, and if so whether they were given oxygen, and/or admitted to ICU, and what treatments (if any) they were given for their COVID-19 illness.
The following sections were specific questions relating to symptoms experienced by category. The categories were chosen with respect to WHO data available at the time of creating the survey namely: olfactory & gustatory symptoms (which included loss of smell and/or taste together with questions relating to changed or phantom smells and/or tastes); flu-like symptoms (fever, chills, dizziness, cough, sore throat, fatigue, shortness of breath, muscle weakness, muscle pain (aches) or joint pain, headache); gastro-intestinal symptoms (difficulty swallowing, difficulty eating/drinking, loss of appetite, nausea or vomiting, diarrhea); cognitive, neurological and psychiatric symptoms (migraines, memory problems, attention disorders, speech disorders, altered consciousness, confusion, serious neurological disorders, irritability, anxiety, depression, sleep disorders); cutaneous and inflammatory symptoms (rash, itching, hair loss, tooth loss, conjunctivitis); cardiac and renal symptoms (kidney problems, heart problems, chest pain) and finally they were asked about any other symptoms that had not already been covered. For each symptom category participants reported experiencing they were asked about symptom onset and whether or not the symptom was still present. If they had recovered they were asked after how long. They were systematically asked whether they found the symptom a handicap in their everyday lives and whether the symptom had affected their psychological health, diet, social & relational life and professional life. Finally, people were asked to describe their needs categorised as specialist medical, psychological/psychiatric and socio-professional. These questions were asked on a true/false basis, but participants were also able to provide verbatim free responses. Some additional questions were asked related to potential therapies people might accept to redress their olfactory and gustatory symptoms.
We also provide data related to the questionnaire completion itself, namely the date and time of submission, the time participants spent completing various sections of the questionnaire and the total time spent.
Data pre-processing
LimeSurvey, a tool that creates online questionnaires and surveys, was used to collect the data. The raw data were first exported from LimeSurvey in xlsx format. Preprocessing was then performed to filter out incomplete or duplicate responses, respondents not residing in France, minors, and responses with inconsistent dates. Although the questionnaire was completely anonymous, participants were given the opportunity to describe symptoms, needs, or other information in dedicated boxes. To ensure the anonymity of the database, this information has not been included. The final filtered database was saved in csv format.
Data Records
The dataset4 consists of a text readme file, a pdf with the survey questionnaire, three descriptive json files and a csv format database containing the 639 responses. In this data base each row corresponds to a single participant and each column to a variable. The descriptive json files (Table 1) put the study in context and provide a detailed description of the variables. The final json file lists the variables in the original French and in English translation. We also provide a copy of the questionnaire in pdf format.
Technical Validation
We provide the following analyses to facilitate evaluation of the validity and limitations of the data.
Diagnosis date
Recorded diagnosis dates closely follow the known waves of infection in France5 (see Fig. 2).
Correspondence between diagnosis date and Covid testing proportion
The lock-down imposed in France lasted from 17th March 2020 to 3rd May 2020. During this time very few people were tested for COVID infection nationally. The data from our survey reflect this. Indeed, among the 138 participants diagnosed before 3/05/2020, 65 (47%) were diagnosed by analytical tests and 73 (53%) by a medical doctor. In contrast, among the 501 participants diagnosed on or after 3/05/2020, 468 (93%) were diagnosed by analytical test and only 33 (7%) by a doctor on the basis of symptoms alone (see Fig. 3).
Geographical distribution
We have good geographical distribution of participants across metropolitan France (Fig. 4a) consistent with French government statistical data6 (Fig. 4c). The over-sampling of the Lyon, Toulouse & Paris areas reflects the impact of publicity from our academic partners (see Fig. 5). 137 people (21% of the survey population) live in a suburban area, 159 (25%) in a rural area and 343 (54%) in an urban area7. We have a small over sampling of the urban population, which is typical for online surveys (see Fig. 4b).
In Fig. 5 we provide a comparison between the proportion of survey participants by French department with the proportion on the general French population.
Average age of hospitalized people
The average age of hospitalized people is greater than that of un-hospitalized people (average hospitalized 50.2 [95% CI 46.7,53.7] years compared with average age not hospitalized 42.5 [95% CI 41.5,43.6] years) p < 0.001, calculated using the software package jamovi8) and the average age of those who reported experiencing flu-like and at least one other symptom (43.8 [95% CI 42.8,44.8] years) is higher than the average age of those who reported experiencing flu-like symptoms only (39.7 [95% CI 36.9,42.6] years) (p = 0.003), which agrees with known information relating to the vulnerability of people to COVID-19 increasing with age.
Prevalence of anosmia
We note that our data include participants diagnosed from the very beginning of the pandemic and also periods when the alpha, beta & gamma variants were circulating and the prevalence of anosmia may vary not only with variant but with vaccination status or other factors. Nevertheless, our data are consistent with the literature9. Only 7% of our survey participants (43) were diagnosed after 01/01/2022 when the omicron variant of COVID-19 became prevalent in France10 and the incidence of anosmia declined11. Excluding these participants, 61% of our participants reported olfactory loss which is comparable to the results of Menni et al.11 who quote a prevalence of 52.7% anosmia during the delta wave.
Percentage of smokers
110 people (17%) were either occasional or regular smokers, which compares to 18.5% of people over 18 years old in France12.
Preponderance of women
We note that online surveys typically have a preponderance of women13,14,15,16. Our population is 77% female which is comparable with the data of Ferdenzi et al.14 whose online survey of French COVID-19 sufferers was 78% female.
Usage Notes
Population studied
The survey focused on the symptoms and needs of patients who had a COVID-19 infection, regardless of any subjective persistence of symptoms. Information relates to individuals’ subjective perception of their health, with no independent verification either of the severity of the symptoms nor of their source. The data do not focus specifically on the so-called “long COVID-19” population (although many participants fall into this category).
We are only able to include data from people who chose to complete the online survey. The data therefore suffer from selection biases (over representation of urban, educated individuals, women, and symptomatic individuals, and exclude those unable to access the internet) (see Figs. 6, 7).
Age distribution of the survey population
Children were excluded from the survey and, although the survey was open to the elderly, general inability to use the internet (poor eyesight, computer illiteracy, lack of access etc) means that the survey is heavily biased towards working age (under 65 year old) persons. In Fig. 8 we compare the age distribution of the survey population with that of France in 2023 (data from17).
Delay between infection and reporting
This is variable between individuals (see Fig. 9). It is possible to filter data to focus on, or exclude, specific groups, based on the time elapsed between diagnosis and completing the survey.
Variants
The diagnosis dates cover waves of infection with different dominant variants, with an under-representation of the delta & omicron variants. We do not have sufficient confirmed diagnoses of the variant of infection (this was an open question) to enable a robust statistical analysis comparing the different variants. Although it is possible to match the diagnosis date to the predominance of variants at that time, we note that for the first six months of 2021 the alpha, beta and gamma variants circulated mostly concomitantly in France18.
Declared recovery
A significant proportion of participants declared themselves cured of COVID-19, yet later described a number of persistent symptoms (80/200). We consider these responses, which at first sight seem counterintuitive, to be linked to the fact that declaring oneself cured of COVID-19 depends in part on subjective factors, on the individual perception of each person. This individual perception, which our data show to be variable from one person to another, is possibly constructed on the basis of the appreciation of the severity of the persistent symptoms, or of the feeling that people have still not recovered their initial state of health. In fact, all of this suggests that there is no simple definition of who is considered cured or not cured.
Code availability
Data filtering and statistical analysis was performed using the open access software jamovi8. No custom code was used for the curation or validation of the dataset.
References
WHO COVID-19 Dashboard. World Health Organisation https://covid19.who.int/info (2020).
Al-Aly, Z., Xie, Y. & Bowe, B. High-dimensional characterization of post-acute sequelae of COVID-19. Nature 594, 259–264 (2021).
Xu, E., Xie, Y. & Al-Aly, Z. Long-term neurologic outcomes of COVID-19. Nat. Med. 28, 2406–2415 (2022).
Bensafi, M. & Stanley, H. B. COVID-19 - Symptoms - Impact on quality of life and needs of affected people. zenodo https://doi.org/10.5281/zenodo.7920303 (2023).
Dong, E., Du, H. & Gardner, L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 20, 533–534 (2020).
INSEE. Population statistics of France. https://www.insee.fr/fr/statistiques/1893198.
de Bellefon, M.-P. (PSAR A. territoriale-I., Eusebio, P. ((PSAR A. territoriale-I., Forest, J. ((PSAR A. territoriale-I. & Warnod, R. ((PSAR A. territoriale-I. 38% de la population française vit dans une commune densément peuplée. Statistiques et études https://www.insee.fr/fr/statistiques/4252859 (2019).
The_jamovi_project. jamovi. (2021).
Santos, R. E. A. et al. Onset and duration of symptoms of loss of smell/taste in patients with COVID-19: A systematic review. Am. J. Otolaryngol. 42, 102889 (2021).
Maisa, A. et al. First cases of Omicron in France are exhibiting mild symptoms, November 2021-January 2022. Infect. Dis. now 52, 160–164 (2022).
Menni, C. et al. Symptom prevalence, duration, and risk of hospital admission in individuals infected with SARS-CoV-2 during periods of omicron and delta variant dominance: a prospective observational study from the ZOE COVID Study. Lancet (London, England) 399, 1618–1624 (2022).
Eurostat. Daily smokers of cigarettes by sex, age and educational attainment level. https://ec.europa.eu/eurostat/databrowser/view/HLTH_EHIS_SK3E__custom_6000386/default/table?lang=en&page=time:2019 (2019).
Davis, H. E. et al. Characterizing long COVID in an international cohort: 7 months of symptoms and their impact. EClinicalMedicine 38, 101019 (2021).
Ferdenzi, C. et al. Recovery From COVID-19-Related Olfactory Disorders and Quality of Life: Insights From an Observational Online Study. Chem. Senses 46 (2021).
Bousquet, C., Bouchoucha, K., Bensafi, M. & Ferdenzi, C. Phantom smells: a prevalent COVID-19 symptom that progressively sets. Eur. Arch. oto-rhino-laryngology Off. J. Eur. Fed. Oto-Rhino-Laryngological Soc. Affil. with Ger. Soc. Oto-Rhino-Laryngology - Head Neck Surg. 280, 1219–1229 (2023).
Hopkins, C., Surda, P., Whitehead, E. & Kumar, B. N. Early recovery following new onset anosmia during the COVID-19 pandemic – an observational cohort study. J. Otolaryngol. - Head Neck Surg. 49, 26 (2020).
Papon Silvain. Bilan démographique 2022. INSEE https://www.insee.fr/fr/statistiques/6687000?sommaire=6686521 (2023).
Galmiche, S. et al. SARS-CoV-2 incubation period across variants of concern, individual factors, and circumstances of infection in France: a case series analysis from the ComCor study. The Lancet Microbe 4, e409–e417 (2023).
Ritchie, H. et al. Coronavirus Pandemic (COVID-19). Our World Data (2020).
INSEE. Diplômes - Formation en 2020. INSEE https://www.insee.fr/fr/statistiques/7633108 (2023).
Acknowledgements
This Project has received funding from both the European Union’s Horizon 2020 research and innovation programme under grant agreement No 964529 (Pathfinder Rose project) and the Human Chemosensation IRP project granted by the CNRS.
Author information
Authors and Affiliations
Contributions
Conceived, designed the study: M.B.; Data acquisition and curation: H.B.S., M.B.; Wrote the paper: H.B.S., M.B.; Edited and approved the final manuscript: H.B.S., M.B.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Stanley, H.B., Bensafi, M. A data set of symptoms and needs of individuals affected by COVID-19. Sci Data 11, 122 (2024). https://doi.org/10.1038/s41597-024-02961-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-024-02961-6