Introduction

Since the outbreak of the COVID-19 pandemic, millions of individuals have been infected. Many of them had to be quarantined and isolated from their family and friends. Some had to be hospitalized in COVID-19 care units. Roughly two in a hundred died (CSSEGIS n.d.). At the beginning, mostly elder citizens and those with preexisting diseases seemed to be affected; however, insights emerged that anyone could be infected. Even if the disease was mild, “long Covid” could be frustrating; long Covid refers to signs and symptoms that develop during or after an infection consistent with COVID-19, continue for >12 weeks, and are not explained by an alternative diagnosis (NICE guideline [NG188], 2020). Effective drugs do not yet exist for COVID-19, and breakthrough infections have been noted in some people receiving vaccines. In this environment of uncertainty and doubt, fear and anxiety thrive (Gu et al., 2020). Between January 1, 2020, and January 29, 2021, anxiety disorders increased by 25.6% worldwide (COVID-19 Mental Disorders Collaborators, 2021).

In the early stage of the COVID-19 outbreak, on January 26, 2020, the National Health Commission of China published “A notice on the issuance of guidelines for emergency psychological crisis intervention in pneumonia for novel coronavirus infections” (National Health Commission of China, 2020). The Chinese government intended to mitigate the consequences of psychological and mental diseases caused by the epidemic, such as loneliness, anger, fear, or anxiety, which could lead to attack, self-harm, or even suicide (Moitra et al., 2021), likely drawing from studies on the 2003 SARS outbreak, when fear, boredom, loneliness, and anger were observed (Maunder et al., 2003). Lee and Crunk (2020) reported that fear plays a substantial role in mental disorders caused by COVID-19; the risk of anxiety (e.g., COVID-19 anxiety) increases by more than 20% in the presence of neuroticism (John & Srivastava, 1999).

After the worldwide spread of COVID-19, psychologists, psychiatrists, and other mental health workers recommended timely mental health care, particularly for psychologically vulnerable patients (Xiang et al., 2020). Most health care starts with a diagnosis in which it is determined whether and to what extent an individual is affected by a disease. This determination often uses scales derived from constructs composed of several items that are related to a specific disorder. Until autumn 2021, more than 150 research papers were published regarding COVID-19 scales; most of them used the Fear of COVID-19 Scale (FCV-19S), which was developed by Ahorsu et al. (2020). FCV-19S has been verified in many countries (PubMed n.d.). Results are inconclusive as to how many factors FCS-19S should have. Caycho-Rodríguez et al. (2021) evaluated the model in seven Latin American countries and concluded that the two-factor model was the most reasonable. They were the first to analyze the cross-cultural validity of the FCV-19S by the existence of measurement invariance across countries. They could not find overall scalar invariance, which does not guarantee that the intercepts of the structural model are equal across all seven countries. Without invariance, however, comparisons between the countries could lead to doubtful results because the models were understood differently by respondents in the different cultures.

Another cross-cultural analysis of the FCV-19S was executed by Lin et al. (2021) in 11 countries (Bangladesh, UK, Brazil, Taiwan, Italy, New Zealand, Iran, Cuba, Pakistan, Japan, and France). They noted the one-dimensional factor solution to be the most suitable. Measurement invariance, however, could not be found for all countries.

Another scale—the five-item Coronavirus Anxiety Scale (CAS)—was presented in a timely manner after the outbreak of the pandemic (Lee, 2020a)Footnote 1. This more parsimonious scale was validated several times by Lee et al. (2020b) and Lee (2020a) and found to be a useful mental health screener. Lee (2020b) confirmed the importance of the CAS by demonstrating a significant increase in the explained variance in depression, generalized anxiety, and anxiety when the CAS was added to sociodemographic factors, COVID-19 factors (positively tested, knowledge about infections), and vulnerability factors (neuroticism, health anxiety, reassurance-seeking).

Burkova et al. (2021) analyzed anxiety due to COVID-19 in 23 countries by using the Generalized Anxiety Disorder Scale (GAD-7) and the State Anxiety Inventory. They reported scores on both scales to be higher in individualistic countries and those with a low power distance, which refers mostly to the Western hemisphere. However, Burkova et al. (2021) did not test the scales for measurement invariance across cultures, precluding cross-country comparisons.

Because of its parsimonious structure, the CAS can be an excellent choice when determining disorders caused by COVID-19 and may serve as a standard worldwide. However, as Lee et al. (2020b) indicated, the CAS should be validated in countries outside the US, in English or its translated versions whenever necessary, as has been performed with the FCV-19S. The CAS has been validated in Turkey (Evren et al., 2020), Pakistan (Ashraf et al., 2020), Mexico (García-Reyna et al., 2021), and Brazil (Padovan-Neto et al., 2021), with high internal consistency (Cronbach’s α > .82) and sufficiently high fit indices in confirmatory factor analyses (CFAs). However, to determine the worldwide applicability of the CAS with internationally comparable results, a multigroup analysis on measurement invariance across many countries is necessary, as has been performed for the FCV-19S (Caycho-Rodríguez et al., 2021; Lin et al., 2021). To test the CAS for measurement invariance across different groups is the contribution of this study. The rest of the paper is distributed as follows. The choice of countries and participants as well as the applied method will be described. Results, included regional differences, are reported. A final discussion concludes the article.

Methods

Design

The CAS was administered in online surveys in 25 countries. The countries were chosen as a convenient sample to cover all six continents, adding up to most world population (more than 50%) and to represent most global cultures (Australia, Austria, Belgium, Brazil, Canada, China, Denmark, Finland, France, Germany, India, Italy, Japan, Netherlands, Norway, Portugal, Russia, South Africa, South Korea, Spain, Sweden, Switzerland, Taiwan, the UK, and the US).

Participants

The participants were recruited by an international provider for data analyses and market research with several subsidiaries worldwide. They mostly have their own panels of potential survey participants (only in South Africa, the participants were recruited by an outside provider). The sample size was approximately 400 participants per country. Because statistical inferences are sensitive to sample size, the same number of cases in each country was chosen. To get a sample demographically as close as possible to the countries’ population, a first set of some dozen potential respondents was invited by email. Those who answered were evaluated on their fit to the overall demographics. The next set was then invited to complement the first set regarding the match with sex or age (as an example, in case the country’s distribution of sex is 50% female vs. 50% male and the first set of responses had 30% females and 70% males, the second set tried to compensate for this by including an extra number of female participants). Although this procedure is rather heuristic and not representative, it resulted in acceptable allocations in the assessed samples with only some major deviations from actual distributions (in Japan, Russia, and the US, females were underrepresented by approximately 15%). In the worldwide data, however, female participants accounted for up to 46.4%, whereas the total distribution is 49.3%. The average age of the samples in online surveys typically tends to be younger than the actual ages in any country. However, the demographic structure of the assessed data seemed to be sufficiently appropriate for the current research question.

Measure

The five items of the CAS (Lee, 2020a) were presented to the participants: dizziness, sleep disturbances, tonic immobility, appetite loss, and abdominal distress. Participants had to rate them on the same 5-point scale as in the original version (How often have you experienced any of the following in the past 2 weeks? 0 = not at all; 1 = rare, less than a day or two; 2 = several days; 3 = more than 7 days; 4 = nearly every day over the last 2 weeks). In addition to age and sex, three more variables were assessed: (1) the participant’s own history of COVID-19 infection; (2) a relative’s history of COVID-19 infection; and (3) a relative’s death from COVID-19.Footnote 2 The original CAS version in English was administered in the US, Australia, Canada, India, South Africa, and the UK. For the other countries, the translations of the questionnaires were conducted by a professional translation agency, which employs graduate translators who are native speakers of a particular language. The original English AS scale was translated into 14 languages: German (for Germany, Austria, and Switzerland), French (for France, Belgium, Canada, and Switzerland), Portuguese (for Portugal and Brazil), Chinese (for China and Taiwan), Danish (for Denmark), Finnish (for Finland), Italian (for Italy), Japanese (for Japan), Dutch (for The Netherlands and Belgium), Norwegian (for Norway), Russian (for Russia), Korean (for Korea), Spanish (for Spain), and Swedish (for Sweden). These 14 versions were translated back into English to assure that the correct meaning of the questions. Some other translations already exist on “The Coronavirus Anxiety Project” (n.d.), which were compared with the own versions. Only minor differences were noted, with no shifts in meaning. This may be due to the easily comprehensible everyday topics of the respective questions. Data collection was conducted during March 18–31, 2021.

Results

Descriptive Results

In total, 10,232 respondents participated, with an average of approximately 400 per country. The descriptive details are depicted in Table 1. The average CAS scores range in the middle of what had been reported in the past. In some studies, they were higher at 6.66 (Evren et al., 2020) or 8.62 (Lee, 2020a) and in some, they were lower at 2.15 (García-Reyna et al., 2021) or 2.66 (Padovan-Neto et al., 2021).

Table 1 Descriptive data per country

The appropriateness of comparisons across groups (countries, sex, age, etc.) depends on the invariance of the measurements, which guarantees that in all groups, the same parameters are measured using the same instruments. Since in CFA, the fit of the data to the measurement function can be assessed, it can be determined whether and how this fit aggravates when one or more other groups of data are added in the model. Thus, the data were first examined on their suitability to analyze them in exploratory factor analysis (EFA) and subsequently tested regarding invariance across countries, sex, age, and the groups that were influenced by the COVID-19 virus.

Exploratory Factor Analysis (EFA)

In the aggregated global sample, the reliability index was similarly high, as in Lee et al.'s (Lee, 2020c) analysis (Cronbach’s α = 0.918). The Kaiser–Meyer–Olkin test (= .895, p < .001) and Bartlett test indicated sufficient support for a factorable solution. Principal component analysis resulted in one single factor explaining 75% of the total variance. Factor loadings were ≥0.84. These figures were similar for nearly all countries, with all Cronbach’s α values > .9 and the average extracted variance (AVE) of the data > .720. In four countries (Brazil, India, Portugal, South Africa), all Cronbach’s α values were < .890 and the AVE was < .690; nevertheless, the results remain reasonable for a factor solution.

Confirmatory Factor Analyses (CFA)

A multigroup CFA with the 25 countries as groups was conducted. To assess comparability across countries, data were tested on invariances across the groups (Steenkamp & Baumgartner, 1998) using four models:

A An unconstrained model to assess configural invariance (or pattern invariance) to assess for the fit of the same one-factor structure with five items in all countries.

B A model with measurement weights (factor loadings) constrained to be equal across groups (metric or “weak” invariance) to assess that not only the same items load on the same factors for all groups but also the actual magnitudes of the factor loadings are the same across groups for each respective item. In case of metric invariance, different scores on the items can be meaningfully compared across groups.

C A model with factor covariances constrained to be equal; because in this CFA only one factor and no structural model exists, only the variance of the one-factor CAS will be constrained to be equal.

D A model where all variances of the residuals (errors) were constrained to be equal across all countries.

In case all items are metrically invariant, and the factor variance and the error variances are invariant cross-nationally as well, the model is equally reliable across countries (Steenkamp & Baumgartner, 1998).

The analyses were conducted with the AMOS analytic tool for structural equation modeling (Arbuckle, 2014), using the maximum likelihood estimatorFootnote 3. Three criteria had to be met to support the invariance hypotheses (Cheung & Rensvold, 2000). The decrease of the Tucker–Lewis index (TLI) from one model to the next had to be low (|ΔTLI| ≤ .05), and the difference of root-mean-square error of approximation (RMSEA) and the probability of close fit had to be nonsignificant (pclose ≥ .05). For a description of the fit indices and cutoff values, see the note below Table 2.

Table 2 Fit Indices and Factor Loads per Country

The unconstrained model A had a good fit (TLI = .963, RMSEA = .0228, pclose = 1.000; Hu & Bentler, 1999). Model B for metric invariance had an even higher TLI (.971), lower RMSEA (.020), and insignificant pclose (1.000). Thus, metric invariance was granted. Model C had TLI = .963, an RMSEA = .023, and pclose = 1.000. Thus, the variance of the CAS factor can be assumed to be invariant across countries. Finally, for Model D, TLI = .925, RMSEA = .033, and pclose = 1.000. Thus, the difference in TLI in the last step from model C to model D is lower than the threshold of .05 (|ΔTLI| = .038; ΔRMSEA = .010; pclose = 1.000, not significant). Thus, the model is supposed to be similarly reliable across all countries with the parameters presented in Figure 1. AVE extracted regarding convergent validity was sufficiently high for all items (AVE > 50%), thus meeting the Fornell–Larcker criterion (Fornell & Larcker, 1981).

Fig. 1
figure 1

The Coronavirus Anxiety Scale in 25 countries

A similar procedure for the age groups resulted in a slightly worse fit for the unconstrained model with an RMSEA of .062 and a significant pclose < .001, but still sufficient with a TLI = .978. Steps to models B, C, and D even increased the fit to TLI = .985 and RMSEA = .051, pclose = .347, not significant. For the sex groups, model A had a TLI = .973, which decreased to .942 in model D; thus, ΔTLI = .031, which is below the threshold. The pclose values for models A, B, and C were nonsignificant (1.000); however, model D had pclose < .001 but had RSMEA = .053, thus indicating a good fit. The same held for the groups where the individual was infected/not infected, the groups where a relative was not infected/not infected, and the groups where a relative had died/not died. In all group comparisons, factor loadings, factor variances, and error variances were the same (except some rounding effects): CAS → dizziness = .79, CAS → sleep disturbance = .78, CAS → sonic immobility = .85, CAS → appetite loss = .86, CAS → abdominal distress = .86; this added up to an AVE of 68.6%, which is higher than the required 50% (Fornell & Larcker, 1981). Thus, the CAS (Figure 1) was determined to be completely reliable across all 25 countries, sex, age, and infection experiences, and the respective CAS scores can be appropriately compared across the above groups. Detailed information about fit indices and factor loading for each country can be found in Table 2.

Cross-Group Comparisons

CAS Depending on Sex and Age

In total, CAS did not significantly differ regarding sex (male: 3.41, female: 3.43). This applied for 21 out of the 25 countries. CAS differed significantly between females and males in the US (females 3.05, males 4.90, p < .01), in Switzerland (females 4.11, males 2.85, p < .01), in Russia (females 2.01, males 1.13, p < .05), and in Germany (females 4.24, males 2.98, p < .01).

Elder participants had a lower CAS (r = –.200, p < .001). This applied significantly to 17 out of the 25 countries, see Table 3. This is consistent with the findings of Kowal et al. (2020) who found that higher levels of stress due to the pandemic were associated with younger age.

Table 3 CAS and age groups per country

CAS Depending on Concern of COVID-19

Participants who had been infected by COVID-19 (n = 850) had a significantly higher CAS of 5.66 than those who were not infected (CAS = 3.22, t = 14.803, p < .001). Participants whose relatives had been infected by COVID-19 (n = 2650) had a CAS of 4.40 compared with those who had not been infected (CAS = 3.08, t = 12.622, p < .001). Participants who had a death in their family because of COVID-19 (n = 651) had a higher CAS of 5.5 than those who had no COVID-19-related death in their family (CAS = 3.28, t = 11.852, p < .001). Stronger anxiety can also be seen from the correlations between the CAS and the three infection stages: individual infected (r = .408, p < .05), relative infected (r = .468, p < .05), and relative died (r = .577, p < .01).

CAS Compared with Incidence Rates Per Country

The variables in Sections 3.4.1 and 3.4.2 were individually assessed per respondent (age, sex, and concern of COVID-19). Several external country-specific key figures were included in the analyses. One of these numbers is the incidence rate per country during the COVID-19 crisis until the end of March 2021 (the month of the execution of this survey). Contrasting CAS with the incidence rate in a one-way analysis of variance resulted in a significant F(24,10207) = 22.389, p < .001. Tukey’s-b post hoc test revealed 10 homogeneous subgroups (Table 4).

Table 4 CAS scores and Incidences

Group 10 comprises only India. Its CAS value (7.03) was the highest of all, but the incidence was quite low until that time. India, thus, seems to be an outlier. The reason for the high CAS score will be analyzed in the discussion section. Removing India (Group 10) from the homogeneous subgroups resulted in a significant positive correlation (.931, p < .001), thus supporting the plausibility that countries that are strongly affected by COVID-19 display stronger Coronavirus anxiety.

CAS Compared with Hofstede’s Cultural Indices

Similar one-way ANOVAs were conducted with Hofstede’s cultural indices (Hofstede, 1980; Hofstede et al., 2010). The outlier India was again excluded from these analyses. Regarding the power distance index (PDI, low = low power distance), eight subgroups emerged with a negative correlation (r = –.855, p < .01); this implied that countries with a high power distance had a lower Coronavirus anxiety. The same analysis for Individualism vs. Collectivism index (IDV, low = collectivistic, high = individualistic) resulted in seven subgroups with r = .982, p < .001. Thus, individualistic countries show a higher Coronavirus anxiety than collectivistic countries. Similarly, the indulgence versus restraint index (IVR, low = restraint, high = indulgent) had a high positive correlation with CAS (five subgroups with r = .943, p < .05); thus, indulgent cultures had higher Coronavirus anxiety than restraint cultures. Regarding the uncertainty avoidance index (low = low uncertainty avoidance), seven subgroups emerged with r = –.893, p < .01. Thus, countries with high uncertainty avoidance displayed lower Coronavirus anxiety. The masculinity (MAS) and long-term orientation indices were not significantly correlated with Coronavirus Anxiety Index.

Discussion

The one-dimensional CAS fit the assessed data well. Fit indices (CFI and TLI) that are well above the cutoff value of .95 are rare. However, this excellent fit is in line with prior research (Evren et al., 2020; Lee, 2020a, 2020b; Lee & Crunk, 2020; Padovan-Neto et al., 2021). The lowest TLI was .971 in Brazil (Padovan-Neto et al., 2021). Thus, the CAS was concluded to be a parsimonious and valid instrument to assess individual anxiety about the Coronavirus.

Although the CAS could be successfully replicated separately in different countries, this study is the first to analyze cross-national validity. The CAS items were shown to be equally reliable across 25 countries across six continents. All invariances—configurational, metric, covariance, and error invariances— were approved, which is also rare. Since such a good fit is so unusual for an international dataset like the one used in this study, one might suspect that the model has unwanted properties, such as redundant items in the one-factor solution. The high reliability indices (Cronbach’s α) could be evidence for such redundancy. Except in Brazil, India, and South Africa, they are well above the suggested level of .90 (Tavakol & Dennick, 2011). However, high reliability indices can be expected from larger scales with many items evaluating the same concept. Next, redundancy is more probable. The item scores (Table 1) correlate significantly, which is a prerequisite to combine them with a factor. However, pairwise t-tests in each country resulted only in a minority of cases assuming that two items had to be treated as equal. Thus, redundancy is not a reason for the good fit of the model. A more likely reason might be the balanced and easy-to-understand questions that make respondents answer consistently, fostering good reliability.

Scores differ significantly across countries. CAS scores ranged from 1.51 to 7.03, consistent with recent research from other countries—6.66 in Turkey, 2.15 in Mexico, and 2.66 in Brazil (Evren et al., 2020; García-Reyna et al., 2021; Padovan-Neto et al., 2021). The reason underlying the high CAS value of 7.03 in India can only be assumed from another recent study (Lieven & Hildebrand, 2016) demonstrating that Indian respondents tended to an extreme response style on the right side of the scale. While the average skewness across all countries was 1.45, it was only .325 in India, indicating that the distribution in India was much less right skewed because of higher ratings. The high CAS score of 8.62 in the inaugural study by Lee et al. (2020a) may be due to the fact that participants were recruited who had experienced significant anxiety, fear, or worry about the disease outbreak. This cohort was therefore likely to have higher Coronavirus-related anxiety than a broader cross-sectional cohort of the representative population. The more it has to be emphasized that the CAS model fits all data well, be it from highly COVID-19 affected persons or less affected respondents.

The differences between the scores across countries may be because of inappropriate translations. However, in India, the same original English questionnaire was used as in the US, but the CAS score was 7.03 and 4.27, respectively. Similarly, the same Portuguese translation was administered in Brazil and Portugal, but scores in Brazil were nearly twice as high as those in Portugal.

The translations of the questionnaire were conducted by professional translators, who were all native speakers in the respective language, and their names and qualifications are known to the authors of this study. A comparison with some available translations on "The Coronavirus Anxiety Project” (n.d.) and a backtranslation of both versions did not show serious differences; only minor ones that did not distort the meaning of the five questions. The questions themselves were comprehensible to most people. Diseases such as dizziness, faint, insomnia, lack of appetite, or nausea are common everyday topics that simply cannot be misunderstood. In any case, why scores differ significantly between countries cannot be analyzed from the perspective of this study. Cultural sensitivities or idiosyncrasies could lead to a different comprehension. These causes cannot be observed in a worldwide view, as was conducted in this study. It could be a promising venue for future research when local researchers take care of these matters.

Cultural comparisons revealed that individualistic and indulgent countries with a low PDI (i.e., Western countries) exhibited stronger COVID-19 anxiety. Asian countries (except India) had a CAS of 2.64; the CAS in the Western hemisphere was 3.51. This is consistent with previous findings from studies in 23 countries (Burkova et al., 2021); however, that study did not verify cross-cultural invariance. Perhaps the cohesion and solidarity with a clear hierarchal society, which is the characteristic of collectivistic cultures with high power distance, helps people overcome anxiety. Surprisingly, uncertainty avoidance did not facilitate anxiety. By contrast, countries with a high uncertainty avoidance index had lower anxiety. It might have been a rational risk aversion instead of psychological uncertainty that had been assessed by Hofstede (1980) and Hofstede et al. (2010). In general, one should approach Hofstede's scale with caution. It is 40 years old, and cultures have changed due to globalization. However, no new cultural scale is available.

Two findings in this study are counterintuitive. One is the stronger anxiety among young individuals. COVID-19 is more dangerous for elder persons, particularly those with preexisting diseases. Younger persons are not yet affected by such diseases and are said to have a stronger immune system. Consequently, one should expect elder individuals to have stronger Coronavirus anxiety. The reason for the unusual finding might be that old people above 70 or 80 were not sufficiently covered by this study. From 20 to 69 years, however, the above finding holds.

We did not use the Rasch DIF model for parsimonious reasons. This seemed to be appropriate because several studies have shown consistent results regarding confirmatory factor analysis and the Rasch DIF (Randall & Engelhard, 2010), particularly for the FCV-19S (Lin et al., 2021).

One limitation of the present study is the unbalanced choice of countries. Fifteen countries were chosen from Europe, and only one each from South America and Africa. However, our results revealed that there is a high probability that any CAS dataset from another country will fit in the existing panel of 25 countries as long as the individual fit indices of that country are sufficiently good.

Conclusion

Our data indicates that CAS is robust to be disseminated into many other cultures. Compared with the Obsession with COVID-19 scale (OCS, Choi et al., 2020; Lee, 2020c) and the FCV-19S (Ahorsu et al., 2020; Caycho-Rodríguez et al., 2021), the CAS seems to be at least equivalent and has cross-cultural reliability.