Introduction

A recent article from the New England Journal of Medicine questioned the use of race or ethnicity in assessment algorithms [1]. In the case of osteoporosis, the authors noted that the US FRAX calculator returns a lower fracture risk for women who are Black (by a factor of 0.43), Asian (0.50) or Hispanic (0.53). They conclude that the lower risk for Black, Asian and minority ethnic (BAME) women may delay intervention with osteoporosis therapy. The New York Times goes further in stating that with FRAX (sic) Black women end up having a score that makes them less likely to be prescribed osteoporosis medication than white women who are similar in all other respects [2].

In the case of osteoporosis and FRAX, the authors do not appear to have grasped the reality of fracture epidemiology and risk assessment. In this editorial, we set out the key messages from the epidemiology of fracture globally, key considerations in building risk assessment tools, the specific contribution of race/ethnicity and practical considerations related to any move to alter race/ethnicity categorisation or remove them entirely.

Heterogeneity of fracture risk

Fracture probability varies markedly in different regions of the world due to differences in fracture risk and mortality [3, 4]. In the case of hip fracture, there is a tenfold range in probability worldwide which far exceeds the differences in incidence between sexes within a country (Fig. 1). Ethnicity is not a direct input variable in the FRAX model; these differences therefore require that FRAX models for a specific country be calibrated to national fracture and mortality rates. Failure to do so would result in exceptionally large and avoidable errors in the stratification of risk. Indeed, it would negate the utility of risk assessment. In addition to 73 country-specific models, ethnicity-specific models are available in the USA, South Africa and Singapore. Variations in ethnicity-specific risk often exceed the differences in risk between sexes. Failure to calibrate for ethnicity would have adverse consequences greater than failure to calibrate for sex.

Fig. 1
figure 1

10-year probability of hip fracture in men and women age 65 years with a prior fragility fracture in countries where a FRAX model is available. Body mass index set at 25 kg/m2. The diagonal dotted line denotes the line of equality. Hip fracture probability in women ranged from 0.5 to more than 5%—a tenfold range. Probabilities in women were on average 65% higher than in men

Choosing a risk factor

There are a number of factors to be considered in the selection of risk factors for case finding. Of particular importance, in the setting of primary care, is the ease with which they might be used. For a globally applicable tool, the chosen risk factors should also be valid in an international setting and their predictive value stable over time. The use of risk factors for case finding requires that the risk so identified is responsive to a therapeutic intervention. It is important to draw the distinction between reversible risk and reversibility of risk. Age is an example of an irreversible risk factor, but the risk of fracture identified by age has reversibility. Thus, pharmacological intervention has an effect on fracture that is independent of age indicating reversibility of risk [5]. Levels of evidence for the suitability of risk factors have been developed [6, 7]. The efficacy of interventions has been tested worldwide in randomised controlled trials so that ethnicity, race and location have a high level of evidence indicating their suitability for inclusion in risk assessment.

Understanding ethnicity

Vyas et al. [1] contest that, if race does correlate with clinical outcomes, this does not necessarily justify its inclusion in diagnostic or predictive tools. Given the complexity of the determinants of race, it is insufficient to translate a data signal into a race adjustment without determining what race might represent in the particular context. Vyas et al. argue that most race corrections implicitly, if not explicitly, operate on the assumption that genetic difference tracks reliably with race. If the empirical differences seen between racial groups were actually due to genetic differences, then race adjustment might be justified: different coefficients for different bodies. Whilst the aspiration that genetics might replace the need for race or ethnicity is worthy [8], its potential in osteoporosis is presently limited and restricted to only a single component of fracture risk (bone mineral density) [9].

The counter argument is that risk factors should be chosen according to established criteria irrespective of our understanding of their basis or their accuracy. A good example is consumption of alcohol, which is notorious for being inaccurately reported. In general, people who drink alcohol tend to neglect or underestimate their alcohol consumption [10, 11]. It matters not whether the return is accurate—only that it provides a consistent indication of risk, which it does. Thus, we are more interested in association than causality. The same goes for race, location and ethnicity.

It is important to recognise that the significance of ethnicity will vary by location. For example, Black people in the USA have lower fracture probabilities than Caucasians [12], but the probability of fracture in USA Black people is much higher than in African Black people [4] in part due to the higher fracture rates and lower mortality risks in those from the USA (Fig. 2). The same holds true for Chinese from Hong Kong, mainland China and Singapore.

Fig. 2
figure 2

Ethnic-specific 10-year probabilities of a major osteoporotic fracture in men and women age 65 years with a prior fragility fracture in the USA and South Africa. Body mass index set at 25 kg/m2

Intervention thresholds

The purpose of FRAX is to characterise fracture risk so that decisions can be facilitated on the need for treatment and, in some instances, the type of treatment [6, 13]. This demands the consideration of intervention thresholds which, in the case of FRAX, is the 10-year probability of fracture above which pharmacological intervention should be considered. Several methods have been used to define intervention thresholds [14] but, in the case of ethnicity-specific models in the USA and Singapore, use thresholds that apply to all ethnicities [15, 16]. Thus, we challenge the view of Vyas et al. [1] that the incorporation of race can exacerbate inequities. Indeed, the converse is true when the gateway to risk assessment is based on BMD testing rather than fracture probability. If the intervention threshold is set at 20% as in North America [16, 17], then the equivalent T-score at age 75 years is − 2.8 for Caucasian women but − 3.8 for Hispanic and Asian women and − 4.2 for Black women. Thus, the use of FRAX as a gateway for intervention helps to resolve, rather than exacerbate, racial inequalities.

The elephant in the room

A useful measure of health service uptake is the treatment gap which is defined in its simplest form as the number of people with a condition or disease who need treatment for it but who do not get it. The quantification afforded by FRAX has allowed inequalities in the treatment gap to be identified. Shortly after the release of FRAX, it was noted that the treatment gap was substantially wider in Black than white patients at high risk even after adjustment for fracture probability [18, 19]. More recently in the Women’s Health Initiative, those at high risk and Asian ethnicity had a much higher likelihood (by 45%) of being on appropriate treatment compared with white women, whilst in Black/African American, there remained almost half the likelihood of appropriate medication use [20]. In addition to racial inequalities, the treatment gap is higher in men than in women [18, 21, 22]. Paradoxically, the therapeutic care gap may be particularly wide in the elderly in whom the importance and impact of treatment are high [23, 24] and particularly in individuals with fracture who reside in long-term care [25]. It is perhaps ironic that FRAX has permitted these inequalities to be recognised.

In the context of osteoporosis, the major issue is disease discrimination. Many surveys indicate that a small minority of men and women at high fracture risk actually receive treatment [18, 21, 22, 25, 26]. Fewer than 20% of individuals receive therapies to reduce the risk of future fracture within the year following a fracture. Moreover, the treatment gap is increasing with time [22, 26]. The under-treatment of osteoporosis globally has led societies such as the International Osteoporosis Foundation and the American Society for Bone and Mineral Research to come together to urgently address this global crisis in the management of osteoporosis [27,28,29]. This crisis of undertreatment contrasts with the situation following myocardial infarction, for which condition a significant care gap has been overcome in the past 15 years: 75% of such individuals now receive beta blockers to help prevent recurrent myocardial infarction [30].

Studies to date provide little insight into the causes underlying the substantial and increasing treatment gap. Factors that may play a role in the USA include a decline in BMD testing owing to reimbursement issues and lack of intensive detailing by pharmaceutical companies. Others point the finger at the lay press for raising awareness over the last decade of the potential side effects of the bisphosphonates, such as osteonecrosis of the jaw, atypical femoral fractures and atrial fibrillation [26, 28]. Indeed, many doctors, dentists and patients are now more frightened of the rare but serious side effects than they are of the disease and the fractures that arise. Notwithstanding, the lay press is simply the messenger bringing news and opinion from the scientific community, some or much of which may be ill-judged. The paradox arises in that we seek to treat individual patients to the highest standards but at the same time disservice and disadvantage the wider osteoporosis community [31]. We should not make the same mistake with ethnicity and risk assessment

Conclusion

Despite the wide acceptance of the tool, FRAX should not be considered as a gold standard in patient assessment, but rather as a reference platform. Thus, the fracture risk estimates derived from FRAX should not be uncritically used in the management of patients without an appreciation of its limitations as well as its strengths. In some instances, limitations (e.g. to experts in bone disease) are perceived as strengths to others (e.g. primary care physicians) [32, 33]. Notwithstanding, the calibration of country-specific models and, where appropriate, ethnicity-specific intranational models, helps direct treatment to those most at need and avoids unnecessary intervention in those at low risk, amongst all segments of society. However, well intentioned, the NEJM article has the potential to do more harm than good to patients with osteoporosis.