Using Vanderbilt’s patient database, Batarache et al. found that constellations of billing codes could be used to identify patients with previously unidentified Mendelian (gene-borne) diseases.1,2 Artificial intelligence-informed, billing-record-based3 physician decision support at the point of care might enable earlier diagnosis and treatment. Among the fee-for-service Medicare population, we sought to examine the prevalence with which cases of newly diagnosed Mendelian conditions had phenotypically related diagnoses in previous years

METHODS

We used Medicare inpatient, outpatient, and part B files to identify individuals who were fully enrolled in fee-for-service Medicare between 2016 and 2019 and had a 2019 ICD-10 diagnosis of any of 12 Mendelian genetic conditions (each having at least 150 newly diagnosed cases between 2016 and 2019) listed in Table 1 that had not been recorded in 2016, 2017, or 2018.

Table 1 Number and Distribution Across Race and Age Categories of New Diagnoses of 12 Mendelian Genetic Conditions in 2019 that Had Not Been Diagnosed in 2016, 2017, or 2018. Blank Cells Represent < 11 Patients; CMS Does Not Allow Reporting Exact Numbers for Cell Sizes less than 11. Data for Ages 18–30 Are Not Reported as the Vast Majority of Cells Had < 11 Patients

For beneficiaries with any of these diagnoses, we examined 2016–2018 billing records to identify ICD-10 codes phenotypically associated with each specific condition, as described by Wu et al.4 and provided through the Phenome Wide Association Studies Resources website.5 For each condition, we enumerated condition-specific-related ICD-10 diagnostic codes (for example, macrocephaly for achondroplasia) and calculated the proportion of cases for which at least 5 and at least 10 phenotypically related codes were listed in billing records in 2018 and between 2016 and 2018. We also examined distributions of cases across white, black, and other race, limiting to those categories because black and white race constituted the majority of cases.

We had IRB and Centers for Medicare and Medicaid Services (CMS) approval to conduct this work through CareJourney’s Virtual Research Data Center.

RESULTS

In 2019, of 39,917,598 beneficiaries fully enrolled in fee-for-service Medicare between 2016 and 2019, 28,377 had a newly coded diagnosis of at least 1 of the Mendelian diseases that we studied (Table 1). Polycythemia vera was the most common, representing 44% of all conditions examined; achondroplasia was the least common. With the exceptions of hereditary hemochromatosis and polycythemia vera, older white women accounted for most cases.

Depending on the disorder, between 60.5 and 87.8% of patients with a newly diagnosed Mendelian disorder in 2019 had at least 5 phenotypically related diagnoses in the previous year; between 73.6 and 97.3% had at least 5 phenotypically related diagnoses coded across the previous 3 years (Table 2). In 2018, between 29.7 and 58.7% of patients newly diagnosed in 2019 had at least 10 phenotypically related diagnoses; aggregating data from 2016 to 2018, those proportions grew to between 50.4 and 84.8%.

Table 2 Proportion of Patients with Newly Coded Mendelian Disease in 2019 Who Had 5 or More (Left) or 10 or More (Right) Phenotypically Related Diagnoses in 2018 and in the 3-Year Period 2016–2018

DISCUSSION

We used 4 years of Medicare fee-for-service data to identify beneficiaries with any of 12 newly coded Mendelian diseases and examined the prevalence of phenotypically related diagnoses in the 3 preceding years’ billing records. For a given condition, up to 87.8% of identified patients had 5 or more related diagnoses in the year immediately preceding a new Mendelian condition diagnosis; up to 97.3% had 5 or more across the preceding 3 years.

Our findings suggest that—even in the older, Medicare-insured population that we studied—artificial intelligence-informed decision support might help providers identify patients with Mendelian disorders by aggregating constellations of diagnoses recorded in the recent past that suggest an overarching one.3 Surprisingly, a not insubstantial number of Medicare beneficiaries might be identified with the disorders we studied.

Our study has several limitations. First, it is possible that Mendelian disorder diagnostic codes were simply not recorded for 3 years before reappearing in 2019. While we recently found a fairly dramatic year-to-year drop-off in diagnostic coding of chronic conditions,6 Mendelian conditions tend to be life-long, disabling, and, frequently, visually apparent; it should be somewhat surprising for them not to be recorded. Second, we were not able to confirm the diagnoses we studied with genetic testing, as Batarache et al. were able to do.2 Third, our study was limited by its reliance on relatively recent administrative datasets wherein final reconciliation delays might trivially impact dataset completeness. Finally, future research should explore whether artificial intelligence-based decision support using recent phenotypically related diagnoses is appropriate for the Medicare-insured population.

Nonetheless, our analysis demonstrates that there are relatively large numbers of individuals in the fee-for-service Medicare beneficiary population that might be identified as having a Mendelian genetic disorder by screening phenotypically related diagnostic billing codes. Among those for whom the diagnosis is indeed novel, earlier genetic testing and diagnosis of these Mendelian disorders might lead to better treatment and outcomes.