Screening for breast, colorectal, and cervical cancer is an integral component of preventive health care in the United States. The importance of cancer screening is underscored by its mandated insurer coverage in the Affordable Care Act1 and the creation of screening guidelines by the United States Preventive Services Task Force (USPSTF), which provides evidence-based guidelines for average-risk populations.24

In 2011, the U.S. National Cancer Institute (NCI) established the Population-based Research Optimizing Screening through Personalized Regimens (PROSPR) consortium, funding seven research centers, with an overall goal of promoting coordinated, multidisciplinary research to evaluate and improve the cancer screening process across the care continuum from cancer screening to diagnosis and treatment. An essential premise in establishing PROSPR, initially comprised of three centers focused on breast cancer, three on colorectal cancer, and one on cervical cancer screening, is that variation at multiple levels—health system, clinic, provider, and patient—may be leveraged through comparative effectiveness research to elucidate best cancer screening practices. To advance research in this area, the consortium has developed conceptual models for studying screening processes of care57 that extend the work of Zapka and Taplin.8 These models detail the screening process from risk assessment through diagnosis and treatment as steps with interfaces requiring a transfer of information and/or responsibility across providers, and elucidate transitions where threats to timely follow-up may occur at multiple levels.5

The purpose of this paper is to describe observed variations in rates of screening abnormalities and their timely follow-up, with attention to similarities and differences across breast, cervical, and colorectal organ sites and to variations across primary care clinics. With primary care providers and health care systems increasingly taking responsibility for population health through the emergence of patient-centered medical homes and accountable care organizations,9 consideration of variation in cancer screening practices from a primary health care delivery perspective may shed light on areas where the quality of cancer screening can be improved.10,11 Focusing on the patient and their interaction with the entirety of cancer screening, beyond organ-centric cancer screening, has the potential to improve overall quality of care.5,12,13

METHODS

This study was conducted as part of the NCI-funded Population-Based Research Optimizing Screening through Personalized Regimens (PROSPR) consortium. The overall aim of PROSPR is to conduct multi-site, coordinated, trans-disciplinary research to evaluate and improve cancer screening processes. The seven PROSPR research centers reflect the diversity of U.S. health care delivery organizations. The three breast centers include an integrated health care delivery system affiliated with the University of Pennsylvania (UPenn), a statewide mammography and pathology registry housed at the University of Vermont (VT), and primary care practice networks in two states within the Dartmouth-Hitchcock health system in New Hampshire and Brigham and Women’s Hospital in Massachusetts (Dartmouth-BWH). The three integrated health care systems comprising the colorectal centers include two managed care plans, Kaiser Permanente Northern and Southern California (KP) and Group Health (GH), and a regional safety-net clinical provider network at the Parkland-University of Texas Southwestern Medical Center (Parkland-UTSW). The cervical center is a statewide registry housed at the University of New Mexico (NM). Each center submits common data elements obtained from several sources (e.g., electronic health records, administrative data) to a central data repository, which undergoes rigorous data quality assurance processes involving harmonization to account for differences in data sources.

Study Populations

Individuals undergoing screening with the tests of interest in 2011, 2012, or 2013 who met commonly used age criteria for screening were studied.24 For breast cancer screening, we report on women aged 40–75 years screened in 2012 or 2013, with no prior history of breast cancer, who received screening digital mammography with or without digital breast tomosynthesis and had no breast imaging in the prior 90 days. For cervical cancer screening, women aged 21–65 years who received Papanicolaou (Pap)-stained cervical cytology with or without human papillomavirus (HPV) tests in 2012 and had no Pap test in the prior 300 days were studied. For colorectal cancer screening, we report on men and women aged 50–75 years who received high-sensitivity guaiac-based fecal occult blood tests (FOBT) or fecal immunochemical tests (FIT) in 2011. Activities were approved by the institutional review boards at each research center and by the PROSPR Statistical Coordinating Center.

Screening Abnormalities and Timely Initial Follow-Up of Abnormalities

The screening abnormalities studied and expected time frame for timely follow-up are summarized in Table 1. Due to current PROSPR data availability, we restricted reporting for cervical cancer screening to the subset of abnormalities where follow-up is anticipated within 3 months. This includes the combination of HPV+ and atypical squamous cells of undetermined significance (ASC-US), and low-grade squamous intraepithelial lesion (LSIL) or higher-grade lesions (ASC-H: atypical squamous cells, cannot exclude high-grade squamous intraepithelial lesion, and HSIL: high-grade squamous intraepithelial lesion).

Table 1 Definitions for Abnormal Screening and Timely Follow-Up Testing

Statistical Analysis

We report screening abnormality rates per 1000 by organ site and center. For the centers that report screening data by primary care clinic (UPenn, Dartmouth-BWH, Parkland-UTSW, GH, NM), we describe the distribution of screening abnormalities across clinics within centers using medians and interquartile ranges (IQR), and portray these graphically using box plots. For cervical cancer screening, we included all submitted Pap tests regardless of provider type. Due to the instability of rates in light of the small sample sizes, clinic-level analyses are restricted to clinics with 25 or more screening exams for abnormality rates and to 11 or more abnormal screening exams for follow-up rates.

We portray time to initial follow-up of an abnormal screening test using Kaplan–Meier plots by center. To characterize variation in timely follow-up, we report both the overall percentage of cases with timely follow-up across centers within organ and by clinic, and the median days and IQR from abnormal result to follow-up.

We used generalized linear mixed models to determine whether observed variation in follow-up across primary care clinics was attributable to differences in observed patient characteristics, and whether variation was greater than expected by chance after adjustment for patient characteristics (SAS PROC GLIMMIX). In these mixed-effects logistic regression models, we treated center and individual patient characteristics as fixed effects and primary care clinic as a random effect. To facilitate comparisons across demographically diverse populations, all models were adjusted for age, colorectal and breast models were adjusted for race/ethnicity, and colorectal models were also adjusted for sex. Primary care clinic-level variation in the percentage of abnormal screens and percentage with timely initial follow-up is described as the range in the 25th to 75th percentile clinics estimated using a clinic variance component.

RESULTS

The characteristics of 1,024,566 individuals who underwent screening are reported in Table 2. Breast centers had comparable ages but differed in race/ethnicity. Colorectal centers differed in both age and race/ethnicity. The cervical center showed a smaller proportion screened at ages 60–65 years and included a sizable Hispanic population, as did one of the colorectal centers.

Table 2 Characteristics of Screened Populations and Clinics by Cancer Type and PROSPR Center

Screening Abnormality Rates

A range in screening abnormality rates was seen across centers and organ sites (Table 3), with some notable age-related patterns (Appendix Table, available online). For breast cancer screening, abnormality rates were higher for 40–49-year-olds than 50–75-year-olds. Colorectal cancer screening begins at age 50, and abnormality rates were lower for 50–64-year-olds than 65–75-year-olds. For cervical cancer screening, abnormality rates declined with age for all categories of abnormal results, with rates in the younger age group approaching those for breast cancer screening.

Table 3 Screening Abnormality Rates (Unadjusted), Percentage with Timely Follow-up*, and Median Time to First Follow-up (TTF) by Cancer Type and PROSPR Center

Timely Initial Follow-Up of Screening Abnormalities

The percentage with timely follow-up was highest for breast screening, at 95.6 % (Table 3). While abnormality follow-up was lower for both colorectal cancer (68.1 %) and cervical cancer screening (44.8 %), cervical abnormality follow-up tended to increase with more severe screening abnormalities, with the highest rates for ASC-H/HSIL (52.6 %).

Time to initial follow-up by center is shown in Figure 1. Time to follow-up was shortest for breast cancer screening and longest for cervical cancer screening. While the breast centers have tightly clustered time-to-follow-up curves, statistically significant variation in time to follow-up was observed across the colorectal centers (log-rank test, p value < 0.001).

Figure 1
figure 1

Kaplan–Meier curves for time to first follow-up by organ and PROSPR Center. Ticks (+) are shown at 3 and 9 months.

Variation in Screening Outcomes Across Primary Care Clinics

Analyses of clinic-level variation in abnormality rates included 33,236 women from 61 clinics for breast, 27,416 individuals from 37 clinics for colorectal, and 164,772 women from 427 clinics for cervical cancer screening. The variation in abnormality rates across primary care clinics by center is shown in Figure 2a. While median abnormality rates were similar across the breast sites—98.0 IQR (87.5–109.5) for UPenn and 93.7 IQR (81.5–102.9) for Dartmouth-BWH—UPenn had more clinics identified as outliers on both the upper and lower ends of the spectrum. Clinic-level abnormality rates differed somewhat among colorectal sites (medians of 32.1 IQR (27.6–46.4) for Parkland-UTSW and 49.6 IQR (37.5–56.3) for GH). The cervical center had more outliers than did breast or colorectal centers. Clinic-level variation in the percentage of abnormal screening exams receiving timely follow-up was least pronounced for breast cancer screening, intermediate for colorectal cancer screening, and most pronounced for cervical cancer screening (Fig. 2b).

Figure 2
figure 2

Box plots of clinic-level screening outcomes by organ and PROSPR Center, with number of clinics included shown for each center. (a) Screening abnormality rates. Limited to clinics with 25 or more screening exams. Cervical includes: Pap−/HPV+, ASC-US/HPV−, ASC-US/No HPV, ASC-US/HPV+, LSIL or worse (any HPV result). (b) Percentage with timely follow-up*. Limited to clinics with 11 or more abnormal screening exams. Cervical includes: ASC-US/HPV+ and LSIL or worse (any HPV result). *Timely follow-up is defined as initial follow-up within 3 months for BI-RADS 0, 4, and 5, and within 9 months for BI-RADS 3; within 3 months for FIT/FOBT+; and within 3 months for ASC-US/HPV+ and LSIL or worse (ASC-H, HSIL, LSIL).

Analyses of primary care clinic-level variation in the percentage with timely initial follow-up included 3144 women from 51 clinics for breast, 966 individuals from 29 clinics for colorectal, and 6697 women from 207 clinics for cervical cancer screening. There was significant variation across primary care clinics for both the percentage with an abnormal screen (25th to 75th percentile for breast, 8.5 to 10.3 %; for colorectal, 3.0 to 4.8 %; for cervical, 6.3 to 9.9 %) and the percentage having timely follow-up (25th to 75th percentile for breast, 90.2 to 95.8 %; for colorectal, 43.4 to 52 %; for cervical, 29.6 to 61.4 %).

DISCUSSION

Our study characterizes the variation observed across health care settings and patient populations in screening abnormalities for breast, colorectal, and cervical cancer screening and their timely follow-up—an important process measure for effective cancer screening.14 Timely follow-up was most frequently achieved for breast cancer screening, where the most severe abnormalities (BI-RADS 4 or 5) had 88.9 to 100 % follow-up at 3 months. In contrast, for colorectal cancer screening—where only two-thirds of individuals with FOBT/FIT+ received follow-up within 90 days—there is ample opportunity for improvement. Likewise, among women with cervical screening abnormalities for whom receipt of guideline-concordant colposcopy within 3 months was expected, only 42.9 to 52.6 % received such care.

Screening abnormality rates observed across PROSPR are consistent with prior studies,12,1522 yet the cross-organ perspective highlights the higher abnormality rate for breast cancer screening (10.5 % of individuals screened) relative to colorectal cancer screening using FOBT/FIT (4.7 % of individuals screened) or cervical cancer screening (7.9 % of individuals screened). Differences in rates of colorectal cancer screening abnormalities may reflect differences in the proportion of individuals undergoing first-time versus repeat FOBT/FIT screening. Since colorectal adenomas are detected more frequently in men and older patients, Parkland-UTSW’s younger population with a higher proportion of women would be expected to have a lower abnormal test rate than GH and KP.

Other factors may contribute to variation in the observed organ-specific abnormality rates, including the mix of breast imaging modalities with different operating characteristics, the sensitivity of different types of fecal occult blood tests, and the subjectivity of radiologist and cervical cytology interpretations, as well as differences in laboratory processing procedures. While UPenn was an early adopter of digital breast tomosynthesis, their abnormality rate was intermediate, between the rates observed for Dartmouth-BWH and VT. The health systems also differed in the types of fecal tests (FOBT and FIT) and processors used during the years studied.7 The lower abnormality rate found for GH in comparison to KP could reflect the use of the FOBT (used at some GH clinics through 2011), which is more specific and less sensitive than FIT (used at KP).23 For cervical cancer screening, the higher variation in abnormality rates may be attributable to the diverse populations served at specific clinics and to the fact that we did not restrict analysis to primary care clinics or providers.

From a primary care population health management perspective, variation in timely screening abnormality follow-up is concerning, and may be influenced by patient and system characteristics as well as differences in the underlying complexity of coordinating the next steps in clinical care.24 Inherent differences in the severity of abnormalities detected and the type of follow-up required may also affect timely follow-up. For example, the lower follow-up rates for colorectal cancer screening abnormalities may be due to the perceived inconvenience and invasive nature of the procedures involved. Patients undergoing colonoscopy typically need to take time off work, which may pose a barrier especially for lower-income individuals. Preparation for colonoscopy also includes dietary restriction, bowel preparation, and sedation in an inpatient or same-day surgical environment. Furthermore, patients with a positive FOBT/FIT must transition from the primary care setting to gastrointestinal (GI) specialty care. This requires an interface—a transfer of role/responsibility between primary care and specialist.25 Similar challenges are encountered for follow-up of cervical screening abnormalities, as specialist referral for diagnostic colposcopy is often required. In contrast, the initial follow-up for breast cancer screening abnormalities often entails additional imaging, which may be completed in the same setting and often on the same day.

The nature of the populations served may also affect the timeliness of follow-up care.2630 At Parkland-UTSW, a safety-net setting, GI clinics must handle all symptomatic uninsured patients who walk in through the emergency department, which may overwhelm the capacity of GI clinics to perform timely colonoscopy follow-up.

Health care organizations’ capacity, integration of services, and outreach may also influence the timeliness of follow-up. During the time interval studied, the colorectal center with the most favorable follow-up (KP) began implementing benchmarks and pathways designed to promote colonoscopy follow-up. These include a goal of 14 days for follow-up of a positive FIT with dedicated staff to directly schedule diagnostic colonoscopies by phone, email, and/or standard mail (KP-southern region), and a goal of 80 % diagnostic colonoscopy performance within 1 month of a positive FIT (KP-northern region).31 The provision of monthly reports on time to follow-up for GI physicians and clinical leaders at KP likely also contributes to higher follow-up rates.

Federal policies and regulations that encourage timely cancer screening may affect screening follow-up well beyond individual patient and health system factors. Such policies exist for breast and cervical cancer screening, but are non-existent for colorectal cancer screening, for which we found the greatest variation in timely follow-up. Since its reauthorization in 1999, the national Mammography Quality Standards Act (MQSA; 42 U.S.C. 263b), which regulates breast cancer screening quality and the use of the Breast Imaging-Reporting and Data System (BI-RADS),32 has promoted timely follow-up of screening abnormalities. Importantly, the MQSA requires that health care facilities provide patients with written results of their mammograms in language that is easy to understand, and also requires reporting of breast cancer screening results to primary care providers. These regulations may partly explain the smaller variation in follow-up rates across centers and clinics for breast cancer screening compared to colorectal and cervical cancer screening.

Prior to enactment of the MQSA, the Breast and Cervical Cancer Mortality Prevention Act of 1990 (Public Law 101-354) established the Centers for Disease Control and Prevention's National Breast and Cervical Cancer Early Detection Program, which provides screening and diagnostic follow-up to low-income and under- or uninsured women, set targets for timely follow-up of screening abnormalities (80 % complete diagnostic evaluation within 90 days), and ties funding to meeting those targets.33 The act also funded public education designed to increase public awareness of the benefits of early detection, which may contribute to timely follow-up by patients with screening abnormalities.

Although colorectal cancer screening is not regulated by federal policies, it is encouraged through national clinical guidelines and the National Committee for Quality Assurance’s Healthcare Effectiveness Data and Information Set (HEDIS) for health plan quality reporting.34 However, HEDIS cancer screening-related quality measures address the provision of screening services rather than timely follow-up of abnormalities.

While the diversity of settings represented within PROSPR provides rich opportunities for comparative studies, some limitations warrant mention. First, cross-organ comparisons may be hampered by the different care settings represented across organ sites, with some settings affected to a greater degree by potential out-of-system care.35 Second, the data available through each center differ in source and quality. The two statewide registries provide strength in their inclusion of more heterogeneous populations than may be seen among the insured who are cared for within integrated health systems or primary care networks, but have more limited access to detailed clinical information. However, even when electronic health record data are available, challenges remain in creating comparable data across centers given the differences among systems and the lack of structured data for covariates of interest in studying screening processes of care (e.g., risk factors).36,37 Finally, PROSPR’s common data elements capture care that was delivered but do not include information on scheduled care.

Our study underscores the potential for using PROSPR data to gain insight into factors affecting timely follow-up of screening abnormalities and for understanding whether timeliness of follow-up results in different clinical outcomes. Gaining insight into best screening processes of care within PROSPR will require multi-level investigation of patient, provider, clinic, and health system factors, and entails assembling health system information not readily available through electronic health records alone.38,39 Clinical care pathways regarding communication of test results is one such area where the consortium is actively working to document current practices. These data will enable evaluation of the role of communication at care interfaces in the timely follow-up of screening abnormalities. This should advance systems approaches to improving the quality of cancer care, which are already gaining traction.40 They also allow investigation into whether differences in time to follow-up result in meaningful differences in disease outcomes; such data can inform evidence-based metrics for follow-up guidelines. Other features of the health system and the legal and regulatory environments in which care is provided must also be considered, including the extent to which clinics participating in PROSPR have received National Council on Quality Assurance recognition as a patient-centered medical home.41 Through focused comparative effectiveness studies, the variation documented in rates of screening abnormalities and follow-up within PROSPR may help guide the development and implementation of health care delivery interventions and policies with the potential to improve population health.