Introduction

Globally bladder cancer is the tenth most common cause of cancer related death in men [1]. The majority of bladder cancers are urothelial carcinomas. Accurate pre-operative staging is essential to guide management of bladder cancer [2]. Approximately 75% of bladder tumours are non-muscle invasive cancers (NMIBC) which are commonly diagnosed and treated with local therapy by transurethral resection of bladder tumour (TURBT) and/or intravesical therapies [3, 4]. Muscle-invasive bladder cancer (MIBC) is typically treated by radical cystectomy or radiation therapy [5].

In the UK national timed-cancer pathway targets mandate that for at least 85% of patients, cancer treatment commences within 62 days of referral [6]. The updated 2015 national cancer strategy proposed a 28-day target in which to confirm or rule out a cancer diagnosis following urgent referral with a suspicion of cancer [7]. Current diagnostic pathways for bladder cancer rely on cystoscopy and CT or MRI followed by TURBT [8]. TURBT is regarded both as a form of definitive treatment and local staging for NMIBC, however studies have shown that TURBT can underestimate T-categorisation in up to 40% of patients [9, 10]. Patients with MIBC who are under-staged are at significantly higher risk for disease progression and worse outcomes, with an approximately 30% higher five-year cancer specific mortality [10].

Data from Getting It Right First Time (GIRFT), an NHS (National Health Service) programme designed to improve quality of care in urology, showed that in 2013–2016 patients with MIBC were waiting on average of 144 days from referral to cystectomy [11]. One of the contributory factors to this delay is the wait for imaging investigations. Although bladder MRI is the cornerstone for local staging and is recommended by international guidelines, our experience is that MRI is generally underutilised in the UK due to pressures to meet timed-cancer pathway targets. Scheduling and performing bladder MRI between cystoscopy and TURBT can be challenging with MRI often performed after intervention. Preliminary data from an ongoing randomised trial (BladderPath) comparing risk-stratified (5-point Likert scale) image-directed care with TURBT for newly diagnosed bladder cancer suggest that it is feasible to direct MIBC patients to multiparametric MRI for staging instead of TURBT, thereby reducing time to treatment [12].

The VI-RADS (Vesical Imaging Reporting and Data System) scoring system has helped standardise MRI acquisition and reporting of bladder cancer, with several studies demonstrating good diagnostic performance and reproducibility for assessment of MIBC [13, 14]. However, interpretation of MRI after TURBT can be challenging due to the presence of inflammation and fibrosis within the bladder wall or extravesical fat. VI-RADS is not intended to be used post-TURBT and few studies have investigated the diagnostic performance of MRI in this setting. El-Assmy et al. demonstrated that DWI (diffusion weighted imaging) can reliably differentiate post-TURBT inflammatory change from tumour with results similar to those of conventional cystoscopy [15]. van der Pol et al. found that multiparametric MRI showed high sensitivity and specificity for detection of MIBC, however the inclusion of only patients who underwent cystectomy is likely to have led to a selection bias [16]. Lim et al. showed that MRI with textural analysis can help in the local staging of bladder cancer after TURBT, however this technique is yet to enter routine clinical practice [17]. No further studies have been performed to date. We therefore aimed to assess the diagnostic performance of MRI for detection of bladder cancer following TURBT using histopathology as a reference standard.

Methods

Between August 2016 and January 2018, consecutive patients who underwent MRI post cystoscopy for bladder cancer at our institution and meeting study eligibility criteria were identified. The local institutional ethics committee approved this retrospective study, with the need to obtain informed consent waived (NRES Committee East of England, UK, reference A093248). The inclusion criteria were (a) biopsy-proven bladder cancer, (b) bladder MRI including T1-weighted images (T1WI), T2-weighted images (T2WI) and DWI within 6 weeks of TURBT performed at our institution and (c) either cystectomy or follow-up cystoscopy within 90 days. 62 patients met the initial inclusion criteria, with 21 patients excluded due to concurrent treatment with intravesical BCG, systemic chemotherapy or radiotherapy (n = 14), absence of muscle in the biopsy specimen (n = 3), poor quality T2WI/DWI owing to motion artefact or hip prosthesis (n = 3), or debris obscuring the bladder wall at cystoscopy (n = 1), giving a final eligible study cohort of 41 patients (Fig. 1).

Fig.1
figure 1

Flow diagram showing patient enrolment

MRI protocol

Patients underwent bladder MRI using a 1.5 T MR450 (GE Healthcare, Waukesha, WI, USA) with a 16-channel phased array body coil. Unless contraindicated, intravenous injection of hyoscine butylbromide (Buscopan, 20 mg/ml, Boehringer, Germany) was administered prior to imaging to reduce peristaltic movement. Conventional T1-weighted spin-echo images (449 ms repetition time (TR), 9 ms echo time (TE); 352 × 224 matrix; 24 × 24 cm field-of-view (FOV); 5 mm slice thickness; 2.5 mm intersection gap; one signal acquired) in the axial plane. High-resolution T2 weighted turbo spin-echo images (3700 ms TR, 86 ms TE; 320 × 288 matrix; 24 × 24 cm field-of-view (FOV); 3 mm slice thickness; 1 mm intersection gap; two signals were acquired. Images were obtained in three orthogonal planes. Diffusion-weighted imaging (DWI) was performed using a spin-echo echo-planar imaging pulse sequence (5000 ms TR, 78 ms TE; 128 × 128 matrix; 27 × 27 cm field-of-view (FOV); 4.5 mm slice thickness; 0 mm intersection gap with b-values of 0 and 1000 s/mm2. Apparent diffusion coefficient (ADC) maps were calculated automatically.

Image analysis

MR images were retrospectively evaluated by two subspecialist uroradiologists with 12 years (TB) and 6 years (NS) of experience in reading bladder MRI. Both readers reviewed MRI studies independently and were blinded to clinical data and pathology reports. Each reader independently assessed T1WI and T2WI together and T1WI, T2WI and DWI combined and assigned the perceived likelihood of tumour using a Likert scale: 1 = definitely absent, 2 = possibly absent, 3 = indeterminate, 4 = possibly present and 5 = definitely present. For each lesion size, location and T-categorisation was evaluated using the 8th edition of the American Joint Committee on Cancer (AJCC) staging manual [18]. In brief, readers assessed T1WI to look for haematoma, T2WI to look for a continuous low signal intensity line in the bladder wall that represents an intact muscularis propria and DWI (high b-value images) for tumour which is hyperintense with corresponding hypointensity on ADC maps. To minimise learning bias the second reading session was performed after a wash-out interval of 4 weeks. Readers only had access to specific sequences when reviewing each set of images. Differences in opinion were resolved by consensus, with the most experienced reader’s opinion considered as definitive.

Histopathologic analysis

All biopsies and surgical specimens were analysed and graded according to existing guidelines [19] by an experienced uropathologist and reviewed by a second uropathologist at a multidisciplinary team meeting. Tumour cell type, and T-category were also obtained. In patients with multiple tumours the tumour with the highest pathological T-category was recorded.

Data analysis

Demographic characteristics, MRI data and pathologic data were described with summary statistics. Sensitivity, specificity, positive predictive value (PPV), and negative predictive (NPV) value for detection of tumour using the Likert scale were calculated for both readers, using a 2 × 2 contingency table. For the purpose of data analysis, Likert score 3 was considered as positive for tumour as this reflects how this group of patients would be managed in clinical practice. Accuracy of MRI for T-categorisation was determined for for NMIBC detectable at MRI (< pT2, excluding pTis) and MIBC (≥ pT2). Data was analysed with the statistical package MedCalc (Version 20.305). The receiver operating characteristic (ROC) analysis was performed. Inter-reader agreement was analysed by means of weighted κ values with quadratic weighting (κ = 0.00–0.20, poor; 0.21–0.40, fair; 0.41–0.60, moderate; 0.61–0.80, good; and 0.81–1.00, excellent agreement).

Results

41 patients were included in the study, 35 men (85.4%) and 6 women (14.6%) with median age of 74 years (IQR: 68–79 years). The median interval between TURBT and MRI was 26 days (IQR: 22–30 days). The median interval between MRI and follow-up cystoscopy or cystectomy was 16 days (IQR: 6–32 days).

Histopathology

All patients had biopsy-proven bladder cancer at presentation. 40 patients had urothelial carcinoma and one patient had squamous cell carcinoma. At initial diagnosis all patients were classified as high-grade according to the WHO 2004 classification [20]. The histopathological reference was TURBT (n = 28) or cystectomy (n = 12). One patient underwent follow-up cystoscopy without biopsy followed by close clinical surveillance for ≥ 12 months to establish a benign diagnosis. In this study cohort, 58.5% (n = 24) had no residual tumour, 22.0% (n = 9) had pTis disease, 4.9% (n = 2) had pTa disease, 4.9% (n = 2) had pT1 disease, 4.9% (n = 2) had pT2 disease, 4.9% (n = 2) had pT3 disease and no patients had pT4 disease.

Diagnostic performance of MRI for tumour detection

Using T1WI and T2WI, consensus read identified 19 patients with one tumour and 22 patients with no tumour. Using T1WI, T2WI and DWI, consensus read identified one patient with two tumours, 15 patients with one tumour and 25 patients with no tumour. The sensitivity, specificity, PPV and NPV for detection of tumour using T1WI, T2WI were 50.0%, 54.6%, 21.1%, and 81.8%, respectively and compared to T1WI, T2WI and DWI at 100%, 76.5%, 50.0% and 100% respectively (Table 1). For both readers, using all three sequences (T1WI, T2WI and DWI) resulted in higher sensitivities, specificities, PPVs and NPVs compared to using T1WI and T2WI alone (Supplementary data Table 1). The area under the curve (AUC) value of ROC analysis was 0.587 for T1WI and T2WI and 0.890 for T1WI, T2WI and DWI (Fig. 2). Inter-reader agreement using T1WI and T2WI alone was 0.54 (95% CI 0.23–0.84) and improved to 0.70 (95% CI 0.48–0.91) using all three sequences.

Table 1 Diagnostic performance of MRI for tumour detection with T1WI and T2WI together and T1WI, T2WI and DWI combined
Fig. 2
figure 2

ROC curves for detection of bladder cancer following TURBT. AUC value of ROC analysis was 0.587 for T1WI and T2WI and 0.890 for T1WI, T2WI and DWI

Overestimation of tumour was more common than underestimation (Table 2). The inclusion of DWI improved overall accuracy of MRI from 53.7% to 81.0%. Accuracy of MRI (with DWI) for detection of no tumour was 78.9% (26/34), 100% (4/4) for NMIBC and 100% (4/4) for MIBC. Case examples are shown in Figs. 3, 4 and 5.

Table 2 Diagnostic performance of MRI for tumour detection for each pathological T-category with T1WI and T2WI together and T1WI, T2WI and DWI combined
Fig. 3
figure 3

Axial T1WI (A), T2WI (B), DWI (C) and ADC map (D) in a 68-year-old male with urothelial carcinoma who underwent MRI after TURBT shows a 5 mm lesion at the anterior bladder wall to the left of the midline close to site of known multifocal tumour seen at TURBT (pT1G3). This lesion was missed on the anatomical sequences (A, B) but the focal hyperintense area at DWI (arrow) and corresponding restricted diffusion on ADC maps (C, D) allowed this to be called with the combined read. Cystectomy confirmed tumour in this location, pTis disease

Fig. 4
figure 4

Axial T1WI (A), T2WI (B), DWI (C) and ADC map (D) in a 75-year-old male with urothelial carcinoma who underwent MRI after TURBT shows thickening and perivesical stranding at the left posterolateral bladder wall post-TURBT. Based on the anatomical sequences this finding could be interpreted as T3b disease (A, B), however there was no diffusion restriction to suggest residual/recurrence disease (C, D), the staging was therefore organ-confined disease. Cystectomy showed no residual disease.

Fig. 5
figure 5

Axial T1WI (A), T2WI (B), DWI (C) and ADC map (D) in a 67-year-old male with urothelial carcinoma who underwent MRI after TURBT shows a 9 mm lesion at the right ureteric orifice post-TURBT (G3pTa). Based on the anatomical sequences this finding could be interpreted as residual disease (A, B). Although this was hyperintense at DWI (arrow) there was no diffusion restriction (C, D) as ADC was not low, and therefore this was interpreted as negative for residual/recurrence disease. Serial cystoscopic biopsies showed no evidence of malignancy

Diagnostic performance of MRI for T-categorisation

For NMIBC detectable at MRI (< pT2, excluding pTis) accuracy of T-categorisation improved from 50% (2/4) to 100% (4/4) with DWI. For MIBC (≥ pT2) accuracy of T-categorisation improved from 50% (2/4) to 100% (4/4) with DWI.

Discussion

In this study we assessed the diagnostic performance of MRI for tumour detection and T-categorisation. We found that MRI had high sensitivity, relatively high specificity and high NPV for tumour detection following TURBT. MRI showed high accuracy for patients with no residual tumour. However, overall PPV was only moderate, highlighting the limitations of MRI assessment in this setting and importance of close follow-up cystoscopic surveillance. Overestimation of tumour was more common than underestimation.

In our cohort MRI was performed at a median time of 26 days after TURBT. This was due to time constraints imposed by cancer treatment referral pathways and may compromise staging assessment. Despite the availability of non-invasive imaging and the risk of complications from multiple TURBTs, current guidelines do not specifically address whether and when to use MRI in this setting due to limited data.

To date, there have been limited studies investigating the diagnostic performance of bladder MRI post cystoscopic intervention. Van der Pol et al. reported high sensitivity and specificity for detection of MIBC in 45 patients who underwent TURBT followed by radical cystectomy [16]. More recently, Pecoraro et al. demonstrated the feasibility of a novel scoring system for response assessment at follow-up MRI in 10 patients with MIBC who underwent TURBT and neoadjuvant chemotherapy followed by radical cystectomy [21]. Hafeez et al. showed that quantitative DWI can be helpful for response assessment post radiotherapy in patients with MIBC [22]. We are not aware of any further studies which have investigated the diagnostic performance of MRI after cystoscopic intervention in a representative cohort with NMIBC and MIBC who underwent TURBT, cystectomy or long-term cystoscopic surveillance. Our results suggest that DWI can better distinguish residual tumour from acute inflammatory changes which are commonly encountered after TURBT, where the presence of mural oedema may cause high signal intensity on both T2 and DWI (‘T2-shine through’ effect), but can be mitigated by assessing the ADC map. In the longer term chronic inflammatory changes and fibrosis can also hinder assessment, but also typically do not show marked restricted diffusion. DWI is considered the dominant sequence for estimation of muscle invasion in VI-RADS [13]. Indeed, our findings are consistent with previous studies in the intervention naïve setting where DWI outperforms T2WI for tumour detection [23, 24]. Inter-reader agreement for tumour detection improved from fair (κ = 0.54) to moderate (κ = 0.70) when DWI was included. This was comparable to inter-reader agreement for VI-RADS shown by Barchetti et al. in the intervention-naïve bladder (κ = 0.73), which employed readers of similar experience [25].

Our study has some limitations including its retrospective nature. The sample size is relatively small due to the strict inclusion criteria employed, which was necessary to ensure our cohort benefited from more definitive follow-up histopathological assessment, however, this is comparable to a prior study [26]. The reference standard was TURBT or cystectomy. Despite this mixed reference standard, it was important to include patients with a representative spectrum of disease stages in the study, as this better reflects clinical practice where treatment choice depends on tumour stage and depth of invasion. Finally, we did not utilise dynamic contrast enhanced (DCE) imaging in this study. Although DCE is an integral component of multiparametric bladder MRI with VI-RADS, our study cohort pre-dated publication of VI-RADS, with our protocol at the time based on earlier work which suggested that addition of one functional sequence such as DWI was sufficient [23]. Furthermore VI-RADS was designed for treatment-naïve patients. A study by Wang et al. found that DWI was superior to DCE for differentiating recurrent bladder tumour from inflammation or fibrosis after TURBT [27]. Future studies in a larger population are warranted to investigate factors that affect accuracy of MRI after cystoscopic intervention with the goal of developing reliable and validated criteria for interpretation of MRI in this setting.

Conclusion

Non-contrast MRI with DWI showed high sensitivity and relatively high specificity for detection of residual tumour after TURBT. Inter-reader agreement improved from fair to moderate with the addition of DWI. Our results highlight the usefulness of MRI in the post-TURBT setting to accurately detect residual disease and inform further management pathways.