Introduction

Ovarian malignancies in children and adolescents are relatively rare, with an incidence of 3 per 100,000 compared with 56 cases per 100,000 at the age of 65 to 69 years [1,2,3]. Despite this low incidence, ovarian tumours constitute the most common gynaecological malignancy in children and adolescents. Paediatric ovarian masses encompass a variety of benign and malignant tumours, including rare types such as sex cord-stromal tumours [4,5,6]. Both this heterogeneity and the importance of fertility preservation in this age group make the diagnostic assessment of these masses challenging.

While malignant ovarian neoplasms may need a more aggressive surgical approach, benign masses can either be safely monitored or undergo simple resection allowing for a fertility- and ovary-sparing approach [7]. Being able to discriminate between benign and malignant masses of the ovary is therefore of considerable clinical importance in the initial surgical management [4, 8]. Ultrasound is the first imaging modality in the diagnostic assessment of ovarian masses at any age. Clinically useful rules have been established by the International Ovarian Tumor Analysis (IOTA) group to differentiate between benign and malignant masses. Nevertheless, in about one-fifth of the cases, the nature of the ovarian mass remains undefined [9].

In case of sonographically indeterminate ovarian masses, magnetic resonance (MR) imaging can provide additional information, e.g. on the different components of the mass, tumour rupture and peritoneal depositions. Figures 1 and 2 show examples of an immature teratoma grade I (treated as a benign tumour with local resection and follow-up) and a malignant yolk sac tumour. Functional imaging techniques like diffusion-weighted imaging (DWI) and dynamic contrast-enhanced (DCE) imaging could be of additional value [10]. DCE enables qualitative, quantitative or semi-quantitative evaluation of tumour vascularity, thereby providing information about the nature of the mass. This investigation is based on enhancement patterns, expressed as time-intensity curves (TICs), of which three different types are acknowledged. Type I displays a gradual, continuous rise in signal intensity; type II shows a moderate rise in signal intensity followed by a plateau; and type III is characterised as early washout [11, 12]. In adults, several studies have evaluated the diagnostic value of MR imaging in differentiating between malignant and benign neoplasms and characterising the specific nature of ovarian masses. Based on these studies, the European Society of Urogenital Radiology (ESUR) has developed an algorithmic approach for the imaging of the sonographically indeterminate adnexal mass [7, 13,14,15,16]. However, data on the role of MR imaging in discriminating between benign and malignant ovarian masses in children is scarce. In this systematic review, we evaluate the diagnostic value of MR imaging in children and adolescents with an ovarian mass, including the value of additional MR techniques.

Fig. 1
figure 1

An example of immature teratoma grade 1 of the right ovary in a 15-year-old girl, treated as a benign tumour with local resection and follow-up. Axial T1-weighted before and after administration of gadolinium contrast (a, c), axial T1-weighted with fat-suppression (b) and sagittal T2-weighted turbo spin echo (d) show a cystic-solid mass with fatty components (arrows). Intralesional fat is diagnostic for a teratoma. The relative large amount of enhancing parts increases the risk of immature components

Fig. 2
figure 2

An example of yolk sac tumour of the right ovary in a 16-year-old girl. Sagittal T2-weighted turbo spin echo TSE (a) and T1-weighted gradient echo with fat suppression before and after administration of gadolinium contrast (b, c) show a large cystic solid mass in the lower abdomen. The enhancing parts of the lesion show relative impeded diffusion (arrow) at axial DWI (b1000 and ADC map; d,e)

Methods

Search strategy and eligibility criteria

This review is written according to the PRISMA Statement [17]. A thorough search of PubMed and EMBASE for all available literature published from 2008 to 2018 was performed. These libraries were systematically searched for original studies on the use of MR imaging in differential diagnosis of ovarian masses in both adult women and children. We classified studies into two groups. Studies were classified as ‘paediatric’, when the age of all included patients was 18 years or less. Studies performed on adult women, on the other hand, were classified as ‘adult’. The full search strategy is provided in Supplementary Table 1. Articles were included if suspected ovarian masses were evaluated with MR imaging (either 1.5 T or 3.0 T), including the evaluation of contrast enhancement, and were compared with a histopathology reference standard. Studies providing no description of MR imaging findings and studies on adult women that analysed selectively benign, borderline or malignant masses were excluded. However, similar studies as well as case reports performed on paediatric patients were included, in order to minimise the risk of missing relevant studies. Since ovarian carcinomas are very rare in children, only studies performed on adult patients that included more than 20% of malignant tumours other than carcinoma were considered relevant for this review. This particular cut-off was chosen pragmatically, since it was expected most MR studies in adult ovarian tumours focus on epithelial neoplasms, due to its prevalence of 80–90%.

All studies resulting from the literature search were assessed independently by two researchers (A.M., L.N.). Disagreements about study inclusion or exclusion were settled by consensus.

Quality assessment

The quality of the individual studies was judged using the “Standards for Reporting Diagnostic Accuracy 2015” (STARD 2015) checklist [18]. Included studies were further assessed for methodologic quality independently by two researchers (A.M., L.N.), using the Oxford Centre for Evidence-Based Medicine Levels of Evidence Classification rubric [19].

Data extraction

From the included studies, population size expressed as the number of ovarian masses analysed, mean age of the participating patients, histopathological classification of the ovarian masses and MR imaging protocol and analysis, as well as MR imaging features of the concerning ovarian masses, were scored. As for MR imaging features, information about the following parameters were extracted: size, shape, boundary, wall and septum thickness, vegetation, mass configuration, bilaterality, signal intensity of T1-weighted imaging, ascites/pelvic fluid, peritoneal implants/nodules and contrast enhancement. If available, information on b-values used in DWI and apparent diffusion coefficient (ADC) values were collected. Concerning semi-quantitative DCE, data on TICs, enhancement amplitude and time to peak were included. Lastly, data on diagnostic performance expressed as sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV) or area under the curve (AUC) for these individual parameters were extracted when provided.

Results

Search strategy and eligibility criteria

The study selection process is shown in Fig. 3. The search in PubMed and EMBASE resulted in 3015 studies, of which 536 studies turned out to be duplicates. The remaining 2479 studies were screened by title and abstract, based on which 2341 studies were excluded. Consequently, 138 articles were of potential relevance to this systematic review and their full texts were analysed. This led to the exclusion of another 104 studies. The remaining 34 studies were analysed in this review.

Fig. 3
figure 3

The flowchart summarises the search process with the number of studies included and excluded

Quality assessment

The studies in adult women were predominantly scored as Oxford Evidence level 2 (cross-sectional studies with consistently applied reference standard and blinding). Levels of evidence of the individual studies can be found in Table 1. Quality assessment of the included studies in adult women, using the STARD 2015, is provided in Supplementary Table 2.

Table 1 Characteristics of the studies included in this systematic review regarding the use of MR imaging in differential diagnosis of ovarian masses

Since most studies in children and adolescents concerned either case reports or case series, the majority of these were scored as Oxford Evidence level 4, with the exception of two studies (one cross-sectional study, one non-consecutive study) (Table 1).

Characteristics of included studies

The characteristics of the included studies (18 ‘adult’ [11, 12, 20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35] and 16 ‘paediatric’ studies [36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51]) are provided in Table 1. The mean age of patients included was 10.8 years in the paediatric and 46.9 years in the adult studies. The number of ovarian lesions analysed ranged between 1 and 74 in the paediatric studies and between 23 and 235 in the adult studies. All studies analysed the use of MR imaging in differentiating between benign and malignant tumours of the ovary, with several studies incorporating the differentiation of epithelial borderline tumours as well.

Paediatric studies

Table 2 shows MR imaging findings of the sixteen studies that were included: three cohort studies and 13 case reports. All three cohort studies analysed the diagnostic performance of MR imaging in children and adolescents with ovarian masses (or ovarian germ cell tumours specifically). The thirteen case reports describe limited data on MR characteristics.

Table 2 MR imaging descriptions of ovarian masses provided by ‘paediatric studies’

Adult studies

MR imaging

Ten studies provided a description of MR imaging features. The most often-described features (> 4 out of 10 studies) concerned size, thickness of walls and septa (when present), presence of vegetation, mass configuration, bilaterality, signal intensity on T2-weighted imaging, presence of ascites or peritoneal implants and contrast enhancement. An increased risk of malignancy was related to increased size of the lesion, increased wall thickness, presence and increased size of vegetation, mixed cystic and solid configuration, intermediate to high intensity on T2-weighted imaging, presence of contrast enhancement and of ascites or peritoneal implants. Six of the studies performed an analysis of the diagnostic performance of MR imaging [23, 27, 29,30,31, 34]. Criteria predictive of malignancy, sensitivity, specificity, PPV, NPV and accuracy, if provided, are depicted in Table 3. Sensitivity and specificity, depending on the criteria used, varied between 84.8 to 100% and 20.0 to 98.4%, respectively.

Table 3 Diagnostic performance of MR imaging in differential diagnosis of ovarian masses

DWI-MR imaging

Eight studies investigated the value of DWI-MRI in the differential diagnosis of ovarian masses [20, 21, 23,24,25,26, 31, 34]. b-values (s/mm2), regions of interest (ROI) used to calculate ADC values (× 10−3 mm2/s) and diagnostic performance are shown in Table 4. Mean ADC values for benign and malignant lesions exhibited a significant overlap, with values for benign masses varying between 1.16 and 2.03 × 10−3 mm2/s, whereas the range of ADC values reported for malignant masses was 0.76 to 1.39 × 10−3 mm2/s. Three of the included studies provided information on the diagnostic performance of DWI [23, 25, 31]. Nasr et al provided diagnostic performance of DWI solely, with sensitivity, specificity and accuracy of 100%, 75% and 87% respectively [23]. Emad-Eldin et al and Mansour et al demonstrated sensitivity, specificity and accuracy of DWI additional to MR imaging of 100% and 93.3%; 96.77% and 85%; and 98.46% and 82.3% respectively [25, 31]. The diagnostic performance of specific ADC cut-off values, if provided, is shown in Table 4.

Table 4 Diagnostic performance of DWI-MR imaging in differential diagnosis of ovarian masses

DCE-MR imaging

Nine studies investigated the value of DCE-MRI in the differential diagnosis of ovarian masses [11, 12, 22, 23, 25, 28, 30,31,32]. Data on the TICs and semi-quantitative DCE parameters are depicted in Table 5, as well as diagnostic performance of this sequence and accompanying TICs. Five of these studies divided the different ovarian masses analysed by type of TIC. Type I TICs were most frequently found in benign lesions, with 33 to 85.7% of benign masses showing type I TICs. In type III TICs, on the other hand, there appeared more characteristics of malignancy, with 57.1 to 94.3% of all malignant masses exhibiting type III TICs. Overlap between benign and malignant masses was found by Elzayat et al [30] and Mansour et al [32], with one and nine malignant masses exhibiting a type I TIC, respectively. Overlap was also demonstrated by Li et al [12], with 3 benign masses exhibiting a type III TIC. The enhancement amplitude constituted one of the semi-quantitative parameters and was expressed in various ways, including maximum relative enhancement percentage (MRE%), maximum absolute enhancement (SImax), maximum relative enhancement (SIrel) and signal intensity at 60 s after enhancement (SI60). Malignant masses generally showed an increased enhancement amplitude compared with benign or borderline masses, with some of the studies demonstrating a statistically significant difference between these groups. Time to peak constituted the other semi-quantitative parameter and was indicated by time of half rising (THR), Tmax and time to peak within 200 s after enhancement (TTP200). All studies analysing this parameter agreed on malignant masses exhibiting a shorter time to peak compared with benign masses, again in several of these studies with statistically significant difference. Four studies provided information on the diagnostic performance of DCE [23, 25, 30, 31]. Nasr et al [23] and Elzayat et al [30] provided diagnostic performance of solely DCE, with sensitivity, specificity and accuracy of 60% and 80%; 91% and 100%; and 77.2% and 96%, respectively. Mansour et al [25] and Emad-Eldin et al [31] demonstrated sensitivity, specificity and accuracy of DCE in addition to MR imaging of 93.3 and 94.3; 100 and 100%; and 95% and 96.9%, respectively.

Table 5 Diagnostic performance of DCE-MR imaging in differential diagnosis of ovarian masses

Discussion

Pre-operative discrimination between benign and malignant ovarian masses is of major importance, particularly in children and adolescents, where preserving fertility constitutes a highly important aspect of the therapeutic approach. Although data of MR imaging from paediatric patients were scarce, this review suggests that DWI, with ADC values measured in enhancing components, and semi-quantitative DCE might increase the diagnostic performance of MR imaging in the pre-operative differentiation between benign and malignant ovarian masses.

MR imaging characteristics associated with malignancy included larger size, thicker walls, presence of septa and/or vegetation within the mass, increased signal intensity on T2-weighted imaging, increased contrast enhancement, ascites, peritoneal implants and bilaterality. This corresponds with reports in existing literature describing masses larger than 4 cm, with solid components demonstrating contrast enhancement or cystic lesions with vegetation > 1 cm (as profuse papillary projections), wall and septum thickness of > 3 mm and areas of necrosis as suspicious [52,53,54]. Diagnostic performance of MR imaging has a fairly good sensitivity for differentiating malignant from benign masses. Regarding specificity, however, there is still room for improvement.

DWI seems to improve sensitivity and specificity of MR imaging to 93.3–100% and 85–96.8%, respectively [25, 31]. The added value of ADC is less clear. Although ADC values for malignant masses were lower compared with benign tumours, a considerable overlap was found. This can partly be explained by ADC values depending strongly on the pathologies included, the b-values used and whether ADC is calculated on both solid and cystic components of the lesion, or solely solid components. Several masses of benign origin, including mature teratomas, cystic endometriosis and fibromas, might occur as false positives. These ‘complex masses’ have a more dense composition, not as a result of increased cellularity but rather as a result of the presence of keratinoid substances, products of haemoglobin degradation and dense fibres respectively [24, 25, 31]. To date, no consensus exists on which preferred b-value should be used in DWI of ovarian masses. When solely analysing the studies that focussed on ‘complex masses’ (excluding fat-containing lesions or solely cystic masses), using b-values of > 800 s/mm2 and calculating ADC on solid components of the mass, considerably less overlap in ADC values was demonstrated [20, 24,25,26, 31]. Mean ADC values for benign masses then varied between 1.16 and 1.38 × 10−3 mm2/s and for malignant masses between 0.76 and 1.03 × 10−3 mm2/s. DWI should be performed as an additional sequence in assessing non-fatty, non-haemorrhagic ovarian masses, with ADC values only measured in enhancing components of solid lesions, preferably with the highest b-value of > 800 s/mm2 [7]. Additionally, our results suggest an ADC cut-off of 1.1 × 10−3 mm2/s might represent the best cut-off to help discriminate between benign and malignant lesions.

Another sequence that might contribute to the specificity of MR imaging is based on the process of angiogenesis, which is characteristic of and essential to nearly all malignant tumours [11, 12]. DCE MR attempts to differentiate between benign, borderline and malignant masses by attributing them to one of the three TICs as obtained by DCE. This systematic review shows type I TICs to be fairly predictive of benign origin of the ovarian mass, whereas type III TICs are predictive of malignancy. However, the assessment of enhancement patterns remains qualitative and might therefore be subject to user bias, similar to the evaluation of masses based on morphological criteria [55]. The use of semi-quantitative parameters deducted from the TIC, for example the enhancement amplitude and time to peak, might offer a solution to this subjectivity. Unfortunately, no reliable cut-off values could be extracted due to much heterogeneity of the studies regarding the semi-quantitative parameters analysed and their corresponding cut-off values as well as diagnostic performance. TIC type alone might not be sufficient in distinguishing between benign and malignant masses, since malignant lesions such as adenocarcinomas are sometimes found to be hypovascular, whereas benign masses, e.g. thecomas or sclerosing stromal tumours, might show hypervascularity [11]. Nevertheless, the diagnostic performance of the semi-quantitative parameters seems promising and DCE-MR imaging might thus form a valuable addition. We therefore support the advice of the ESUR to consider DCE-MR imaging in inhomogeneous solid masses on T2 or in complex cystic or cystic/solid masses with concern for malignancy. To deal with the aforementioned user bias and increasing extent of the diagnostic workup of ovarian masses (by incorporating DWI and DCE as well), there might be an interesting role for radiomics to play. This ‘data-driven’ approach which enables the extraction of innumerable quantitative features from tomographic images has already shown promising results in the classification of ovarian epithelial cancer, as well as in predicting several outcome measures [56, 57]. MR spectroscopy has also been reported to play a role in differentiating between borderline and malignant epithelial ovarian tumours [58]. However, epithelial tumours are rare in the paediatric population.

This systematic review faced some limitations. Data on the performance of MR imaging, combined with DWI and DCE, were largely deducted from studies performed in adult women (with no inclusion of paediatric patients), as MR imaging descriptions by paediatric studies were insufficient and no data from a purely paediatric cohort could be obtained. However, in order to minimise the risk of missing relevant studies, such studies and case reports in paediatric patients were included. The included studies showed much heterogeneity in MR imaging protocols, which made a meta-analysis impossible.

The description of the MR imaging features of the ovarian masses was very limited in the paediatric studies, which hampers the implementation for clinical use. Previously published reviews on the imaging of ovarian masses in children and adolescents were mainly based on findings in adult women [59,60,61,62]. This systematic review attempted to select studies applicable to children and adolescents, by exclusively including studies that were conducted either on paediatric patients or on adult women where at least 20% of the included patients had a malignant ovarian tumour other than carcinoma.

In conclusion, this systematic review suggests that DWI, with ADC values measured in enhancing components, and semi-quantitative DCE might further increase the diagnostic performance of MR imaging in the pre-operative differentiation between benign and malignant ovarian masses. Furthermore, our data show that an ADC cut-off of 1.1 × 10−3 mm2/s might contribute to this differentiation. Prospective age-specific studies are needed to confirm the high diagnostic performance of MR imaging in combination with DWI and DCE techniques in children and adolescents with a sonographically indeterminate ovarian mass.