Introduction

A substantial number of men receiving a prostate cancer (PCa) diagnosis will initially show advanced disease [1] or will experience cancer relapse after initial local treatment (radical prostatectomy—RP or external beam radiotherapy—EBRT). The accuracy of conventional imaging (CI) for PCa staging and restaging is poor. Both contrast-enhanced abdominopelvic computed tomography (CT) and whole-body bone scintigraphy (BS) are of limited utility in patients with low-risk primary disease or relapsed disease with only moderately elevated prostate-specific antigen (PSA) levels [24]. Although these imaging modalities are widely used in first-line imaging workup, American and European guidelines do not recommend them [5]. At initial staging, transrectal ultrasonography (TRUS) is usually the first imaging method used and its role, primarily, is to guide prostatic needle biopsy. However, in view of the increase in organ-sparing treatment options, precise evaluation of intraprostatic extension of PCa is gaining clinical importance. Magnetic resonance imaging (MRI) is widely employed to detect PCa lesions; however, detectability of this disease is limited, even when MRI is combined with MR spectroscopy (MRS) [69]. Furthermore, 18F-fluorodeoxyglucose (18F-FDG) positron emission tomography (PET) is of only minimal utility for the evaluation of organ-confined PCa [10, 11].

Restaging and follow-up of PCa is typically performed using whole-body BS, the most sensitive test for detecting skeletal metastasis, and abdominopelvic contrast CT for detection of soft tissue malignancy, predominantly pelvic nodal involvement. These and other imaging modalities including 18F-FDG PET have shown limited accuracy [1114].

Many recently published reports have highlighted the potential advantages offered by PET with new radiotracers such as 18F/11C-labeled choline and 11C-acetate in the assessment of PCa patients [1518]. The utility of these tracers is based on the increased cell proliferation in tumors and the up-regulation of choline kinase in cancer cells [14].

This systematic review and meta-analysis sets out to provide an overview of the diagnostic performance, in PCa, of 18F/11C-labeled choline PET or PET/CT in the detection of primary tumor and loco-regional and distant metastases, in comparison with histology of primary PCa, other tracers and other imaging modalities.

Materials and methods

Literature search

A computer search of the literature was performed to identify studies, in humans, that compared the diagnostic performance of radiolabeled choline PET or radiolabeled choline PET/CT with other imaging modalities or tracers used for the detection of PCa. We also included studies validating the results of radiolabeled choline PET with histology of primary PCa alone. The PubMed and ISI Web of Knowledge databases from 2000 to October 2012 were searched using the following key words “prostatic neoplasm” or “prostatic” and “neoplasm” or “prostate” and “cancer” or “prostate cancer” and “choline pet” or “prostate cancer” and “choline PET/CT”. For the Medline search, the following limits were used: species (human), article type (reviews and systematic reviews, clinical trials and randomized clinical trials, original articles, comparative studies, and multicenter studies), and language (English). The references of articles found in the literature search were also examined to identify additional reports that met the inclusion criteria. In each of these series, we looked for the following items: number of patients, mean or median age, design of the study, reference standard, sensitivity, specificity, and other diagnostic data from radiolabeled choline PET or PET/CT scans. Articles giving results of 18F/11C-labeled choline PET or PET/CT studies and containing information on primary PCa histological specimens, other imaging modalities (MRI or CT or BS) or other tracers in PCa patients and published in English were reviewed. The list of articles was supplemented through extensive cross-checking of the reference lists of all the retrieved articles.

Selection of studies

Two experienced nuclear medicine physicians (LE and ARC) independently checked the retrieved articles. Disagreements were resolved by consensus. For the qualitative analysis, reports that included data on comparisons between histology of primary tumor, radiolabeled choline PET and other imaging modalities were included if: (a) the reference standard was pathology or other common imaging modalities, and (b) the sample size was ≥10 patients. Moreover, for the meta-analysis, articles were included if the absolute numbers of true positive (TP), false negative (FN), false positive (FP), and true negative (TN) test results were available or derivable from the articles, thereby allowing us to construct 2 × 2 contingency tables. Abstracts were excluded from this analysis on the grounds that they do not provide sufficient data to evaluate methodological quality and to allow calculation of diagnostic accuracy. Reviews, clinical reports and editorial comments were also excluded.

Data extraction

Three observers (LE, ARC, and MB), using a standardized form, independently extracted relevant data on study characteristics and examination results. The observers were not blinded to the information such as the journal name, authors’ names, authors’ affiliations, and year of publication, since this has been shown to be unnecessary [19]. The reviewers (ARC and MB) evaluated relevant studies using the Quality Assessment Tool for Diagnostic Accuracy Studies (QUADAS) criteria [20]. The evaluation was based on a 14-point scale. Each item was rated “yes,” “no,” or “unclear”. Inconsistent findings between the two readers were discussed and agreed upon by consensus (LE). For each included study, information was collected concerning the basic study (author name, journal, year of publication, country of origin, and study design), patients’ demographic and clinical characteristics (mean age, number of patients), technical parameters (type of radiopharmaceutical injected), and radiolabeled choline PET or PET/CT evaluation (visual or semi-quantitative analysis by standardized uptake value—SUV).

Statistical analysis

The numbers of TP, TN, FP, and FN results were extracted or computed from each selected study based on the radiolabeled choline PET as the index test. The analysis of each report was carried out according to the site of disease, the type of comparative imaging and the tracer. For each invasive and non-invasive technique, pooled sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), likelihood ratio (LR), accuracy and diagnostic odds ratio (DOR) were calculated with 95 % confidence intervals (CIs). We also calculated summary receiver operating characteristic (sROC) curves and areas under the curve (AUCs). A random effects model was used. Between-study heterogeneity was assessed using the Chi-squared and I-squared tests. The Chi-squared test provided an estimate of the between-study variance, while the I-squared test measured the proportion of inconsistency in individual studies that cannot be explained by chance. In accordance with Higgins et al. [21], the values of 25, 50, and 75 % for heterogeneity (I-squared) were considered low, moderate and high, respectively. The AUC was calculated to measure the accuracy of radiolabeled choline PET/CT in diagnosis of PCa versus other imaging techniques or tracers. All statistical analyses were performed using Meta-Disc statistical software version 1.4 (Unit of Clinical Biostatistics, Ramòn y Cajal Hospital, Madrid, Spain [22]).

Results

Identification and characteristics of studies

The search, using Mesh terms, of the PubMed and Web of Knowledge databases from 2000 to October 2012 generated 246 and 432 results, respectively. Thirty-seven and four original articles, respectively, were in English and dealt with comparative studies. Three articles were the same and were, therefore, considered only once. Six articles were identified through examination of the references of the retrieved studies. Twenty-three articles on the use, in PCa, of radiolabeled choline PET or PET/CT in comparison with other imaging techniques or other modalities were identified based on the inclusion criteria. Table 1 shows the characteristics of the identified studies. For the meta-analysis, we assessed the diagnostic performance of 18F/11C-labeled choline or PET/CT and other tools in eight original articles (Fig. 1, according to the PRISMA standard). Application of the QUADAS criteria showed these studies to be of a high quality (n = 8; score 11–14). The selected articles yielded data on a total of 276 patients (n = 89 for MRI, n = 89 for other tracers, and n = 98 for histology of primary PCa). Table 2 details the characteristics of the studies included in the meta-analysis. The entire population studied ranged in age from 41 to 88 years. Table 3 summarizes the diagnostic accuracy of each of the selected studies.

Table 1 Basic study and patient characteristics
Fig. 1
figure 1

Flow diagram of the process used to select, according to the PRISMA standard, studies for a meta-analysis investigating the diagnostic accuracy of radiolabeled choline PET and PET/CT in prostate cancer

Table 2 Clinical characteristics of the patients included in the studies selected for the meta-analysis
Table 3 Summary of studies included for the meta-analysis

Radiolabeled choline PET vs. step section histopathology in primary tumor

Qualitative analysis

Kwee et al. [16], Farsad et al. [23], Martorana et al. [24], and Giovacchini et al. [25] compared the step-sectioned whole-mounted prostate specimen analysis with radiolabeled choline PET prior to RP. Kwee et al. [16] found a correlation between the maximum SUV (SUVmax) of malignant sextants and tumor size, assuming the SUV measured in a volume of tissue containing only malignant cells to be higher than that in a volume containing both malignant and benign cells. The authors, using an SUVmax threshold of 4.0, reported a sensitivity of 85 % and a specificity of 62 % for the detection of primary cancer. Farsad et al. [23], using six regions of interest (ROIs) and an SUVmax threshold of 2.5, found sensitivity, specificity, and accuracy values of 66, 81, and 71 %, respectively. Reske et al. [17] employing an SUVmax threshold of 2.65 obtained using 36 ROIs in prostate, reported a sensitivity of 81 %, a specificity of 87 %, and an accuracy of 84 %. Moreover, Scher et al. [26] performed a patient-based analysis of predominantly qualitative images analysis and provided a sensitivity of 86.5 %, a specificity of 61.9 %, and an accuracy of 77.6 % for radiolabeled choline PET and PET/CT in the detection of primary malignancy. Furthermore, these authors suggested that a cut-off value of 3.3, above which the SUVmax is considered to be malignant, would have yielded the best compromise between sensitivity and specificity, at 70.3 % and 57.1 %, respectively. Finally, in 2008, Giovacchini et al. [25] also argued that choline SUVmax had only moderate diagnostic accuracy in the assessment of histological findings.

Quantitative analysis (meta-analysis)

The sensitivity and specificity of radiolabeled choline PET/CT in the detection of primary PCa as compared to histology specimens ranged from 64 to 87 % and from 45.4 to 84.2 %, respectively, reaching a pooled sensitivity of 72 % (66.9–76.6 %) and a pooled specificity of 61.6 % (55.1–67.7 %). Likelihood ratio χ 2 test statistics were 19.49 (p < 0.001) and 36.70 (p < 0.001), respectively, for the sensitivity and specificity while the I 2 index values were 89.7 and 94.6 %, respectively; therefore, a high heterogeneity was shown. The SROC analysis reported an AUC value of 0.754 (p = 0.08).

Radiolabeled choline PET vs. MRI and TRUS in primary tumor

Qualitative analysis

Four reports from the literature compared radiolabeled choline PET and MRI in primary tumor detection [2730], while one study focused on the difference in accuracy between MRI and radiolabeled choline PET/CT in the detection of local recurrence [31]. Rinnab et al. [32] compared the accuracy of 11C-labeled choline PET with TRUS, reporting that PET was more sensitive than TRUS for extracapsular extension, seminal vescicle involvement, and T4 tumors, but both methods understaged microscopic extension of the disease. Yamaguchi et al. [27] demonstrated the superiority of 11C-labeled choline PET with localization by SUVmax to predict laterality among primary lesions when compared with MRS using localization by the ratio of choline + creatinine to citrate (Cho + Cr/Ci ratio). These results led the authors to speculate that there is an essential difference in the significance of choline measurement between radiolabeled choline PET and MRS. Testa et al. [29] and Watanabe et al. [28] concluded that MRI had the highest sensitivity, while PET/CT showed good specificity; therefore, according to Van den Berg et al. [30], the association of MRI and SUVmax on radiolabeled choline PET/CT should be of benefit in primary tumor detection (sensitivity = 80.5 vs. 84.7 % for MRI alone and MRI plus SUVmax ≥2.7, respectively). On the other hand, Panebianco et al. [31] underlined the advantages of MRI over radiolabeled choline PET in the detection of local recurrence after surgical prostatectomy.

Quantitative analysis (meta-analysis)

Table 4 reports the results of the meta-analysis performed to compare MRI and radiolabeled choline PET or PET/CT in primary tumor detection. As illustrated, radiolabeled choline PET/CT showed a pooled sensitivity of 62.6 % (54.0–70.6 %) and a pooled specificity of 76.3 % (65.4–85.1 %), whereas MRI demonstrated a pooled sensitivity of 59.7 % (51.1–67.9 %) and a pooled specificity of 76.1 % (65.9–84.6 %). Heterogeneity was found both for MRI and radiolabeled choline PET studies, which was confirmed by the likelihood ratio χ 2 test and the I 2 index. There was no conclusive evidence of a cut-off effect for either radiolabeled choline PET or MRI according to Spearman correlation coefficients (ρ < 0.4), although an abnormal negative LR value was reported for radiolabeled choline PET (see Table 4). The AUCs were 0.743 (p = 0.24) and 0.737 (p = 0.05) for radiolabeled choline PET and MRI, respectively.

Table 4 Pooled diagnostic accuracies of 18F/11C-labeled choline PET–PET/CT and MRI in primary tumor detection

Radiolabeled choline PET vs. other tracers

Qualitative analysis

Langsteger et al. [33] and Beheshti et al. [34] studied 42 and 38 patients with PCa using both 18F-fluoride and 18F-labeled choline PET/CT for the detection of bone metastases. On lesion-based analysis, Beheshti et al. [34] reported sensitivity, specificity, and accuracy values of 74, 99 and 85 % for 18F-labeled choline PET/CT vs 81, 93, and 86 % for 18F-fluoride PET/CT, while Langsteger et al. [33], on site-based analysis, reported sensitivity, specificity, and accuracy values of 90, 96, and 95 % for 18F-labeled choline PET/CT and 87, 94, and 93 % for 18F-fluoride PET.

On the other hand, three articles compared the diagnostic performance of radiolabeled choline PET with that of FDG PET in detecting PCa recurrence [3537]. In all cases, 18F-FDG did not seem indicated for the diagnosis of PCa, unless a poorly differentiated Gleason score group is considered. The different behaviors of the tracers indicate that choline is more sensitive in the detection of PCa relapse, while 18F-FDG seems better able to discriminate the proliferative character of the process.

Watanabe et al. [28] compared the accuracy of radiolabeled choline PET/CT vs 18F-FDG PET/CT in primary PCa, reporting sensitivity, specificity, and accuracy values of 73, 59, and 67 % vs. 31, 88, and 53 %, respectively.

Quantitative analysis (meta-analysis)

The pooled diagnostic performance results of the radiolabeled choline PET and 18F-FDG PET for all sites of disease in only two included studies are presented in Table 5. In all sites of disease, the sensitivity and specificity of radiolabeled choline PET in PCa patients ranged from 60.6 to 91.7 % and from 75.0 to 100 %, reaching a pooled sensitivity of 65.1 % (53.8–75.2 %) and a pooled specificity of 83.3 % (35.9–99.6 %), while the sensitivity and specificity of 18F-FDG PET in PCa patients ranged from 31.0 to 91.7 % and from 75.0 to 100 %, reaching a pooled sensitivity of 39.8 % (29.2–51.1 %) and a pooled specificity of 83.3 % (35.9–99.6 %). There was heterogeneity in only few cases, confirmed by both the likelihood ratio χ2 test and the I 2 index. No SROC curve was computed for radiolabeled choline and 18F-FDG PET due to the small number of studies considered.

Table 5 Pooled diagnostic accuracies of 18F/11C-labeled choline PET–PET/CT and 18F-FDG PET–PET/CT in the detection of PCa metastases

Radiolabeled choline PET vs bone scan

Qualitative analysis

Bone metastasis is a common complication of PCa; about 65–75 % of men with advanced PCa demonstrated bone localizations of the disease [38]. CT, MRI, and BS have been widely used in restaging of the disease, but have shown low sensitivity especially for the detection of bone metastases [39, 40]. However, BS, due to its wide availability, low cost and considerable associated clinical experience, remains the initial study and the standard for the detection of bone lesions. Luboldt et al. [41] demonstrated that diffusion-weighted imaging appeared just as effective as 11C-labeled choline PET/CT in the detection of bone metastases in patients with PCa. Fuccio et al. [42] and Picchio et al. [43] evaluated the clinical utility of 11C-labeled choline PET/CT in patients with bone metastases, in comparison with a single lesion on BS and with multiple lesions on BS, respectively. In both studies, 11C-labeled choline PET/CT showed better sensitivity than BS in the detection of bone lesions but as suggested by Beheshti et al. [44], the comparison of an advanced tomographic modality like PET/CT with the planar only anterior and posterior views of BS may not provide reliable data on the basis of which to reach a conclusion, in particular for the vertebral spine, pelvic skeleton, and ribs. Moreover, PET/CT is able to detect early marrow-based metastases that constitute an important data for therapy monitoring.

Discussion

In most of the cited reports [16, 17, 24, 25], the authors concluded that radiolabeled choline PET, being associated with low accuracy, should not be used as a first-line technique for the initial diagnosis of primary PCa. The present meta-analysis gave a pooled sensitivity, a pooled specificity, and a DOR of 72, 61.6, and 4.84 % for radiolabeled choline PET in comparison with histological specimens, supporting the conclusions of the above-mentioned researchers.

In accordance with Watanabe et al. [28], MRI is recommended as a primary tool in the diagnosis of PCa, while radiolabeled choline PET imaging may serve in the detection of distant metastases when suspected. Moreover, as suggested by Yamaguchi et al. [27], when PCa is highly suspected due to a high serum PSA level but the results of biopsy are negative, obtaining information regarding localization of the main primary lesion by 11C-labeled choline PET may increase the success rate of biopsy and avoid unnecessary repeated biopsies, in turn reducing the discomfort for patients.

In the literature, radiolabeled choline PET has been compared with other tracers, such as 18F-FDG and fluoride with different end points [33, 36, 37, 45]. Radiolabeled choline PET is a convenient and non-invasive one-step procedure for the staging and restaging of PCa disease because in the initial assessment of high-risk PCa patients it can allow the early detection of bone marrow metastases [45], while in the restaging setting, radiolabeled choline PET can provide more important prognostic and therapeutic information than 18F-FDG PET, especially when the Gleason score and PSA value are high (for Gleason scores of 8–10, the overall accuracy was 50 vs. 65 %, while for PSA values >10 ng/mL, the overall accuracy was 33 vs. 100 %, respectively, for 18F-FDG and radiolabeled choline PET [37]).

According to McCarthy et al. [46], PET/CT with radiolabeled choline is able to differentiate malignant nodal involvement from reactive change, but this latter point should be better defined, given that a lack of sensitivity was reported in the staging setting by Evangelista et al. [47]. The issue of possible FP findings is of critical importance because increased 18F/11C-labeled choline uptake is not specific for carcinogenesis. Histological signs of inflammation have been found in lymph nodes with pathological 18F/11C-labeled uptake that was erroneously attributed to recurrent disease [48, 49]. False positive findings may also occur in the prostatectomy bed, although FN results are the greatest concern in this anatomical district [35].

Radiolabeled choline PET/CT is able to identify multiple bone sites of relapse and in many cases to identify extra-osseous lesions, too, even in the presence of low PSA values [43]; it therefore, emerges as an important tool in the restaging of PCa patients, as suggested by Beheshti et al. [44]. Almost all sclerotic lesions with Hounsfield Unit levels >825 are fluorocholine-negative, as reported by the same group [45], but under hormone therapy this pattern raises the possibility that these lesions may no longer be viable. A recent report by Giovacchini et al. [50] reported that PET/CT with 11C-labeled choline is able to detect PCa recurrence in about 10 % of patients who had increasing PSA after RP and no evidence of disease on CI.

In the present study, the difference in diagnostic performance between 18F- and 11C-labeled choline PET in the different clinical scenarios was not assessed due to the low number of relevant reports. In our opinion, Carbon-11 may, in accordance with its physiological elimination, be more useful than 18F-labeled choline PET for the identification of primary PCa and recurrent disease in the prostatic fossae, as reported by Richter et al. [37]. Conversely, Fluorine-18 is more accurate than 11C-labeled choline PET for the identification of lymph node involvement, in accordance with its small bowel accumulation. Both Carbon-11 and Fluorine-18 can be used for the identification of bone metastases. Prospective comparative studies across the tracers should be conducted.

The present study presents some limitations. First, the small number of studies included in the meta-analysis and the heterogeneity between the studies may have influenced the strength of the pooled results in the head-to-head comparison, as also suggested by the wide 95 % CIs. Second, publication bias was not assessed due to the small number of included studies. Third, the inclusion of English only studies in the systematic review could be considered a limitation of the research strategy. Finally, we searched only two data sources for articles, namely PubMed and ISI Web of Knowledge, and the lack of consultation other databases (i.e., SCOPUS or EMBASE) represents a limitation of the study.

Implications for practice. Radiolabeled choline PET/CT does not seem to be indicated for the detection of primary PCa due to its moderate sensitivity and specificity, unless an unexplained increase in PSA is found without any clear evidence of the disease. The advantage of a single scan for the detection of all sites of disease (lymph node and distant metastases) at initial staging, especially in high-risk patients and at restaging, should be considered, particularly when bone lesions are suspected, and a systemic treatment or targeted therapy are planned.

Implications for research. In summary, (1) 11C-labeled choline PET can be useful for guiding a biopsy in cases of PSA increase and no evidence of primary PCa tumor on CI (TRUS or MRI); (2) the diagnostic accuracy of 11C-labeled choline PET/CT in PCa is more accurate than that of conventional BS and 18F-FDG PET or PET/CT, and (3) radiolabeled choline PET/CT may be a “one-stop diagnostic procedure” for PCa both at initial staging and at restaging. In our opinion, a well-structured randomized prospective clinical trial comparing the prognostic value and cost-effectiveness of radiolabeled choline PET/CT in comparison with other imaging modalities should be planned.