Abstract

Background. In the past few years, the immune system and tumor immune microenvironment are becoming increasingly popular as more work has been accomplished in this field. However, nomograms based on immune-related characteristics for prognosis prediction of cervical cancer have not been fully explored to our knowledge. We constructed a novel immune score-based nomogram to predict patients with high risk and poor prognosis. Materials and Methods. 198 patients with cervical cancer from The Cancer Genome Atlas (TCGA) database were included in our study. Immune scores were generated with Estimation of STromal and Immune cells in MAlignant Tumor tissues using Expression data (ESTIMATE) algorithm, and clinic-pathological characteristics were also included for subsequent analysis. Cox proportional hazards regression models were performed for univariate and multivariate analyses to screen the significant factors, and a prognostic nomogram was built. Bootstrap resampling analysis was used for internal validation. The calibration curve and concordance index (C-index) were used to assess the predictive performance of the nomogram. Results. Patients were split into three subgroups based on immune scores. We found that patients with high immune scores conferred significantly better overall survival (OS) compared with those with medium and low immune scores (hazard ratio (HR), 0.305; 95% confidence interval (CI), 0.108-0.869). A nomogram with a C-index of 0.720 had a favorable performance for predicting survival rate for clinical use by combining immune scores with other clinical features. The calibration curves at 3 and 5 years suggested a good consistency between the predicted OS and the actual OS probability. Conclusions. Our work highlights the potential clinical application significance of immune score-based nomogram in predicting the OS of cervical cancer patients.

1. Introduction

Cervical cancer is frequently lethal and is the most common female malignancy worldwide, representing 9.8% of all female cancers [1]. Cervical cancer is currently curable if detected and treated early. However, patients with advanced/recurrent cervical cancer carry a poor prognosis, which poses a severe threat to women health and life [2]. Given the high morbidity and poor survival rates related to cervical cancer, reliable prognostic tools are urgently needed to better identify populations at high risk and guide clinical treatments.

Numerous factors are associated with the prognosis of patients with cervical cancer, including tumor stage, depth of invasion, and lymph node (LN) status. The routine prognostic assessments of cervical cancer patients are currently based on the International Federation of Gynaecology and Obstetrics (FIGO) staging system [3]. However, this system is based on physical examination and cannot precisely predict the postoperative prognosis of patients. And clinical outcome differs even among patients with the same tumor stage [4]. Therefore, it is urgent to explore novel and efficient strategies to improve prognosis in patients with cervical cancer. Recently, accumulating evidence has demonstrated that the host immunological features are closely associated with tumor development and patients’ prognosis [57].

Several studies have been performed on the association between tumor microenvironment (TME) and prognosis [811]. TME includes not only tumor cells, but also the surrounding immune cells and stromal cells. Tumor-infiltrating immune cells have been reported to play critical roles in the development of cervical cancer [12]. Besides, Yoshihara et al. calculated immune and stromal scores based on gene expression profiles and used these scores to infer the patterns of different infiltrating cells [13]. Furthermore, accumulating evidence has highlighted the potential clinical value of immune scores in the prognostic evaluation of patients with cervical cancer [1416]. Therefore, a more practicable, effective predictive model using the immune scoring method compared to adopted staging systems should be introduced to assist clinicians to identify patients at higher risk of poor prognosis. And prognostic models based on immune scores have been implemented to predict survival rates in different tumor types [1720]. However, as far as we know, nomograms based on immune scores for prognosis prediction of cervical cancer have not been well established.

In this study, we assessed immune scores of cervical cancer samples from TCGA database utilizing the ESTIMATE algorithm. We tried to discover the association between immune scores and the disease prognosis, and an immune score-based nomogram was constructed to predict the survival rate for individual patients.

2. Materials and Methods

2.1. Data Collection

The gene expression data of cervical cancer were obtained from TCGA database (https://cancergenome.nih.gov/). Clinic-pathological information and prognostic outcomes regarding TCGA cohort were retrieved from the cBioPortal website client (http://www.cbioportal.org), including unique patient IDs, age, race, tumor grade, tumor stage, clinical pathological type, survival status, and survival time [21]. Only subjects meeting the following inclusion criteria were included in this study: (a) pathologically diagnosed with cervical cancer, (b) with complete clinical pathological grading and staging information, and (c) with both follow-up information and expression data available at the same time. Samples with documented chemotherapy or targeted therapies were excluded to reduce possible confounding bias. And this study fully complies with TCGA publication guidelines and policies (https://cancergenome.nih.gov/publications/publicationguidelines).

2.2. Calculation of Immune Scores

ESTIMATE is a newly designed algorithm by Yoshihara et al. that can be used to calculate immune and stromal scores based on gene expression signatures [13]. These scores could reflect the patterns of different infiltrating cells in tumor samples. Specifically, by performing the single-sample gene set enrichment analysis (ssGSEA), gene expression values were then normalized and reordered. The enrichment score of each gene was calculated by the Empirical Cumulative Distribution Functions of the genes in the signature and the remaining genes. Immune and stromal scores were then obtained by comparing the difference between the cumulative distribution functions based on the absolute expression. Importantly, the ESTIMATE algorithm could be used extensively in almost all human solid cancers, including breast cancer, prostate carcinoma, colon cancer, and cervical cancer [14, 2224]. Thus, this algorithm is a powerful method to evaluate the cellular heterogeneity of TME. In the present study, the immune scores were calculated and 95% CI was inferred from transcriptomic profiles of a cervical cancer cohort from TCGA database by ESTIMATE function of the R software package.

2.3. Data Preprocessing

Duplicate samples refer to those with the same serial sequencing number (sample ID) assigned. Sample data were firstly sorted by sample ID in an Excel spreadsheet (®Microsoft) to identify duplicates. Then, duplicate samples, as well as paracancerous samples, noncancerous samples, and samples without survival information were automatically removed using gdcRNA tools (version 4.6.3) [25]. We then matched the clinic-pathological variables and immune score information dataset based on sample IDs using the R “merge” function. As a result, a total of 198 cases were selected for further analysis. Each immune score corresponds to a different patient. The details of the cohort collections included and excluded at each stage of this study are listed in Figure 1.

2.4. Correlations between Immune Scores and Prognosis

OS was used as primary endpoint. It was defined as the time from inclusion to death from any cause. By applying the X-tile software (Yale University, version 3.6.1), the cutoff points of immune scores were determined [26]. Specifically, applying the X-tile plot in bioinformatics software, the best cutoff points of immune scores were determined according to the relationship between the immune score and OS. To assess significant differences between immune score subgroups and clinical pathological factors, the categorical variables were statistically analyzed using Pearson’s chi-square test or Fisher’s exact test, as appropriate (when any categorical data presented a cases). And continuous variables were analyzed by the analysis of variance (ANOVA) for repeated measurements (one-way ANOVA or Kruskal-Wallis test). To further explore the relationship between immune score subgroups and prognosis, survival curves were generated using Kaplan-Meier estimates, and differences between immune score subgroups were assessed using the log-rank test.

2.5. Nomogram Construction and Validation

Single-factor logistic regression analysis was initially used to determine the relationship of baseline characteristics (Table 1) to the OS of patients. Then, to identify independent survival-related predictors, factors with statistical significance in single factor analysis were subjected to multifactor Cox regression analysis. After excluding the influence of stage and age, the adjusted hazard ratios for prognostic factors and the corresponding 95% CI were estimated. Subsequently, a nomogram model was established depending on the results of multivariate analysis. Internal validation was performed using 1000 bootstrap resamples to test the reliability of the nomogram. The predictive accuracy of the model was quantified by using the concordance index (C-index) [27]. The C-index uses values from 0.5 to 1, with 1.0 indicating perfect discriminative accuracy and 0.5 indicating lack of discrimination (no better than a coin flip). The calibration of the nomogram, which measures how far predictions are from observed outcomes, was assessed via calibration plots for 3- and 5-year survival rates of cervical cancer patients. All statistical significance tests were two-sided, and was considered as the threshold of statistical significance. All descriptive statistics and tests were performed using the statistical software R version 3.6.0 (R Development Core Team 2011).

3. Results

3.1. Patients’ Characteristics

After excluding no detailed data, a total of 198 study patients were enrolled in the final analysis. The detailed clinical characteristics were presented in Table 1. The patients ranged in age from 20 to 88 years (mean age, 48 years; SD, 13.77), and a significant proportion (35.86%) was older than age 50 years. Out of the 198 cervical cancer patients, 146 (77.66%) patients were Asian, 123 (65.43%) were in stage I, and 158 (84.04%) were cervical squamous cell carcinoma patients. The immune scores for 198 cervical cancer samples were downloaded from the ESTIMATE website. The immune scores of patients ranged from -1645.6 to 3002.1, with a median score of 302.42. Based on the best cutoff values generated by X-tile plots for immune scores (374.3 and 1051.6), patients were subsequently assigned to low, intermediate, and high immune score groups (X-tile plot was shown in Figure 2). In general, according to immune scores, patients were distributed as follows: 97 (48.9%) patients were in the low immune score subgroup, 60 (30.3%) patients were in the intermediate immune score subgroup, and 41 (20.7%) patients were in the high immune score subgroup. For all patients from the moment of initial diagnosis, the median OS time of the patients in this study was 33.68 months (range 0-210.51 months).

Table 1 showed the clinic-pathological characteristics of patient subgroups according to immune scores. In the low immune score subgroup, 69.1% of patients were less than 50 years of age. And in the high immune score subgroup, 56.1% of patients were less than 50 years of age. As for the disease stage, the results demonstrated that the percentage of those low immune score subgroup patients that were in stage IV was higher compared to the high score subgroup patients. Beyond this, most of the patients with high or intermediate immune scores were cervical squamous cell carcinoma cases.

3.2. Results of Univariate and Multivariate Analyses

The unadjusted and adjusted associations between clinical pathological features and OS rates were demonstrated in Table 2. As shown in Figure 3 and Table 2, there were substantial differences in terms of OS among patients with age of 70-80, high immune scores, and stage IV (unadjusted hazard ratio (HR) 3.680, 95% CI 1.145-11.828, and ; HR 0.360, 95% CI 0.137-0.946, and ; HR 6.944, 95% CI 2.924-16.488, and , respectively), while different races, grades, and pathological types indicated no statistical significance compared with OS.

All factors that displayed prognostic significance in the univariate analysis were included in the multivariate analysis. Table 3 showed the results of multivariate logistic regression analysis on the determinants of OS. From the results, we can summarize that patients with high immune scores exhibited a better prognosis than those with low or intermediate immune scores (HR and 95% CI: 0.305 and 0.108-0.869, respectively). And, with one exception, we interestingly found that only the age of 70-80 years subgroup was strongly linked with a worse OS (HR: 5.722; 95% CI: 1.666-19.647). When compared to the patients in stages I, II, and III, patients in stage IV clearly had a worse OS (HRs and 95% CIs of stages II, III, and IV were 1.418 [0.584-3.445], 1.249 [0.357-4.364], and 5.898 [2.404-14.473], respectively). Furthermore, as for other clinical characteristics, significant associations were not found ().

3.3. Prognostic Nomogram for OS

As shown in Figure 4, the prognostic nomogram combined all the important independent factors from multivariate analysis for the OS. A C-index of 1 indicates perfect prediction ability, and a C-index of 0.5 indicates a random guess. In the present study, the C-index of the established nomogram for predicting OS was 0.720 (95% CI, 0.622-0.818), which showed that the model had a good predictive ability. The calibration curve of 3-year and 5-year survival probabilities exhibited excellent concordance between actual observations and the nomogram predictions (Figures 5(a) and 5(b)).

4. Discussion

In this study, based on gene expression data and clinical information downloaded from the public databases, we were in a position to examine the prognostic risk factors in cervical cancer patients. The ESTIMATE algorithm has been confirmed to be an efficient method in large and independent datasets. According to this algorithm, the immune scores were obtained for each sample of cervical cancer from TCGA database. The samples were subsequently assigned to low, medium, and high immune score groups based on X-tile plots. Then, after consideration of potential confounders, our results revealed that high immune scores were surely associated with better OS of cervical cancer cases. Additionally, a nomogram combining immune scores with clinical factors was built up to efficiently predict the OS of cervical cancer patients, aiming at improving patients’ prognosis.

The FIGO staging system is the most commonly used clinical staging system to estimate the prognosis for cervical cancer. However, this system has some limitations. Firstly, it is mainly based on the results of physical examinations and fails to incorporate other prognostic factors, including the host immune responses, pathological parameters, and lymph node metastasis [28]. Secondly, the prognostic outcome may significantly vary among patients with the same FIGO stage [29]. Since recently, there is growing evidence that immune-related characteristics could serve as prognostic indicators and guide a personalized treatment in the future. Several studies have identified a series of immune-related genes, which could serve as promising biomarkers for the prognostic prediction of cervical cancer [14, 16, 30]. These findings suggest that immune-related components play critical roles in the prognosis evaluation of patients with cervical cancer. However, most of these studies have not integrated clinic-pathological factors, and few have been widely popularized in routine clinical practice. Nomograms could integrate multiple clinical factors and give an individualized risk assessment for each patient. Compared with the traditionally used staging systems, nomograms have the obvious advantages and stronger predictive power [31].

The present study, to the best of our knowledge, is the first to combine the immune scores with clinical pathological characteristics to construct the nomogram for predicting the prognosis of cervical cancer patients. According to the results of our research, we found that after adjusting for potential confounding factors, high immune scores conferred apparently better OS than middle and low immune scores in patients with cervical cancer. One possible explanation could be that higher immune scores signified an increase in the extent of immune cell infiltration in the TME. And several previous studies have demonstrated that the immune microenvironment of cancer is an important prognostic factor [3234]. Indeed, the TME induced and augmented the systemic antitumor immunity to effectively eradicate the tumor. In addition, novel immune-metabolic targets, such as GLUT1, which were used to overcome therapeutic resistance, played an independent prognostic value in cervical cancer [35]. And a study demonstrated that complete and durable regression of metastatic cervical cancer can occur after a single infusion of HPV-TILs [36]. Therefore, we speculate that immune score may not just be treated as a prognostic predictor, but also provide additional support for the investigation of immune-based treatments for this disease. To identify patients who were at high risk for poor prognosis is quite important because of the great health benefits that have been brought about by immunotherapy.

It should be noted that patients with higher immune scores seemed to be with cervical squamous cell carcinoma. It implied that patients with cervical squamous cell carcinoma may benefit from immunotherapy and achieve a better prognosis compared with those with cervical nonsquamous cell carcinoma, including endocervical adenocarcinoma and endocervical mucinous adenocarcinoma. Also, our results indicated that different pathological types were not related to the prognosis of cervical cancer. Actually, the roles of pathological types in cervical cancer so far remain controversial. Some studies in the literature found that histopathological types were of limited prognostic value or were changed merely in some selected variables or only within some subsets [37, 38]. A study based on the SEER data found that the survival differences between squamous cell carcinoma and adenocarcinoma did not exist [37]. And another study collected 17 histological subtype data of nonsquamous cell carcinoma of the uterine cervix from the Cancer Registry of Norway. This study found that histological subtypes lacked the statistical and clinical meaning except for small cell carcinoma, which was regarded as the only histologic subset of independent importance for prognosis [38]. Thus, our results were consistent with those of previous studies. However, a study conducted by Vinh-Hung et al. demonstrated that the histological type was an important independent prognostic factor in cervical cancer [10]. They found that small cell carcinoma and adenocarcinoma were associated with poorer survival. The inconsistent results of these studies may be due to diverse sample sources, different control groups, or by chance. All these results made this issue complicated and controversial. Further research is therefore needed to confirm our findings. Furthermore, for 70 to 80-year-old patients, their prognosis tended to be worse (HR: 5.722; 95% CI: 1.666-19.647), when compared to patients older than 80 years or younger than 70 years. The probable reason for this result may be due to the limited sample size in other age subsets.

We acknowledge that this study presents several limitations. First, it was a retrospective observational study based on publicly available databases, and it was difficult to cover the data from different races and geographical regions, so a prospective, multicenter, randomized clinical trial is needed to validate our findings. Second, many factors may affect the prognosis of cervical cancer patients, and additional research including more variables should be carried to improve the nomogram. Finally, because of the limited datasets including gene expression files available for calculating immune scores, our model needs to be further validated using independent data.

5. Conclusion

In summary, our research indicated that patients with high immune scores are significantly related to better OS. Moreover, we developed a novel nomogram based on immune scores and used it to calculate 3-year and 5-year survival rates, which may serve as a prognosis stratification tool for facilitating clinical decision-making to make a more reasonable follow-up plan.

Data Availability

All clinical pathological data of cervical cancer patients can be obtained from TCGA database and cBioPortal database. All immune score data can be downloaded from the ESTIMATE database (https://bioinformatics.mdanderson.org/estimate/).

Conflicts of Interest

The authors declare that they have no conflicts of interest to disclose.

Authors’ Contributions

Xingling Qi and Yipeng Fu contributed equally to this work.

Acknowledgments

We are grateful for the support from the National Natural Science Foundation of China (NSFC, No. 81873124).

Supplementary Materials

Supplementary Table 1: the detailed clinical data for each patient. Supplementary Table 2: the immune scores of each patient based on the ESTIMATE algorithm. Supplementary Table 3: the main clinical characteristics and follow-up information of patients with cervical cancer. Supplementary Table 4: results of univariate Cox regression analyses. Supplementary Table 5: results of multivariate Cox regression analyses. (Supplementary Materials)