Introduction

Anaplastic thyroid carcinoma (ATC) is an orphan disease and one of the most aggressive cancers due to its rapid progression with limited mean survival of 3–6 months (Neff et al. 2008). Distant metastases that are often present at the time of initial diagnosis in ATC do not only result in a very dismal prognosis but also present a major challenge in decision making for the optimal treatment regime (Maso et al. 2017; Maniakas et al. 2020). At this stage most commonly utilized therapies include cytotoxic chemotherapy (ChT) with or without radiation therapy (RT) (Bible et al. 2021; Haddad et al. 2018; Filetti et al. 2019). More recently immunotherapeutic approaches and targeted therapies have been proposed for the treatment of advanced and metastatic ATC if targetable mutations are detected (Bible et al. 2021; Haddad et al. 2018; Filetti et al. 2019; Subbiah et al. 2018).

Current guidelines (National Comprehensive Cancer Network [NCCN], American Thyroid Association [ATA]) recommend to consider thyroidectomy also in ATC stage IVC if the primary tumor is considered resectable and R0/R1 margins are achievable (Bible et al. 2021; Haddad et al. 2018). It is important to consider local and distant complications when planning therapy at this stage of the disease and to evaluate individual susceptibility toward a chosen therapy within a multidisciplinary expert team. Surgical therapy to the primary tumor ranging from debulking surgery to R0 thyroidectomy could possibly prevent patients from developing life-threatening aspirations, dysphagia, dyspnea, bleedings or superior vena cava syndrome (Haigh et al. 2001). Some of these complications may, however, be prevented by less invasive interventions, e.g., tracheostomy, percutaneous endoscopic gastrostomy (PEG) or venous stenting. Furthermore, sufficient radiation therapy to the primary site or systemic therapies with platin-based agents or targeted therapies to actionable mutations such as BRAF V600E, or NTRK and RET gene fusions may also provide sufficient local control rates (Filetti et al. 2019). Thus, it is important to analyze outcomes and prognostic factors among patients with metastatic ATC to determine the rationale for an invasive intervention.

In this study we aim to investigate the role of different surgical procedures to the primary tumor in patients with ATC stage IVC on their survival. Hereby we compared various surgery types among each other by performing a systematic review of literature, pooled analysis derived from individualized patient data and SEER-database analysis.

Patients and methods

Systematic review of literature and pooled analysis

A systematic review of literature was primarily carried out on 30 th October 2020 and repeated on 15 August 2021 using PubMed/MEDLINE (National Center for Biotechnology Information, Bethesda, MD, USA) and Cochrane databases to identify relevant publications according to PRISMA-protocol. A full list of search terms is provided in Fig. 1. Articles published in English, in the timeframe from 1 January 2000 to 1 August 2021 and identified by the mentioned terms were included into the preliminary analysis. Further eligibility assessment incorporated abstract screening for article and neoplasm type, as well as duplicates. Studies with conflicts of interests, e.g. publications of our facility were excluded. Reviews, meta-analyses, case reports, experimental preclinical data, drug trials, guidelines or consortia and studies without stage IVC ATC were excluded. Remaining articles were analyzed in full-text and excluded, if data or statistical analyses on surgery as prognostic factor for OS/PFS were missing. Furthermore, only studies with exact stage IVC percentage data were included. Publications with individualized patients’ data on age, sex, TNM/UICC stage, treatment specifications and OS were included into pooled analysis. The complete review process was based on the PRISMA guidelines and is depicted in Fig. 1.

Fig. 1
figure 1

PRISMA-flowchart for the systematic review of literature: surgery in stage IVC ATC patients

A cohort of patients with metastatic ATC from our facility has been described previously and pooled with the newly identified data to form a representative cohort (Augustin et al. 2021). Inclusion and censorship criteria have been reported in that study.

SEER-based analysis

Based on the promising results of the pooled analysis we used Surveillance, Epidemiology, and End Results (SEER) database to further verify our null hypothesis. Data on all patients diagnosed with ATC employing histopathological codes of International Classification of Disease for Oncology (3rd edition; ICD-O-3; code 8021/3 [Carcinoma, anaplastic, NOS] and site [Thyroid]) from 2000 to 2016 were analyzed. Only patients with metastatic ATC were included into our study. Patients with aberrant stages as M1/IVC were excluded from the analysis. Staging was based on the SEER histological stage A (1973–2015) [Distant] and American Joint Committee on Cancer (AJCC) 3rd–7th editions [M1/IVC]. Following information was subsequently extracted from the SEER database: sex, age, surgery record and type according to “Rx Surg Prim Site (1998+)”, RT codes and sequences, ChT records, cause-specific deaths codes, vital status and survival in months. Data acquisition date was 2 August 2021. Surgery records “isthmectomy only”, “resection of less than one lobe” and “local surgical excision” were considered as debulking surgery. Patients without specified type of thyroidectomy “thyroidectomy NOS”, “surgery NOS” or “unknown if surgery performed; death certificate ONLY” were excluded from the analysis.

Statistical analysis

Statistical analysis was conducted using SPSS statistics 25 (IBM, Chicago, IL, USA). Univariate analysis of variables comprised log-rank test. Significant variables were subsequently analyzed multivariately in the Cox regression. Significance level was defined for all analyses at α = 0.05. In our single-center data patients were censored if lost to follow-up, overall survival (OS) was defined as the time from initial diagnosis to death, sufficient RT dose was defined as a total RT dose of  ≥ 30 Gy due to a considerable number of patients receiving a total dose of < 30 Gy. In these cases, RT was mostly discontinued given the premature patients death or local complications.

Results

Systematic review of literature and pooled analysis

In total, 214 publications were yielded by our combination of search terms in the PubMed MEDLINE database. Cochrane search did not reveal any relevant studies. Potentially relevant publications accounted for 101 studies and were identified by abstract screening for eligibility, as shown in Fig. 1. Full-text assessment of these articles was based on the presence of statistical analyses specific for our hypothesis, namely, surgery as prognostic factor for improved overall or progression-free survival (OS, DSS and PFS, respectively). Thus, 36 retrospective studies were fully included into our systematic review (Table 1). A total of 12,725 patients were evaluated by the mentioned studies. Stage IVC accounted for 41.1% or approximately 5226 of these patients. Resection margins were reported at least partially by 16 (44.4%) of the studies. Twenty-seven (75%) publications showed surgery or certain resection status to be favorable for a longer survival, decreased relative risk or an improved local control (LC) in the univariate analysis. Two studies claimed their univariate analysis to be significant at α-levels of p < 0.2 or did not provide additional information on the significance level other than p < 0.1. In multivariate analysis, surgery or certain resection status were identified as an independent prognostic factor in 19 (52.7%) publications. Two publications did not provide p values for their multivariate analyses, but the 95% CI did not cross HR value of 1.0.

Table 1 Systematic review of literature—surgery in ATC

In total, 10 studies (Takahashi et al. 2019; Aslan et al. 2014; Crevoisier et al. 2004; Stavas et al. 2014; Ito et al. 2012; Lim et al. 2012; So et al. 2017; Tennvall et al. 2002; Troch et al. 2010; Busnardo et al. 2000) provided individualized patient data that were eligible for a pooled analysis with our single center cohort described previously (Augustin et al. 2021). This new cohort included 123 patients with stage IVC ATC and a median age of 71 (39–94) years (Table 2). Treatment records were available for RT, ChT and surgery in 61.8, 76.4 and 31.7% of the cases, respectively. Treatment sequences with RT/ChT were reported for 96.7% of the population. In 39.8% of the cases, concurrent chemoradiotherapy (CRT) was administered. Sufficient RT was used in 33 (26.8%) cases. Surgical intervention included debulking, sub- or near total thyroidectomy, total thyroidectomy or was not specified. Information about resection status was available for 35 (28.4%) of these patients and multimodal treatment was applied in 21 (17.1%) of the cases. OS rate was 27.1% and 7.9% at 6 and 12 months, respectively. In the univariate analysis surgery (p < 0.001), administration of sufficient RT ≥ 30 Gy (p < 0.001), ChT (p < 0.001) and multimodal treatment (p = 0.014) resulted in improved OS (Fig. 2A–D). A specific age group or the age < 65 years did not correlate with an improved OS, but there was a tendency for patients < 65 years towards a better survival rate at 6 months: 19.3 vs. 27.1%. In the multivariate analysis, only surgery (1.997 [1.162–3.433], p = 0.012) and sufficient RT ≥ 30 Gy (1.877 [1.232–2.843], p = 0.012) were identified as independent predictors for OS (Table 3).

Table 2 Patients characteristics from the pooled analysis: whole cohort and only operated patients
Fig. 2
figure 2

AD. Univariate analysis of prognostic factors for patients from the pooled cohort

Table 3 Uni- and multivariate analyses of the pooled cohort

To identify the most favorable surgical approach, we selected or identified patients with ATC stage IVC who underwent different surgical procedures, and analyzed possible predictors for an improved OS. A sub-cohort of 39 patients was investigated (Table 2). Median age was 68 (39–83) years, 84.6% of the patients received RT ≥ 30 Gy, 39.8%—with concurrent ChT, and multimodal therapy was administered to 53.8%. Surgical types analyzed included total thyroidectomy in 28.2% of the cases, sub- or near total thyroidectomy, lobectomy/hemi-thyroidectomy and debulking in 10.3, 5.1 and 12.8% of the cases, respectively. Surgical data of 43.6% of the operated patients was, unfortunately, not available. Resection margins other than R2 were reported for 41% of the patients. In the univariate analysis only, total thyroidectomy showed a tendency towards an improved OS, when compared to other surgery types with survival rates at 9 months of 45 and 30%, respectively (p = 0.058) (Fig. 3A, B). Margin status, therapy regimen, age or multimodality were not significantly correlating with OS in the operated sub-cohort.

Fig. 3
figure 3

AB. Univariate analysis of prognostic factors within the operated cohort from the pooled analysis

Given the insufficiency of the individual surgical data gathered from the pooled analysis we decided to use SEER data base to identify best surgical approach. Therefore, we have analyzed a cohort of 617 ATC stage IVC patients (Table 4). Median age was 69 (33–97) years, ChT and RT were employed in 61.4 and 52% of patients, respectively. A total of 187 (30.3%) patients underwent surgery, i.e., either lobectomy and/or isthmectomy, removal of less than one lobe, sub- or near-total thyroidectomy and total thyroidectomy in 22.4, 18.2, 11.8 and 47.6% of cases, respectively. Multimodal treatment was administered to a total of 11.7% of patients and > 75% of all patients had an ATC-related death. On the univariate analysis surgery (p < 0.001), administration of ChT (p < 0.001) or RT(p < 0.001), multimodal treatment (p < 0.001) and age group (p < 0.001) correlated with an improved OS and DSS. Age ≥ 65 years (OS: p < 0.001, DSS: p = 0.008) correlated with a worsened OS and DSS (Suppl. Figures 1A–F, 2A–F). On the multivariate analysis surgery (OS–HR 1.934, 95% CI 1.538–2.427, p < 0.001; DSS–HR 1.803, 95% CI 1.275–2.550, p < 0.001), RT (OS–HR 1.873, 95% CI 1.558–2.247, p < 0.001; DSS–HR 1.611, 95% CI 1.191–2.178, p = 0.002), ChT (HR 1.727, 95% CI 1.412–2.114, p < 0.001; DSS–HR 1.572, 95% CI 1.197–2.064, p = 0.001) were independent predictors for an improved OS and DSS (Table 5). Age ≥ 65 years (HR 0.795, 95% CI 0.665–0.950, p = 0.012) was also identified as an independent predictor for a higher overall mortality. Administration of best supportive care only correlated inversely with DSS on the univariate analysis (p < 0.001), but it was not an independent predictor for a worse DSS.

Table 4 Patient characteristics from SEER database (any treatment left, without best supportive care mid, operated right)
Table 5 Multivariate analysis of prognostic factors for OS and DSS of the SEER cohorts

Furthermore, we eliminated 172 patients from our analysis, who did not receive any cancer-directed treatment, to verify predictors for an improved OS/DSS. On the univariate analysis surgery (p < 0.001), RT (p < 0.001), ChT (p < 0.001), multimodal treatment (p < 0.001), age ≥ 65 years (p < 0.001) and combinations of therapies (not trimodal) vs. RT alone (p < 0.001) or surgery alone (p < 0.001), correlated with an improved OS (Suppl. Figure 3A–E). Multivariately, surgery (HR 1.769, 95% CI 1.289–2.427, p < 0.001) administration of RT (HR 1.729, 95% CI 1.311–2.280, p < 0.001), ChT (HR 1.638, 95% CI 1.273–2.106, p < 0.001) and age <65 years (HR 0.789, 95% CI 0.644–0.967, p = 0.022) were independent predictors for an improved OS (Table 5). Univariate analyses for DSS showed similar correlations: surgery (p < 0.001), ChT (p < 0.001), RT (p = 0.002), multimodal treatment (p < 0.001), ChT without RT or without surgery vs. surgery and RT ± ChT (p < 0.001), combined therapies (not only trimodal) vs. RT only (p < 0.001) or surgery only (p = 0.008) (Fig. 4A–E). On the multivariate analysis surgery (HR 1.913, 95% CI 1.286–2.845, p = 0.001) administration of RT (HR 1.490, 95% CI 1.058–2.099, p = 0.022) and ChT (HR 1.579, 95% CI 1.203–2.072, p < 0.001) were independent predictors for an improved DSS (Table 5).

Fig. 4
figure 4

AE. DSS of patients from SEER database without best supportive care only regimen

To identify specific surgical interventions, that correlate with best OS/DSS, we subsequently analyzed a separate cohort of patients, who all underwent surgery and/or any other cancer-directed treatment. On the univariate analysis, total thyroidectomy (p = 0.031), administration of ChT (p = 0.007), RT (p < 0.001), combination of surgery and RT ± ChT (p < 0.001) and multimodal treatment (p < 0.001) correlated with an improved DSS (Fig. 5A–I). Debulking surgery inversely correlated with the DSS (p < 0.001) (Fig. 5D). On the multivariate analysis, debulking surgery was an independent predictor for a worse outcome (HR 0.535, 95% CI 0.332–0.862, p = 0.010), whereas RT administration correlated with a longer DSS (HR 2.316, 95% CI 1.362–3.939, p = 0.002) (Table 6). Total, sub-total and near-total thyroidectomy showed significantly longer DSS than other thyroid surgeries (p = 0.043 and p = 0.031) (Fig. 5A, B). On the other hand, if debulking patients were eliminated from the analysis, there was no significant difference in the DSS, when comparing total and less than total thyroidectomy (OS–p = 0.115; DSS–p = 0.463, Fig. 5C, Suppl. Figure 4C). Similarly, total thyroidectomy (p = 0.005), administration of ChT (p < 0.001), RT (p < 0.001), as well as multimodality (p < 0.001) and age < 65 years (p = 0.002) correlated with a longer OS (Suppl. Figure 4A–I). However, on the multivariate analysis only administration of RT (HR 2.349, 95% CI 1.469–3.757, p < 0.001) and debulking (HR 1.564, 95% CI 1.024–2.389, p = 0.038) were independent predictors for OS (Table 6).

Fig. 5
figure 5

AG. DSS of operated patients from SEER database

Table 6 Multivariate analysis of prognostic factors for OS and DSS within the operated SEER cohort

Discussion

In the current study, we have evaluated the impact of different thyroid surgical strategies on OS and DSS in 3 different cohorts of patients with ATC stage IVC and found it to be associated with a significantly improved outcome. Furthermore, we showed that administration of RT, ChT and multimodal therapy, as well as younger age correlate with an improved OS or DSS.

The role of surgical treatment in metastatic ATC remains controversial. There are several treatment approaches at this stage, which may include surgery to the primary tumor and neck dissection depending on the specific guideline (Bible et al. 2021; Haddad et al. 2018; Filetti et al. 2019). Oncological surgical approaches in ATC vary from total thyroidectomy, subtotal or near total thyroidectomy to debulking. On the basis of tumor size and/or infiltration of adjacent structures, as well as extent of metastatic disease, i.e., T-/N-/M-stage, different strategies have to be considered within a multidisciplinary board. Removal of one of the lobes is less invasive, but yet only plausible in patients with intrathyroidal or incidental ATC, whereas total thyroidectomy is recommended by most of the guidelines (Bible et al. 2021; Haddad et al. 2018; Smallridge et al. 2012). If feasible and the tumor classified as resectable, R0/R1 resection has to be goal (Haddad et al. 2018), which, however, is rarely achievable given the infiltrative growth pattern (Filetti et al. 2019). Based on our systematic review, several studies have reported a negative resection margin to be associated with a survival benefit (Aslan et al. 2014; Glaser et al. 2016; Roche et al. 2010; Liu et al. 2016; Passler et al. 1999). However, our pooled analysis on the resection status did not show any correlation with survival, which may be attributed to a small and heterogeneous sample size of only 35 patients with accurate data on the resection margins. Since ATC is characterized by an aggressive growth, it often infiltrates neighboring structures, so that an option of an ultra-radical resection including laryngectomy, resection of the infrahyoid muscles, trachea or esophagus arises. Thus, Sugitani et al. (Sugitani et al. 2014) found ultra-radical surgery to offer a benefit for survival in patients with an ATC IVB. Conversely, Goffredo et al. (Goffredo et al. 2015) evaluated a retrospective cohort of 335 operated ATC patients and did not find a survival benefit for aggressive resections in stages IVB and IVC. In that study a missing potential benefit from surgery was attributed to the morbidities and operative risks of radical resections. In our systematic review we did not find any of the studies evaluating exclusively ATC IVC patients or comparing radical resections with limited thyroid surgeries. However, some large-scale studies (Glaser et al. 2016; Sugitani et al. 2012; Gui et al. 2020; Ridder et al. 2020; Haymart et al. 2013; Kebebew et al. 2005; Huang et al. 2019; Pezzi et al. 2017) have reported their cohorts to contain up to 56.1% stage IVC patients and all of these studies found surgery to be associated with an improved OS (Gui et al. 2020). This is similar to our findings, that surgical treatment to the primary tumor also in an advanced stage ATC is an appropriate treatment option and can bear a significant survival benefit in selected patients. In addition, we have shown total thyroidectomy as a specified surgical approach to correlate with an improved OS/DSS. In that analysis we found no difference between TT and less than TT surgeries excluding debulking, i.e., compared subtotal-, near total thyroidectomies or lobectomies. This is in line with a study of Venkatesh et al. who reported less invasive thyroidectomies to be non-inferior to TT in terms of OS (Venkatesh et al. 1990). In our pooled analysis, we were only able to see a tendency regimen toward an improved OS for TT (p = 0.058).

Removal of the gross tumor in the head and neck region is of crucial importance for adjuvant RT or ChT, since it facilitates a beneficial local control by these therapies, as it was shown for ATC by some authors (Glaser et al. 2016; Fan et al. 2020). However, the extent of primary tumor-directed surgery needs to take into consideration the extent of primary tumor spread. The risk–benefit-ratio needs to be thoroughly evaluated between surgery to the primary tumor which may enhance therapeutic outcome and the risk of surgery-induced morbidity and delay of systemic therapy. The main goal of surgery to the primary tumor in systemically metastasized ATC is to avoid potential complications from the locally destructive tumor growth leading to obstructions of airway and hence respiratory insufficiency, as well as compression and/or infiltration of carotid vessels, as these are common death causes reported to date (Kitamura et al. 1999). Nilsson et al. investigated debulking surgery to the primary tumor in ATC and found it to improve patients’ outcome within multimodality approach (Nilsson et al. 1998). Debulking as palliative surgery in ATC IVC is, however, generally not recommended, as there is no sufficient evidence for a patient’s benefit within the multimodality approach, where urgently necessary systemic treatment is of highest priority(Bible et al. 2021; Haddad et al. 2018; Filetti et al. 2019; Sugitani et al. 2018). In our analyses debulking surgery was an independent predictor for a higher overall and disease-specific mortality in ATC IVC patients. This may be caused by the unfavorable constellation of peri-operative morbidity and insufficient response towards RT or systemic therapies in terms of local control. Moreover, debulking surgery is only performed in cases, where R0/R1 resection is not achievable in locally aggressive advanced disease, which is a prognostically unfavorable constellation by itself. In addition, debulking surgery postpones the start of RT and/or systemic therapy and can lead to intervention-related complications with a negative impact on the outcome.

In general, there are several specific complications after an oncologic thyroid surgery: permanent and transient uni- or bilateral recurrent laryngeal nerve palsy, injuries to the superior laryngeal nerve, tracheomalacia, hypoparathyroidism, and fistulae (Rosato et al. 2004; Oertli and Udelsman 2007). The incidence of nerve injury, the most common specific complication, has been sufficiently decreased by the utilization of an intraoperative neurological monitoring, as suggested by some authors (Bai and Chen 2018; Zheng et al. 2013). Other complications can effectively be managed with either conservative or additional invasive approaches (Orloff et al. 2018; Campisi et al. 2013; Lee et al. 2016a; Spitzweg et al. 2017). These potential risks in the course of surgery, especially debulking, seem to outweigh the profit from this intervention because of the heterogeneous volume of the remaining tumor burden. Thus, limited (not ultra-radical) thyroidectomy, but not debulking, may be offered in selected patients with ATC IVC, since there is a promising evidence of potential profit, whereas complications can appropriately be avoided or managed. Furthermore, data suggest that at least < R2 resection has to be achieved to facilitate further therapeutic response.

In ATC, surgery is recommended to be followed by adjuvant therapy, consisting of local RT with or without simultaneous or sequential ChT, to reduce the risk of local recurrence and thus improve overall survival. The recommended radiation doses range between 20 and 75 Gy, depending on the therapeutic goal (Filetti et al. 2019; Pezzi et al. 2017). For palliative radiation, doses between 20 and 30 Gy are usually administered; for a curative therapeutic goal, doses of ≥ 40 Gy are used (Liu et al. 2016; Sugitani et al. 2018; Sun et al. 2015; Wendler et al. 2016). However, there is still disagreement about the level of doses administered to patients with a curative therapy goal. In addition to studies recommending ≥ 40 Gy for ATC patients, however, many indicate effective irradiation only at doses of ≥ 60 Gy. According to a study by Fan et al. for example, irradiation doses of ≥ 60 Gy improve overall survival (p = 0.004), as well as local control (p < 0.001) and additionally prolong median overall survival (10.6 months vs. 3.6 months)(Fan et al. 2020). Similar results were obtained in the study by Glaser et al. which indicates an effective radiation dose for a favorable outcome at ≥ 59.4 Gy (Glaser et al. 2016). A more aggressive therapy regimen with higher radiation doses not only shows a benefit in stage IVA, but also in selected patients in inoperable stage IVB or stage IVC. Higher doses have a positive effect on local control and reduce the risk of local recurrence and thus improve overall survival rates (Pezzi et al. 2017). The evaluations of Fan et al. also suggest that higher radiation doses do not necessarily mean higher toxicity and that grade 4 toxicities did not occur more frequently than with lower radiation doses (Fan et al. 2020). In our pooled analysis from the systematic review, sufficient radiation doses beyond 30 Gy correlated uni- and multivariately with a beneficial OS (Table 3, Fig. 2C). Analyses obtained from the SEER database also show RT to be an independent predictor for a beneficial OS and DSS (Suppl. Figures 1C, 2B); however, exact dosage remains unknown due to limited SEER data. Furthermore, our previous study showed that RT in advanced ATC may also offer a durable local control and can be considered safe for patients (Augustin et al. 2021).

The application of ChT, usually in combination with adjuvant RT, is still controversial. Systemic therapy is the main treatment regimen for metastatic patients, but has only a low response rate and usually leads to significant side effects with a corresponding loss of quality of life (Filetti et al. 2019). There are studies that show that ChT in ATC does not bring a survival benefit and only increases therapy-associated toxicities (Huang et al. 2019; Sun et al. 2015; Corrigan et al. 2019). Other studies, however, found a survival benefit that can be achieved by simultaneously or sequentially administered ChT (Wendler et al. 2016; Käsmann et al. 2016). Recommended ChT regimens include either single-agent therapy with paclitaxel or doxorubicin, or a combination of agents, such as carboplatin/paclitaxel and docetaxel/doxorubicin (Haddad et al. 2018; Filetti et al. 2019; Ain KB et al. (CATCHIT) Group* 2000; ; ; Sosa et al. 2014; Shimaoka et al. 1985). In our analysis, administration of ChT corresponded with beneficial OS and DSS rates in univariate analysis and also in multivariate analysis in both SEER cohorts (Suppl. Figures 1B, C, 3C, F, 4F; Fig. 4C; 5F; Tables 5 and 6), but also univariately in the pooled cohort (Fig. 2B).

Combination of all three therapeutic approaches in the course of a multimodality therapy approach shows an advantage in the vast majority of patients with regard to overall survival and progression-free survival(Haymart et al. 2013; Pezzi et al. 2017; Fan et al. 2020; Corrigan et al. 2019; Rao et al. 2017). In stage IVA and resectable stage IVB, this approach is already an established standard of care (Haddad et al. 2018; Filetti et al. 2019; Smallridge et al. 2012; Sun et al. 2015). However, some studies are extending multimodal therapy to patients in stage IVB and stage IVC. Tian et al. showed CRT in ATC patients with metastatic disease to correlate with a longer 1-year OS (HR 0.65, p < 0.001) (Tian et al. 2020). Depending on the physical condition of the patients and their personal expectations, a more intensive therapy regime should, therefore, be considered. The decision on the individual therapy approach should be made within the framework of an interdisciplinary expert team of oncologists, radiotherapists, endocrinologists, pathologists and surgeons. In our analyses, multimodal therapy in ATC IVC was associated with a prolonged OS and DSS in all of our cohorts on the univariate analysis (p value: < 0.001–0.014, Figs. 25, Suppl. Figures 1–4). On the multivariate analysis, it, however, did not reach any significance (Fig. 5G, Suppl. Figure 4G).

In the course of our investigation, we also evaluated the impact of older age ≥ 65 years in metastatic ATC on the OS and DSS. We found it to be an independent predictor for a higher overall mortality in both SEER databases, but not for the disease-specific mortality (Table 5, Suppl. Figures 1F, 2E, 3E, 4I; Fig. 4E). This is in line with the findings of other authors that stated age ≥ 65 and ≥ 70 years, respectively, to be an independent predictor for a shortened OS (Glaser et al. 2016; Sugitani et al. 2012; Pezzi et al. 2017). Such a difference is most likely caused by the lower performance score and lower susceptibility for intensive treatment regimens in older patients.

In addition, some authors reported a significant increase in survival of ATC patients within the last two decades due to a significantly improved patient-management (Maniakas et al. 2020). Prasongsook et al. have also shown a difference between treatment outcome in ATC patients, yet not for the multimodal approach in the metastasized ATC (Prasongsook et al. 2017). In none of our analysis, however, we were able to find any difference in the patient outcome depending on the year of their diagnosis. Conversely, we have only investigated an advanced stage ATC. These findings, however, were generated from the data of an era prior to Food and Drug Administration approvals (FDA, USA) of any of the available TKIs or immunotherapies for ATC. A combination of dabrafenib and trametinib has been approved by FDA for an advanced, BRAF V600E/MEK positive ATC, as it showed acceptable toxicity with an overall response rate (ORR) of 69% in May 2018 (Subbiah et al. 2018; Highlights of prescribing information xxxx). Furthermore, first results of the phase-II ATLEP trial on the combination of Lenvatinib and pembrolizumab in the metastasized ATC/PDTC showed a promising outcome with acceptable toxicity(Dierks et al. 2021). In total, not only may these therapies provide an improved survival, distant and local control rates, but it may also facilitate a reevaluation of the role of surgery in ATC stage IVC, especially when used in a neo-adjuvant setting.

Our study has several limitations, such as the retrospective nature and, hence, the risk of including hidden selection biases. In addition, SEER data on ChT and RT have been reported to have limitations in terms of sensitivity, biases and variables within the treatment sequence (Noone et al. 2016). The SEER data have, however, still a significant positive-predictive value, but have to be used with caution. In our analysis, we partially investigated patients with any tumor-directed treatment and did not evaluate any treatment-related sequences, to minimize possible biases between treated and untreated patients.

In conclusion, we were able to show that surgery to the primary tumor—thyroidectomy in any form, but not debulking—was an important factor bearing an OS and a DSS benefit for ATC patients with distant metastases from SEER database. Furthermore, sufficient RT ( ≥ 30 Gy), administration of ChT or combined multimodal treatment and young age < 65 years had a significant influence on the OS and DSS from both pooled and SEER-based analysis.