Introduction

Pelvic reconstructive surgery using the transvaginal Uphold™ Lite mesh kit to suspend the apical vaginal segment has been shown to effectively ameliorate pelvic organ prolapse (POP) symptoms and restore anatomical outcomes at short term [1,2,3,4,5,6,7,8]. Due to a lack of “sufficient evidence to assure that the probable benefits of the devices outweigh their probable risks”, the FDA decided that vaginal mesh products should not be distributed in the US (FDA 2019) [9]. As a consequence, many mesh manufacturers discontinued manufacturing and sales of the products worldwide. However, there is a continued need for investigating safety and effectiveness of transvaginal mesh devices since many patients who have already undergone the procedures are facing unknown long-term outcomes.

It is well established that centralization of complex or high-risk procedures to high-volume centers may optimize outcomes, increase patient safety, decrease mortality and also increase cost-effectiveness [10,11,12,13,14,15,16,17,18]. In the field of pelvic reconstructive surgery, there is, however, scarce evidence to support this claim. As a consequence, it is not known how long-term outcomes and morbidity may differ between the settings [2, 4, 8]. Increased knowledge on the differences in outcomes between single- and multi-center use of surgical innovations in urogynecology may guide and improve the clinical introduction of new surgical procedures in the future. The aim of this study was to evaluate if high-volume center outcomes are superior to the results of regular use of Uphold mesh kit for apical prolapse at long term and how this relates to long-term morbidity of the procedure.

Materials and methods

For the purpose of the study, we combined data derived from two separate studies designed as a single- and a multi-center study [2, 4, 8]. These studies aimed to evaluate various clinical aspects of the Uphold mesh kit. All patients included in this study had symptomatic and quantified apical segment (uterine or vaginal vault) prolapse ≥ stage 2, with or without anterior vaginal wall prolapse, according to the pelvic organ prolapse quantification (POP-Q) system [19]. Anatomical outcomes were assessed using the POP-Q system during a gynecological examination with the patient in a lithotomy position at which time mesh exposure or other healing defects were recorded. POP-Q stage 0 or 1 in the apical compartment was considered an optimal anatomical outcome and was the primary outcome measure for the analysis.

Using a standardized procedure with a capturing device, a monofilament, macroporous and uncoated polypropylene mesh (Uphold™ Lite) was placed to suspend the apical vaginal segment [2, 4]. All patients at the single center were operated by two experienced urogynecological surgeons (50 and 65 patients, respectively) at the Department of Obstetrics and Gynecology, Danderyd Hospital, Stockholm, Sweden.

The multicenter study included a total of 207 patients operated by 26 surgeons at senior consultant level throughout 24 centers in Sweden, Finland, Denmark and Norway [2, 8]. Data have been previously published and described in detail [2, 8]. The 24 centers were located as 11 centers at Sweden, 4 centers at Finland, 5 centers at Norway and 4 centers at Denmark. The rate of surgery was from 1 to 13 patients for each surgeon (mean 8 patients/surgeon) and 1–24 patients for each center (mean 8.6 patients/center).

The study protocol was near identical for both studies and neither study was blinded [2, 4, 8]. Exclusion criteria for both studies included: current or previously treated pelvic organ cancer, cervical elongation, severe rheumatic disease, insulin-treated diabetes mellitus, connective tissue disorders, current systemic steroid treatment, and urinary incontinence. Follow-up after surgery was performed after 1 and 5 years in the multicenter study and 2 and 5 years in the single-center study. There were no restrictions on weight, parity, menopausal status, or previous surgery although urinary incontinence was not an exclusion criterion at the single center. In both studies, patients were included by the responsible urogynecologist if a patient fulfilled the inclusion and no exclusion criteria.

Subjective disease-specific pelvic floor outcomes were assessed by the Pelvic Floor Distress Inventory 20 (PFDI-20) questionnaire which includes 3 scales of 20 questions: Urinary Distress Inventory-6 (UDI-6), Pelvic Organ Prolapse Distress Inventory-6 (POPDI-6), and Colorectal-Anal Distress inventory-8 (CRADI-8) [20]. The Pelvic Floor Impact Questionnaire—short form (PFIQ-7) was used to assess effects on quality of life [20]. The PFIQ-7 includes the Urinary Impact Questionnaire (UIQ-7), the Colorectal-Anal Impact Questionnaire (CRAIQ-7) and the Pelvic Organ Prolapse Impact Questionnaire (POPIQ-7) [20]. To measure the impact on patients’ sexual function, Pelvic Organ Prolapse/Urinary Incontinence Sexual Questionnaire (PISQ-12) was used which includes 12 questions concerning behavioral/emotional, physical and partner related sexual domains [21]. Additional surgical procedures, POP recurrence, as well as, interventions for mesh-related complications following the primary operation was registered up to 5 years after surgery.

A Visual Analogue Scale (VAS) was used to assess pelvic pain at baseline and follow-up visits [22]. The 11-point scale ranges from 0 to 10 where 0 indicates no pain and 10 indicates maximal severe pain. In a sub-analysis of pain, we used the Numeric Rating Scale (NRS). The NRS for pain is classified into four categories; 0 indicates no pain, 1–3 mild pain, 4–6 moderate pain, whereas 7–10 indicate severe pain [22].

Statistical analysis

Statistical analysis was performed using the predictive analysis software (IBM@SPSS© Statistics, Version 25, Inc, Chicago, IL, USA, 2017). We used independent sample t tests to compare independent variables. The statistical difference between nominal data was tested by Chi-square test of independence. Repeated measures ANOVA was used to test total values and to compare continued variables and over time of follow-up between the single and the multicenter. The General Linear Model of analysis was used to compare changes over time and centers when analyzing the NRS scale for pain. Inference for population proportion analysis was used to statistically compare between proportions in the two different populations, i.e. the single and multicenter. All missing data were considered as missing without imputation of data.

Ethical approval

The studies were approved by the Stockholm Regional Ethical Review Board and ethic review committees in the respective countries as appropriate.

The single-center and the multicenter studies were registered at www.clinicaltrials.gov: NCT03077490 and NCT01823055 respectively.

Results

Five years after surgery, 101/115 (88%) and 164/207 (79%) patients answered the patient-reported outcome questionnaires at the single and multicenter, respectively. 74/115 patients (64.3%) at the single-center and 139/207 patients (67.1%) at the multicenter trial were available for clinical examination and POP-Q evaluation.

Anatomical outcomes and POP-Q staging at 5 years after surgery are described in Table 1. An optimal anatomical apical segment outcome (POP-Q C stage 0–1) was achieved in 97.3% (72/74 patients) in the single-center cohort as compared to 97.5% (116/119) in the multicenter cohort (p = 0.938). If considering patients re-operated by any POP surgical procedure during follow-up as a failure, i.e. an optimal outcome (POP-Q stage 0–1 in the apical segment) and no secondary procedures the success rate was 95% (70/74) in the single-center and 83.3% (116/139) in the multicenter cohort (p < 0.001). When considering the anterior and posterior vaginal compartments separately, occurrence of anterior vaginal wall prolapse (POP-Q Ba ≥ 0) was 5.4% (4/74) in the single-center cohort vs. 19.8% (26/131) in the multicenter cohort (p < 0.001). For posterior vaginal wall prolapse (POP-Q Bp ≥ 0), the corresponding figures were 2.7% (2/74) vs. 26% (34/131) (p < 0.001). Other individual POP-Q items were largely comparable between the cohorts. There were no cases of mesh erosion (0/74) in the single-center study as compared to 3/139 (2.2%) in the multicenter study (p = 0.034).

Table 1 5-year evaluation of anatomical outcomes following the Uphold procedure

Total number of surgical interventions for pelvic floor insufficiency or mesh-related complications was 7/115 patients (6.1%) vs. 31/202 patients (15.3%) in the single- and multicenter cohorts, respectively, during the 5-years follow-up (p = 0.001). Four patients were re-operated because of postoperative pain (4/164, 2.4%) at the multicenter vs. none at the single center (p = 0.021). At the time of surgery, the mesh was removed in all four cases and in one case, a hysterectomy was performed.

There were no significant differences in demographic characteristics between the single and multicenter (Table 2). There were no significant differences between the cohorts in subjective outcomes, i.e. patient-reported symptoms as measured by the PFIQ-7, PFDI-20, PISQ-12 (Table 1). Although the UDI-6 domain score was higher in the single center (23.6 ± 19.61 vs. 17.2 ± 18.28, p = 0.01), indicating greater symptom severity of urinary incontinence, it did not significantly affect the total PFDI-20 score. The PISQ physical domain was significantly higher at the single center (17.7 ± 2.72 vs. 14.1 ± 3.48, p < 0.001), indicating higher sexual activity, whereas the PISQ-partner domain was lower at the single center (7.2 ± 2.51 vs. 10.09 ± 2.79, p < 0.001) indicating lower partner sexual activity. Neither of these differences affected the total PISQ-12 score.

Table 2 5-year data comparing subjective outcomes in a single vs. multicenter study

In an analysis of changes over time, we performed pairwise comparisons of changes in reported pain outcomes and patient-reported pelvic floor questionnaires (PFIQ-7 and PFDI-20) if individual patient data were available at the two follow-up times (Table 3). There were no significant differences noticed when comparing changes in patient-reported outcomes between the cohorts. There was an insufficient number of pairwise observations with completed PISQ-12 scores to allow for a valid comparison between the cohorts over time.

Table 3 Pairwise comparison of changes over time

Table 4 shows pain-related outcomes following surgery. The previously reported short-term improvements in patient reported pain was sustained at the 5-years follow-up after surgery in both cohorts (p < 0.001). Furthermore, there were no significant differences in estimated pain scores (VAS) in-between the cohorts during follow-up (independent-sample t test). At the single center, 1/81 patients (1.2%) had VAS 7/10, i.e. severe pain, as compared to 3/131 (2.3%) in the multicenter study (p = 0.277) based on the NRS scale.

Table 4 Analysis of pain before and after surgery

Discussion

In this 5-years analysis which combined data from two separate cohort studies using the UpholdLite mesh device for POP reconstructive surgery, we found that single-center high-surgery volumes resulted in less reoperations for POP recurrence and fewer mesh-related complications requiring surgical intervention. There were, however, no significant differences between the cohorts with regard to patient-reported outcomes. A number of studies have shown that the UpholdLite procedure for apical prolapse provide satisfactory subjective and objective outcomes and that high surgical volumes result in short-term decreased complication rates following apical mesh augmented surgery [1,2,3,4,5,6,7,8]. However, long-term efficacy and safety data have been missing.

Overall, we found that anatomical outcomes were sustainable over time with only minor, and non-significant, differences when comparing short and long-term follow-up both with regard to anatomical and patient-reported outcomes. When comparing the single- to the multi-center cohort 5 years after surgery, we also found no significant differences in our primary outcome (restoration of apical support) or in overall subjective scores with regard to condition specific symptoms, quality of life and sexual function. There was, however, a significant difference with regard to the occurrence of reoperations of both mesh-related complications and secondary pelvic reconstructive surgery, to the advantage of single-center use. The overall risk ratio for any complication in the multi-center compared to the single-center setting was 1.5:1 in patients undergoing the Uphold procedure as first-time prolapse surgery and 4:1 in patients undergoing surgery for recurrence in a previously operated compartment. These results suggest that a high surgical volume is beneficial for patients also at long term and that the risk for mesh-related complications requiring removal or revision of the mesh decreases with increased surgical experience. There is no universally accepted definition of high surgical volume and the number considered as high-volume certainly varies between different procedures. There is, however, a clear link between volume and patient morbidity, as well as, mortality in many fields. In concurrence with our results, a study by De Tayrac et al. showed a negative association between an increasing number of pelvic organ prolapse mesh procedures performed and the rate of complications [23]. Thus, emerging data suggest that the assumed relationship between volume and morbidity is valid also for urogynecological mesh surgery.

Subjective outcomes overall were improved also at long term when compared to baseline and largely sustained compared to the short-term outcomes. This was true also for perceived pain. Given recent years of widespread attention and highlighting of pain complications related to the use of vaginal mesh, it was somewhat surprising that levels of experienced pelvic pain in both the single- and multi-center cohort 5 years after surgery was low in general and still significantly lower than at baseline. We nonetheless recognize that long-term severe pain complications did occur and afflicted one and three patients in the single- and multi-center study, respectively. The mechanisms for mesh-related pain after biomaterial augmented surgery, although poorly understood, may be caused by a wide range of factors including different modes and routes of surgery, surgical dissection prior to placing the mesh, bleeding, infection, and pre-existing pain. The management of patients who experience persistent pain after mesh surgery remains clinically challenging and deserves further attention given our lack of knowledge on long-term consequences of mesh procedures, although many previously used products are no longer commercially available worldwide.

Three cases of mesh exposure were as reported in the multicenter cohort at the 5-years follow up, whereas none was reported in the single-center study. None of the three cases of mesh exposure reported persistent pain and were treated conservatively with topical estrogen without additional surgical interventions. Although suggested by our data, and supported by other studies [24,25,26,27], there was an insufficient number of cases to determine if high-volume single-center use was beneficial also with regard to postoperative mesh exposure at long term. Regardless, our long-term clinical data on more than 200 patients suggest that mesh exposure seems to be a relatively minor problem at long-term following this specific procedure and can be handled conservatively.

Despite early reports of risks and benefits associated with the use of vaginal mesh kits use of these products spread rapidly worldwide. In retrospect, one may argue that early centralization to specialized high-volume centers rather than widespread dissemination of mesh procedures could have prevented some complications and individual suffering. It would be inherently difficult to randomize patients between high-volume centers and clinics with lesser volumes due to geographic and social considerations. We must, therefore, rely on analyses from observational studies, such as the present one, to obtain information on the long-term effects of surgical volumes. Classification of the procedure was homogenous since all operations used an identical mesh kit performed in a standardized manner. There is an inevitable loss to follow-up when performing long-term clinical cohort studies. We believe that number of patients lost to follow-up in the present cohorts were not unusual with only minor differences between the single- and multi-center cohort suggesting a limited selection bias. A weakness of the study is that neither cohort was large enough to perform sub-analyses on possible predictors of both positive and negative outcomes with adequate statistical strength and precision. Factors such as pre-existing mood disorders, functional pain syndromes, smoking, obesity and previous pelvic surgery, may influence surgical outcomes but need further studies to understand the mechanisms involved [28, 29].

In summation, our data suggest that despite the high objective and subjective long-term effectiveness of the procedure in both regular use and at a high-volume center, centralizing the use of mesh kit procedures had clear clinical benefits. A reduction of secondary interventions by more than half when a standardized capturing-device guided transvaginal mesh was used at a high-volume center is an important advantage for patients and health care system alike. In countries where products similar to the one used in the present study remain in use, one may, therefore, advocate that mesh kit procedures should be centralized to high-volume center to minimize complications and secondary interventions.