Introduction

Endoscopic combined intrarenal surgery (ECIRS) is a combination of endoscopic surgery with percutaneous nephroscopic access and retrograde ureteroscopic access, and is now recognised as one of the standard endoscopic treatment options for renal and ureteral stone1,2,3,4. Achieving stone free (SF) status rates are 65.3–81.9%2,5,6 and perioperative complication rates are 7.3–38.6%2,7,8 indicating higher efficacy and a safer procedure than percutaneous nephrolithotripsy (PCNL) alone1,4,9,10,11. In the last decade, ECIRS has become a less invasive procedure, and mini-ECIRS has emerged with a smaller percutaneous access tract and nephroscopy, demonstrating the advantages of less postoperative pain and potentially less bleeding than conventional ECIRS12.

However, ECIRS requires at least two experienced endoscopic surgeons and is generally challenging to perform safely because of the learning curve13, which is reported to cause perioperative complications, including fever, sepsis, and other organ injuries2,7,8. There is a clear demand for accurate prediction of ECIRS outcomes based on preoperative factors, allowing the allocation of reasonable and feasible treatment strategies. However, there is scarce literature reporting on and investigating mini-ECIRS, leading to a lack of clinical evidence for ECIRS.

To address some clinical issues in ECIRS, we created the largest cohort of ECIRS data from three high-volume centres in Japan and developed three prediction models of mini-ECIRS outcomes based on clinical parameters, followed by internal validation. To the best of our knowledge, this is the first study to develop prediction models of mini-ECIRS outcome including machine learning model utilizing the largest cohort of the database. Those our proposed prediction model will lead to a safer and more efficient treatment strategy with mini-ECIRS and will be beneficial for surgeons and patients who undergo mini-ECIRS.

Materials and methods

The clinical information of consecutive patients who underwent conventional or mini-ECIRS for urinary stone disease between 2015 and 2021 at three high-volume centres in Japan (Yokosuka Kyosai Hospital, Ohguchi East General Hospital, and Hara Genitourinary Hospital) was collected and pooled for analysis. The inclusion criterion was single-session mini-ECIRS for the treatment of renal and/or ureteral stones. There were no restrictions on patient age, Eastern Cooperative Oncology Group Performance Status (ECOG-PS), and position setup during mini-ECIRS. Both modified Valdivia and prone positions were included, as previous work has shown ECIRS in both positions with comparable results14. The exclusion criteria were as follows: preoperatively intended staged procedures, no available postoperative CT, and no available preoperative clinical information. Patients with urinary tract abnormalities, including horseshoe kidney, ileal conduit, neobladder, severe ureteral stricture, and ureteropelvic junction (UPJ) obstruction that did not allow ureteroscopy (URS) to be used during mini-ECIRS were also excluded. The study protocol (IRB number 20-90 at Yokosuka Kyosai Hospital, 202201 at Ohguchi East General Hospital, and 2021-05-06 at Hara Genitourinary Hospital) waiving the requirement for written informed consent was approved by the institutional ethics committee of each hospital. Informed consent was obtained from participants as an opt-out on each hospital website. The study was conducted in accordance with the principles of the Declaration of Helsinki and all local regulations.

Non-contrast CT imaging was utilised to evaluate surgical outcomes at 1 month postoperatively, following the same protocol in the three institutions. SF was defined by CT imaging 1 month postoperatively as completely no residual stone or residual stone fragments ≤ 2 mm. Residual stone fragments > 2 mm was defined as non-SF.

The ECIRS technique was similar to previously reported methods in modified Valdivia or prone position15. All enrolled procedures were performed by 2–3 urologists, with at least one surgeon with more than 50 ECIRS procedures as a supervisor. In brief, two urologists worked simultaneously to fragment the kidney stones: one performed PCNL and the other performed URS. A ureteral access sheath (10/12 or 12/14Fr B-Flex, Rocamed, Monaco, Italy) was placed to facilitate frequent insertion of fURS (URF-V2 or P-6™; Olympus, Tokyo, Japan or Flex-X2™, Karl Storz, Tuttlingen, Germany). A 9.5Fr ureteral access sheath (Flexor, Cook Medical LLC, Bloomington, USA) was used for difficult ureters16. Percutaneous renal access was performed under ultrasound guidance with or without fURS monitoring. The choice of puncture calyx was at the discretion of the PCNL surgeon for efficient treatment. The percutaneous operating sheath (13Fr or 17.5F) was inserted using a one-step dilator. We utilized a Holmium‐YAG laser (Lumenis Pulse™ 120H; Yokne’am Illit, Israel or Dornier MedTech; Munich, Germany) through a fURS (commonly used laser settings were 0.5–1.0 J at 5–10 Hz) and a pneumatic lithotripsy (Swiss LithoClast® Master J; Electro Medical Systems, Nyon, Switzerland) through a mini-nephroscope (Olympus or Karl Storz). The stone burdens were crashed into tiny fragments and washed out through the PCNL sheath using manual or automated retrograde irrigation (UROMAT E.A.S.I.®; Karl Storz). Finally, a 6F ureteral stent tube and/or a 14F urinary nephrostomy tube were placed for 2–4 weeks and/or a few days, respectively, in cases of risk of postoperative infection or bleeding.

Three prognostic models (multiple logistic regression, classification tree analysis, and machine learning-based random forest) were used to predict SF using preoperative clinical factors17. The models were developed using age, sex, body mass index (BMI), ECOG-PS, stone laterality (left or right), number of stones, stone burden (sum of the largest stone diameters), number of involved calyces, stone position (presence or absence of R2 stone), presence of hydronephrosis, indwelling preoperative stenting, and preoperative nephrostomy. To summarise the crude relationship, a univariate logistic regression was initially performed. In multiple logistic regression, variables were selected by stepwise selection using the Akaike Information Criteria18. The area under the receiver operating characteristic curve (AUC) was used to evaluate the predictive ability. The random forest model is a supervised machine learning algorithm that grows multiple classification trees, wherein variables are evaluated for more accurate prediction using the mean decrease in the Gini coefficient value (a larger mean decrease indicates stronger predictive potential)17,19,20. Classification tree analysis enables the selection of predominant factors and determines the significance thresholds of these parameters17,21. Internal validation was performed to evaluate overfitting and optimism using the bootstrap method22, wherein 1000 bootstrap samples were repeatedly generated from an original dataset. All analyses were performed using R 4.2.3, using the rpart, randomForest, and pROC packages.

Informed consent

Informed consent was obtained from participants as an opt-out on each hospital website. The study protocol (IRB number 20-90 at Yokosuka Kyosai Hospital, 202201 at Ohguchi East General Hospital, and 2021-05-06 at Hara Genitourinary Hospital) waiving the requirement for written informed consent was approved by the institutional ethics committee of each hospital.

Results

Clinical data from 1432 ECIRS were pooled in a database registered from three institutions and 1374 of 1432 were identified as mini-ECIRSs. Finally, 996 single sessions of mini-ECIRS were analysed in this study after 378 cases were excluded, mainly due to intended preoperative staged ECIRS (Fig. 1). The overall SF rate (complete no residual stone or residual stone fragments ≤ 2 mm) was 62.3%. Only three cases were performed in the prone position and remained in the modified Valdivia position. The stone burden, number of stones, and involved calyces were significantly lower in the SF group than in the non-SF group (P < 0.001, Table 1).

Figure 1
figure 1

Flow chart of selection.

Table 1 Comparison of patient characteristics between the stone free and non-stone free groups of the mini-ECIRS.

Surgical treatment outcomes are summarised in Table 2. The most common postoperative complications (Clavien–Dindo ≥ 2) were fever, sepsis, renal vascular complications, and other organ injuries. Fever (P = 0.0021) and sepsis (P = 0.001) were more frequent in the non-SF cases than in the SF cases.

Table 2 Comparison of perioperative surgical outcomes between the stone free and non-stone free groups of the mini-ECIRS.

Univariate logistic regression analysis revealed that the significant predictive variables were the number of stones, stone burden, number of calyces involved, and stone position (P < 0.001 in all four, Table 3). The multiple logistic regression model consisted of stone burden (P < 0.001), number of involved calyces (P < 0.001), presence of nephrostomy prior to mini-ECIRS (P = 0.091), and ECOG-PS (P = 0.110). The AUC was 70.7% (Table 3 and Fig. 2A).

Table 3 Univariate and multivariate logistic regression models for predicting stone free after mini-ECIRS.
Figure 2
figure 2

Receiver operating characteristics (ROC) curve of 3 models predicting stone free outcome of mini-ECIRS (residual fragments ≤ 2 mm).

The classification tree analysis consisted of the number of calyces involved, with an AUC of 61.7% (Fig. 2B). The threshold value for the number of involved calyces was four (Fig. 3). The random forest model showed that the top predictive variable was the number of involved calyces, followed by BMI, stone burden, age, and number of stones (Table 4). The model involving all clinical variables showed an AUC of 91.9% (Fig. 2C). Internal validation using the bootstrap method showed that the AUCs of the logistic regression model, classification tree analysis, and random forest model were 70.4% (95% confidential interval (CI) [0.695–0.711]), 69.6% (95% CI [0.612–0.743]), and 85.9% (95% CI [0.840–0.876]), respectively (Table 5).

Figure 3
figure 3

Classification tree analysis to predict mini-ECIRS outcome.

Table 4 Random forest model for predicting stone free after mini-ECIRS.
Table 5 AUC of internal validation by bootstrap method of 3 models.

Discussion

The current study developed three different statistical models to predict SF in mini-ECIRS. The multiple logistic regression model consisted of four clinical parameters: stone burden, number of involved calyces, and presence of nephrostomy before mini-ECIRS, and ECOG-PS. Classification tree analysis showed that the number of calyces involved, in particular when less than four, was significantly associated with SF. Random forest analysis, a machine learning-based model, showed remarkably high accuracy of outcome prediction based on preoperative parameters in both real and internal validation datasets. To our knowledge, this multicentre cohort is the largest in the mini-ECIRS database.

A key finding of the study was that all three prediction models consistently demonstrated that the number of involved calyces was the most important predictor of mini-ECIRS outcomes. In particular, if the number of involved calyces is < 4, the outcome seems to be favourable, and a single session of mini-ECIRS should be recommended as first-line treatment. Otherwise, multiple staged sessions of the mini-ECIRS might be better. Enrolled mini-ECIRSs were mostly single tract surgery and multi-tract ECIRS might be reasonable options in case the number of involved calyces was ≥ 4. The number of calyces involved has been reported to be an independent predictor of PCNL23 and conventional ECIRS outcome6. Another study showed that the number of calyces involved was an independent predictor of postoperative complications in ECIRS8. Interestingly, those papers consistently indicated that a clinically significant cut-off causing worse outcomes were ≥ 4 involved calyces. These findings might suggest that ≥ 4 involved calyces is the limit for retrograde assistant during ECIRS, such as reposition and dusting with URS.

In addition to the number of calyces involved, stone burden was a strong predictor of mini-ECIRS in this study. Stone burden was identified as a long-term treatment outcome predictor in stone surgeries, including ECIRS24, PCNL23,25 and retrograde URS26,27. However, the current study showed that stone burden was the predominant factor in the prediction model of mini-ECIRS outcomes.

Before this investigation, we considered the presence of hydronephrosis as a possible successful predictor, due to hydronephrosis allowing us easier, safer, and more appropriate percutaneous access. However, the three statistical models currently used did not identify hydronephrosis as a successful indicator. These findings may suggest that easier development of access did not always indicate complete stone removal, wherein hydronephrosis was possibly a confounder of other variables, or the current analysis of mixed renal and ureteral stones may have caused heterogeneity in this cohort, resulting in hydronephrosis having less impact. Another interesting finding was that multivariate analysis showed that BMI was not a significant predictor of mini-ECIRS outcomes; however, random forest analysis indicated that BMI was one of the best predictors as well as number of involved calyces and stone burden. The possible reason for these discrepancies in BMI was the statistical character of the random forest analysis, which counts continuous values as more significant than categorical values19,20,21. Indeed, no previous studies have shown that high BMI contributes to worse outcomes of PCNL or ECIRS, despite some speculation that high BMI results in difficulties in setting patient positioning during surgery and decreased visibility of the pelvis when puncturing.

In other clinical areas, machine learning and deep learning techniques show outstanding performance28, with recent machine learning models demonstrating high accuracy in disease detection and prediction of surgical outcomes29,30. Our random forest model, based on machine learning techniques, showed remarkably high accuracy of more than 90% for real data and more than 85% for internal validation, which is better than the other two models, suggesting that the model is effective.

The limitations of this study are its retrospective data analysis and the inclusion of surgeries performed by multiple surgeons. However, we were of the opinion that this study was a good reflection of the real-world practice. CT value could not be assessed in this study because of different CT imaging facilities among multi-centre. External validation of prediction models is warranted to determine whether the current model is consistent and reliable.

In conclusions, the three models predicted the successful outcomes of single session mini-ECIRS using preoperative parameters from the largest mini-ECIRS database. Less than four involved calyces and a smaller stone burden implied a successful outcome. The machine learning-based model showed remarkably high accuracy and may be a promising tool for physicians and patients to obtain proper consent, avoid inefficient surgery, and decide preoperatively on the most efficient treatment strategies, including staged mini-ECIRS.