Skip to main content

Utilizing machine learning to facilitate the early diagnosis of posterior circulation stroke

Abstract

Background

Posterior Circulation Syndrome (PCS) presents a diagnostic challenge characterized by its variable and nonspecific symptoms. Timely and accurate diagnosis is crucial for improving patient outcomes. This study aims to enhance the early diagnosis of PCS by employing clinical and demographic data and machine learning. This approach targets a significant research gap in the field of stroke diagnosis and management.

Methods

We collected and analyzed data from a large national Stroke Registry spanning from January 2014 to July 2022. The dataset included 15,859 adult patients admitted with a primary diagnosis of stroke. Five machine learning models were trained: XGBoost, Random Forest, Support Vector Machine, Classification and Regression Trees, and Logistic Regression. Multiple performance metrics, such as accuracy, precision, recall, F1-score, AUC, Matthew’s correlation coefficient, log loss, and Brier score, were utilized to evaluate model performance.

Results

The XGBoost model emerged as the top performer with an AUC of 0.81, accuracy of 0.79, precision of 0.5, recall of 0.62, and F1-score of 0.55. SHAP (SHapley Additive exPlanations) analysis identified key variables associated with PCS, including Body Mass Index, Random Blood Sugar, ataxia, dysarthria, and diastolic blood pressure and body temperature. These variables played a significant role in facilitating the early diagnosis of PCS, emphasizing their diagnostic value.

Conclusion

This study pioneers the use of clinical data and machine learning models to facilitate the early diagnosis of PCS, filling a crucial gap in stroke research. Using simple clinical metrics such as BMI, RBS, ataxia, dysarthria, DBP, and body temperature will help clinicians diagnose PCS early. Despite limitations, such as data biases and regional specificity, our research contributes to advancing PCS understanding, potentially enhancing clinical decision-making and patient outcomes early in the patient’s clinical journey. Further investigations are warranted to elucidate the underlying physiological mechanisms and validate these findings in broader populations and healthcare settings.

Peer Review reports

Introduction

Posterior circulation stroke (PCS) constitutes 20% of all ischemic stroke [1] with 70,000–100,000 patients presenting with PCS in the USA annually [2]. PCS is difficult to diagnose owing to the often stuttering, progressive and/or non-lateralizing nature of the symptoms given the vast area of blood supply and non-specific symptomatology [3]. Furthermore, computed tomography (CT), is less reliable in diagnosing PCS [10]. As described by Schneider et al. [4]. , the Timely diagnosis relies upon a careful history and a high clinical index of suspicion (e.g. speed of onset, age, and vascular risk factors).

Research conducted by Mehndiratta and colleagues highlighted hypertension as the predominant risk factor associated with PCS. Interestingly, vertigo emerged as the most frequently observed clinical symptom, closely followed by ataxia [5]. Furthermore, in a comparison with anterior circulation strokes, it was observed that type 2 diabetes mellitus exhibited a stronger association with PCS, particularly in cases of pontine infarctions as opposed to non-pontine subtypes of PCS [6]. Additionally, it’s worth noting that vertebral artery hypoplasia is present in 10% of the general population in China, and it stands as an independent risk factor for PCS, alongside male sex [7].

Qatar, a prosperous peninsula situated on the northeastern border of the Arabian Peninsula, has a native Qatari population comprising only 15% of the total populace [4]. Despite its affluence, the country faces significant public health challenges, including a high prevalence of obesity, diabetes mellitus (DM), and cardiovascular disease [5]. In 2020, Qatar ranked 15th globally for obesity, affecting over 35% of its citizens. Additionally, in 2013, approximately 16% of the population received a diagnosis of diabetes mellitus [6]. Notwithstanding these concerning statistics, Qatar maintains a relatively low stroke incidence rate of 58 cases per 100,000 individuals, significantly lower than the MENA region’s rate of 250 cases per 100,000 people [7, 8]. Moreover, Qatar exhibits a comparatively low rate of stroke-related fatalities [7]. This phenomenon can be ascribed to the distinctive demographic makeup, where the expatriate working-age population constitutes the majority [4, 9]. This heterogeneous demographic and ethnic composition have a significant implication on the stroke characteristics compared to the Caucasian dominant population where most of the published stroke database publications come from [4].

The use of Machine Learning (ML) in medicine, particularly through Explainable Artificial Intelligence (XAI), is crucial for enhancing model performance, building user trust, and supporting decision-making processes, thereby potentially increasing AI’s clinical impact and adoption in healthcare [8]. By developing robust ML models for early and accurate prediction of clinical outcomes across diverse demographics. This advancement enables the customization of treatment plans to meet the unique needs of each patient, thereby potentially saving lives. Giuste et al. developed a robust ML model to predict patient-specific risk of death using features available at the time of diagnosis. Consequently, this fosters a greater acceptance and integration of AI technologies within healthcare frameworks [9].

Various scoring systems and scales have been employed to assess and predict the outcome, prognosis, and severity of PCS. A substantial body of research consistently indicates that PCS tends to carry a less favorable prognosis in comparison to anterior circulation strokes [10]. The National Institutes of Health Stroke Scale (NIHSS) serves as the most widely utilized tool for gauging stroke severity. However, it falls short when assessing PCS due to its inability to capture clinical elements specific to the posterior circulation, such as nystagmus or gait disturbances. This limitation can result in an underestimation of the severity of PCS [11, 12]. To address this gap, several alternative scoring systems have been developed to more accurately evaluate the severity of PCS, such as Adam’s Scale of Posterior Stroke (ASPOS) [12] and the posterior NIHSS [13] Additionally, research has shown that utilizing NIHSS scores 24 h post-stroke proves to be a more precise predictor of functional outcomes within 90 days following thrombectomy, as opposed to relying solely on NIHSS scores upon admission [14].

ML has been heavily utilized in the field of stroke to predict certain stroke outcomes and to help improve personalized medicine [15, 16]. ML-based assessment tools, such as the Posterior Circulation Acute Stroke Prognosis Early CT-Score (pc-ASPECTS), have been developed to enhance outcome prediction for PCS utilizing imaging-based ML. Impressively, pc-ASPECTS demonstrates superior accuracy when evaluated through Receiver Operating Characteristic (ROC) curves, particularly in forecasting outcomes for minor strokes occurring within the initial 36 h after stroke onset. Utilizing a cutoff value of 7, individuals with a pc-ASPECTS score exceeding 7 tend to exhibit more favorable outcomes [17, 18]. Similarly, pcASCO (Posterior Circulation Acute Stroke Prognosis Collateral Score) serves as another imaging-based scoring system designed to predict functional independence at day 90 and the occurrence of malignant cerebellar edema (MCE) in patients with basilar artery occlusion (BAO) stroke upon admission [19]. Furthermore, Tan and colleagues, have identified the Hyperdense Basilar Artery Sign (HDBA) on unenhanced computed tomography scans as a valuable indicator for the early diagnosis of acute PCS and the prediction of a less favorable short-term outcome [20].

While significant strides have been made in predicting stroke outcomes, it is evident that progress in the domain of PCS outcome prediction lags significantly behind that of anterior circulation stroke models. What is more concerning is the limited research dedicated to facilitating the early diagnosis of PCS, primarily due to its challenging presentation. Therefore, this study aims to address this gap by utilizing a diverse range of machine-learning models to boost the physicians’ capacity to diagnose PCS early using patients’ demographic and clinical data without relying on complex imaging data.

Methods

Our proposed methodology involves a systematic approach to utilize ML techniques for analysing data from a stroke registry. Initially, we gather comprehensive data from the registry, ensuring inclusion of all relevant variables. Following this, we conduct thorough data analysis to uncover correlations among the variables. The dataset is then split for training multiple ML models. These models undergo evaluation, with the best-performing model selected for further analysis and potential deployment. Figure 1 outlines the sequential steps of our ML prediction system.

Fig. 1
figure 1

Flow diagram for the proposed prediction system

Ethical approval

The study obtained authorization from the Institutional Research Board (IRB) at Hamad Medical Corporation in Qatar, identified by reference number MRC-01-22-594.

Study population

We gathered data from the national Qatar Stroke Registry housed at Hamad General Hospital (HGH), the sole tertiary and referral stroke center in Qatar, covering the period from January 2014 to July 2022. The dataset comprises individuals aged 18 years and above who were admitted to HGH with a primary diagnosis of stroke. Over the course of establishing the stroke registry in Qatar until July 2022, a total of 15,859 patients sought specialized stroke treatment at the hospital. This encompassed patients with diagnoses of ischemic and hemorrhagic strokes, transient ischemic attacks (TIAs), and stroke mimics. However, our study specifically concentrates on patients diagnosed with non-hemorrhagic strokes who have valid Bamford class, while excluding all other conditions.

Baseline variables

The data collected encompassed a comprehensive array of patient information, spanning demographics, hemodynamic measurements upon admission (including heart rate (HR), blood pressure (BP), and temperature), factors contributing to stroke risk (such as smoking history, pre-existing medical conditions, Body Mass Index (BMI)), neurological symptoms at presentation, and the assessment of stroke severity at admission utilizing the National Institute of Health Stroke Score (NIHSS) [21, 22]. We utilized the CDC’s 5-class definition for adult overweight and obesity [23] to categorize Body Mass Index (BMI) groups.

Regarding ethnicity, patients were categorized into five distinct groups based on their reported nationality: Qatari, Middle East and North Africa (MENA) region, South Asia region, South East Asia region (defined in accordance with the United Nations geo-scheme), and all other nationalities were grouped into an “other” category [24, 25]. Notably, the specific categorization for Qatari patients was employed to enable meaningful comparisons, recognizing the unique demographic composition of the country where a significant portion of the population consists of expatriates [26, 27]. This classification methodology has been consistently applied in previous stroke research in Qatar [15, 24, 28].

All relevant risk factors, such as pre-existing medical conditions and smoking history, were meticulously recorded during the patient’s hospitalization and cross-validated by stroke registry personnel through electronic medical records. A total of 29 variables, as outlined in Table  1, were utilized in predicting PCS.

Outcome measure

Our primary focus for outcome is diagnosing POCI (PCS) based on the Bamford classification of cerebral stroke. Bamford classification, also known as, Oxford Community Stroke Project Classification, classifies the cerebral infarction stroke into four categories prognostically and etiologically as follows; (a) Total Anterior Circulatory Infarct (TACI), (b) Lacunar Anterior Circulatory Infarct (LACI), (c) Partial Anterior Circulatory Infarct (PACI), and (d) Posterior Circulatory Infarct (POCI) [29,30,31]. In this study’s outcome variable, PCS cases were coded as ‘1’, while all other classes were assigned a ‘0’ code.

Inclusion/exclusion criteria

This study encompassed all adult patients aged 18 years or older who received a diagnosis of stroke. Out of the initial cohort of 15,859 patients, 1,657 who were diagnosed with haemorrhagic stroke were excluded. Records with missing or unstandardized outcome variables were also eliminated. Additionally, any records that lacked more than 50% of the included variables were excluded, resulting in a final dataset of 12,703 records eligible for the study, as depicted in Fig. 2.

Fig. 2
figure 2

Inclusion and exclusion process

Handling missing data and class imbalance

In instances where data value was missing, we adopted the Multiple Imputation using Chained Equations (MICE) technique to generate data imputations [32]. Within the dataset, it was observed that the Random Blood Sugar (RBS) had the highest rate of missing values at 3% and followed by Heart Rate (HR) at 0.3%. The cohort presented a PCS rate of 20.7%, leading to a concern regarding class imbalance. To address this issue, To address this issue, we integrated class weighting to counteract the imbalance [33, 34]. Specifically, we assigned class weights inversely proportional to class frequencies, granting greater weight to the minority class (patients who have PCS) to enhance their impact during the training process.

Model training and evaluation

The dataset was divided into two subsets: a training set comprising 80% of the data and a validation set containing 20%, employing a stratified random sampling method. Our models were built using the training dataset, and their performance was assessed using the validation dataset. In order to optimize model performance and efficiency, feature scaling with data normalization was conducted prior to the training of ML models. We trained a total of five machine learning models, including XGB, weight-adjusted RF, SVM, CART, and LR coupled with random under-sampling of the majority class.

We employed a range of classification metrics to evaluate the effectiveness of the models. These metrics included accuracy, precision, specificity, recall, F1-score, area under the receiver operating characteristic curve (AUC), Matthew’s correlation coefficient (MCC), log loss, and Brier score [35,36,37,38,39]. These metrics offer insights into the model’s ability to accurately classify both positive instances (patients with PCS) and negative instances (patients with non-PCS), considering the class imbalance. The model with the highest F1-score will be selected as the primary model for subsequent external and temporal validation.

Model explainability

To gain better insight into the decision-making of ML models, we utilized SHAP (SHapley Additive exPlanations). SHAP is a robust toolkit employed to elucidate the classification made by ML models [40]. It works by producing “variable importance” scores at the individual level for features, known as SHAP values. These values quantify how much each feature contributes to a specific classification result.

Table 1 Statistical characteristics of the collected stroke dataset
Table 2 Models performance evaluation metrics

Results

The cohort includes 12,703 patients. As in Tables 1 and 73% of the patients are males. The mean age of the cohort is 53.7 ± 14 years. Approximately, 21% of the patients were diagnosed with PCS consistent with the global trend. Around 44% of the cohort is from MENA region and 43% is from South Asian ethnicity.

Model evaluation

As indicated in Table 2; Fig. 3, the models exhibit varying levels of performance, with the XGBoost (XGB) model emerging as the top performer. Despite the moderate F1 score and precision, XGB achieved notable results, boasting an AUC of 0.81, an accuracy of 0.79, a precision score of 0.5, a recall rate of 0.62, and an F1-score of 0.55. The Random Forest (RF) model also demonstrated competitive performance, with an AUC of 0.83, though it recorded the lowest F1 score of 0.39, indicating relatively weaker overall classification capabilities. Conversely, Logistic Regression (LR) excelled in terms of AUC, recall, Brier score, and F1 score but exhibited lower precision, specificity, and Matthew’s Correlation Coefficient (MCC). Therefore, we have chosen XGB as the primary model for subsequent SHAP analysis, external validation, and temporal validation in diagnosing PCS.

Fig. 3
figure 3

Area Under the Receiving Operating Curve (AUC-ROC) of the trained models

SHAP analysis

The SHAP analysis yielded invaluable and pivotal insights for identifying PCS. In this context, the crucial factors, ranked by importance, are BMI, stroke severity upon presentation (measured by NIHSS), Random Blood Sugar (RBS), ataxia, dysarthria, Diastolic Blood Pressure (DBP), and body temperature at admission. Figures 4 and 5 illustrate SHAP analysis results.

Fig. 4
figure 4

SHAP mean values

Fig. 5
figure 5

SHAP violin plot- variables impact on model’s output

Discussion

We evaluated the effectiveness of five ML models and harnessed SHAP analysis to determine the pivotal factors linked to PCS. The aim is to facilitate early identification of PCS which is challenging to diagnose through ML models utilizing simple clinical data without neuroimaging. The paucity of existing literature addressing utilization of simple clinical data to classify PCS diagnosis underlines the distinctiveness and originality of our study

SHAP analysis reaffirmed the significant influence of BMI on the model’s classification performance. Typically, obesity is well-recognized as a leading risk factor for stroke and is linked with unfavorable prognoses [41, 42]. A study conducted in Qatar in 2009 similarly identified obesity as one of the major risks associated with PCS, finding that patients with a BMI > 30 faced a heightened risk of PCS compared to those with lower BMIs [43]. Likewise, our study revealed that patients with a BMI ≥ 25 are four times more likely to be at a higher risk of PCS compared to those with a BMI < 25. Remarkably, 22.7% of patients presenting with PCS had a BMI greater than 25%, in contrast to only 6.6% with a BMI below 25%. This difference was statistically significant with a p-value less than 0.05. However, it is important to acknowledge recent reviews that have questioned the robustness of BMI as an indicator of obesity. These critiques cite concerns of potential inaccuracies in body weight estimation, insufficient adjustments for comorbidities, non-linear BMI-outcome relationships, limited follow-up durations, and potential selection bias arising from the retrospective nature of much of the earlier literature [44, 45]

The National Institute of Health Stroke Scale (NIHSS) is a well-established tool for assessing stroke severity and predicting stroke outcomes [15, 16]. In our study, we observed that PCS tends to be associated with a slightly lower NIHSS scores upon admission when compared to others, as depicted in Fig. 5, with PCS averaging 3.25 on the NIHSS score, while other stroke types averaged 3.43, however this was not statistically significance. Similarly, Wen-Dan et al. and Imam et al. found that PCS to be associated with lower NIHSS at admission [46, 47]. This is probably due to its inherent bias of design towards anterior circulation resulting in higher scores assigned to anterior circulation strokes rather than PCS [48]

The third variable that exhibited a significant association with PCS and contributed to the model’s output is the Random Blood Sugar (RBS) level. The anterior and posterior circulations in the body differ anatomically and functionally from the posterior circulation. Anatomically, it is common to observe hypoplasia (underdevelopment) or aplasia (absence) in the vertebral artery. Frequently, one of these two arteries is either underdeveloped or terminates prematurely at the posterior inferior cerebral artery. These thinner, underdeveloped arteries are more susceptible to thrombotic occlusion, especially in the context of poor glycemic control. Functionally, the posterior circulation exhibits a less effective self-regulating mechanism for maintaining stable blood flow [49], and it also receives reduced influence from the sympathetic nervous system [50]. This makes the posterior circulation more susceptible to damage in diabetics. Diabetes also causes different structural changes in the anterior and posterior circulations, influenced by glycated hemoglobin [51], increasing the risk of stroke in the posterior circulation due to chronic high blood sugar [47]. This study identified a positive correlation between RBS levels and PCS. In our secondary analysis, we observed that the average RBS measured at admission for patients with PCS was significantly higher than that for those with non-PCS, with values of 10 mmol/l compared to 8.5 mmol/l, p-value < 0.05

Symptoms presented during initial patient assessment play a crucial role in aiding physicians in the diagnostic process. Our study revealed that ataxia emerges as a key indicator for the early diagnosis of PCS. Remarkably, 59.2% of PCS patients manifested ataxia, in contrast to 41% of patients in other diagnostic categories. Importantly, the odds of experiencing ataxia in cases of PCS were six times higher than in non-PCS cases, with a p-value < 0.05. This finding aligns with previous research, where the presence of ataxia has consistently been associated with PCS, regardless of the specific location of the lesion [5, 52]

Similarly, our analysis revealed that dysarthria was more strongly associated with non-PCS categories as opposed to PCS. Specifically, only 16.5% of patients diagnosed with PCS exhibited dysarthria, while a significant majority, accounting for 83.5%, was observed in non-PCS cases, p-value < 0.05. Interestingly, Wen-Dan et al. did not identify a significant distinction between PCS and anterior circulation strokes in terms of the presentation of dysarthria [46]. Dysarthria is a nonspecific symptom that occurs in stroke [53] and in many non-vascular brain disease such Parkinsons Disease [54], Traumatic brain injury [53] and Motor neuron disease [55] as well as stroke mimics [56]. It’s worth noting that our findings may be attributed to the focus of our study on PCS versus all other non-hemorrhagic stroke types rather than anterior stroke alone as in Wen-Dan et al.’s study. And given lacunar syndromes that are commonly associated with dysarthria such as clumsy-hand dysarthria are typically classified as lacunar stroke rather than POCI this maybe an additional reason for its prevalence in non-PCS stroke

Diastolic Blood Pressure (DBP) was found to be strongly associated with PCS, with patients diagnosed with PCS typically presenting higher mean DBP levels than those in other diagnostic categories. For the entire cohort, we observed a mean DBP of 88 ± 18 mmHg. Notably, patients with PCS exhibited a mean DBP of 89 mmHg, compared to 87.7 mmHg in patients from other diagnostic categories, p-value < 0.05. Age and lacunar etiology are both associated with systolic hypertension whereas in this selective relatively young population where presenting phenotypical lacunar strokes are grouped as non-PCS, we found an association with an elevated DBP [57]. Similarly, a study conducted in South London reported a noteworthy association between DBP readings, both before and after the occurrence of PCS, in comparison to anterior circulation stroke cases [58]

Past literature has examined the relationship between body temperature and acute stroke, with findings suggesting that body temperature can serve as an independent predictor of stroke outcomes, where higher temperatures are associated with poorer outcomes [59, 60]. However, there has been a very limited prior research reporting differences in body temperature readings upon admission between PCS and other non-hemorrhagic stroke categories such as in Karaszewski et al. [61, 62] and Kim eta al [63]. , . This gap in research could stem from healthcare providers primarily concentrating on distinguishing between normal and abnormal hemodynamic values, rather than identifying subtle variations within the normal range. These minor changes in hemodynamic readings, although within normal limits, may have potential correlations with various clinical conditions. In our study, the mean temperature for the entire cohort stood at 36.7 ± 0.33 °C. Our secondary analysis revealed a statistically significant difference in mean body temperature between patients with PCS and those with other stroke types. Specifically, the mean body temperature for the PCS cohort was 36.6, compared to 36.72 for the other cohort, with a p-value < 0.05. The remaining variables, although important, were found to have lesser influence on the model’s classification performance

Strength and limitations

The strength of this study lies in its creation of a forward-thinking, comprehensive, and diverse dataset, which is distinguished by its formation as a nationwide database. This approach effectively surpasses the constraints typically associated with traditional hospital-centric registries. This study employed a national stroke registry, prospectively capturing data from all patients receiving specialized stroke care across the country. However, our research encounters limitations stemming from the retrospective method of data extraction and the presence of incomplete data records. Furthermore, the use of registries can lead to potential challenges in interpretation, documentation, and coding accuracy

Firstly, our study relies on retrospective integration of clinical data, which may be subject to inherent biases and limitations associated with data collection methods. The accuracy and completeness of the electronic medical records could impact the quality of the data, potentially introducing information bias or missing relevant variables

Secondly, the study’s dataset is drawn from a specific healthcare setting in Qatar, and its generalizability to broader populations or healthcare systems may be limited. Regional variations in stroke demographics, risk factors, and healthcare practices could influence the external validity of our findings. Moreover, our study focuses on the utilization of clinical data and ML models to classify PCS. While this approach yields promising results, it does not necessarily take into consideration the underlying physiological mechanisms or causative factors driving the observed associations. Further research may be needed to elucidate the biological underpinnings of these relationships

Additionally, our study does not account for potential confounding factors that could influence the associations between the variables and PCS. Factors such as medication usage, or lifestyle factors were not included in our analysis but may play a role. Lack of imaging information may also present a significant limitation to the potential of the ML model capacity to achieve better performance

Finally, while ML models offer predictive capabilities, their interpretability may be limited, and the model’s decisions may not always align with clinical reasoning. Ensuring that the predictions generated by these models are clinically relevant and actionable is an ongoing challenge in the field of predictive analytics. Considering these limitations, our study serves as a foundational exploration of PCS classification using clinical data and ML but underscores the need for further research, prospective studies, and consideration of broader contexts and variables in future investigations

Conclusions

In conclusion, this study represents an original effort to help classify and differentiate PCS upon admission to the hospital using clinical data and ML models. The significance of this study lies in exploring a potential supportive role of ML in helping clinicians with the challenging diagnosis of PCS, based on simple clinical data without neuroimaging. Our analysis of five ML models, accompanied by SHAP analysis, has contributed to a deeper understanding of how these models can aid healthcare providers in the early detection of PCS, addressing a critical gap in stroke research

Several key findings have emerged from our investigation. BMI, stroke severity (NIHSS), RBS, ataxia, dysarthria, DBP and body temperature have been identified as important factors associated with PCS.

Data availability

The datasets generated and/or analyzed during the current study are available from the corresponding author on reasonable request and subject to appropriate ethical approvals.

References

  1. Zürcher E, Richoz B, Faouzi M, Michel P. Differences in ischemic anterior and posterior circulation strokes: a clinico-radiological and Outcome Analysis. J Stroke Cerebrovasc Dis. 2019;28(3):710–8.

    Article  PubMed  Google Scholar 

  2. Mozaffarian D, Benjamin EJ, Go AS, Arnett DK, Blaha MJ, Cushman M, et al. Heart Disease Stroke Statistics—2016 Update Circulation. 2016;133(4):e38–360.

    PubMed  Google Scholar 

  3. Burns JD, Rindler RS, Carr C, Lau H, Cervantes-Arslanian AM, Green-LaRoche DM, et al. Delay in diagnosis of basilar artery stroke. Neurocrit Care. 2016;24(2):172–9.

    Article  PubMed  Google Scholar 

  4. Schneider AM, Neuhaus AA, Hadley G, Balami JS, Harston GW, DeLuca GC, et al. Posterior circulation ischaemic stroke diagnosis and management. Clin Med (Lond). 2023;23(3):219–27.

    Article  PubMed  Google Scholar 

  5. Mehndiratta M, Pandey S, Nayak R, Alam A. Posterior circulation ischemic stroke-clinical characteristics, risk factors, and subtypes in a north Indian population: a prospective study. Neurohospitalist. 2012;2(2):46–50.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Zhu J, Li Y, Wang Y, Zhu S, Jiang Y. Higher prevalence of diabetes in Pontine Infarction than in other posterior circulation strokes. J Diabetes Res. 2022;2022:4819412.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Hu XY, Li ZX, Liu HQ, Zhang M, Wei ML, Fang S, et al. Relationship between vertebral artery hypoplasia and posterior circulation stroke in Chinese patients. Neuroradiology. 2013;55(3):291–5.

    Article  PubMed  Google Scholar 

  8. Bhadra S, Kumar CJ. Enhancing the efficacy of depression detection system using optimal feature selection from EHR. Comput Methods Biomech Biomed Engin. 2024;27(2):222–36.

    Article  PubMed  Google Scholar 

  9. Giuste FO, He L, Lais P, Shi W, Zhu Y, Hornback A, et al. Early and fair COVID-19 outcome risk assessment using robust feature selection. Sci Rep. 2023;13(1):18981.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Rarhi D, Kundu PK, Datta AK, Basu S, Ray A. A clinical comparison along with prediction of the outcome and prognosis of anterior and posterior circulation stroke patients admitted in Tertiary Care Hospital. J Assoc Physicians India. 2022;70(6):11–2.

    PubMed  Google Scholar 

  11. Sato S, Toyoda K, Uehara T, Toratani N, Yokota C, Moriwaki H, et al. Baseline NIH stroke scale score predicting outcome in anterior and posterior circulation strokes. Neurology. 2008;70(24 Pt 2):2371–7.

    Article  CAS  PubMed  Google Scholar 

  12. Wiśniewski A, Filipska K, Piec K, Jaskólski F, Ślusarz R. Introducing Adam’s Scale of Posterior Stroke (ASPOS): A Novel Validated Tool to Assess and Predict Posterior Circulation Strokes. Brain sciences [Internet]. 2021 2021/03//; 11(4):[424 p.]. http://europepmc.org/abstract/MED/33810516https://www.mdpi.com/2076-3425/11/4/424/pdf?version=1617937162https://doi.org/10.3390/brainsci11040424https://europepmc.org/articles/PMC8065750https://europepmc.org/articles/PMC8065750?pdf=render.

  13. Alemseged F, Rocco A, Arba F, Schwabova JP, Wu T, Cavicchia L, et al. Posterior National Institutes of Health Stroke Scale improves prognostic accuracy in posterior circulation stroke. Stroke. 2022;53(4):1247–55.

    Article  CAS  PubMed  Google Scholar 

  14. Kniep H, Bechstein M, Broocks G, Brekenfeld C, Flottmann F, van Horn N, et al. Early surrogates of outcome after thrombectomy in posterior circulation stroke. Eur J Neurol. 2022;29(11):3296–306.

    Article  PubMed  Google Scholar 

  15. Abujaber AA, Alkhawaldeh IM, Imam Y, Nashwan AJ, Akhtar N, Own A et al. Predicting 90-day prognosis for patients with stroke: a machine learning approach. Front Neurol. 2023;14.

  16. Abujaber AA, Albalkhi I, Imam Y, Nashwan AJ, Yaseen S, Akhtar N, et al. Predicting 90-Day prognosis in ischemic stroke patients Post Thrombolysis using machine learning. J Personalized Med. 2023;13(11):1555.

    Article  Google Scholar 

  17. Kniep HC, Elsayed S, Nawabi J, Broocks G, Meyer L, Bechstein M, et al. Imaging-based outcome prediction in posterior circulation stroke. J Neurol. 2022;269(7):3800–9.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Lin S-F, Chen C-I, Hu H-H, Bai C-H. Predicting functional outcomes of posterior circulation acute ischemic stroke in first 36 h of stroke onset. J Neurol. 2018;265(4):926–32.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Broocks G, Meyer L, Faizy TD, Elsayed S, Kniep H, Kemmling A, et al. New imaging score for outcome prediction in basilar artery occlusion stroke. Eur Radiol. 2022;32(7):4491–9.

    Article  CAS  PubMed  Google Scholar 

  20. Tan X, Guo Y. Hyperdense basilar artery sign diagnoses acute posterior circulation stroke and predicts short-term outcome. Neuroradiology. 2010;52(12):1071–8.

    Article  PubMed  Google Scholar 

  21. Purrucker J, Hametner C, Engelbrecht A, Bruckner T, Popp E, Poli S. Comparison of Stroke Recognition and Stroke Severity Scores for Stroke Detection in a single cohort. J Neurol Neurosurg Psychiatry. 2015;86(9):1021.

    Article  PubMed  Google Scholar 

  22. Brott T, Adams HP, Olinger CP, Marler JR, Barsan WG, Biller J, et al. Measurements of Acute Cerebral infarction: a clinical examination Scale. Stroke. 1989;20(7):864–70.

    Article  CAS  PubMed  Google Scholar 

  23. Center of Disease Control (CDC). Defining Adult Overweight & Obesity 2022 [.

  24. Saqqur M, Salam A, Ayyad A, Akhtar N, Ali M, Khan A, et al. The prevalence, Mortality Rate and Functional Outcome of Intracerebral Hemorrhage according to age sex and Ethnic Group in the state of Qatar. Clin Neurol Neurosurg. 2020;199:106255.

    Article  PubMed  Google Scholar 

  25. Seizing the Opportunity. Ending AIDS in the Middle East and North Africa Amman. United Nations Children’s Fund (UNICEF); 2019.

  26. Gulli G, Rutten-Jacobs L, Kalra L, Rudd A, Wolfe C, Markus H. Differences in the Distribution of Stroke Subtypes in a UK Black Stroke Population - final results from the South London ethnicity and stroke study. BMC Med. 2016;14:77.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Imam Y, Kamran S, Saqqur M, Ibrahim F, Chandra P, Perkins J, et al. Stroke in the Adult Qatari Population (Q-stroke) a hospital-based Retrospective Cohort Study. PLoS ONE. 2020;15(9):e0238865.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Imam Y, Kamran S, Akhtar N, Deleu D, Singh R, Malik R, et al. Incidence, clinical features and outcomes of Atrial Fibrillation and Stroke in Qatar. Int J Stroke. 2020;15(1):85–9.

    Article  PubMed  Google Scholar 

  29. Bamford J, Sandercock P, Dennis M, Warlow C, Jones L, McPherson K, et al. A prospective study of acute cerebrovascular disease in the community: the Oxfordshire Community Stroke Project 1981-86. 1. Methodology, demography and incident cases of first-ever stroke. J Neurol Neurosurg Psychiatry. 1988;51(11):1373–80.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. de Andrade JBC, Mohr JP, Timbó FB, Nepomuceno CR, Moreira J, Timbó I, et al. Oxfordshire Community Stroke Project classification: a proposed automated algorithm. Eur Stroke J. 2021;6(2):160–7.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Bamford J, Sandercock P, Dennis M, Burn J, Warlow C. Classification and natural history of clinically identifiable subtypes of cerebral infarction. Lancet. 1991;337(8756):1521–6.

    Article  CAS  PubMed  Google Scholar 

  32. Van Buuren S, Groothuis-Oudshoorn K. Mice: Multivariate imputation by chained equations in R. J Stat Softw. 2011;45:1–67.

    Article  Google Scholar 

  33. Chen T, Guestrin C, editors. Xgboost: A scalable tree boosting system2016.

  34. Breiman L. Random forests. Mach Learn. 2001;45:5–32.

    Article  Google Scholar 

  35. Dharmarathne G, Hanea A, Robinson AP. Improving the computation of Brier scores for evaluating Expert-elicited judgements. Front Appl Math Stat. 2021;7:669546.

    Article  Google Scholar 

  36. Dodge Y. The concise encyclopedia of statistics. Springer Science & Business Media; 2008.

  37. McHugh ML. Interrater reliability: the kappa statistic. Biochemia Med. 2012;22(3):276–82.

    Article  Google Scholar 

  38. Chicco D, Jurman G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics. 2020;21(1):1–13.

    Article  Google Scholar 

  39. Vujović Ž. Classification model evaluation metrics. Int J Adv Comput Sci Appl. 2021;12(6):599–606.

    Google Scholar 

  40. Lundberg SM, Lee S-I. A unified approach to interpreting model predictions. Adv Neural Inf Process Syst. 2017;30.

  41. Abedi V, Avula V, Razavi S, Bavishi S, Chaudhary D, Shahjouei S, et al. Predicting short and long-term mortality after acute ischemic stroke using EHR. J Neurol Sci. 2021;427:117560.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Zhu E, Chen Z, Ai P, Wang J, Zhu M, Xu Z, et al. Analyzing and predicting the risk of death in stroke patients using machine learning. Front Neurol. 2023;14:1096153.

    Article  PubMed  PubMed Central  Google Scholar 

  43. Akhtar N, Kamran S, Deleu D, D’Souza A, Miyares F, Elsotouhy A, et al. Ischaemic posterior circulation stroke in state of Qatar. Eur J Neurol. 2009;16(9):1004–9.

    Article  CAS  PubMed  Google Scholar 

  44. Oesch L, Tatlisumak T, Arnold M, Sarikaya H. Obesity paradox in stroke–myth or reality? A systematic review. PLoS ONE. 2017;12(3):e0171334.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Forlivesi S, Cappellari M, Bonetti B. Obesity paradox and stroke: a narrative review. Eating and Weight disorders - studies on Anorexia. Bulimia Obes. 2021;26(2):417–23.

    Google Scholar 

  46. Tao W-D, Liu M, Fisher M, Wang D-R, Li J, Furie KL, et al. Posterior Versus Anterior Circulation Infarct Stroke. 2012;43(8):2060–5.

    PubMed  Google Scholar 

  47. Imam Y, Chandra P, Singh R, Hakeem I, AlSirhan S, Kotob M, et al. Incidence, clinical features, and outcomes of posterior circulation ischemic stroke insights from a large multiethnic stroke database. Front Neurol. 2024;15:1302298.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Kazi SA, Siddiqui M, Majid S. Stroke outcome prediction using admission Nihss in anterior and posterior circulation stroke. J Ayub Med Coll Abbottabad. 2021;33(2):274–8.

    PubMed  Google Scholar 

  49. Haubrich C, Wendt A, Diehl RR, Klotzsch C. Dynamic autoregulation testing in the posterior cerebral artery. Stroke. 2004;35(4):848–52.

    Article  CAS  PubMed  Google Scholar 

  50. Edvinsson L, Owman C, Sjo N-O. Autonomic nerves, mast cells, and amine receptors in human brain vessels. A histochemical and pharmacological study. Brain Res. 1976;115(3):377–93.

    Article  CAS  PubMed  Google Scholar 

  51. Ichikawa H, Mukai M, Takahashi N, Katoh H, Kuriki A, Kawamura M. Dilative arterial remodeling of the brain with different effects on the anterior and posterior circulation: an MRI study. J Neurol Sci. 2009;287(1–2):236–40.

    Article  PubMed  Google Scholar 

  52. Deluca C, Moretto G, Di Matteo A, Cappellari M, Basile A, Bonifati DM, et al. Ataxia in posterior circulation stroke: clinical-MRI correlations. J Neurol Sci. 2011;300(1–2):39–46.

    Article  PubMed  Google Scholar 

  53. Mitchell C, Bowen A, Tyson S, Butterfint Z, Conroy P. Interventions for dysarthria due to stroke and other adult-acquired, non-progressive brain injury. Cochrane Database Syst Rev. 2017;1(1):Cd002088.

    PubMed  Google Scholar 

  54. Tjaden K. Speech and Swallowing in Parkinson’s Disease. Top Geriatr Rehabil. 2008;24(2):115–26.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Tomik B, Guiloff RJ. Dysarthria in amyotrophic lateral sclerosis: a review. Amyotroph Lateral Scler. 2010;11(1–2):4–15.

    Article  CAS  PubMed  Google Scholar 

  56. Popkirov S, Stone J, Buchan AM. Functional neurological disorder: a Common and Treatable Stroke Mimic. Stroke. 2020;51(5):1629–35.

    Article  PubMed  Google Scholar 

  57. Webb AJS, Werring DJ. New insights into Cerebrovascular Pathophysiology and Hypertension. Stroke. 2022;53(4):1054–64.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Cates MJ, Paton JFR, Smeeton NC, Wolfe CDA. Hypertension before and after posterior circulation infarction: analysis of data from the South London Stroke Register. J Stroke Cerebrovasc Dis. 2012;21(7):612–8.

    Article  PubMed  Google Scholar 

  59. Saxena M, Young P, Pilcher D, Bailey M, Harrison D, Bellomo R, et al. Early temperature and mortality in critically ill patients with acute neurological diseases: trauma and stroke differ from infection. Intensive Care Med. 2015;41(5):823–32.

    Article  PubMed  PubMed Central  Google Scholar 

  60. Zhang W, Li F, Zhang C, Lei B, Deng W, Zeng H, et al. Impact of body temperature in patients with Acute Basilar artery occlusion: analysis of the BASILAR database. Front Neurol. 2022;13:907410.

    Article  PubMed  PubMed Central  Google Scholar 

  61. Karaszewski B, Thomas RGR, Dennis MS, Wardlaw JM. Temporal profile of body temperature in acute ischemic stroke: relation to stroke severity and outcome. BMC Neurol. 2012;12(1):123.

    Article  PubMed  PubMed Central  Google Scholar 

  62. Karaszewski B, Carpenter TK, Thomas RG, Armitage PA, Lymer GK, Marshall I, et al. Relationships between brain and body temperature, clinical and imaging outcomes after ischemic stroke. J Cereb Blood Flow Metab. 2013;33(7):1083–9.

    Article  PubMed  PubMed Central  Google Scholar 

  63. Kim SH, Saver JL. Initial body temperature in ischemic stroke: nonpotentiation of tissue-type plasminogen activator benefit and inverse association with severity. Stroke. 2015;46(1):132–6.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

Open Access funding is provided by the Qatar National Library.

Funding

The study was funded by the Medical Research Center at Hamad Medical Corporation (Grant: MRC-01-22-594).

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization: AAA, YI, IA.Formal analysis: AAA, IA.Data Curation, Methodology, Writing – original draft, Writing – review & editing: AAA, IA, YI, AJN, SY, NA.All authors read and approved the final manuscript.

Corresponding author

Correspondence to Abdulqadir J. Nashwan.

Ethics declarations

Ethics statement

The IRB at the Medical Research Center of Hamad Medical Corporation approved this project (MRC-01-22-594). The research was carried out in compliance with the ethical principles outlined in the Helsinki Declaration of 1964 and its subsequent modifications, as well as related ethical norms. The IRB waived the need for informed consent at the Medical Research Center of Hamad Medical Corporation due to the retrospective nature of the study.

Consent for publication

Not applicable.

Conflict of interest

The authors have no conflicts of interest to declare.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Abujaber, A.A., Imam, Y., Albalkhi, I. et al. Utilizing machine learning to facilitate the early diagnosis of posterior circulation stroke. BMC Neurol 24, 156 (2024). https://doi.org/10.1186/s12883-024-03638-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12883-024-03638-8

Keywords