Predictors of social risk for post-ischemic stroke reintegration

Cisek, Katryna K.; Nguyen, Thi Nguyet Que; Garcia-Rudolph, Alejandro; Saurí, Joan; Becerra Martinez, Helard; Hines, Andrew; Kelleher, John D.

doi:10.1038/s41598-024-60507-7

Download PDF

Article
Open access
Published: 02 May 2024

Predictors of social risk for post-ischemic stroke reintegration

Katryna K. Cisek^1,8,
Thi Nguyet Que Nguyen¹,
Alejandro Garcia-Rudolph^2,3,4,5,
Joan Saurí^2,3,4,
Helard Becerra Martinez⁶,
Andrew Hines⁶ &
…
John D. Kelleher^5,7,8

Scientific Reports volume 14, Article number: 10110 (2024) Cite this article

234 Accesses
Metrics details

Subjects

Abstract

After stroke rehabilitation, patients need to reintegrate back into their daily life, workplace and society. Reintegration involves complex processes depending on age, sex, stroke severity, cognitive, physical, as well as socioeconomic factors that impact long-term outcomes post-stroke. Moreover, post-stroke quality of life can be impacted by social risks of inadequate family, social, economic, housing and other supports needed by the patients. Social risks and barriers to successful reintegration are poorly understood yet critical for informing clinical or social interventions. Therefore, the aim of this work is to predict social risk at rehabilitation discharge using sociodemographic and clinical variables at rehabilitation admission and identify factors that contribute to this risk. A Gradient Boosting modelling methodology based on decision trees was applied to a Catalan 217-patient cohort of mostly young (mean age 52.7), male (66.4%), ischemic stroke survivors. The modelling task was to predict an individual’s social risk upon discharge from rehabilitation based on 16 different demographic, diagnostic and social risk variables (family support, social support, economic status, cohabitation and home accessibility at admission). To correct for imbalance in patient sample numbers with high and low-risk levels (prediction target), five different datasets were prepared by varying the data subsampling methodology. For each of the five datasets a prediction model was trained and the analysis involves a comparison across these models. The training and validation results indicated that the models corrected for prediction target imbalance have similarly good performance (AUC 0.831–0.843) and validation (AUC 0.881 - 0.909). Furthermore, predictor variable importance ranked social support and economic status as the most important variables with the greatest contribution to social risk prediction, however, sex and age had a lesser, but still important, contribution. Due to the complex and multifactorial nature of social risk, factors in combination, including social support and economic status, drive social risk for individuals.

Social frailty as a predictor of all-cause mortality and functional disability: a systematic review and meta-analysis

Article Open access 10 February 2024

Machine learning to predict mortality after rehabilitation among patients with severe stroke

Article Open access 18 November 2020

Machine learning predicts clinically significant health related quality of life improvement after sensorimotor rehabilitation interventions in chronic stroke

Article Open access 04 July 2022

Introduction

Following post-stroke rehabilitation, the long-term patient outcome generally encompasses reintegration into normal activities of daily living in the home, community, and workplace^1,2,3. An essential part of this process is community integration, which includes relationships with others, the ability to be independent in daily life activities (ADL), and participation in meaningful events^4,5,6. There is consistent evidence that continued positive interaction with one’s proximate social environment (e.g., family, friends and work life) exerts beneficial effects on health and well-being, increasing resilience to unexpected setbacks^7,8. Conversely, social isolation or lack of close social ties is associated with poor health and increased mortality risk^9,10. Complementary to community integration is minimizing social risk, which is a complex and multifactorial phenomenon that can vary significantly for an individual, but generally encompasses environmental, socioeconomic, as well as family and social support factors^11,12,13. For example, a patient with insufficient family support who is unable to access social support, such as home health care or a day center, is at a greater risk of poorer quality of life during reintegration, social isolation, and retreat from life (also termed fragility). These considerations emphasize the importance of quality of life, social well-being, as well as adequate support for patients with social risks during long-term reintegration^14,15.

Several studies in the literature highlight the importance of family support in the context of the social environment (also termed sociofamiliar) and the socioeconomic situation in the overall rehabilitation outcome and reintegration of patients^16,17,18,19. Although these studies target a broader patient population with physical, cognitive and sensory disturbances which include stroke patients, as well as elderly patients and their likelihood of discharge from a geriatric unit centre, nevertheless sociofamiliar factors play a significant role in the resilience of most patient populations. Ramírez-Duque et al. analyzed the clinical, functional, cognitive, sociofamiliar, and other characteristics of pluripathological patients and found that older people with cognitive and more severe functional impairment had worse sociofamiliar support than other patient groups¹⁸. In a similarly comprehensive study of the clinical, functional and social risk profiles of the elderly in a community in Lima, Peru, Varela-Pinedo et al. found that 8% of individuals lived alone, and nearly 60% had inadequate socioeconomic support and were at social risk¹⁹. In another study, Cahuana-Cuentas Milagros et al. concluded that family and socioeconomic factors have a significant impact on the levels of resilience of people with physical and sensory disabilities¹⁷. Finally, Sabartés et al. identified a deteriorated social situation as the only significant predictor of being institutionalized rather than discharged home for a cohort of hospitalized elderly patients¹⁶.

Since family, social and economic factors have been identified as having a significant impact on the quality of life of patients post-rehabilitation, the key goals of post-stroke reintegration have focused on improving patient outcomes across these factors, as well as designing personalized interventions for patients with social risk²⁰. More recently, special situations, such as the pandemic, have added additional uncertainties and strains to the recovery and reintegration process of patients^21,22,23. Therefore, it is essential for both the patients as well as clinicians to be able to forecast the level of dependence on social supports (the level of social risk) for an individual patient at admission to rehabilitation so that the necessary interventions can be put in place during rehabilitation in order to prevent setbacks after discharge. Due to the complexity of reintegration, encompassing the spatiotemporal component (long-term processes taking place in the home, community, and workplace)²⁴, multifactorial component (interdependency of psychosocial, environmental, and socioeconomic factors) as well as demographic and cultural factors (younger age, gender, geographic location)^{25,26,27,28,29}, predictive modelling of social risk is an invaluable tool in not only forecasting the level of social risk for an individual but also identifying the contributing factors to this risk. Accurate predictions of factors contributing to social risk can allow rehabilitation professionals (social workers, physical therapists, neuropsychogists, psychologists, etc.) to support persons with personalized interventions, prevent fragility, as well as help improve patients’ quality of life and support their specific clinical needs and challenges throughout the reintegration process. For this purpose, machine learning (ML) algorithms and statistical analyses have been employed in recent years to develop predictive models for stroke reintegration, such as in the case of long-term trajectories of community integration^30,31, and functional and cognitive improvement during rehabilitation^32,33. However, predictive modelling for social risk utilizing ML methodology has been a largely underexplored topic³⁴. Cisek, et al. focused on various conceptualizations of social risk during post-stroke reintegration, such as the International Classification of Functioning, Disability, and Health (ICF) framework, as well as utilizing data visualization to explore the cohort³⁵. In this work, we go beyond data exploration and understanding to predictive modeling and apply machine learning to develop interpretable predictive models that provide individualized predictions to guide personalized interventions for patients with social risk.

Table 1 EVSF questionnaire items and risk scoring metric.

Full size table

Methodology

Social risk questionnaire

Social workers conduct an interview at admission and discharge from the rehabilitation hospital following a structured questionnaire to assess social risk of patients, called “Escala de Valoracion Socio Familiar” (EVSF; eng. trans.: sociofamiliar assessment scale)³⁵. The questionnaire is based on the Gijon sociofamiliar scale³⁶ that includes five items (housing, family situation, economic situation, relationships, and social support). Accordingly, the EVSF questionnaire consists of five items also termed dimensions: cohabitation, economic status (indicating income sufficiency), home status (indicating home accessibility in case of mobility problems), family support and social support (Table 1). Each of these five items has five levels of risk that are scored from 1 to 5. A higher score for each item represents a higher risk for the social reintegration of the patient. The total score is the sum of the five-item scores and is between 5 and 25 and determines four social risk categories: (i) no social risk (5 points); (ii) mild social risk (6-9 points); (iii) important social risk (10-14 points); and (iv) severe social risk (15-25 points)³⁵. The reliability and validity of this questionnaire were evaluated by comparing the score obtained on the scale with a reference criterion of an independent, blind assessment by social work experts. It was reported to enable the detection of risk situations and social problems with good reliability and acceptable validity³⁷.

Table 2 Training set patient cohort admission data including social risk and demographics.

Full size table

Table 3 Hold-out test set cohort information including social risk and demographics.

Full size table

Training set patient cohort

Demographic, diagnostic and questionnaire data utilizing the EVSF items during the rehabilitation and reintegration of patients were recorded and collected at the Institut Guttmann (Barcelona, Spain) from 2007 to 2020. Inclusion criteria for this cohort consisted of adult patients 18–85 years of age at the time of stroke with an ischemic stroke diagnosis who were admitted within 3 weeks of the onset of symptoms, without any previous comorbidities leading to disability, and whose data was recorded within a week of admission and discharge. Exclusion criteria were any of the following: diagnosis of stroke in the context of another concomitant comorbidity (e.g., traumatic brain injury), a previous history of another disabling condition, patients with EVSF questionnaire performed more than 5 months post-injury, as well as more than 5 months stay at the rehabilitation hospital. The authors confirm that this study is compliant with the Helsinki Declaration of 1975, as revised in 2008 and it was approved by the Ethics Committee of Clinical Research of Institut Guttmann. Experimental protocols applied in this study were approved by Institut Guttmann’s Ethics Commitee. At admission participants provided written informed consent to be included in research studies addressed by the Institut Guttmann hospital.

On the basis of available demographic, diagnostic and questionnaire data at the admission of the patient to the Guttmann rehabilitation hospital, the patient cohort consisted of 217 patients and 16 variables for the modelling (Table 2). Although the Length Of Stay variable reported was the actual duration of the patients in rehabilitation from admission to discharge, this variable is estimated by clinicians at admission²⁰. Length of stay varies greatly within Spain; for an older population with mean age of 79.6 ± 7.9 years, Pérez et al reports mean 61.6 ± 45.6 days for 9 facilities in Catalonia-Spain, however, younger patients, such as the patients in this study who are 30 years younger, are reported to stay longer^{26,38, 39}. Hence, the longer length of stay (median 90 days) in this cohort is indicative of the poor functional status of this young, Spanish population (Table 2). The changes in social risk dimensions during patients stay at the hospital were previously examined in Cisek et al.; approximately a third of patients transitioned into another category by improving or worsening their social risk situation, and the majority of patients changed individual risk dimensions³⁵. Since patients can undergo a social risk transition over the course of rehabilitation, the 16 admission variables were used to predict the level of social risk at discharge from rehabilitation in a binary classification, where patients in the no social risk and mild social risk categories were considered as having negligible social risk (GREEN), whereas patients in the important and severe social risk categories were considered as having significant social risk (RED) (Fig. 1). In the 217-patient cohort, there were twice as many male patients as female patients; there was no way to control for this sex ratio in the admitted patients or any gender bias in the referral from acute treatment units. There was a similar imbalance for the social risk classification (Table 2); nearly twice as many patients had negligible social risk (GREEN) than significant social risk (RED) at discharge from the hospital.

Hold-out test set patient cohort

For validation purposes, demographic, diagnostic and data utilizing the EVSF questionnaire during the rehabilitation and reintegration of patients were recorded and collected at the Institut Guttmann (Barcelona, Spain) from 2020 through 2021 during a prospective study. The initial inclusion criteria for this cohort were the same as for the training set and consisted of adult patients 18–85 years of age at the time of stroke with an ischemic stroke diagnosis who were admitted within 3 weeks of the onset of symptoms, without any previous comorbidities leading to disability, and whose data was recorded within a week of admission and discharge. Similarly, exclusion criteria were any of the following: diagnosis of stroke in the context of another concomitant comorbidity (e.g., traumatic brain injury), a previous history of another disabling condition, patients with EVSF questionnaire performed more than 5 months post-injury, as well as more than 5 months stay at the rehabilitation hospital.

However, the difficulties caused by the Covid pandemic resulted in a reduced number of new patients being recruited for this prospective study for validation, only 25. Therefore, in addition to these 25 patients from the prospective study meeting inclusion criteria, an additional 92 patients, that were filtered out for model training due to exclusion criteria were added to the hold-out test set. The benefit of using patients with exclusion criteria for external validation is that it validates the utility and robustness of the models in a real-world clinical use case where patients at social risk may not meet inclusion criteria (specifically patients older than 85 years old, patients assessed more than 5 months post-stroke and patients with a longer length of stay at the rehabilitation hospital). Indeed, for these 92 patients, some had comorbidities and other disabling conditions in addition to the stroke diagnosis (data not shown), whereas Days Since Stroke and Length of Stay (Table 3) were significantly higher than other subjects, while other variables were similar to the rest of the cohort. Similarly, to the model training cohort, there is an imbalance in the hold-out test set dataset of the negligible (GREEN) and significant (RED) social risk patients, with twice as many GREEN than RED class patients. Table 3 shows the hold-out test set cohort information of the total 117 patients (25 from the prospective study with inclusion criteria plus 92 patients not used for training due to exclusion criteria). Data filtering according to inclusion and exclusion criteria is presented in Fig. 2a.

Machine learning analysis

The ML analysis has two goals. The first is to create a prediction model that at the point of admission to rehabilitation can accurately forecast the level of social risk that an individual will experience at discharge. The second is to understand what are the factors that drive high social risk. We approach this second goal by analyzing what predictors are important in driving the models’ predictions for the entire cohort, as well as for an individual patient. The framework is presented in Fig. 2.

To create a binary classifier of significant (RED) or negligible (GREEN) social risk level at discharge, we used Generalized Boosted Regression Models (GBM)⁴⁰ as implemented in R statistical software^41,42. In a boosted ensemble methodology, a strong prediction model is built by combining a set of (potentially weaker) component models. The component models are built through successive iterations of model building over the training dataset^43,44. At each iteration a new component model is trained, so as to pay particular attention to the errors the models already in the ensemble made on the training data, and is added to the ensemble. In contrast to other decision tree algorithms such as random forest, that generate ensembles of deeper independent trees, GBM generates sequential ensembles of shallow trees, improving performance incrementally (by reducing error in each iteration) instead of taking an average of all models⁴⁰. Although shallow trees may be weak predictive models, they are “boosted” to produce a powerful ensemble, making GBMs efficient and powerful, especially for classification problems⁴⁵.

Classification models can suffer from poor performance (poor model fit, or poor sensitivities and specificities) in the case that the target classes are imbalanced, such as the cohort in this study. We used four subsampling techniques to correct for this issue⁴⁶. These four techniques were: incorporating weights of the classes into the cost function, (i.e., giving equal weight to both classes in binary prediction) without resampling the data; randomly up sampling (with replacement) the minority class to equal the size of the majority class; randomly down sampling and dropping the majority class samples so that it equals the size of the minority class, which results in model training on a subset of the total data; and hybrid sampling using the synthetic minority oversampling technique (smote) methodology which down-samples the majority class and synthesizes new data samples in the minority class by interpolating between existing minority class data samples. To robustly model the dataset and evaluate the importance of the variables as predictors, models were trained on the original dataset (not correcting for class imbalance), using a weighted cost function (giving equal weight to both classes), and with the up sampling, down sampling and smote method described above being applied to the data (Fig. 2b)^47,48.

For each of these five experimental conditions we performed a 10-fold cross-validation process on the training data (further splitting the training data into 90% fold training set and 10% fold test set for each fold), where the subsampling techniques were applied inside each cross-validation fold on the 90% training subset of data, but not the 10% fold test set (Fig. 2a). Our motivation for using a cross-fold validation processes was first to find for each subsampling method the best hyperparameters of the GBM algorithm (discussed in more detail below), and second to create a baseline estimation of model performance that provides a comparator to contextualise the performance obtained on the hold-out test set. In terms of assessing model performance for each subsampling method, using a cross-validation methodology enables us to consider model performance across different training and validation sets and to report both a mean performance for each metric and a confidence interval across the 10 folds. To enable a more reliable estimation of model performance from cross-validation for each fold, the 10% test set, which was used to calculate performance metrics (confusion matrices and scores), was not adjusted using subsampling methods. For each subsampling method, once the hyperparameters for the GBM algorithm were fitted via cross-validation⁴⁹, a final model was trained on the entire training dataset (with the subsampling method applied to the full dataset and using the corresponding fitted hyperparameters) and assessed on the independent hold-out test set.

All models were generated using the R statistical software^41,42 using the GBM algorithm as implemented in the gbm package⁴⁶ with a k-fold (k = 10) cross-validation with 10 repetitions with resampling method, and sampling (’up’, ’down’, ’smote’) and class weighing applied using the implementation from caret library⁵⁰ (weights using the weights argument in the train function, and sampling methods using the sampling argument in the trainControl, ensuring that the subsampling step is correctly done inside of the cross-validation procedure as described in the paragraph above)⁵¹. Tuning hyperparameters ’shrinkage’ was held constant at a value of 0.1 and ’n.minobsinnode’ was held constant at a value of 10. The hyperparameter grid search explored via cross-validation included number of tress (n.trees = 50, 100, 150 and interaction depth = 1, 2, 3. The final hyperparameters used for the models were n.trees = 50, interaction.depth = 1 (except for smote where model interaction.depth = 2), shrinkage = 0.1 and n.minobsinnode = 10. The same random seeds were used for each model to ensure comparable results from the same cross-validation folds.

For each of the five experimental settings (original data, weighted cost function, up sampling, down sampling, and smote) a single “final” GBM model was trained using the best hyperparameters found for the data subsampling method via cross-validation by fitting the GBM model to all the training data. These final models were then validated by applying the independent 117 sample hold-out test set not used in the training in the predict() function to predict the social risk level for these samples (Fig. 2c). It is important to note that the subsampling techniques were only applied to the training portion of the data and the label distribution in the hold-out test set was not adjusted using these subsampling techniques. As a result, the model validation performance we report on the hold-out test set is indicative of model performance on a real data distribution. The validation on the hold-out test set gave a training-independent estimate of the real performance of the models, not only to compare the models but also to validate the models robustness on data samples outside of the training set inclusion criteria, thereby reflecting the variety of patients in real world scenarios.

For the cross-validation processes that were run for the resampling conditions (up sampling, down sampling, and smote) the resampling method was only applied to the training folds and not to the test set validation for each fold. Consequently, in all the cross-validation processes (irrespective of whether a resampling is applied to the training folds) each example in the training data is used exactly once as a test validation sample (i.e., it occurs in only one of the validation folds and only once in that validation fold). To construct the confusion matrices we recorded for each example in the training data whether the prediction returned for that example was a true-positive, false-positive, false-negative, of true-negative, and present the totals for each of these four outcome types across the 10-validation folds. We also report a range of performance metrics calculated across the 10-validation folds, including Accuracy, Recall, F1 score, Precision, and AUC (Recall and sensitivity are the same measure. We report both here as it is standard to report recall alongside precision and sensitivity alongside specificity)^52,53. Some of the metrics, such as accuracy, focus on overall performance, others—such as F1, Sensitivity/Recall, Precision—prioritise performance on the minority/positive class (RED, significant social risk), and others—Specificity—prioritise performance on the majority/negative (GREEN, negligible social risk) class. Metrics Accuracy, Precision, Recall and F1 were calculated directly from the values reported in the confusion matrices, whereas metrics AUC, Sensitivity, Specificity and Balanced Accuracy were calculated by averaging over the metrics results obtained in each of the 10-validation folds.

Prediction interpretation analysis

In addition to identifying the most important predictor variables for the models, it is also important to discern which variables contribute to the prediction of a particular class for an individual patient; i.e., it is crucial for clinicians to know which variables drive the social risk prediction outcome for that patient (Fig. 2d). To this effect, we applied a local interpretability analysis of a predictive model that was proposed by authors in the publication “A Unified Approach to Interpreting Model Predictions” called SHAP (SHapley Additive exPlanations)⁵⁴, a model-agnostic approach based on Lloyd Shapley ideas for interpreting predictions. Unlike other expandability methodologies such as Local Interpretable Model-agnostic Explanations (LIME), SHAP is based on a strong theoretical basis, provides a full interpretation of a prediction, rather than an explainability prediction model. Moreover, it allows for contrastive explanations; instead of comparing a prediction to the average prediction of the entire dataset, it can be compared to a subset, such as a target class, or even to a single data point.

In brief, Shapley values are calculated on the prediction of a data point using the marginal contribution of a variable to a given model, i.e., a Shapley value is the average marginal contribution of that variable across all possible sets of variables (predictors)⁵⁴. Therefore, variables with a high positive Shapley value are contributing more to the final prediction in contrast to negative values. In other words, Shapley values explain the distribution of the prediction results (classification) among the predictors. In the case that exact Shapley values are calculated for a single data point, they should add up to the difference between the prediction for that observation and the average predictions across the entire training set. Since SHAP calculates the average impact of adding a variable to the model by accounting for all possible subsets of the other variables, the computation time of exact Shapley values grows exponentially with the number of variables in the model. To improve computational efficiency, approximate Shapley value calculations consider the root-to-leaf paths in the trees that contain the target variable, and all the subsets within these paths. For this work we opted to use approximate Shapley values calculated using the Monte Carlo simulation approach described in⁵⁵. The approximated Shapley values (nsim = 50) considering all the data points in the training dataset were calculated for each of the models using package fastshap⁵⁶, indicating the contribution of each of the predictors to the negligible risk prediction (GREEN class) and the significant risk prediction (RED class).

Results

Confusion matrices as well as standard classification model metrics including Accuracy, Recall, F1 score, Precision, were generated and the Area Under the Receiver Operator Characteristic Curve (ROC-AUC) as the evaluation metric for the best performing models (Tables 4 and 5). Corresponding ROC-AUC curves are presented in Fig. 3. Tables 4 and 5 present for each of the five experimental conditions the confusion matrices and a set of performance metrics calculated from the test sets across the validation folds during the 10-fold cross-validation process run on the training data.

Table 4 Confusion matrices for all models calculated across the 10-fold cross-validation process run on training data (top-left true-pos., top-right false-pos., bottom-left false neg., bottom-left true-neg.).

Full size table

Table 5 Model training performance statistics (bold font indicates the highest score for a metric).

Full size table

Model performance

Looking at the performance metrics in Table 5 all the models have good mean performance across the validation sets in the cross-fold validation, accuracy in the range of 0.811–0.880, and AUC in the range 0.827–0.843, with the smote sampling model outperforming all other models followed by the up sampling model. For each model we calculated the 95% confidence interval around its mean accuracy using the accuracies obtained by the model across the folds as the population. These confidence intervals overlap (i.e., the lower end of the CI range for the best model is lower than the high end of the CI range for the weakest model) which suggests that at the 95% confidence level there is no statistical difference between the accuracies of the models. However, if we consider measures such as F1, balanced accuracy, sensitivity, specificity, we do see differences between the models. This is because these measures explicitly weigh for class distribution and/or performance on the minority class (in this instance significant social risk).

What is of importance to clinicians is to identify patients with significant social risk (RED class). Sensitivity (also known as recall) is the critical performance metric here because it measures out of all the patients with significant social risk (positive class) how many of these did the model predict as having significant social risk. On this metric, the original model performs much worse than the other models, and the weighted method model has the best performance. In fact, the original model has the lowest sensitivity and the highest specificity of all models, suggesting that the original model is over predicting the majority class. This difference in model performance on the significant social risk class is also evident in the confusion matrices in Table 4. The model trained on the original data, with no adjustment for class imbalance, either in terms of cost function class weighting or resampling, only correctly identified 37 out of the 62 individuals who had significant social risk, by comparison the other models correctly identified 53 or 52 of these cases. From among these other four approaches, the smote method has comparable recall/sensitivity with the others and has better precision (very few false positives) resulting in the best overall F1. The smote method also results in the best overall balanced accuracy.

Table 6 Confusion matrix for hold-out test set validation (top-left true-pos., top-right false-pos., bottom-left false neg., bottom-left true-neg.).

Full size table

Table 7 Model validation performance metrics for hold-out test set (bold font indicates the highest score for a metric).

Full size table

Model validation

Confusion matrices as well as standard classification metrics including AUC, Accuracy, Sensitivity and Specificity, were generated for the validation on the independent hold-out 117-patient dataset and are presented in Tables 6 and 7.

On the hold-out test set the model trained using the original data distribution obtains the highest overall accuracy, 0.8632. However, the overlap of the confidence intervals of all the models indicates that none of the accuracy scores are significantly different at the 95% confidence level. Comparing these accuracy scores with the ones obtained through the cross-validation process the accuracy of the original model on the test set is higher than the accuracy obtained in the cross-validation data (although this difference is not statistically significant at the 95% level). By comparison, the accuracies of the other models all drop between the cross-validation and hold-out test set. These drops are in the range of 0.06–0.09 (again none of these drops are statistically significant at the 95% level).

Regarding AUC all the models have similar performance, in the range of 0.881–0.909. Interestingly all the models obtained a higher AUC score on the test set than the models trained in the corresponding setting during cross-validation, these increase range between 0.05 and 0.07. Although hold-out test set AUC values were generally slightly higher than those obtained during cross-validation, the test set and cross-validation accuracies are very comparable indicating that all models had a similarly stable and robust performance in both settings. As in the cross-validation setting the identification of individuals with significant social risk is of primary performance, and consequently, the performance of the models in terms of sensitivity/recall is of particular concern. On the hold-out test set, sensitivity was higher than specificity for all models (except the model trained on the original data) suggesting that applying a class weighting or resampling method does produce models that are more sensitive to the significant social risk class. The model trained using the weighted cost function method obtains the joint highest score for sensitivity on the test set (0.9394), and the highest score for sensitivity in the cross-validation setting (0.8548), and the highest AUC on the test set (0.909). However, the precision of these weighted models is low. Indeed, the model trained on the original data distribution has the highest accuracy, specificity, precision, F1, and balanced accuracy on the test set. This suggests that there is a trade-off between sensitivity and specificity and our results do not indicate a clear winning method on this task.

Overall, our results do indicate that it is possible to train a model that can accurately predict social risk at discharge based on the information available at admission to rehabilitation. The mean accuracy of all the models based on cross-validation and the test set results is 0.8148 (min 0.7265, max 0.8802), the mean F1 is 0.7208 (min 0.6596, max 0.8), the mean sensitivity is 0.8331 (min 0.5968, max 0.9394), and the mean specificity is 0.8078 (min 0.6429, max 0.9161). Given this, in the following sections, we analyze these models to better understand what variables they use in making the predictions, so as to gain insight into the drivers of social risk.

Understanding the drivers and predictors of social risk

Beyond knowing which individuals are at social risk it is also useful to know what the general drivers of social risk are and what are the factors that are driving social risk in a particular individual. One way of understanding the relative importance of variables in terms of their contribution to social risk is to analyse the importance of the variables to a model’s output. This analysis can be done at two levels (Fig.2d). First, we can analyse the importance of a variable to a prediction model’s performance across the entire dataset. This type of analysis can inform our understanding of what variables are most important in general to social risk at the population level. The second analysis we can do is to analyse what variables are most important in determining the social risk for a particular individual. This analysis can support the design of a personalised programme of interventions to reduce the social risk for that individual. In this section, we present both of these types of analyses using the models developed and validated in the above sections. First, we present general variable importance, where predictors are ranked for each model, followed by model explainability, where we explain how particular variables drive social risk for an individual.

Variable importance

In addition to accurately identifying which patients are at social risk, it is also useful to understand what the primary factors are that drive social risk, as this informs the design of appropriate interventions to reduce the social risk. For this analysis for each of the 5 models that were run on the hold-out test set we calculated the relative influence of each predictor variable, i.e., whether that variable was selected to split on during the tree-building process, and how much the squared error (over all trees) improved (decreased) as a result. The intuition being that the more a variable contributes to the outputs of an accurate model the more important the variable is in terms of the phenomenon the model predicts. This variable importance analysis also acts as a sense check of model performance (i.e., if the model relies on variables that domain knowledge would indicate are not important this would suggest that the model may be overfitting to the specifics of the data sample used to create the model). Figure 4 presents the results of the variable importance analysis for each of our 5 models. This analysis revealed that although variable importance of all of the 16 predictors varied for each of the models, nevertheless, four predictors consistently retained their high importance value and rank order: Family Support and Economic Status, as well as with lesser importance but same ranking, Cohabitation and Days Since Stroke. Interestingly, the Sex variable did not have any notable importance for most of the models, which may reflect the imbalanced ratio of men to women.

Model explainability

In binary classification tasks, predictions may fall near the classification threshold, therefore, the idea is that by using SHAP for establishing the contribution of each variable to either the GREEN or the RED classes separately, rather than as one population, we may elucidate the drivers of social risk for predictions near the threshold (50%) more clearly. In other words, for a given individual we can calculate the Shapley values for the variables under the assumption that the prediction of the model would be GREEN classification, and then calculate a separate set of Shapley values for the variables under the assumption that the prediction of the model would be a RED classification. In Figs. 5 and 6, we present the variables contributing to social risk for the 5 models of two random individuals from the training dataset. Figure 5 presents the results of a patient predicted with a probability of \({\sim }\) 90% across all models to have negligible risk (GREEN) whereas Fig. 6, presents the results of a patient predicted with a probability of \({\sim }\) 60% across all models to have significant risk (RED). For the first patient predicted to have negligible social risk with very strong probability, both the approximate SHAP results in the GREEN and RED plots mainly had positive predictor contributions to the prediction, meaning that these variables contribute to negligible social risk, and should these same variables change, they would contribute to significant social risk prediction. In the original model the variables Family Support and Economic Status made the largest contributions to the model’s predictions in both the GREEN and RED classification scenarios. This was also true for all other models (except the down-sampling model). In the down-sampling model, Social Support was the second highest contributing variable after Family Support. Except for the smote model, Sex and other demographic variables, which were similar to the variable importance rankings, were not notably predictive of social risk for this patient. Overall, this suggests that Family Support and Economic Status were the main drivers of the (negligible) social risk for this patient, and that Social Support also contributed to this outcome for this patient.

For the second patient predicted to have significant social risk with albeit close to the classification threshold at \({\sim }\) 60%, both the approximate SHAP result in the GREEN and RED plots mainly had opposite predictor trends (i.e., the positive Shapley values drove the negligible risk prediction Fig. 6a, whereas the negative Shapley values did not contribute to the significant social risk prediction (Fig. 6b). For all the models, Family Support, Economic Status and Cohabitation had the greatest contributions to negligible risk (Fig. 6a, and the predictor Days Since Stroke also had an important contribution. However, the prediction for this patient was that of significant social risk (RED class), where depending on the model, mainly clinical variables contributed to this final prediction (Fig. 6b: for original and weighted method models the greatest predictors were Days Since Stroke and motor FIM and for up sampling and down sampling models the predictors were total FIM and motor FIM and for the smote method model the predictors were NIHSS and Age @ Stroke. Similarly to the other patient SHAP result (Fig. 5), except for the smote method model, demographic variables such as Sex had a negligible contribution to the class prediction.

Discussion

Successful rehabilitation includes identifying patients at risk of poor reintegration trajectories and minimizing patient fragility by reducing social risk^{4,6, 14}. From the clinical perspective, this entails adequately supporting patients in the areas that contribute to the individual’s overall social risk^11,12,13. A few observational studies have explored socioeconomic, environmental and demographic factors predictive of the discharge destination after rehabilitation (i.e., home, hospital, care home), and found that several factors, such as cohabitation with a caregiver, family support, and marital status, were influential in discharge planning and destination^{16,17,18,19, 57,58,59,60,61,62}. Although these studies identify risk factors for patient populations, they do not forecast the level of social risk nor the specific variables contributing to that risk for an individual patient, despite a great need for such a predictive tool⁶³. To the best of our knowledge, there are no publications predicting the level of an individual patient’s social risk based on an ad hoc questionnaire routinely integrated into the clinical practice for assessing social factors (EVSF).

To this effect, this study applied a GBM ML methodology to a 217-patient cohort of mostly young, male, ischemic stroke survivors who were evaluated for their functional independence (FIM assessment) as well as social risk (EVSF questionnaire), to forecast social risk upon discharge from the rehabilitation hospital. Due to the target class imbalance (approximately twice as many negligible social risk patients (GREEN) than significant risk patients (RED)), several binary classifiers were built, including an original model (not correcting for class imbalance), as well as a weighted model and three other models utilizing subsampling methodologies to balance classes. The performance of the models was accurate as well as very comparable on the basis of AUC, even for the original model, however, because the prediction of patients with significant risk is essential to clinicians, model sensitivity was one of the most crucial metrics (Table 5). Therefore, depending on the characteristics of the cohort used as the training data for social risk classification, the performance of the resulting models with and without subsampling techniques may vary and should be validated on independent hold-out test sets to assess their performance.

The population level variable importance from the predictive models, as well as individual predictor contribution to GREEN and RED class prediction using SHAP methodology, both mainly indicated Family Support and Economic Status, rather than demographic variables such as Sex, Educational Level or Civil Status, contributed to social risk prediction. Interestingly, our previous work utilizing visualizations to explore this data³⁵ found that Family Support was an important dimension for identifying patient risk at discharge, however, Economic Status did not emerge as an important variable; this predictor was only detected using ML (suggesting that the importance of this variable arises from its interaction with other variables). However, other predictor variables varied in rank and contribution to the predictions, which is likely due to the complex and multifactorial nature of social risk, where a combination, rather than a single risk factor may be increasing social risk for individuals. Moreover, in order to discern which variables contributed to social risk for a particular individual rather than the whole cohort, we utilized explainability metrics using approximate Shapley values in order to assign contribution values of each predictor variable to the overall prediction for two random patients: first strongly predicted to have negligible risk (\({\sim }\) 90% probability GREEN) and the second predicted to have significant risk (\({\sim }\) 60% probability RED). Interestingly, for the first patient (Fig. 5), adequate Family Support and Economic Status contribute to that patient’s high probability of negligible social risk and should these factors change, they would mainly contribute to the patient’s increased social risk (Fig. 5b), rather than demographic or clinical factors, such as the patient’s age or functional status. In contrast, for the second patient (Fig. 6), whose prediction was closer to the classification threshold, the significant social risk prediction was driven by the patient’s clinical outcomes, mainly functional status (motor FIM and total FIM), rather than sociofamiliar factors (Fig. 6b). The personalized intervention for this patient may include a longer length of stay at the hospital to improve clinical outcomes or a professional carer at home. This further supports the use of a predictive tool for personalized forecasting an individual patient’s social risk.

Strengths and limitations

Individualized intervention strategies for patients at social risk are necessary for successful rehabilitation for many individuals and planning an optimal length of rehabilitation^64,65. Indeed, optimizing the length of stay to the needs of the individual patient not only offers opportunity for minimizing social risk after discharge, but also the right amount of time for implementing a personalized intervention during rehabilitation^20,38. In the clinical setting, key challenges to developing individualized interventions include first identifying individual patients at social risk and identifying the specific factors that contribute to this risk (variable predictors)³⁵. The machine learning analysis in this work overcomes these two challenges by stratifying patients by their social risk, rank ordering the factors of social risk by the general importance to the prediction of risk, and finally, for specific individuals at risk we identify the most important predictors of risk for each of these individuals. This third contribution of identifying the specific factors of risk for an individual is, we believe, a particularly noteworthy novelty of this work. The practical application of the results can in turn optimize the decision-making process during reintegration, by specifically tailoring intervention strategies to the patient, to minimize their social risk, improve their outcomes and quality of life during reintegration. Another strength of the study design focusing on the period between admission and discharge from the rehabilitation hospital is minimization of confounder effects. Post-discharge, stroke survivors and carers are faced with immense emotional, health and social related challenges, such as fragility (patient falling into depression and retreating from social life) or a recurrent stroke and these additional factors can contribute to the improvement or deterioration of social risk dimensions^66,67. Furthermore, the GBM and SHAP methodologies implemented in this work are flexible, accurate and reliable and for small to medium datasets have relatively low complexities, enabling deployment with minimal resources and computational time. It is feasible that this analysis can be implemented as an app for clinicians.

The main limitation of this study stems from the size of the dataset^68,68,70. In rehabilitation hospitals, there is no way to control for either the number of stroke patients admitted in a given time period or the gender ratio (this cohort has twice as many male as female patients) because patients are referred from acute treatment units. Moreover, the Covid pandemic resulted in a reduced number of new subjects for the hold-out test set validation. However, identifying adequate sample sizes for predictive modeling is not a trivial task, because large training datasets do not automatically guarantee strong predictive models. The consensus between various guidelines for developing robust predictive models is applying a hold-out test set to validate models. Hold-out validation is regarded to be one of the most definitive and reliable strategies of evaluating the accuracy and overall performance of prediction models^{47,71, 72}. It is a robust safeguard against overfitting, because it tests the models against new (unseen) data samples to give a realistic model performance evaluation⁷³, such as the hold-out test set validation in this study. Nevertheless, additional external validations on different cohorts (subjects with hemorrhagic stroke, comorbidities, from other countries, etc.) would evaluate the generalizability and robustness of these models to other populations and geographical regions.

Conclusions

There is a lack of studies, especially including social aspects in young patients after stroke, despite the growing incidence of stroke in young populations^{26,65, 74}. In this study, for the training cohort of patients of the Catalonia region of Spain, consisting of mostly male, young ischemic stroke patients, despite the prevalence of individuals in negligible social risk class upon discharge from the hospital, ML modeling of this data revealed that predictors contributing to significant social risk were primarily Family Support and Economic Status, as well as Cohabitation and Days Since Stroke, with lesser contribution of other predictors, such as the FIM and specifically no notable contribution from the Sex of the patient. Model validation on an additional patient dataset that was not part of model training reflects the actual usage of the models by clinicians for patients that may not meet inclusion criteria and confirmed the utility of the models in the real-world clinical scenario. However, due to the target class imbalance, as well as an imbalance in the numbers of men and women in the cohort, leaves room for future studies with a larger and more balanced cohort. This highlights that social risk is a complex and multifactorial phenomenon that can vary significantly for an individual over the course of stroke rehabilitation and reintegration and so it is important to understand what can drive these variations and also to be able to predict this variation so that appropriate support measures can be put in place in a timely manner.

Data availability

The models developed and/or analysed during the current study are available in the Gitlab repository, https://gitlab.com/precise4q-tud.

Abbreviations

ADL:: Activities of daily living
EVSF:: Escala de Valoracion Socio Familiar
GBM:: Generalized boosted regression models
FIM:: Functional independence measure
ICF:: International classification of functioning, disability, and health
LIME:: Local interpretable model-agnostic explanations
ML:: Machine learning
NIHSS:: National institutes of health stroke scale
ROC-AUC:: Area under the receiver operator characteristic curve
SHAP:: Shapley additive explanations

References

Jellema, S. et al. What environmental factors influence resumption of valued activities post stroke: A systematic review of qualitative and quantitative findings. Clin. Rehab. 31(7), 936–947. https://doi.org/10.1177/0269215516671013 (2016).
Article Google Scholar
Trigg, R., Wood, V. A. & Hewer, R. L. Social reintegration after stroke: The first stages in the development of the Subjective Index of Physical and Social Outcome (SIPSO). Clin. Rehab. 13(4), 341–353. https://doi.org/10.1191/026921599676390259 (1999).
Article CAS Google Scholar
Wood-Dauphinee, S. Opzoomer, M. Williams, J. Marchand, B. & Spitzer, W. Assessment of global function: The reintegration to normal living index. Arch. Phys. Med. Rehab. 69(8), 583–590. http://europepmc.org/abstract/MED/3408328 (1988).
Parvaneh, S. & Cocks, E. Framework for describing community integration for people with acquired brain injury. Aust. Occup. Ther. J. 59(2), 131–137. https://doi.org/10.1111/j.1440-1630.2012.01001.x (2012).
Article PubMed Google Scholar
Tough, H. & Siegrist, J. & Fekete, C.,. Social relationships, mental health and wellbeing in physical disability: A systematic review. BMC Public Health. 17(1), 1. https://doi.org/10.1186/s12889-017-4308-6 (2017).
Article Google Scholar
Walsh, M. E., Galvin, R., Loughnane, C., Macey, C. & Horgan, N. F. Factors associated with community reintegration in the first year after stroke: A qualitative meta-synthesis. Disabil. Rehabil. 37(18), 1599–1608. https://doi.org/10.3109/09638288.2014.974834 (2014).
Article PubMed Google Scholar
Berkman, LF. Kawachi, I. & Glymour, MM. editors. Social Epidemiology (Oxford University Press, 2014). https://doi.org/10.1093/med/9780195377903.001.0001.
Wood-Dauphinee, S. & Williams, J. I. Reintegration to normal living as a proxy to quality of life. J. Chronic Dis. 40(6), 491–499. https://doi.org/10.1016/0021-9681(87)90005-1 (1987).
Article CAS PubMed Google Scholar
Holt-Lunstad, J., Smith, T. B. & Layton, J. B. Social relationships and mortality risk: A meta-analytic review. PLoS Med. 7(7), e1000316. https://doi.org/10.1371/journal.pmed.1000316 (2010).
Article PubMed PubMed Central Google Scholar
Marcheschi, E., Koch, L. V., Pessah-Rasmussen, H. & Elf, M. Home setting after stroke, facilitators and barriers: A systematic literature review. Health Soc. Care Community. 26(4), e451–e459. https://doi.org/10.1111/hsc.12518 (2017).
Article PubMed Google Scholar
Elloker, T., Rhoda, A., Arowoiya, A. & Lawal, I. U. Factors predicting community participation in patients living with stroke, in the Western Cape. S. Afr. Disab. Rehab. 41(22), 2640–2647. https://doi.org/10.1080/09638288.2018.1473509 (2018).
Article Google Scholar
Teoh, V., Sims, J. & Milgrom, J. Psychosocial Predictors of Quality of Life in a Sample of Community-Dwelling Stroke Survivors: A Longitudinal Study. Top. Stroke Rehabil. 16(2), 157–166. https://doi.org/10.1310/tsr1602-157 (2009).
Article PubMed Google Scholar
White, J. et al. Predictors of health-related quality of life in community-dwelling stroke survivors: A cohort study. Fam. Pract. 33(4), 382–387. https://doi.org/10.1093/fampra/cmw011 (2016).
Article PubMed Google Scholar
Zawawi, N. S. M., Aziz, N. A., Fisher, R., Ahmad, K. & Walker, M. F. The unmet needs of stroke survivors and stroke caregivers: A systematic narrative review. J. Stroke Cerebrovasc. Dis. 29(8), 104875. https://doi.org/10.1016/j.jstrokecerebrovasdis.2020.104875 (2020).
Article CAS PubMed Google Scholar
Donkor, E. S. Stroke in the 21st Century: A snapshot of the burden, epidemiology, and quality of life. Stroke Res. Treatm. 2018, 1–10. https://doi.org/10.1155/2018/3238165 (2018).
Article Google Scholar
Sabartés, O. et al. Factores predictivos de retorno al domicilio en pacientes ancianos hospitalizados. Anales De Medicina Interna. 16, 1 (1999).
Google Scholar
Cahuana-Cuentas, M., Gallegos, W. L. A., Rivera-Calcina, R. & Canaza, K. D. C. Influencia de la familia sobre la resiliencia en personas con discapacidad física y sensorial de Arequipa, Perú. Revista chilena de neuro-psiquiatría. 57(2), 118–128. https://doi.org/10.4067/s0717-92272019000200118 (2019).
Article Google Scholar
Ramírez-Duque, N. et al. Características, clínicas, funcionales, mentales y sociales de pacientes pluripatológicos: Estudio prospectivo durante un año en Atención Primaria. Revista Clínica Española 208(1), 4–11. https://doi.org/10.1157/13115000 (2008).
Article PubMed Google Scholar
Varela-Pinedo, L., Chávez-Jimeno, H., Tello-Rodriguez, T., Ortiz-Saavedra, P., Gálvez-Cano, M., Casas-Vasquez, P. et al. Perfil clínico, funcional y sociofamiliar del adulto mayor de la comunidad en un distrito de Lima, Perú. Revista Peruana de Medicina Experimental y Salud Publica. 32(4):709. https://doi.org/10.17843/rpmesp.2015.324.1762 (2015).
García-Rudolph, A. et al. Predicting length of stay in patients admitted to stroke rehabilitation with severe and moderate levels of functional impairments. Medicine 99(43), e22423. https://doi.org/10.1097/md.0000000000022423 (2020).
Article PubMed PubMed Central Google Scholar
García-Rudolph, A., Cegarra, B., Saurí, J., Kelleher, J. D., Cisek, K., Frey, D. et al. Intersection of resilience and COVID-19: Structural topic modelling and word embeddings from reddit titles (2023).
García-Rudolph, A. et al. The impact of coronavirus disease 2019 on emotional and behavioral stress of informal family caregivers of individuals with stroke or traumatic brain injury at chronic phase living in a Mediterranean setting. Brain Behav. 12(1), 1. https://doi.org/10.1002/brb3.2440 (2021).
Article CAS Google Scholar
García-Rudolph, A. et al. The impact of COVID-19 on home, social, and productivity integration of people with chronic traumatic brain injury or stroke living in the community. Medicine 101(8), e28695. https://doi.org/10.1097/md.0000000000028695 (2022).
Article PubMed PubMed Central Google Scholar
Shavelle, R. M., Brooks, J. C., Strauss, D. J. & Turner-Stokes, L. Life Expectancy after Stroke Based On Age, Sex, and Rankin Grade of Disability: A Synthesis. J. Stroke Cerebrovasc. Dis. 28(12), 104450. https://doi.org/10.1016/j.jstrokecerebrovasdis.2019.104450 (2019).
Article PubMed Google Scholar
GBD 2016 Lifetime Risk of Stroke Collaborators, Feigin, V. L., Nguyen, G., Cercy, K., Johnson, C. O., Alam, T., et al. Global, regional, and country-specific lifetime risks of stroke, 1990 and 2016. N. Engl. J. Med. 379(25), 2429–2437. https://europepmc.org/articles/PMC6247346 (2018).
Amaya Pascasio, L. et al. Stroke in young adults in Spain: Epidemiology and risk factors by age. J. Pers. Med. 13(5), 768. https://doi.org/10.3390/jpm13050768 (2023).
Article PubMed PubMed Central Google Scholar
Von Elm, E. et al. Strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. BMJ. 335(7624), 806–808. https://doi.org/10.1136/bmj.39335.541782.ad (2007).
Article Google Scholar
Maaijwee, N. A. M. M., Rutten-Jacobs, L. C. A., Schaapsmeerders, P., Dijk, E. J. & Leeuw, F. E. Ischaemic stroke in young adults: Risk factors and long-term consequences. Nat. Rev. Neurol. 10(6), 315–325. https://doi.org/10.1038/nrneurol.2014.72 (2014).
Article PubMed Google Scholar
Stack, C. A. & Cole, J. W. Ischemic stroke in young adults. Curr. Opin. Cardiol. 33(6), 594–604. https://doi.org/10.1097/hco.0000000000000564 (2018).
Article PubMed Google Scholar
García-Rudolph, A. et al. Long-term trajectories of community integration: Identification, characterization, and prediction using inpatient rehabilitation variables. Top. Stroke Rehab. 1, 1–13. https://doi.org/10.1080/10749357.2023.2188756 (2023).
Article Google Scholar
Matos, I. et al. Investigating predictors of community integration in individuals after stroke in a residential setting: A longitutinal study. PLoS ONE 15(5), e0233015. https://doi.org/10.1371/journal.pone.0233015 (2020).
Article CAS PubMed PubMed Central Google Scholar
García-Rudolph, A. et al. Long-term trajectories of motor functional independence after ischemic stroke in young adults: Identification and characterization using inpatient baseline assessments. NeuroRehabilitation 50(4), 453–465. https://doi.org/10.3233/nre-210293 (2022).
Article PubMed Google Scholar
Martinez, H. B., Cisek, K., Garcia-Rudolph, A., Kelleher, J. D. & Hines, A. Understanding and Predicting Cognitive Improvement of Young Adults in Ischemic Stroke Rehabilitation Therapy. Front. Neurol. 13, 1. https://doi.org/10.3389/fneur.2022.886477 (2022).
Article Google Scholar
Current Topics in Technology-Enabled Stroke Rehabilitation and Reintegration: A Scoping Review and Content Analysis IEEE Transactions on Neural Systems and Rehabilitation Engineering 313341–3352 https://doi.org/10.1109/TNSRE.2023.3304758 (2023).
Cisek, K., Nguyen, T. N. Q., García-Rudolph, A., Saurí, J., & Kelleher, J. D. Understanding social risk variation across reintegration of post-ischemic stroke patients. In: Cerebral Ischemia, pp. 201–220 (Exon Publications, 2021). https://doi.org/10.36255/exonpublications.cerebralischemia.2021.reintegration.
Amarilla-Donoso, F. J. et al. Quality of life after hip fracture: A 12-month prospective study. PeerJ 8, e9215. https://doi.org/10.7717/peerj.9215 (2020).
Article PubMed PubMed Central Google Scholar
García González, J. et al. An evaluation of the feasibility and validity of a scale of social assessment of the elderly. Atencion Primaria. 23(7), 434–440 (1999).
PubMed Google Scholar
Pellico-López, A. et al. Cost of stay and characteristics of patients with stroke and delayed discharge for non-clinical reasons. Sci. Rep. 12(1), 1. https://doi.org/10.1038/s41598-022-14502-5 (2022).
Article CAS Google Scholar
Pérez, L. M. et al. Rehabilitation profiles of older adult stroke survivors admitted to intermediate care units: A multi-centre study. PLoS ONE 11(11), e0166304. https://doi.org/10.1371/journal.pone.0166304 (2016).
Article CAS PubMed PubMed Central Google Scholar
Friedman, J. H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 29(5), 1189–1232. http://www.jstor.org/stable/2699986 (2001).
Manning, C. D. & Schütze, H. Foundations of Statistical Natural Language Processing (MIT Press, Cambridge, MA, USA, 1999).
Google Scholar
Team RC. R: A Language and environment for statistical computing. Vienna, Austria: CRAN; https://www.R-project.org/ (2021).
Kim, C. & Park, T. Predicting determinants of lifelong learning intention using gradient boosting machine (GBM) with grid search. Sustainability. 14(9), 5256. https://doi.org/10.3390/su14095256 (2022).
Article Google Scholar
Natekin, A. & Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobot. 7, 1. https://doi.org/10.3389/fnbot.2013.00021 (2013).
Article Google Scholar
Friedman, J. H. Stochastic gradient boosting. Comput. Stat. Data Anal. 38(4), 367–378. https://doi.org/10.1016/s0167-9473(01)00065-2 (2002).
Article MathSciNet Google Scholar
Ridgeway, G. Generalized Boosted Models: A guide to the gbm package. CRAN; R package version 1.1. https://CRAN.R-project.org/package=gbm (2007).
Hastie, T. Tibshirani, R. & Friedman, J. The elements of statistical learning (Springer, New York). https://doi.org/10.1007/978-0-387-84858-7 (2009).
Lakshmanan, V., Robinson, S., & Munn, M. Machine learning design patterns. O’Reilly Media, Inc. (2020).
Schratz, P., Muenchow, J., Iturritxa, E., Richter, J. & Brenning, A. Hyperparameter tuning and performance assessment of statistical and machine-learning algorithms using spatial data. Ecol. Model. 406, 109–120. https://doi.org/10.1016/j.ecolmodel.2019.06.002 (2019).
Article Google Scholar
Kuhn, M. Building predictive models in R using the caret package. J. Stat. Softw. 28(5), 1. https://doi.org/10.18637/jss.v028.i05 (2008).
Kuhn, M., & Johnson, K. Applied predictive modeling (Springer, New York, 2013). https://doi.org/10.1007/978-1-4614-6849-3.
Powers, D. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63. https://doi.org/10.9735/2229-3981 (2011).
Article MathSciNet Google Scholar
Tharwat, A. Classification assessment methods. Appl. Comput. Inf. 17(1), 168–192. https://doi.org/10.1016/j.aci.2018.08.003 (2020).
Article Google Scholar
Lundberg, S. M., & Lee, S. I. A unified approach to interpreting model predictions. In: Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S. et al., editors. Advances in Neural Information Processing Systems. vol. 30, pp. 1–10 (Curran Associates, Inc., 2017). https://proceedings.neurips.cc/paper_files/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf.
Štrumbelj, E. & Kononenko, I. Explaining prediction models and individual predictions with feature contributions. Knowl. Inf. Syst. 41(3), 647–665. https://doi.org/10.1007/s10115-013-0679-x (2013).
Article Google Scholar
Greenwell, B. Package ‘fastshap’. CRAN; R package version 0.0.7. https://CRAN.R-project.org/package=fastshap (2020).
Agarwal, V., McRae, M. P., Bhardwaj, A. & Teasell, R. W. A model to aid in the prediction of discharge location for stroke rehabilitation patients. Arch. Phys. Med. Rehabil. 84(11), 1703–1709. https://doi.org/10.1053/s0003-9993(03)00362-9 (2003).
Article PubMed Google Scholar
Everink, I. H. J., van Haastregt, J. C. M., van Hoof, S. J. M., Schols, J. M. G. A. & Kempen, G. I. J. M. Factors influencing home discharge after inpatient rehabilitation of older patients: A systematic review. BMC Geriatr. 16(1), 1. https://doi.org/10.1186/s12877-016-0187-4 (2016).
Article Google Scholar
Nguyen, V. Q. C. et al. Factors associated with discharge to home versus discharge to institutional care after inpatient stroke rehabilitation. Arch. Phys. Med. Rehabil. 96(7), 1297–1303. https://doi.org/10.1016/j.apmr.2015.03.007 (2015).
Article PubMed Google Scholar
Pereira, S. et al. Discharge destination of individuals with severe stroke undergoing rehabilitation: A predictive model. Disabil. Rehabil. 36(9), 727–731. https://doi.org/10.3109/09638288.2014.902510 (2014).
Article PubMed Google Scholar
Pohl, P. S., Billinger, S. A., Lentz, A. & Gajewski, B. The role of patient demographics and clinical presentation in predicting discharge placement after inpatient stroke rehabilitation: Analysis of a large, US data base. Disab. Rehab. 35(12), 990–994. https://doi.org/10.3109/09638288.2012.717587 (2012).
Article Google Scholar
Wee, J. Y., Wong, H. & Palepu, A. Validation of the Berg balance scale as a predictor of length of stay and discharge destination in stroke rehabilitation. Arch. Phys. Med. Rehabil. 84(5), 731–735. https://doi.org/10.1016/s0003-9993(02)04940-7 (2003).
Article PubMed Google Scholar
Wasserman, A., Thiessen, M. & Pooyania, S. Factors associated with community versus personal care home discharges after inpatient stroke rehabilitation: The need for a pre-admission predictive model. Top. Stroke Rehabil. 27(3), 173–180. https://doi.org/10.1080/10749357.2019.1682369 (2019).
Article PubMed Google Scholar
Reeves, M. J. et al. Improving transitions in acute stroke patients discharged to home: the Michigan stroke transitions trial (MISTT) protocol. BMC Neurol. 17(1), 1. https://doi.org/10.1186/s12883-017-0895-1 (2017).
Article Google Scholar
Lai, W. et al. Clinical and psychosocial predictors of exceeding target length of stay during inpatient stroke rehabilitation. Top. Stroke Rehabil. 24(7), 510–516. https://doi.org/10.1080/10749357.2017.1325589 (2017).
Article PubMed Google Scholar
Ezekiel, L. et al. Factors associated with participation in life situations for adults with stroke: A systematic review. Arch. Phys. Med. Rehabil. 100(5), 945–955. https://doi.org/10.1016/j.apmr.2018.06.017 (2019).
Article PubMed Google Scholar
Wood, J. P., Connelly, D. M. & Maly, M. R. ‘Getting back to real living’: A qualitative study of the process of community reintegration after stroke. Clin. Rehabil. 24(11), 1045–1056. https://doi.org/10.1177/0269215510375901 (2010).
Article PubMed Google Scholar
Button, K. S. et al. Power failure: Why small sample size undermines the reliability of neuroscience. Nat. Rev. Neurosci. 14(5), 365–376. https://doi.org/10.1038/nrn3475 (2013).
Article CAS PubMed Google Scholar
Jimenez-Mesa, C. et al. A non-parametric statistical inference framework for Deep Learning in current neuroimaging. Inf. Fusion. 91, 598–611. https://doi.org/10.1016/j.inffus.2022.11.007 (2023).
Article Google Scholar
Varoquaux, G. Cross-validation failure: Small sample sizes lead to large error bars. Neuroimage 180, 68–77. https://doi.org/10.1016/j.neuroimage.2017.06.061 (2018).
Article PubMed Google Scholar
James, G., Witten, D., Hastie, T., & Tibshirani, R. An Introduction to Statistical Learning (Springer, New York, 2013). https://doi.org/10.1007/978-1-4614-7138-7.
Refaeilzadeh, P. Tang, L. & Liu, H. In: Cross-Validation, pp. 532–538 (Springer, US, 2009). https://doi.org/10.1007/978-0-387-39940-9_565.
Hawkins, D. M. The Problem of Overfitting. J. Chem. Inf. Comput. Sci. 44(1), 1–12. https://doi.org/10.1021/ci0342472 (2003).
Article CAS Google Scholar
Boot, E. et al. Ischaemic stroke in young adults: A global perspective. J. Neurol. Neurosurg. Psychiatry. 91(4), 411–417. https://doi.org/10.1136/jnnp-2019-322424 (2020).
Article PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank the PRECISE4Q consortium and the Reintegration phase project working group, RES-Q+ consortium as well as the STRATIF-AI consortium.

Funding

This research was supported by the PRECISE4Q project, funded through the European Union’s Horizon 2020 research and innovation program under grant agreement No. 777107, the RES-Q+ Horizon Europe RIA, grant No. 101057603, the STRATIF-AI Horizon Europe RIA, grant No. 101080875 and the ADAPT Research Centre for AI-Driven Digital Content Technology, which is funded by Science Foundation Ireland through the SFI Research Centres Programme and is co-funded under the European Regional Development Fund (ERDF) through Grant 13/RC/2106_P2.

Author information

Authors and Affiliations

AIDHM, Artificial Intelligence in Digital Health and Medicine, Technological University Dublin, Dublin, Ireland
Katryna K. Cisek & Thi Nguyet Que Nguyen
Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Bellaterra, Spain
Alejandro Garcia-Rudolph & Joan Saurí
Fundació Institute d’Investigació en Ciències de la Salut Germans Trias i Pujol, Badalona, Spain
Alejandro Garcia-Rudolph & Joan Saurí
Institut Guttmann Hospital de Neurorehabilitacio, Badalona, Spain
Alejandro Garcia-Rudolph & Joan Saurí
STRATIF-AI, Continuous stratification for improved prevention, treatment, and rehabilitation of stroke patients using digital twins and AI, Horizon Europe Project, Linköping, Sweden
Alejandro Garcia-Rudolph & John D. Kelleher
School of Computer Science, University College Dublin, Dublin, Ireland
Helard Becerra Martinez & Andrew Hines
ADAPT Research Centre, School of Computer Science and Statistics, Trinity College Dublin, Dublin, Ireland
John D. Kelleher
RESQ+, Comprehensive solutions of healthcare improvement based on the global Registry of Stroke Care Quality, Horizon Europe Project, Brno, Czech Republic
Katryna K. Cisek & John D. Kelleher

Authors

Katryna K. Cisek
View author publications
You can also search for this author in PubMed Google Scholar
Thi Nguyet Que Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Garcia-Rudolph
View author publications
You can also search for this author in PubMed Google Scholar
Joan Saurí
View author publications
You can also search for this author in PubMed Google Scholar
Helard Becerra Martinez
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Hines
View author publications
You can also search for this author in PubMed Google Scholar
John D. Kelleher
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

KKC and JDK designed the study. TNQN and KKC integrated and filtered the data; KKC implemented and conducted the ML analysis, JDK aided in the ML analysis, and HBM aided in SHAP analysis. TNQN and KKC co-designed and created the figures. JS and AG contributed the data, clinical understanding and implications. All authors contributed to writing the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Katryna K. Cisek.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cisek, K.K., Nguyen, T.N.Q., Garcia-Rudolph, A. et al. Predictors of social risk for post-ischemic stroke reintegration. Sci Rep 14, 10110 (2024). https://doi.org/10.1038/s41598-024-60507-7

Download citation

Received: 22 July 2023
Accepted: 24 April 2024
Published: 02 May 2024
DOI: https://doi.org/10.1038/s41598-024-60507-7

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Social frailty as a predictor of all-cause mortality and functional disability: a systematic review and meta-analysis

Machine learning to predict mortality after rehabilitation among patients with severe stroke

Machine learning predicts clinically significant health related quality of life improvement after sensorimotor rehabilitation interventions in chronic stroke

Introduction

Methodology

Social risk questionnaire

Training set patient cohort

Hold-out test set patient cohort

Machine learning analysis

Prediction interpretation analysis

Results

Model performance

Model validation

Understanding the drivers and predictors of social risk

Variable importance

Model explainability

Discussion

Strengths and limitations

Conclusions

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Comments

Search

Quick links