Umeå University's logo

umu.sePublications
Change search
Link to record
Permanent link

Direct link
Alternative names
Publications (2 of 2) Show all publications
Otieno, J. A., Häggström, J., Darehed, D. & Eriksson, M. (2024). Developing machine learning models to predict multi-class functional outcomes and death three months after stroke in Sweden. PLOS ONE, 19(5), Article ID e0303287.
Open this publication in new window or tab >>Developing machine learning models to predict multi-class functional outcomes and death three months after stroke in Sweden
2024 (English)In: PLOS ONE, E-ISSN 1932-6203, Vol. 19, no 5, article id e0303287Article in journal (Refereed) Published
Abstract [en]

Globally, stroke is the third-leading cause of mortality and disability combined, and one of the costliest diseases in society. More accurate predictions of stroke outcomes can guide healthcare organizations in allocating appropriate resources to improve care and reduce both the economic and social burden of the disease. We aim to develop and evaluate the performance and explainability of three supervised machine learning models and the traditional multinomial logistic regression (mLR) in predicting functional dependence and death three months after stroke, using routinely-collected data. This prognostic study included adult patients, registered in the Swedish Stroke Registry (Riksstroke) from 2015 to 2020. Riksstroke contains information on stroke care and outcomes among patients treated in hospitals in Sweden. Prognostic factors (features) included demographic characteristics, pre-stroke functional status, cardiovascular risk factors, medications, acute care, stroke type, and severity. The outcome was measured using the modified Rankin Scale at three months after stroke (a scale of 0-2 indicates independent, 3-5 dependent, and 6 dead). Outcome prediction models included support vector machines, artificial neural networks (ANN), eXtreme Gradient Boosting (XGBoost), and mLR. The models were trained and evaluated on 75% and 25% of the dataset, respectively. Model predictions were explained using SHAP values. The study included 102,135 patients (85.8% ischemic stroke, 53.3% male, mean age 75.8 years, and median NIHSS of 3). All models demonstrated similar overall accuracy (69%-70%). The ANN and XGBoost models performed significantly better than the mLR in classifying dependence with F1-scores of 0.603 (95% CI; 0.594-0.611) and 0.577 (95% CI; 0.568-0.586), versus 0.544 (95% CI; 0.545-0.563) for the mLR model. The factors that contributed most to the predictions were expectedly similar in the models, based on clinical knowledge. Our ANN and XGBoost models showed a modest improvement in prediction performance and explainability compared to mLR using routinely-collected data. Their improved ability to predict functional dependence may be of particular importance for the planning and organization of acute stroke care and rehabilitation.

Place, publisher, year, edition, pages
Public Library of Science (PLoS), 2024
National Category
Probability Theory and Statistics Cardiology and Cardiovascular Disease
Research subject
Statistics
Identifiers
urn:nbn:se:umu:diva-224459 (URN)10.1371/journal.pone.0303287 (DOI)001245059300043 ()38739586 (PubMedID)2-s2.0-85192913786 (Scopus ID)
Available from: 2024-05-17 Created: 2024-05-17 Last updated: 2025-04-24Bibliographically approved
Wang, W., Otieno, J. A., Eriksson, M., Wolfe, C. D., Curcin, V. & Bray, B. D. (2023). Developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the UK and Sweden.. BMJ Open, 13(11), Article ID e069811.
Open this publication in new window or tab >>Developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the UK and Sweden.
Show others...
2023 (English)In: BMJ Open, E-ISSN 2044-6055, Vol. 13, no 11, article id e069811Article in journal (Refereed) Published
Abstract [en]

OBJECTIVES: We aimed to develop and externally validate a generalisable risk prediction model for 30-day stroke mortality suitable for supporting quality improvement analytics in stroke care using large nationwide stroke registers in the UK and Sweden.

DESIGN: Registry-based cohort study.

SETTING: Stroke registries including the Sentinel Stroke National Audit Programme (SSNAP) in England, Wales and Northern Ireland (2013-2019) and the national Swedish stroke register (Riksstroke 2015-2020).

PARTICIPANTS AND METHODS: Data from SSNAP were used for developing and temporally validating the model, and data from Riksstroke were used for external validation. Models were developed with the variables available in both registries using logistic regression (LR), LR with elastic net and interaction terms and eXtreme Gradient Boosting (XGBoost). Performances were evaluated with discrimination, calibration and decision curves.

OUTCOME MEASURES: The primary outcome was all-cause 30-day in-hospital mortality after stroke.

RESULTS: In total, 488 497 patients who had a stroke with 12.4% 30-day in-hospital mortality were used for developing and temporally validating the model in the UK. A total of 128 360 patients who had a stroke with 10.8% 30-day in-hospital mortality and 13.1% all mortality were used for external validation in Sweden. In the SSNAP temporal validation set, the final XGBoost model achieved the highest area under the receiver operating characteristic curve (AUC) (0.852 (95% CI 0.848 to 0.855)) and was well calibrated. The performances on the external validation in Riksstroke were as good and achieved AUC at 0.861 (95% CI 0.858 to 0.865) for in-hospital mortality. For Riksstroke, the models slightly overestimated the risk for in-hospital mortality, while they were better calibrated at the risk for all mortality.

CONCLUSION: The risk prediction model was accurate and externally validated using high quality registry data. This is potentially suitable to be deployed as part of quality improvement analytics in stroke care to enable the fair comparison of stroke mortality outcomes across hospitals and health systems across countries.

Place, publisher, year, edition, pages
BMJ Publishing Group Ltd, 2023
Keywords
neurology, quality in health care, statistics & research methods, stroke
National Category
Probability Theory and Statistics Cardiology and Cardiovascular Disease Public Health, Global Health and Social Medicine Neurology
Research subject
Statistics; Neurology
Identifiers
urn:nbn:se:umu:diva-216802 (URN)10.1136/bmjopen-2022-069811 (DOI)001107379900057 ()37968001 (PubMedID)2-s2.0-85177074929 (Scopus ID)
Available from: 2023-11-16 Created: 2023-11-16 Last updated: 2025-04-24Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0009-0006-1030-4237

Search in DiVA

Show all publications