Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the UK and Sweden.
Department of Population Health Sciences, King's College London, London, UK.ORCID iD: 0000-0002-1879-7332
Umeå University, Faculty of Social Sciences, Umeå School of Business and Economics (USBE), Statistics.ORCID iD: 0009-0006-1030-4237
Umeå University, Faculty of Social Sciences, Umeå School of Business and Economics (USBE), Statistics.ORCID iD: 0000-0003-3298-1555
Department of Population Health Sciences, King's College London, London, UK.
Show others and affiliations
2023 (English)In: BMJ Open, E-ISSN 2044-6055, Vol. 13, no 11, article id e069811Article in journal (Refereed) Published
Abstract [en]

OBJECTIVES: We aimed to develop and externally validate a generalisable risk prediction model for 30-day stroke mortality suitable for supporting quality improvement analytics in stroke care using large nationwide stroke registers in the UK and Sweden.

DESIGN: Registry-based cohort study.

SETTING: Stroke registries including the Sentinel Stroke National Audit Programme (SSNAP) in England, Wales and Northern Ireland (2013-2019) and the national Swedish stroke register (Riksstroke 2015-2020).

PARTICIPANTS AND METHODS: Data from SSNAP were used for developing and temporally validating the model, and data from Riksstroke were used for external validation. Models were developed with the variables available in both registries using logistic regression (LR), LR with elastic net and interaction terms and eXtreme Gradient Boosting (XGBoost). Performances were evaluated with discrimination, calibration and decision curves.

OUTCOME MEASURES: The primary outcome was all-cause 30-day in-hospital mortality after stroke.

RESULTS: In total, 488 497 patients who had a stroke with 12.4% 30-day in-hospital mortality were used for developing and temporally validating the model in the UK. A total of 128 360 patients who had a stroke with 10.8% 30-day in-hospital mortality and 13.1% all mortality were used for external validation in Sweden. In the SSNAP temporal validation set, the final XGBoost model achieved the highest area under the receiver operating characteristic curve (AUC) (0.852 (95% CI 0.848 to 0.855)) and was well calibrated. The performances on the external validation in Riksstroke were as good and achieved AUC at 0.861 (95% CI 0.858 to 0.865) for in-hospital mortality. For Riksstroke, the models slightly overestimated the risk for in-hospital mortality, while they were better calibrated at the risk for all mortality.

CONCLUSION: The risk prediction model was accurate and externally validated using high quality registry data. This is potentially suitable to be deployed as part of quality improvement analytics in stroke care to enable the fair comparison of stroke mortality outcomes across hospitals and health systems across countries.

Place, publisher, year, edition, pages
BMJ Publishing Group Ltd, 2023. Vol. 13, no 11, article id e069811
Keywords [en]
neurology, quality in health care, statistics & research methods, stroke
National Category
Probability Theory and Statistics Cardiology and Cardiovascular Disease Public Health, Global Health and Social Medicine Neurology
Research subject
Statistics; Neurology
Identifiers
URN: urn:nbn:se:umu:diva-216802DOI: 10.1136/bmjopen-2022-069811ISI: 001107379900057PubMedID: 37968001Scopus ID: 2-s2.0-85177074929OAI: oai:DiVA.org:umu-216802DiVA, id: diva2:1812563
Available from: 2023-11-16 Created: 2023-11-16 Last updated: 2026-02-23Bibliographically approved
In thesis
1. Machine learning for predicting diverse stroke outcomes: binary, multi-class, and time-to-event
Open this publication in new window or tab >>Machine learning for predicting diverse stroke outcomes: binary, multi-class, and time-to-event
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Maskininlärning för prediktion av olika utfall efter stroke : binära, ordinala och tid-till-händelse
Abstract [en]

This thesis aimed to improve the clinical prediction of short- and long-term outcomes after stroke by developing, evaluating, and comparing classical statistical and modern machine learning models. Using large, high-quality national stroke registries, specifically the Swedish national stroke register (Riksstroke) and the Sentinel Stroke National Audit Programme (SSNAP), the thesis investigates whether advanced machine learning models offer added value in predicting clinical outcomes compared to traditional models and importantly in which contexts such improvements occur and are clinically meaningful. It addresses several key gaps in the literature, including the lack of external validation for many prediction models across different health systems, the limited research on predicting multi-class functional outcomes, few comprehensive simulation-based evaluations of prediction models under different realistic data conditions, and the need for multiple-horizon evaluations of competing risk prediction models to support fair model selection.

The first paper of the thesis evaluated models for predicting shortterm mortality after stroke using large national registries from two European countries (Riksstroke and SSNAP). The results showed that machine learning models offered only modest performance gains over well-specified logistic regression models, demonstrating that traditional approaches remain competitive, especially when predictors are limited and the dataset is structured.

The second paper performed multi-class prediction of functional outcomes three months after stroke, a clinically important, yet methodologically challenging outcome. All models demonstrated similar overall accuracy. However, machine learning, particularly neural networks and gradient-boosting models, indicated clearer advantages over multinomial logistic regression in distinguishing the functional dependence category. Using explainability approaches such as SHapley Additive exPlanations, the study demonstrated that complex models can still provide interpretable insights into the contribution of risk factors in predictions.

The third paper comprehensively evaluated the classical Cox proportional hazards model and machine learning models for predicting time-to-event outcomes using both simulation and real-world registry data. The Cox regression model performed better when its assumptions were satisfied or when the violations of the assumptions were minimal, while  tree-based models demonstrated better performance in the presence of non-linearity, misspecification, or large number of noise variables.

The final paper compared multiple modeling frameworks for predicting competing risks at multiple evaluation time points (horizons). The results showed that the performance of the models depended on the dataset and the evaluation time point, and no model consistently performed the best. Tree-based and deep-learning models achieved better discrimination when events were common, while pseudo-observation-based and Fine-Gray models showed better calibration, especially at longer horizons.

In summary, the thesis demonstrated that model choice should be guided not by popularity but by data structure, clinical context, and evaluation using different metrics and at multiple time horizons. Traditional and machine learning models each have strengths and rigorous validation, calibration assessment, and explainability are crucial for trustworthy clinical prediction.

Place, publisher, year, edition, pages
Umeå: Umeå University, 2026. p. 32
Series
Statistical studies, ISSN 1100-8989 ; 61
Keywords
Predictive modeling, Machine learning, Survival analysis, Stroke, Prediktiv modellering, Maskininlärning, Överlevnadsanalys, Stroke
National Category
Probability Theory and Statistics Cardiology and Cardiovascular Disease
Research subject
Statistics
Identifiers
urn:nbn:se:umu:diva-250171 (URN)978-91-8070-890-6 (ISBN)978-91-8070-891-3 (ISBN)
Public defence
2026-03-20, HUM.D.210 - Hummelhonung, Humanisthuset, 09:30 (English)
Opponent
Supervisors
Available from: 2026-02-27 Created: 2026-02-23 Last updated: 2026-02-24Bibliographically approved

Open Access in DiVA

fulltext(894 kB)208 downloads
File information
File name FULLTEXT01.pdfFile size 894 kBChecksum SHA-512
0e84cdd8a2cf1beb5f648fbf764a51c650b5cbae1e735c2885ec6801b57b4be6c70599b1028d2a18652c787ee89bcaa97d8b7171844fd3cc5d10c41bbe392b16
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Otieno, Josline A.Eriksson, Marie

Search in DiVA

By author/editor
Wang, WenjuanOtieno, Josline A.Eriksson, Marie
By organisation
Statistics
In the same journal
BMJ Open
Probability Theory and StatisticsCardiology and Cardiovascular DiseasePublic Health, Global Health and Social MedicineNeurology

Search outside of DiVA

GoogleGoogle Scholar
Total: 209 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 494 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf