Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A deep learning approach for overall survival prediction in lung cancer with missing values
Research Unit of Computer Systems and Bioinformatics, Department of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy.
Research Unit of Computer Systems and Bioinformatics, Department of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy.
Operative Research Unit of Radiation Oncology, Fondazione Policlinico Universitario Campus Bio-Medico, Rome, Italy.
Umeå University, Faculty of Medicine, Department of Diagnostics and Intervention. Umeå University, Faculty of Medicine, Department of Radiation Sciences, Radiation Physics. Research Unit of Computer Systems and Bioinformatics, Department of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy.ORCID iD: 0000-0003-2621-072X
2024 (English)In: Computer Methods and Programs in Biomedicine, ISSN 0169-2607, E-ISSN 1872-7565, Vol. 254, article id 108308Article in journal (Refereed) Published
Abstract [en]

Background and Objective: In the field of lung cancer research, particularly in the analysis of overall survival (OS), artificial intelligence (AI) serves crucial roles with specific aims. Given the prevalent issue of missing data in the medical domain, our primary objective is to develop an AI model capable of dynamically handling this missing data. Additionally, we aim to leverage all accessible data, effectively analyzing both uncensored patients who have experienced the event of interest and censored patients who have not, by embedding a specialized technique within our AI model, not commonly utilized in other AI tasks. Through the realization of these objectives, our model aims to provide precise OS predictions for non-small cell lung cancer (NSCLC) patients, thus overcoming these significant challenges.

Methods: We present a novel approach to survival analysis with missing values in the context of NSCLC, which exploits the strengths of the transformer architecture to account only for available features without requiring any imputation strategy. More specifically, this model tailors the transformer architecture to tabular data by adapting its feature embedding and masked self-attention to mask missing data and fully exploit the available ones. By making use of ad-hoc designed losses for OS, it is able to account for both censored and uncensored patients, as well as changes in risks over time.

Results: We compared our method with state-of-the-art models for survival analysis coupled with different imputation strategies. We evaluated the results obtained over a period of 6 years using different time granularities obtaining a Ct-index, a time-dependent variant of the C-index, of 71.97, 77.58 and 80.72 for time units of 1 month, 1 year and 2 years, respectively, outperforming all state-of-the-art methods regardless of the imputation method used.

Conclusions: The results show that our model not only outperforms the state-of-the-art's performance but also simplifies the analysis in the presence of missing data, by effectively eliminating the need to identify the most appropriate imputation strategy for predicting OS in NSCLC patients.

Place, publisher, year, edition, pages
Elsevier, 2024. Vol. 254, article id 108308
Keywords [en]
Missing data, Oncology, Precision medicine, Survival analysis
National Category
Cancer and Oncology
Identifiers
URN: urn:nbn:se:umu:diva-227835DOI: 10.1016/j.cmpb.2024.108308ISI: 001266463400001Scopus ID: 2-s2.0-85197293687OAI: oai:DiVA.org:umu-227835DiVA, id: diva2:1883637
Funder
Swedish National Infrastructure for Computing (SNIC)Swedish Research Council, 2022-06725Swedish Research Council, 2018-05973Available from: 2024-07-11 Created: 2024-07-11 Last updated: 2025-04-24Bibliographically approved

Open Access in DiVA

fulltext(827 kB)137 downloads
File information
File name FULLTEXT01.pdfFile size 827 kBChecksum SHA-512
4f5569d1a316075710dc96228b632a247dd9ca732dd08a80a8ec6d24d2fd2fdaba5c141e3025f5c4769260e5468d95348dffc0ee4b534145f0412eb0909ed27b
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Soda, Paolo

Search in DiVA

By author/editor
Soda, Paolo
By organisation
Department of Diagnostics and InterventionRadiation Physics
In the same journal
Computer Methods and Programs in Biomedicine
Cancer and Oncology

Search outside of DiVA

GoogleGoogle Scholar
Total: 138 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 412 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf