Umeå universitets logga

umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Early detection of squamous cell carcinoma of the oral tongue using multidimensional plasma protein analysis and interpretable machine learning
Umeå universitet, Medicinska fakulteten, Institutionen för medicinsk biovetenskap, Patologi.ORCID-id: 0000-0002-6574-3628
Umeå universitet, Medicinska fakulteten, Institutionen för medicinsk biovetenskap, Patologi. Umeå university. (Professor Karin Nylander)ORCID-id: 0000-0003-2166-6242
Umeå universitet, Medicinska fakulteten, Institutionen för medicinsk biovetenskap, Patologi.
Research Centre for Applied Molecular Oncology, Masaryk Memorial Cancer Institute, Brno, Czech Republic.
Visa övriga samt affilieringar
2023 (Engelska)Ingår i: Journal of Oral Pathology & Medicine, ISSN 0904-2512, E-ISSN 1600-0714, Vol. 52, nr 7, s. 637-643Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Background: Interpretable machine learning (ML) for early detection of cancer has the potential to improve risk assessment and early intervention.

Methods: Data from 261 proteins related to inflammation and/or tumor processes in 123 blood samples collected from healthy persons, but of whom a sub-group later developed squamous cell carcinoma of the oral tongue (SCCOT), were analyzed. Samples from people who developed SCCOT within less than 5 years were classified as tumor-to-be and all other samples as tumor-free. The optimal ML algorithm for feature selection was identified and feature importance computed by the SHapley Additive exPlanations (SHAP) method. Five popular ML algorithms (AdaBoost, Artificial neural networks [ANNs], Decision Tree [DT], eXtreme Gradient Boosting [XGBoost], and Support Vector Machine [SVM]) were applied to establish prediction models, and decisions of the optimal models were interpreted by SHAP.

Results: Using the 22 selected features, the SVM prediction model showed the best performance (sensitivity = 0.867, specificity = 0.859, balanced accuracy = 0.863, area under the receiver operating characteristic curve [ROC-AUC] = 0.924). SHAP analysis revealed that the 22 features rendered varying person-specific impacts on model decision and the top three contributors to prediction were Interleukin 10 (IL10), TNF Receptor Associated Factor 2 (TRAF2), and Kallikrein Related Peptidase 12 (KLK12).

Conclusion: Using multidimensional plasma protein analysis and interpretable ML, we outline a systematic approach for early detection of SCCOT before the appearance of clinical signs.

Ort, förlag, år, upplaga, sidor
John Wiley & Sons, 2023. Vol. 52, nr 7, s. 637-643
Nyckelord [en]
machine learning, interpretable model, SHAP, SCCOT, PLASMA PROTEIN
Nationell ämneskategori
Cancer och onkologi
Forskningsämne
genetik
Identifikatorer
URN: urn:nbn:se:umu:diva-208270DOI: 10.1111/jop.13461ISI: 001026127400001PubMedID: 37428440Scopus ID: 2-s2.0-85164698201OAI: oai:DiVA.org:umu-208270DiVA, id: diva2:1757086
Forskningsfinansiär
Cancerfonden, 20 0754 PjF 01HRegion VästerbottenUmeå universitet
Anmärkning

Originally included in thesis in manuscript form. 

Tillgänglig från: 2023-05-15 Skapad: 2023-05-15 Senast uppdaterad: 2025-04-24Bibliografiskt granskad
Ingår i avhandling
1. Clinical investigation and application of Artificial Intelligence in diagnosis and prognosis of squamous cell carcinoma of the head and neck
Öppna denna publikation i ny flik eller fönster >>Clinical investigation and application of Artificial Intelligence in diagnosis and prognosis of squamous cell carcinoma of the head and neck
2023 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Background: In Sweden around 1400 people are affected by head and neck cancer each year, and around 400 of these tumours are located in the mobile tongue (SCCOT). A major problem with these tumours is the high degree of relapse. In order to broaden our understanding of the group of squamous cell carcinoma of the head and neck (SCCHN) tumours we evaluated and compared the outcomes of panendoscopy with biopsy, ultrasonography with fine needle aspiration cytology (US-FNAC), and preoperative positron emission tomography/computed tomography (PET/CT) data from the same patients. As patients with SCCHN frequently have distant metastasis and locoregional recurrences, machine learning (ML) techniques were used to create classification models that accurately predict the likelihood of an early recurrence.

Materials and methods: From patients suspected of having head and neck cancer between 2014–2016 results from PET/CT, panendoscopy with biopsy and US-FNAC were compared. Clinical, genomic, transcriptomic, and proteomic markers identifying recurrence risk were investigated. In blood samples taken from healthy individuals, data from proteins relevant to inflammation and/or tumor processes were evaluated. The SHapley Additive Explanations (SHAP) approach was used to determine the best ML algorithm for feature selection. AdaBoost, Artificial neural networks (ANNs), Decision Tree (DT), eXtreme Gradient Boosting (XGBoost), and Support Vector Machine (SVM) were used to create prediction models. Clinical data from patients were analyzed using statistical and ML techniques.

Results: The concordance between results from PET/CT and panendoscopy with biopsy was 91.3%, and somewhat lower, 89.1%, for PET/CT and US-FNAC. The top contributors to classification with the ML approach were five mRNAs (PLAUR, DKK1, AXIN2, ANG, VEGFA), and 10 proteins (RAD50, 4E-BP1, MYH11, MAP2K1, BECN1, NF2, RAB25, ERRFI1, KDR, SERPINE1), using the extreme gradient boosting (XGBoost) method. The SHAP approach was used for feature selection. Using data from analysis of proteins in blood and interpretable ML showed that the Support Vector Machine (SVM) had the best performance with a balanced accuracy of 0.863, and a ROC-AUC of 0.924. The top three contributors to the SVM prediction model's performance were IL10, TNF Receptor Associated Factor 2 (TRAF2), and Kallikrein Related Peptidase 12 (KLK12). Recurrence was correlated with diabetes (p = 0.003), radiographic neck metastasis (p = 0.010), and T stage (p = 0.0012). A ML model got an accuracy rate of 71.2%. In the SCCOT group, diabetics predominated over non-diabetics, and also had lower recurrence rates and better survival (p = 0.012).

Conclusion: Results show that the combination PET/CT is useful in diagnosis of SCCHN. It further emphasizes the use of ML to identify transcriptomic and proteomic factors that are significant in predicting risk of recurrence in patients with SCCHN. It provides a methodical strategy for early diagnosis of SCCOT before onset of clinical symptoms using multidimensional plasma protein profiling and interpretable ML. A model for predicting recurrence of SCCOT is provided by ML utilizing clinical data. As SCCOT patients with co-existing diabetes showed a better prognosis than non-diabetics, results suggest that individuals with SCCOT, regardless of diabetes status, may benefit from therapeutic management of glucose levels.

Ort, förlag, år, upplaga, sidor
Umeå: Umeå Universitet, 2023. s. 47
Serie
Umeå University medical dissertations, ISSN 0346-6612 ; 2227
Nyckelord
SCCHN, SCCOT, Recurrence, ML, PET/CT, mRNA, transcriptomic, proteomic, Diabetes, AI
Nationell ämneskategori
Cancer och onkologi
Forskningsämne
oto-rhino-laryngologi
Identifikatorer
urn:nbn:se:umu:diva-208272 (URN)9789180700146 (ISBN)9789180700153 (ISBN)
Disputation
2023-06-15, Betula, Byggnad 6M, Umeå universitet, Umeå, 09:00 (Engelska)
Opponent
Handledare
Forskningsfinansiär
Cancerfonden, 20 0754 PjF 01H
Tillgänglig från: 2023-05-25 Skapad: 2023-05-15 Senast uppdaterad: 2023-05-16Bibliografiskt granskad

Open Access i DiVA

fulltext(1607 kB)154 nedladdningar
Filinformation
Filnamn FULLTEXT03.pdfFilstorlek 1607 kBChecksumma SHA-512
55f63d6e0fb51990a3872d6b691d02f9982946bda999adcf8fdb1b39bab22d99c1ddc5d8dcc15220b7d458cb042e9f87a7beb66a9fa42634c588a5d15ec0bbcf
Typ fulltextMimetyp application/pdf

Övriga länkar

Förlagets fulltextPubMedScopus

Person

Gu, XiaolianSalehi, Amir M.Wang, LixiaoSgaramella, NicolaNylander, Karin

Sök vidare i DiVA

Av författaren/redaktören
Gu, XiaolianSalehi, Amir M.Wang, LixiaoSgaramella, NicolaNylander, Karin
Av organisationen
Patologi
I samma tidskrift
Journal of Oral Pathology & Medicine
Cancer och onkologi

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 169 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

doi
pubmed
urn-nbn

Altmetricpoäng

doi
pubmed
urn-nbn
Totalt: 389 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf