Umeå University's logo

umu.sePublikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Chemometric strategies for supervised multi-model analysis
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Kemiska institutionen. (Johan Trygg - Chemometrics Lab)ORCID-id: 0000-0002-1898-4453
2025 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)Alternativ tittel
Kemometriska strategier för guidad multi-modellanalys (svensk)
Abstract [en]

Understanding biological processes is inherently complex. The cellular machinery andbiochemical pathways present significant challenges in scientific research. Advances indata collection, such as high-content imaging and omics technologies, have enableddeeper insights, but extracting meaningful conclusions from these complicateddatasets remains a challenge. In this thesis, the focus has been on developingchemometric strategies and supervised modelling approaches to improve datainterpretation, aiming to aid scientists in drawing conclusions from their data.In Paper I, we show that cell imaging data, combined with chemometric tools, caneffectively characterize treatment effects, leading to the development of a metric calledEquivalence (Eq.) scores. This work raised two main questions: Are fluorescent labelsnecessary for meaningful characterization? Can living cells, imaged over time, providedeeper insights? In Paper III, we address these questions by investigating anapproach based on label-free live-cell imaging data where we extended the Eq. scoresto time series data. We demonstrate that time-dependent analysis reveals both earlyand late cellular responses and improves the prediction of drug mechanisms.In Paper II, we address challenges arising when Orthogonal Projections to LatentStructures-Discriminant Analysis (OPLS-DA) models are used to analyse severalclasses, such as subtypes of diseases or different treatments. We introduce OPLSHierarchicalDiscriminant Analysis (OPLS-HDA), a method that integrateshierarchical clustering analysis (HCA) with two-class OPLS-DA models to create anOPLS-based decision tree. We demonstrated that OPLS-HDA is a strong classifiercompared to eight other established methods while maintaining interpretability.Additionally, we provide Python scripts that are integrated with SIMCA®, offering auser-friendly interface for broader accessibility.Extracting reliable insights from complex data requires intentional and structuredapproaches. This work highlights the benefits of modular and interpretable modellingsolutions, ensuring that results are both understandable and trustworthy. By breakingdown complex analytical challenges and building tools that enhance interpretability,this work contributes to the broader goal of accelerating data-driven discoveries in lifesciences.

sted, utgiver, år, opplag, sider
Umeå: Umeå University, 2025. , s. 58
Emneord [en]
Label-free live-cell imaging, Morphological profiling, Multi-class classification
HSV kategori
Identifikatorer
URN: urn:nbn:se:umu:diva-236640ISBN: 978-91-8070-642-1 (tryckt)ISBN: 978-91-8070-643-8 (digital)OAI: oai:DiVA.org:umu-236640DiVA, id: diva2:1945672
Disputas
2025-04-16, Stora Hörsalen (KBE303), KBC-huset, Linnaeus väg 6, Umeå, 09:00 (engelsk)
Opponent
Veileder
Forskningsfinansiär
eSSENCE - An eScience CollaborationTilgjengelig fra: 2025-03-26 Laget: 2025-03-19 Sist oppdatert: 2025-03-21bibliografisk kontrollert
Delarbeid
1. A scalable, data analytics workflow for image-based morphological profiles
Åpne denne publikasjonen i ny fane eller vindu >>A scalable, data analytics workflow for image-based morphological profiles
Vise andre…
2024 (engelsk)Inngår i: Chemometrics and Intelligent Laboratory Systems, ISSN 0169-7439, E-ISSN 1873-3239, Vol. 254, artikkel-id 105232Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Cell Painting is an established community-based microscopy-assay platform that provides high-throughput, high-content data for biological readouts. In November 2022, the JUMP-Cell Painting Consortium released the largest publicly available Cell Painting dataset with CellProfiler features, comprising more than 2 billion cell images. This dataset is designed for predicting the activity and toxicity of 115k drug compounds, with the aim to make cell images as computable as genomes and transcriptomes. In this context, our paper introduces a scalable and computationally efficient data analytics workflow created to meet the needs of researchers. This data-driven workflow facilitates the comparison of drug treatment effects through significant and biologically relevant insights. The workflow consists of two parts: first, the Equivalence score (Eq. score), a straightforward yet sophisticated metric highlighting relevant deviations from negative controls based on cell image morphology; second, the scalability of the workflow, by utilizing the Eq. scores on a large scale to predict and classify the subtle morphological changes in cell image profiles. By doing so, we show classification improvements compared to using the raw CellProfiler features on the CPJUMP1-pilot dataset on three types of perturbations. We hope that our workflow's contributions will enhance drug screening efficiency and streamline the drug development process. As this process is resource-intensive, every incremental improvement is valuable. Through our collective efforts in advancing the understanding of high-throughput image-based data, we aim to reduce both the time and cost of developing new, life-saving treatments.

sted, utgiver, år, opplag, sider
Elsevier, 2024
Emneord
Cell Painting, Chemometrics, Computational Workflow, Drug discovery, High-throughput Screening, Morphological Profiling, Quantitative Image Analysis
HSV kategori
Identifikatorer
urn:nbn:se:umu:diva-230015 (URN)10.1016/j.chemolab.2024.105232 (DOI)001320783800001 ()2-s2.0-85204373412 (Scopus ID)
Forskningsfinansiär
eSSENCE - An eScience Collaboration
Tilgjengelig fra: 2024-10-02 Laget: 2024-10-02 Sist oppdatert: 2025-04-24bibliografisk kontrollert
2. OPLS-based multiclass classification and data-driven interclass relationship discovery
Åpne denne publikasjonen i ny fane eller vindu >>OPLS-based multiclass classification and data-driven interclass relationship discovery
2025 (engelsk)Inngår i: Journal of Chemical Information and Modeling, ISSN 1549-9596, E-ISSN 1549-960X, Vol. 65, nr 4, s. 1762-1770Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Multiclass data sets and large-scale studies are increasingly common in omics sciences, drug discovery, and clinical research due to advancements in analytical platforms. Efficiently handling these data sets and discerning subtle differences across multiple classes remains a significant challenge. In metabolomics, two-class orthogonal projection to latent structures discriminant analysis (OPLS-DA) models are widely used due to their strong discrimination capabilities and ability to provide interpretable information on class differences. However, these models face challenges in multiclass settings. A common solution is to transform the multiclass comparison into multiple two-class comparisons, which, while more effective than a global multiclass OPLS-DA model, unfortunately results in a manual, time-consuming model-building process with complicated interpretation. Here, we introduce an extension of OPLS-DA for data-driven multiclass classification: orthogonal partial least squares-hierarchical discriminant analysis (OPLS-HDA). OPLS-HDA integrates hierarchical cluster analysis (HCA) with the OPLS-DA framework to create a decision tree, addressing multiclass classification challenges and providing intuitive visualization of interclass relationships. To avoid overfitting and ensure reliable predictions, we use cross-validation during model building. Benchmark results show that OPLS-HDA performs competitively across diverse data sets compared to eight established methods. This method represents a significant advancement, offering a powerful tool to dissect complex multiclass data sets. With its versatility, interpretability, and ease of use, OPLS-HDA is an efficient approach to multiclass data analysis applicable across various fields.

sted, utgiver, år, opplag, sider
American Chemical Society (ACS), 2025
Emneord
Cluster Analysis, Discriminant Analysis, Humans, Least-Squares Analysis, Metabolomics
HSV kategori
Identifikatorer
urn:nbn:se:umu:diva-236203 (URN)10.1021/acs.jcim.4c01799 (DOI)001412188800001 ()39899705 (PubMedID)2-s2.0-85216849215 (Scopus ID)
Tilgjengelig fra: 2025-03-13 Laget: 2025-03-13 Sist oppdatert: 2025-03-19bibliografisk kontrollert
3. Label-free live-cell imaging improves mechanism of action classification and reveals temporal drug responses through supervised time-dependent modelling
Åpne denne publikasjonen i ny fane eller vindu >>Label-free live-cell imaging improves mechanism of action classification and reveals temporal drug responses through supervised time-dependent modelling
Vise andre…
(engelsk)Manuskript (preprint) (Annet vitenskapelig)
HSV kategori
Identifikatorer
urn:nbn:se:umu:diva-236639 (URN)
Tilgjengelig fra: 2025-03-19 Laget: 2025-03-19 Sist oppdatert: 2025-03-19bibliografisk kontrollert

Open Access i DiVA

fulltext(7236 kB)282 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 7236 kBChecksum SHA-512
79abaf1e25b5400f6d711e0e9bd7aad8ce1c1b83de85f14e1188c7e68c0457fb4e7d08a7a40bf389fcd52de6942f8fb52600f8ab709c109368f3dc436675c5b9
Type fulltextMimetype application/pdf
spikblad(166 kB)52 nedlastinger
Filinformasjon
Fil SPIKBLAD01.pdfFilstørrelse 166 kBChecksum SHA-512
340be562aca7baa8094503a8955dff66eabf71ac1a26030378763787842adee979cf532229e9451ca3e467f5237af49593009a39f0c8b036fdc5d9bdb5777b06
Type spikbladMimetype application/pdf

Person

Forsgren, Edvin

Søk i DiVA

Av forfatter/redaktør
Forsgren, Edvin
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 282 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

isbn
urn-nbn

Altmetric

isbn
urn-nbn
Totalt: 2105 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf