Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A new pipeline for the normalization and pooling of metabolomics data
Show others and affiliations
2021 (English)In: Metabolites, E-ISSN 2218-1989, Vol. 11, no 9, article id 631Article in journal (Refereed) Published
Abstract [en]

Pooling metabolomics data across studies is often desirable to increase the statistical power of the analysis. However, this can raise methodological challenges as several preanalytical and analytical factors could introduce differences in measured concentrations and variability between datasets. Specifically, different studies may use variable sample types (e.g., serum versus plasma) collected, treated, and stored according to different protocols, and assayed in different laboratories using different instruments. To address these issues, a new pipeline was developed to normalize and pool metabolomics data through a set of sequential steps: (i) exclusions of the least informative observations and metabolites and removal of outliers; imputation of missing data; (ii) identification of the main sources of variability through principal component partial R-square (PC-PR2) analysis; (iii) application of linear mixed models to remove unwanted variability, including samples’ originating study and batch, and preserve biological variations while accounting for potential differences in the residual variances across studies. This pipeline was applied to targeted metabolomics data acquired using Biocrates AbsoluteIDQ kits in eight case-control studies nested within the European Prospective Investigation into Cancer and Nutrition (EPIC) cohort. Comprehensive examination of metabolomics measurements indicated that the pipeline improved the comparability of data across the studies. Our pipeline can be adapted to normalize other molecular data, including biomarkers as well as proteomics data, and could be used for pooling molecular datasets, for example in international consortia, to limit biases introduced by inter-study variability. This versatility of the pipeline makes our work of potential interest to molecular epidemiologists.

Place, publisher, year, edition, pages
MDPI, 2021. Vol. 11, no 9, article id 631
Keywords [en]
Cancer epidemiology, Metabolites, Metabolomics, Normalization, Pooling, Technical variability
National Category
Cancer and Oncology Bioinformatics (Computational Biology)
Research subject
Oncology
Identifiers
URN: urn:nbn:se:umu:diva-188135DOI: 10.3390/metabo11090631ISI: 000701760400001PubMedID: 34564446Scopus ID: 2-s2.0-85115861814OAI: oai:DiVA.org:umu-188135DiVA, id: diva2:1600717
Note

(This article belongs to the Special Issue Metabolomics Meets Epidemiology).

Available from: 2021-10-05 Created: 2021-10-05 Last updated: 2024-09-04Bibliographically approved

Open Access in DiVA

fulltext(3571 kB)211 downloads
File information
File name FULLTEXT01.pdfFile size 3571 kBChecksum SHA-512
eb127ca5330526ca57d73f1330af803ffcba391db8e5a01410e011b0bc3e4a21fbdd79b734557e7793b45bc2d795bb39ac82f7258dbfcbaf7d9cee9a45fc7521
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Vidman, LindaRentoft, Matilda

Search in DiVA

By author/editor
Vidman, LindaRentoft, Matilda
By organisation
Oncology
In the same journal
Metabolites
Cancer and OncologyBioinformatics (Computational Biology)

Search outside of DiVA

GoogleGoogle Scholar
Total: 212 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 344 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf