Umeå universitets logga

umu.sePublikationer
Ändra sökning
Länk till posten
Permanent länk

Direktlänk
Vidman, Linda
Publikationer (10 of 19) Visa alla publikationer
Bovinder Ylitalo, E., Vidman, L., Harlid, S. & van Guelpen, B. (2025). mRNA extracted from frozen buffy coat samples stored long term in tubes with no RNA preservative shows promise for downstream sequencing analyses. PLOS ONE, 20(3), Article ID e0318834.
Öppna denna publikation i ny flik eller fönster >>mRNA extracted from frozen buffy coat samples stored long term in tubes with no RNA preservative shows promise for downstream sequencing analyses
2025 (Engelska)Ingår i: PLOS ONE, E-ISSN 1932-6203, Vol. 20, nr 3, artikel-id e0318834Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Transcriptomics is an important OMICs method that is often unavailable in biobank research. Frozen blood samples are routinely collected and stored in medical biobanks, but transcriptional studies have been limited due to technical difficulties of extracting high-quality RNA from blood frozen in standard tubes (without RNA preservatives). We aimed to determine whether biobanked buffy coat samples stored at -80°C for up to 23 years could be successfully used for mRNA sequencing. We used a CryoXtract CXT 350 to remove frozen sample cores, which were immersed in RNA preservative during thawing prior to RNA extraction. RNA sequencing was then performed on extractions from pooled samples as well as from 23 buffy coat samples from prospective colorectal cancer cases and 23 matched controls included in the population-based, prospective Northern Sweden Health and Disease Study (NSHDS). For all samples, two library preparation methods were used (Illumina TruSeq Stranded mRNA poly-A selection and Illumina Stranded Total RNA with Ribo-Zero Globin). RNA yields of over 1 µg were obtained from the majority of NSHDS samples (mean = 2.57 µg), and over 92% of samples had RIN values of ≥ 6, indicating suitability for downstream analyses. In conclusion, we developed a method for successfully extracting and sequencing high-quality mRNA from frozen buffy coat samples stored long term in tubes with no RNA preservative.

Ort, förlag, år, upplaga, sidor
Public Library of Science (PLoS), 2025
Nationell ämneskategori
Biokemi Molekylärbiologi
Identifikatorer
urn:nbn:se:umu:diva-237153 (URN)10.1371/journal.pone.0318834 (DOI)001449696700013 ()40106499 (PubMedID)2-s2.0-105000259333 (Scopus ID)
Forskningsfinansiär
Cancerfonden, 20 1154 PjFCancerforskningsfonden i NorrlandKnut och Alice Wallenbergs StiftelseRegion Västerbotten, VLL-833291
Tillgänglig från: 2025-04-15 Skapad: 2025-04-15 Senast uppdaterad: 2025-04-15Bibliografiskt granskad
Bodén, S., Zheng, R., Ribbenstedt, A., Landberg, R., Harlid, S., Vidman, L., . . . Brunius, C. (2024). Dietary patterns, untargeted metabolite profiles and their association with colorectal cancer risk. Scientific Reports, 14(1), Article ID 2244.
Öppna denna publikation i ny flik eller fönster >>Dietary patterns, untargeted metabolite profiles and their association with colorectal cancer risk
Visa övriga...
2024 (Engelska)Ingår i: Scientific Reports, E-ISSN 2045-2322, Vol. 14, nr 1, artikel-id 2244Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

We investigated data-driven and hypothesis-driven dietary patterns and their association to plasma metabolite profiles and subsequent colorectal cancer (CRC) risk in 680 CRC cases and individually matched controls. Dietary patterns were identified from combined exploratory/confirmatory factor analysis. We assessed association to LC–MS metabolic profiles by random forest regression and to CRC risk by multivariable conditional logistic regression. Principal component analysis was used on metabolite features selected to reflect dietary exposures. Component scores were associated to CRC risk and dietary exposures using partial Spearman correlation. We identified 12 data-driven dietary patterns, of which a breakfast food pattern showed an inverse association with CRC risk (OR per standard deviation increase 0.89, 95% CI 0.80–1.00, p = 0.04). This pattern was also inversely associated with risk of distal colon cancer (0.75, 0.61–0.96, p = 0.01) and was more pronounced in women (0.69, 0.49–0.96, p = 0.03). Associations between meat, fast-food, fruit soup/rice patterns and CRC risk were modified by tumor location in women. Alcohol as well as fruit and vegetables associated with metabolite profiles (Q2 0.22 and 0.26, respectively). One metabolite reflecting alcohol intake associated with increased CRC risk, whereas three metabolites reflecting fiber, wholegrain, and fruit and vegetables associated with decreased CRC risk.

Ort, förlag, år, upplaga, sidor
Springer Nature, 2024
Nationell ämneskategori
Näringslära och dietkunskap Cancer och onkologi
Identifikatorer
urn:nbn:se:umu:diva-220475 (URN)10.1038/s41598-023-50567-6 (DOI)001152222400046 ()38278865 (PubMedID)2-s2.0-85183347182 (Scopus ID)
Forskningsfinansiär
CancerfondenVetenskapsrådetRegion VästerbottenIngabritt och Arne Lundbergs Forskningsstiftelse
Tillgänglig från: 2024-02-16 Skapad: 2024-02-16 Senast uppdaterad: 2025-03-19Bibliografiskt granskad
Rothwell, J. A., Bešević, J., Dimou, N., Breeur, M., Murphy, N., Jenab, M., . . . Gunter, M. J. (2023). Circulating amino acid levels and colorectal cancer risk in the European Prospective Investigation into Cancer and Nutrition and UK Biobank cohorts. BMC Medicine, 21(1), Article ID 80.
Öppna denna publikation i ny flik eller fönster >>Circulating amino acid levels and colorectal cancer risk in the European Prospective Investigation into Cancer and Nutrition and UK Biobank cohorts
Visa övriga...
2023 (Engelska)Ingår i: BMC Medicine, E-ISSN 1741-7015, Vol. 21, nr 1, artikel-id 80Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Background: Amino acid metabolism is dysregulated in colorectal cancer patients; however, it is not clear whether pre-diagnostic levels of amino acids are associated with subsequent risk of colorectal cancer. We investigated circulating levels of amino acids in relation to colorectal cancer risk in the European Prospective Investigation into Cancer and Nutrition (EPIC) and UK Biobank cohorts.

Methods: Concentrations of 13-21 amino acids were determined in baseline fasting plasma or serum samples in 654 incident colorectal cancer cases and 654 matched controls in EPIC. Amino acids associated with colorectal cancer risk following adjustment for the false discovery rate (FDR) were then tested for associations in the UK Biobank, for which measurements of 9 amino acids were available in 111,323 participants, of which 1221 were incident colorectal cancer cases.

Results: Histidine levels were inversely associated with colorectal cancer risk in EPIC (odds ratio [OR] 0.80 per standard deviation [SD], 95% confidence interval [CI] 0.69–0.92, FDR P-value=0.03) and in UK Biobank (HR 0.93 per SD, 95% CI 0.87–0.99, P-value=0.03). Glutamine levels were borderline inversely associated with colorectal cancer risk in EPIC (OR 0.85 per SD, 95% CI 0.75–0.97, FDR P-value=0.08) and similarly in UK Biobank (HR 0.95, 95% CI 0.89–1.01, P=0.09) In both cohorts, associations changed only minimally when cases diagnosed within 2 or 5 years of follow-up were excluded.

Conclusions: Higher circulating levels of histidine were associated with a lower risk of colorectal cancer in two large prospective cohorts. Further research to ascertain the role of histidine metabolism and potentially that of glutamine in colorectal cancer development is warranted.

Ort, förlag, år, upplaga, sidor
BioMed Central (BMC), 2023
Nyckelord
Amino acids, Colorectal cancer, Glutamine, Histidine
Nationell ämneskategori
Cancer och onkologi
Identifikatorer
urn:nbn:se:umu:diva-205643 (URN)10.1186/s12916-023-02739-4 (DOI)000940903600001 ()36855092 (PubMedID)2-s2.0-85149153333 (Scopus ID)
Tillgänglig från: 2023-03-14 Skapad: 2023-03-14 Senast uppdaterad: 2023-09-05Bibliografiskt granskad
Vidman, L., Zheng, R., Bodén, S., Ribbenstedt, A., Gunter, M. J., Palmqvist, R., . . . van Guelpen, B. (2023). Untargeted plasma metabolomics and risk of colorectal cancer: an analysis nested within a large-scale prospective cohort. Cancer & Metabolism, 11(1), Article ID 17.
Öppna denna publikation i ny flik eller fönster >>Untargeted plasma metabolomics and risk of colorectal cancer: an analysis nested within a large-scale prospective cohort
Visa övriga...
2023 (Engelska)Ingår i: Cancer & Metabolism, E-ISSN 2049-3002, Vol. 11, nr 1, artikel-id 17Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Background: Colorectal cancer (CRC) is a leading cause of cancer-related death worldwide, but if discovered at an early stage, the survival rate is high. The aim of this study was to identify novel markers predictive of future CRC risk using untargeted metabolomics.

Methods: This study included prospectively collected plasma samples from 902 CRC cases and 902 matched cancer-free control participants from the population-based Northern Sweden Health and Disease Study (NSHDS), which were obtained up to 26 years prior to CRC diagnosis. Using reverse-phase liquid chromatography-mass spectrometry (LC-MS), data comprising 5015 metabolic features were obtained. Conditional logistic regression was applied to identify potentially important metabolic features associated with CRC risk. In addition, we investigated if previously reported metabolite biomarkers of CRC risk could be validated in this study population.

Results: In the univariable analysis, seven metabolic features were associated with CRC risk (using a false discovery rate cutoff of 0.25). Two of these could be annotated, one as pyroglutamic acid (odds ratio per one standard deviation increase = 0.79, 95% confidence interval, 0.70-0.89) and another as hydroxytigecycline (odds ratio per one standard deviation increase = 0.77, 95% confidence interval, 0.67-0.89). Associations with CRC risk were also found for six previously reported metabolic biomarkers of prevalent and/or incident CRC: sebacic acid (inverse association) and L-tryptophan, 3-hydroxybutyric acid, 9,12,13-TriHOME, valine, and 13-OxoODE (positive associations).

Conclusions: These findings suggest that although the circulating metabolome may provide new etiological insights into the underlying causes of CRC development, its potential application for the identification of individuals at higher risk of developing CRC is limited.

Ort, förlag, år, upplaga, sidor
BioMed Central (BMC), 2023
Nyckelord
Untargeted metabolomics, Colorectal cancer, Early detection
Nationell ämneskategori
Cancer och onkologi
Identifikatorer
urn:nbn:se:umu:diva-218144 (URN)10.1186/s40170-023-00319-x (DOI)001088049400001 ()37849011 (PubMedID)
Forskningsfinansiär
Cancerfonden, CAN 2017/581Vetenskapsrådet, 2017-01737Region Västerbotten, VLL-833291Region Västerbotten, VLL-841671Knut och Alice Wallenbergs StiftelseCancerforskningsfonden i NorrlandUmeå universitetIngabritt och Arne Lundbergs Forskningsstiftelse
Tillgänglig från: 2023-12-19 Skapad: 2023-12-19 Senast uppdaterad: 2024-01-05Bibliografiskt granskad
Breeur, M., Ferrari, P., Dossus, L., Jenab, M., Johansson, M., Rinaldi, S., . . . Viallon, V. (2022). Pan-cancer analysis of pre-diagnostic blood metabolite concentrations in the European Prospective Investigation into Cancer and Nutrition. BMC Medicine, 20(1), Article ID 351.
Öppna denna publikation i ny flik eller fönster >>Pan-cancer analysis of pre-diagnostic blood metabolite concentrations in the European Prospective Investigation into Cancer and Nutrition
Visa övriga...
2022 (Engelska)Ingår i: BMC Medicine, E-ISSN 1741-7015, Vol. 20, nr 1, artikel-id 351Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Background: Epidemiological studies of associations between metabolites and cancer risk have typically focused on specific cancer types separately. Here, we designed a multivariate pan-cancer analysis to identify metabolites potentially associated with multiple cancer types, while also allowing the investigation of cancer type-specific associations.

Methods: We analysed targeted metabolomics data available for 5828 matched case-control pairs from cancer-specific case-control studies on breast, colorectal, endometrial, gallbladder, kidney, localized and advanced prostate cancer, and hepatocellular carcinoma nested within the European Prospective Investigation into Cancer and Nutrition (EPIC) cohort. From pre-diagnostic blood levels of an initial set of 117 metabolites, 33 cluster representatives of strongly correlated metabolites and 17 single metabolites were derived by hierarchical clustering. The mutually adjusted associations of the resulting 50 metabolites with cancer risk were examined in penalized conditional logistic regression models adjusted for body mass index, using the data-shared lasso penalty.

Results: Out of the 50 studied metabolites, (i) six were inversely associated with the risk of most cancer types: glutamine, butyrylcarnitine, lysophosphatidylcholine a C18:2, and three clusters of phosphatidylcholines (PCs); (ii) three were positively associated with most cancer types: proline, decanoylcarnitine, and one cluster of PCs; and (iii) 10 were specifically associated with particular cancer types, including histidine that was inversely associated with colorectal cancer risk and one cluster of sphingomyelins that was inversely associated with risk of hepatocellular carcinoma and positively with endometrial cancer risk.

Conclusions: These results could provide novel insights for the identification of pathways for cancer development, in particular those shared across different cancer types.

Ort, förlag, år, upplaga, sidor
BioMed Central (BMC), 2022
Nyckelord
Breast, Cancer, Colorectal, Endometrial, EPIC, Kidney, Lasso, Liver, Metabolomics, Prostate
Nationell ämneskategori
Cancer och onkologi
Identifikatorer
urn:nbn:se:umu:diva-200663 (URN)10.1186/s12916-022-02553-4 (DOI)000869850300002 ()36258205 (PubMedID)2-s2.0-85140184323 (Scopus ID)
Forskningsfinansiär
EU, FP7, Sjunde ramprogrammet, 2014/1193EU, FP7, Sjunde ramprogrammet, 313010EU, FP7, Sjunde ramprogrammet, C19335/A21351CancerfondenVetenskapsrådet
Tillgänglig från: 2022-11-07 Skapad: 2022-11-07 Senast uppdaterad: 2023-09-05Bibliografiskt granskad
Viallon, V., His, M., Rinaldi, S., Breeur, M., Gicquiau, A., Hemon, B., . . . Ferrari, P. (2021). A new pipeline for the normalization and pooling of metabolomics data. Metabolites, 11(9), Article ID 631.
Öppna denna publikation i ny flik eller fönster >>A new pipeline for the normalization and pooling of metabolomics data
Visa övriga...
2021 (Engelska)Ingår i: Metabolites, E-ISSN 2218-1989, Vol. 11, nr 9, artikel-id 631Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Pooling metabolomics data across studies is often desirable to increase the statistical power of the analysis. However, this can raise methodological challenges as several preanalytical and analytical factors could introduce differences in measured concentrations and variability between datasets. Specifically, different studies may use variable sample types (e.g., serum versus plasma) collected, treated, and stored according to different protocols, and assayed in different laboratories using different instruments. To address these issues, a new pipeline was developed to normalize and pool metabolomics data through a set of sequential steps: (i) exclusions of the least informative observations and metabolites and removal of outliers; imputation of missing data; (ii) identification of the main sources of variability through principal component partial R-square (PC-PR2) analysis; (iii) application of linear mixed models to remove unwanted variability, including samples’ originating study and batch, and preserve biological variations while accounting for potential differences in the residual variances across studies. This pipeline was applied to targeted metabolomics data acquired using Biocrates AbsoluteIDQ kits in eight case-control studies nested within the European Prospective Investigation into Cancer and Nutrition (EPIC) cohort. Comprehensive examination of metabolomics measurements indicated that the pipeline improved the comparability of data across the studies. Our pipeline can be adapted to normalize other molecular data, including biomarkers as well as proteomics data, and could be used for pooling molecular datasets, for example in international consortia, to limit biases introduced by inter-study variability. This versatility of the pipeline makes our work of potential interest to molecular epidemiologists.

Ort, förlag, år, upplaga, sidor
MDPI, 2021
Nyckelord
Cancer epidemiology, Metabolites, Metabolomics, Normalization, Pooling, Technical variability
Nationell ämneskategori
Cancer och onkologi Bioinformatik (beräkningsbiologi)
Forskningsämne
onkologi
Identifikatorer
urn:nbn:se:umu:diva-188135 (URN)10.3390/metabo11090631 (DOI)000701760400001 ()34564446 (PubMedID)2-s2.0-85115861814 (Scopus ID)
Anmärkning

(This article belongs to the Special Issue Metabolomics Meets Epidemiology).

Tillgänglig från: 2021-10-05 Skapad: 2021-10-05 Senast uppdaterad: 2024-09-04Bibliografiskt granskad
Källberg, D., Vidman, L. & Rydén, P. (2021). Comparison of Methods for Feature Selection in Clustering of High-Dimensional RNA-Sequencing Data to Identify Cancer Subtypes. Frontiers in Genetics, 12, Article ID 632620.
Öppna denna publikation i ny flik eller fönster >>Comparison of Methods for Feature Selection in Clustering of High-Dimensional RNA-Sequencing Data to Identify Cancer Subtypes
2021 (Engelska)Ingår i: Frontiers in Genetics, E-ISSN 1664-8021, Vol. 12, artikel-id 632620Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Cancer subtype identification is important to facilitate cancer diagnosis and select effective treatments. Clustering of cancer patients based on high-dimensional RNA-sequencing data can be used to detect novel subtypes, but only a subset of the features (e.g., genes) contains information related to the cancer subtype. Therefore, it is reasonable to assume that the clustering should be based on a set of carefully selected features rather than all features. Several feature selection methods have been proposed, but how and when to use these methods are still poorly understood. Thirteen feature selection methods were evaluated on four human cancer data sets, all with known subtypes (gold standards), which were only used for evaluation. The methods were characterized by considering mean expression and standard deviation (SD) of the selected genes, the overlap with other methods and their clustering performance, obtained comparing the clustering result with the gold standard using the adjusted Rand index (ARI). The results were compared to a supervised approach as a positive control and two negative controls in which either a random selection of genes or all genes were included. For all data sets, the best feature selection approach outperformed the negative control and for two data sets the gain was substantial with ARI increasing from (−0.01, 0.39) to (0.66, 0.72), respectively. No feature selection method completely outperformed the others but using the dip-rest statistic to select 1000 genes was overall a good choice. The commonly used approach, where genes with the highest SDs are selected, did not perform well in our study.

Ort, förlag, år, upplaga, sidor
Frontiers Media S.A., 2021
Nyckelord
cancer subtypes, feature selection, gene selection, high-dimensional, RNA-seq
Nationell ämneskategori
Sannolikhetsteori och statistik Bioinformatik och beräkningsbiologi
Identifikatorer
urn:nbn:se:umu:diva-181727 (URN)10.3389/fgene.2021.632620 (DOI)000626903100001 ()2-s2.0-85102373666 (Scopus ID)
Tillgänglig från: 2021-03-24 Skapad: 2021-03-24 Senast uppdaterad: 2025-02-05Bibliografiskt granskad
Renman, D., Gylling, B., Vidman, L., Bodén, S., Strigård, K., Palmqvist, R., . . . van Guelpen, B. (2021). Density of CD3+ and CD8+ cells in the microenvironment of colorectal cancer according to pre-diagnostic physical activity. Cancer Epidemiology, Biomarkers and Prevention, 30(12), 2317-2326
Öppna denna publikation i ny flik eller fönster >>Density of CD3+ and CD8+ cells in the microenvironment of colorectal cancer according to pre-diagnostic physical activity
Visa övriga...
2021 (Engelska)Ingår i: Cancer Epidemiology, Biomarkers and Prevention, ISSN 1055-9965, E-ISSN 1538-7755, Vol. 30, nr 12, s. 2317-2326Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Introduction: Physical activity is associated not only with a decreased risk of developing colorectal cancer but also with improved survival. One putative mechanism is the infiltration of immune cells in the tumor microenvironment. Experimental findings suggest that physical activity may mobilize immune cells to the tumor. We hypothesized that higher levels of physical activity prior to colorectal cancer diagnosis are associated with higher densities of tumor-infiltrating T-lymphocytes in colorectal cancer patients.

Method: The study setting was a northern Swedish population-based cohort, including 109792 participants with prospectively collected health- and lifestyle-related data. For 592 participants who later developed colorectal cancer, archival tumor tissue samples were used to assess the density of CD3+ and CD8+ cytotoxic T-cells by immunohistochemistry. Odds ratios for associations between self-reported, pre-diagnostic recreational physical activity and immune-cell infiltration were estimated by ordinal logistic regression.

Results: Recreational physical activity >3 times per week was associated with a higher density of CD8+ T-cells in the tumor front and center compared to participants reporting no recreational physical activity. Odds ratios were 2.77 (95% CI 1.21-6.35) and 2.85 (95% CI 1.28-6.33) for the tumor front and center, respectively, after adjustment for sex, age at diagnosis, and tumor stage. The risk estimates were consistent after additional adjustment for several potential confounders. For CD3 no clear associations were found.

Conclusion: Physical activity may promote the infiltration of CD8+ immune cells in the tumor microenvironment of colorectal cancer.

Impact: The study provides some evidence on how physical activity may alter the prognosis in colorectal cancer.

Ort, förlag, år, upplaga, sidor
American Association for Cancer Research (AACR), 2021
Nationell ämneskategori
Cancer och onkologi
Identifikatorer
urn:nbn:se:umu:diva-188712 (URN)10.1158/1055-9965.EPI-21-0508 (DOI)000728256100001 ()34607838 (PubMedID)2-s2.0-85121664424 (Scopus ID)
Forskningsfinansiär
Cancerfonden, 2017/581Region Västerbotten, RV‐939032Visare Norr, 929704Cancerforskningsfonden i Norrland
Tillgänglig från: 2021-10-19 Skapad: 2021-10-19 Senast uppdaterad: 2022-05-12Bibliografiskt granskad
His, M., Viallon, V., Dossus, L., Schmidt, J. A., Travis, R. C., Gunter, M. J., . . . Rinaldi, S. (2021). Lifestyle correlates of eight breast cancer-related metabolites: a cross-sectional study within the EPIC cohort. BMC Medicine, 19(1), Article ID 312.
Öppna denna publikation i ny flik eller fönster >>Lifestyle correlates of eight breast cancer-related metabolites: a cross-sectional study within the EPIC cohort
Visa övriga...
2021 (Engelska)Ingår i: BMC Medicine, E-ISSN 1741-7015, Vol. 19, nr 1, artikel-id 312Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Background: Metabolomics is a promising molecular tool for identifying novel etiological pathways leading to cancer. In an earlier prospective study among pre- and postmenopausal women not using exogenous hormones, we observed a higher risk of breast cancer associated with higher blood concentrations of one metabolite (acetylcarnitine) and a lower risk associated with higher blood concentrations of seven others (arginine, asparagine, phosphatidylcholines (PCs) aa C36:3, ae C34:2, ae C36:2, ae C36:3, and ae C38:2).

Methods: To identify determinants of these breast cancer-related metabolites, we conducted a cross-sectional analysis to identify their lifestyle and anthropometric correlates in 2358 women, who were previously included as controls in case-control studies nested within the European Prospective Investigation into Cancer and Nutrition cohort and not using exogenous hormones at blood collection. Associations of each metabolite concentration with 42 variables were assessed using linear regression models in a discovery set of 1572 participants. Significant associations were evaluated in a validation set (n = 786).

Results: For the metabolites previously associated with a lower risk of breast cancer, concentrations of PCs ae C34:2, C36:2, C36:3, and C38:2 were negatively associated with adiposity and positively associated with total and saturated fat intakes. PC ae C36:2 was also negatively associated with alcohol consumption and positively associated with two scores reflecting adherence to a healthy lifestyle. Asparagine concentration was negatively associated with adiposity. Arginine and PC aa C36:3 concentrations were not associated to any of the factors examined. For the metabolite previously associated with a higher risk of breast cancer, acetylcarnitine, a positive association with age was observed.

Conclusions: These associations may indicate possible mechanisms underlying associations between lifestyle and anthropometric factors, and risk of breast cancer. Further research is needed to identify potential non-lifestyle correlates of the metabolites investigated.

Ort, förlag, år, upplaga, sidor
BioMed Central, 2021
Nyckelord
Anthropometry, Breast cancer, Cross-sectional, Lifestyle, Metabolites
Nationell ämneskategori
Cancer och onkologi Folkhälsovetenskap, global hälsa och socialmedicin
Identifikatorer
urn:nbn:se:umu:diva-190814 (URN)10.1186/s12916-021-02183-2 (DOI)000728537600001 ()34886862 (PubMedID)2-s2.0-85121052328 (Scopus ID)
Tillgänglig från: 2021-12-29 Skapad: 2021-12-29 Senast uppdaterad: 2025-02-20Bibliografiskt granskad
Vidman, L. (2020). cancer subtype identification using cluster analysis on high-dimensional omics data. (Doctoral dissertation). Umeå: Umeå universitet
Öppna denna publikation i ny flik eller fönster >>cancer subtype identification using cluster analysis on high-dimensional omics data
2020 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Identification and prediction of cancer subtypes are important parts in the development towards personalized medicine. By tailoring treatments, it is possible to decrease unnecessary suffering and reduce costs. Since the introduction of next generation sequencing techniques, the amount of data available for medical research has increased rapidly. The high dimensional omics data produced by various techniques requires statistical methods to transform data into information and knowledge.

All papers in this thesis are related to distinguishing of disease subtypes in patients with cancer using omics data. The high dimension and the complexity of sequencing data from tumor samples makes it necessary to pre—process the data.  We carry out comparisons of feature selection methods and clustering methods used for identification of cancer subtypes. In addition, we evaluate the effect that certain characteristics of the data have on the ability to identify cancer subtypes. The results show that no method outperforms the others in all cases and the relative ranking of methods is very dependent on the data. We also show that the benefit of receiving a more homogeneous data by analyzing genders separately can outweigh the possible drawbacks caused by smaller sample sizes. One of the major challenges when dealing with omics data from tumor samples is that the patients are generally a very heterogeneous group. Factors that lead to heterogeneity include age, gender, ethnicity and stage of disease. How big the effect size is for each of these factors might affect the ability to identify the subgroups of interest.

In omics data, the feature space is often large and how many of the features that are informative for the factors of interest will also affect the complexity of the problem. We present a novel clustering approach that can identify different clusters in different subsets of the feature space, which is applied on methylation data to create new potential biomarkers. It is shown that by combining clinical data with methylation data for patients with clear cell renal carcinoma, it is possible to improve the currently used prediction model for disease progression.  

Using unsupervised clustering techniques, we identify three molecular subtypes of prostate cancer bone metastases based on gene expression profiles. The robustness of the identified subtypes is confirmed by applying several clustering algorithms with very similar results.

 

Ort, förlag, år, upplaga, sidor
Umeå: Umeå universitet, 2020. s. 22
Serie
Research report in mathematical statistics, ISSN 1653-0829 ; 70/20
Nyckelord
cluster analysis, cancer, classification
Nationell ämneskategori
Sannolikhetsteori och statistik
Identifikatorer
urn:nbn:se:umu:diva-167275 (URN)978-91-7855-172-9 (ISBN)978-91-7855-173-6 (ISBN)
Disputation
2020-02-07, N460, Naturvetarhuset, Umeå, 09:15 (Engelska)
Opponent
Handledare
Tillgänglig från: 2020-01-17 Skapad: 2020-01-14 Senast uppdaterad: 2021-10-19Bibliografiskt granskad
Organisationer

Sök vidare i DiVA

Visa alla publikationer