umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Novel variable influence on projection (VIP) methods in OPLS, O2PLS, and OnPLS models for single- and multi-block variable selection: VIPOPLS, VIPO2PLS, and MB-VIOP methods
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Kemiska institutionen.ORCID-id: 0000-0001-8776-8626
2017 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Multivariate and multiblock data analysis involves useful methodologies for analyzing large data sets in chemistry, biology, psychology, economics, sensory science, and industrial processes; among these methodologies, partial least squares (PLS) and orthogonal projections to latent structures (OPLS®) have become popular. Due to the increasingly computerized instrumentation, a data set can consist of thousands of input variables which contain latent information valuable for research and industrial purposes. When analyzing a large number of data sets (blocks) simultaneously, the number of variables and underlying connections between them grow very much indeed; at this point, reducing the number of variables keeping high interpretability becomes a much needed strategy.

The main direction of research in this thesis is the development of a variable selection method, based on variable influence on projection (VIP), in order to improve the model interpretability of OnPLS models in multiblock data analysis. This new method is called multiblock variable influence on orthogonal projections (MB-VIOP), and its novelty lies in the fact that it is the first multiblock variable selection method for OnPLS models.

Several milestones needed to be reached in order to successfully create MB-VIOP. The first milestone was the development of a single-block variable selection method able to handle orthogonal latent variables in OPLS models, i.e. VIP for OPLS (denoted as VIPOPLS or OPLS-VIP in Paper I), which proved to increase the interpretability of PLS and OPLS models, and afterwards, was successfully extended to multivariate time series analysis (MTSA) aiming at process control (Paper II). The second milestone was to develop the first multiblock VIP approach for enhancement of O2PLS® models, i.e. VIPO2PLS for two-block multivariate data analysis (Paper III). And finally, the third milestone and main goal of this thesis, the development of the MB-VIOP algorithm for the improvement of OnPLS model interpretability when analyzing a large number of data sets simultaneously (Paper IV).

The results of this thesis, and their enclosed papers, showed that VIPOPLS, VIPO2PLS, and MB-VIOP methods successfully assess the most relevant variables for model interpretation in PLS, OPLS, O2PLS, and OnPLS models. In addition, predictability, robustness, dimensionality reduction, and other variable selection purposes, can be potentially improved/achieved by using these methods.

Ort, förlag, år, upplaga, sidor
Umeå: Umeå University , 2017. , s. 103
Nyckelord [en]
Variable influence on projection, VIP, MB-VIOP, orthogonal projections to latent structures, OPLS, O2PLS, OnPLS, variable selection, variable importance in multiblock regression
Nationell ämneskategori
Kemi
Forskningsämne
datalogi
Identifikatorer
URN: urn:nbn:se:umu:diva-130579ISBN: 978-91-7601-620-6 (tryckt)OAI: oai:DiVA.org:umu-130579DiVA, id: diva2:1068132
Disputation
2017-02-15, KB.E3.01, KBC-huset, Umeå campus, Umeå, 13:00 (Engelska)
Opponent
Handledare
Tillgänglig från: 2017-01-25 Skapad: 2017-01-24 Senast uppdaterad: 2018-06-09Bibliografiskt granskad
Delarbeten
1. Multiblock variable influence on orthogonal projections (MB-VIOP) for enhanced interpretation of total, global, local and unique variations in OnPLS models
Öppna denna publikation i ny flik eller fönster >>Multiblock variable influence on orthogonal projections (MB-VIOP) for enhanced interpretation of total, global, local and unique variations in OnPLS models
(Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Nyckelord
multiblock variable selection, OnPLS, VIP, MB-VIOP, variable importance in multiblock regression
Nationell ämneskategori
Kemi
Forskningsämne
datalogi
Identifikatorer
urn:nbn:se:umu:diva-130578 (URN)
Tillgänglig från: 2017-01-24 Skapad: 2017-01-24 Senast uppdaterad: 2018-06-09
2. A new approach for variable influence on projection (VIP) in O2PLS models
Öppna denna publikation i ny flik eller fönster >>A new approach for variable influence on projection (VIP) in O2PLS models
2017 (Engelska)Ingår i: Chemometrics and Intelligent Laboratory Systems, ISSN 0169-7439, E-ISSN 1873-3239, Vol. 160, s. 110-124Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

A novel variable influence on projection approach for O2PLS® models, named VIPO2PLS, is presented in this paper. VIPO2PLS is a model-based method for judging the importance of variables. Its cornerstone is the 2-way formalism of the O2PLS models; i.e. the use of both predictive and orthogonal normalized loadings of the two modelled data matrices, and also a new weighting system based on the sum of squares of both data blocks (X, Y). The VIPO2PLS algorithm has been tested in one synthetic data set and two real cases, and the outcomes have been compared to the PLS-VIP, VIPOPLS, and i-PLS methods. The purpose is to achieve a sharper and enhanced model interpretation of O2PLS models by using the new VIPO2PLS method for assessing the importance of both X- and Y- variables.

Ort, förlag, år, upplaga, sidor
Elsevier, 2017
Nyckelord
Multi-block variable selection, O2PLS, VIP, Variable importance, Model interpretation, Multivariate calibration
Nationell ämneskategori
Kemi Data- och informationsvetenskap
Identifikatorer
urn:nbn:se:umu:diva-128916 (URN)10.1016/j.chemolab.2016.11.005 (DOI)000392684100013 ()
Tillgänglig från: 2016-12-19 Skapad: 2016-12-19 Senast uppdaterad: 2018-06-09Bibliografiskt granskad
3. Variable influence on projection (VIP) for OPLS models and its applicability in multivariate time series analysis
Öppna denna publikation i ny flik eller fönster >>Variable influence on projection (VIP) for OPLS models and its applicability in multivariate time series analysis
2015 (Engelska)Ingår i: Chemometrics and Intelligent Laboratory Systems, ISSN 0169-7439, E-ISSN 1873-3239, Vol. 146, s. 297-304Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Abstract Recently a new parameter to infer variable importance in orthogonal projections to latent structures (OPLS) was presented. Called OPLS-VIP (variable influence on projection), this parameter is here applied in multivariate time series analysis to achieve an improved diagnosis of process dynamics. To this end, OPLS-VIP has been tested in three real-world industrial data sets; the first data set corresponds to a pulp manufacturing process using a continuous digester, the second one involves data from an industrial heater that experienced problems, and the third data set contains measures of the chemical oxygen demand into the effluent of a newsprint mill. The outcomes obtained using OPLS-VIP are benchmarked against classical PLS-VIP results. It is demonstrated how OPLS-VIP provides a better diagnosis and understanding of the time series behavior than PLS-VIP.

Ort, förlag, år, upplaga, sidor
Elsevier, 2015
Nyckelord
VIP, Variable influence on projection, Multivariate time series analysis, OPLS, Variable selection, Process monitoring
Nationell ämneskategori
Kemi
Identifikatorer
urn:nbn:se:umu:diva-106759 (URN)10.1016/j.chemolab.2015.05.001 (DOI)000360595100031 ()
Tillgänglig från: 2015-08-07 Skapad: 2015-08-07 Senast uppdaterad: 2018-06-07Bibliografiskt granskad
4. Variable influence on projection (VIP) for orthogonal projections to latent structures (OPLS)
Öppna denna publikation i ny flik eller fönster >>Variable influence on projection (VIP) for orthogonal projections to latent structures (OPLS)
2014 (Engelska)Ingår i: Journal of Chemometrics, ISSN 0886-9383, E-ISSN 1099-128X, Vol. 28, nr 8, s. 623-632Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

A new approach for variable influence on projection (VIP) is described, which takes full advantage of the orthogonal projections to latent structures (OPLS) model formalism for enhanced model interpretability. This means that it will include not only the predictive components in OPLS but also the orthogonal components. Four variants of variable influence on projection (VIP) adapted to OPLS have been developed, tested and compared using three different data sets, one synthetic with known properties and two real-world cases.

Nyckelord
chemometrics, variable influence on projection, VIP, OPLS, variable selection, PLS
Nationell ämneskategori
Analytisk kemi
Forskningsämne
datalogi; analytisk kemi; statistik
Identifikatorer
urn:nbn:se:umu:diva-90733 (URN)10.1002/cem.2627 (DOI)000340504100007 ()
Projekt
Innovative Multivariate Model Based Approaches For Industry.
Forskningsfinansiär
Vetenskapsrådet, 2011-604
Anmärkning

Additional supporting information may be found in the online version of this article at the publisher’s web site.

Tillgänglig från: 2014-06-30 Skapad: 2014-06-30 Senast uppdaterad: 2018-06-07Bibliografiskt granskad

Open Access i DiVA

fulltext(2987 kB)430 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 2987 kBChecksumma SHA-512
7f0d499ff30cd567941d82e66f6a93b9412105e3c3b06e5872d0ca4593f0dd763b1fe743487511d34e92859a1f8fc42b593f454bee6a2cb1bb1aa1b94ceba5aa
Typ fulltextMimetyp application/pdf
cover(155 kB)15 nedladdningar
Filinformation
Filnamn COVER01.pdfFilstorlek 155 kBChecksumma SHA-512
06c5887c65a24a936ef83102e285211d98305ef2236d490a7ee4391ce95a6b4f9734a4704811dd15811c7590e92ea0dc2522eed924af7380f4462dac6d6f7d11
Typ coverMimetyp application/pdf
spikblad(270 kB)58 nedladdningar
Filinformation
Filnamn SPIKBLAD01.pdfFilstorlek 270 kBChecksumma SHA-512
ee5053f8066b7dfbc52bb7e00c2aba2a26f455bea3e3ef5abb0358ba9f7c845d906802d252b487db3ae3c863cefe52403ca5042c88576a0b7391b1dd9fbf8c64
Typ spikbladMimetyp application/pdf

Personposter BETA

Galindo-Prieto, Beatriz

Sök vidare i DiVA

Av författaren/redaktören
Galindo-Prieto, Beatriz
Av organisationen
Kemiska institutionen
Kemi

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 430 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 2339 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf