Change search
ReferencesLink to record
Permanent link

Direct link
Megavariate analysis of hierarchical QSAR data
Umeå University, Faculty of Science and Technology, Department of Chemistry.
Show others and affiliations
2002 (English)In: Journal of Computer-Aided Molecular Design, Vol. 16, no 10, 711-26 p.Article in journal (Refereed) Published
Abstract [en]

Multivariate PCA- and PLS-models involving many variables are often difficult to interpret, because plots and lists of loadings, coefficients, VIPs, etc, rapidly become messy and hard to overview. There may then be a strong temptation to eliminate variables to obtain a smaller data set. Such a reduction of variables, however, often removes information and makes the modelling efforts less reliable. Model interpretation may be misleading and predictive power may deteriorate.

A better alternative is usually to partition the variables into blocks of logically related variables and apply hierarchical data analysis. Such blocked data may be analyzed by PCA and PLS. This modelling forms the base-level of the hierarchical modelling set-up. On the base-level in-depth information is extracted for the different blocks. The score vectors formed on the base-level, here called `super variables', may be linked together in new matrices on the top-level. On the top-level superficial relationships between the X- and the Y-data are investigated.

In this paper the basic principles of hierarchical modelling by means of PCA and PLS are reviewed. One objective of the paper is to disseminate this concept to a broader QSAR audience. The hierarchical methods are used to analyze a set of 10 haloalkanes for which K = 30 chemical descriptors and M = 255 biological responses have been gathered. Due to the complexity of the biological data, they are sub-divided in four blocks. All the modelling steps on the base-level and the top-level are reported and the final QSAR model is interpreted thoroughly.

Place, publisher, year, edition, pages
2002. Vol. 16, no 10, 711-26 p.
URN: urn:nbn:se:umu:diva-9139OAI: diva2:148810
Available from: 2008-03-04 Created: 2008-03-04 Last updated: 2013-02-28Bibliographically approved

Open Access in DiVA

No full text

Other links

Search in DiVA

By author/editor
Sjöström, MichaelWold, Svante
By organisation
Department of Chemistry

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 26 hits
ReferencesLink to record
Permanent link

Direct link