Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A new approach to quantify and analyse tRNA sequence data
Umeå University, Faculty of Social Sciences, Centre for Demographic and Ageing Research (CEDAR).ORCID iD: 0000-0001-9188-5518
(English)Manuscript (preprint) (Other academic)
Abstract [en]

A novel quantitative multivariate approach for describing and analyse tRNA sequence data is presented. This approach is based on a multivariate chemical description of each nucleoside in the sequence. 30 theoretically calculated descriptors were used to characterize 63 nucleosides, and principal component analysis was used to extract the main variation from this multivariate description. The resulting four principal properties were interpreted as (PPa) size/bulk of the nucleoside, (PPb) polarity/hydrophobicity of the nucleoside, (PPc) electronic properties of the nucleoside and (PPd)polarity and size of the ribose moiety. These principal properties may be used to translate the tRNA letter sequence data into a quantitative chemical representation. We demonstrate the use of this quantitative description with a multivariate analysis of a set of tRNA sequences. This analysis gives models that are interpretable in terms of wich sequence positions, and nucleoside properties that discriminate the different isoacceptors. This approach is applicable on all kinds of RNA sequence data and gives information that is complementary to current sequence analysis techniques.

National Category
Other Chemistry Topics Organic Chemistry
Research subject
Organic Chemistry
Identifiers
URN: urn:nbn:se:umu:diva-142697OAI: oai:DiVA.org:umu-142697DiVA, id: diva2:1163785
Available from: 2017-12-08 Created: 2017-12-08 Last updated: 2018-06-09
In thesis
1. Deciphering sequence data: A multivariate approach
Open this publication in new window or tab >>Deciphering sequence data: A multivariate approach
1997 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

In this thesis, attention has been focused on the quantitative description of nucleic acids, proteins and peptides. The strategy was to use multivariate chemometrical methods for improving the understanding of the complex structural codes of these kinds of biological molecules. Tools have been developed that enable quantitative modelling of biological molecules, i.e. models based on data that quantitatively describes their properties. The advantage of such models is that they provide interpretations in terms of chemical characteristics for complex features such as similarity, dissimilarity and potency.

By a multivariate physical-chemical characterization of the building blocks of nucleic acids and proteins, i.e. nucleosides and amino acids, descriptive scales have been developed, so called principal properties. The scales give a description of the intrinsic properties of these building blocks. The multivariate characterization results in a multi-property matrix. A principal component analysis of the multi-property matrix gives a small number of latent variables which are considered as the principal properties of the characterized molecules.

The principal property scales may be used for a wide range of different purposes, such as detecting trends and groupings in large sequence data sets, and for analyzing quantitative relationships between structure and function. In statistical experimental design, the descriptors are well suited as design variables to select combinations of amino acids in such a way that they span a wide range of properties.

The use of these principal property descriptors is demonstrated in the quantitative modelling of relationships between structure and activity of various peptide series, DNA-promoters and in the quantitative modelling of transfer ribonucleic acid sequence data (tRNA).

Place, publisher, year, edition, pages
Umeå: Solfjädern Offset AB, 1997. p. 76
Keywords
Principal properties, amino acids, nucleotides, tRNA, DNA, multivariate data analysis, sequence analysis, QSAR, quantitative sequence activity relationships
National Category
Organic Chemistry
Identifiers
urn:nbn:se:umu:diva-142699 (URN)91-7191-337-8 (ISBN)
Public defence
1997-06-06, N320, Naturvetarhuset, 90187, Umeå, 14:00 (Swedish)
Opponent
Supervisors
Available from: 2023-02-03 Created: 2017-12-08 Last updated: 2023-02-03Bibliographically approved

Open Access in DiVA

No full text in DiVA

Authority records

Sandberg Hiltunen, Maria

Search in DiVA

By author/editor
Sandberg Hiltunen, Maria
By organisation
Centre for Demographic and Ageing Research (CEDAR)
Other Chemistry TopicsOrganic Chemistry

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 353 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf