Recognition of Fold and Sugar Linkage for Glycosyltransferases by Multivariate Sequence Analysis
2004 (English)In: Journal of Biological Chemistry, Vol. 279, 38683-92 p.Article in journal (Refereed) Published
Glycosyltransferases (GTs) are among the largest groups of enzymes found and are usually classified on the basis of sequence comparisons into many families of varying similarity (CAZy systematics). Only two different Rossman-like folds have been detected (GT-A and GT-B) within the small number of established crystal structures. A third uncharacterized fold has been indicated with transmembrane organization (GT-C). We here use a method based on multivariate data analyses (MVDAs) of property patterns in amino acid sequences and can with high accuracy recognize the correct fold in a large data set of GTs. Likewise, a retaining or inverting enzymatic mechanism for attachment of the donor sugar could be properly revealed in the GT-A and GT-B fold group sequences by such analyses. Sequence alignments could be correlated to important variables in MVDA, and the separating amino acid positions could be mapped over the active sites. These seem to be localized to similar positions in space for the // binding motifs in the GT-B fold group structures. Analogous, active-site sequence positions were found for the GT-A fold group. Multivariate property patterns could also easily group most GTs annotated in the genomes of Escherichia coli and Synechocystis to proper fold or organization group, according to benchmarking comparisons at the MetaServer. We conclude that the sequence property patterns revealed by the multivariate analyses seem more conserved than amino acid types for these GT groups, and these patterns are also conserved in the structures. Such patterns may also potentially define substrate preferences.
Place, publisher, year, edition, pages
2004. Vol. 279, 38683-92 p.
IdentifiersURN: urn:nbn:se:umu:diva-14852DOI: 10.1074/jbc.M402925200OAI: oai:DiVA.org:umu-14852DiVA: diva2:154524