Three monophyletic superfamilies account for the majority of the known glycosyltransferases

Protein Sci. 2003 Jul;12(7):1418-31. doi: 10.1110/ps.0302103.

Abstract

Sixty-five families of glycosyltransferases (EC 2.4.x.y) have been recognized on the basis of high-sequence similarity to a founding member with experimentally demonstrated enzymatic activity. Although distant sequence relationships between some of these families have been reported, the natural history of glycosyltransferases is poorly understood. We used iterative searches of sequence databases, motif extraction, structural comparison, and analysis of completely sequenced genomes to track the origins of modern-type glycosyltransferases. We show that >75% of recognized glycosyltransferase families belong to one of only three monophyletic superfamilies of proteins, namely, (1) a recently described GPGTF/GT-B superfamily; (2) a nucleoside-diphosphosugar transferase (GT-A) superfamily, which is characterized by a DxD sequence signature and also includes nucleotidyltransferases; and (3) a GT-C superfamily of integral membrane glycosyltransferases with a modified DxD signature in the first extracellular loop. Several developmental regulators in Metazoans, including Fringe and Egghead homologs, belong to the second superfamily. Interestingly, Tout-velu/Exostosin family of developmental proteins found in all multicellular eukaryotes, contains separate domains belonging to the first and the second superfamilies, explaining multiple glycosyltransferase activities in one protein.

Publication types

  • Comparative Study

MeSH terms

  • Amino Acid Sequence
  • Conserved Sequence / genetics
  • Databases, Protein
  • Evolution, Molecular
  • Glycosyltransferases / chemistry
  • Glycosyltransferases / classification
  • Glycosyltransferases / genetics*
  • Molecular Sequence Data
  • Multigene Family*
  • Nucleotidyltransferases / genetics
  • Phylogeny
  • Protein Folding
  • Protein Structure, Secondary
  • Sequence Alignment
  • Statistics as Topic

Substances

  • Glycosyltransferases
  • Nucleotidyltransferases