Phylogenetic analysis of carbamoylphosphate synthetase genes: complex evolutionary history includes an internal duplication within a gene which can root the tree of life

Mol Biol Evol. 1996 Sep;13(7):970-7. doi: 10.1093/oxfordjournals.molbev.a025665.

Abstract

Carbamoylphosphate synthetase (CPS) catalyzes the first committed step in pyrimidine biosynthesis, arginine biosynthesis, or the urea cycle. Organisms may contain either one generalized or two specific CPS enzymes, and these enzymes may be heterodimeric (encoded by linked or unlinked genes), monomeric, or part of a multifunctional protein. In order to help elucidate the evolution of CPS, we have performed a comprehensive phylogenetic analysis using the 21 available complete CPS sequences, including a sequence from Sulfolobus solfataricus P2 which we report in this paper. This is the first report of a complete CPS gene sequence from an archaeon, and sequence analysis suggests that it encodes an enzyme similar to heterodimeric CPSII. We confirm that internal similarity within the synthetase domain of CPS is the result of an ancient gene duplication that preceded the divergence of the Bacteria, Archaea, and Eukarya, and use this internal duplication in phylogenetic tree construction to root the tree of life. Our analysis indicates with high confidence that this archaeal sequence is more closely related to those of Eukarya than to those of Bacteria. In addition to this ancient duplication which created the synthetase domain, our phylogenetic analysis reveals a complex history of further gene duplications, fusions, and other events which have played an integral part in the evolution of CPS.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Bacterial Proteins / genetics
  • Carbamoyl-Phosphate Synthase (Ammonia) / genetics*
  • Carbamoyl-Phosphate Synthase (Glutamine-Hydrolyzing) / genetics*
  • Carbon-Nitrogen Ligases*
  • Cloning, Molecular
  • Eukaryotic Cells / physiology
  • Evolution, Molecular*
  • Gram-Positive Bacteria / genetics
  • Humans
  • Ligases / genetics*
  • Models, Biological
  • Models, Genetic
  • Molecular Sequence Data
  • Multigene Family
  • Phylogeny*
  • Sulfolobus / enzymology
  • Sulfolobus / genetics

Substances

  • Bacterial Proteins
  • Ligases
  • Carbon-Nitrogen Ligases
  • carbamoyl-phosphate synthetase (N-acetylglutamate)
  • Carbamoyl-Phosphate Synthase (Ammonia)
  • Carbamoyl-Phosphate Synthase (Glutamine-Hydrolyzing)

Associated data

  • GENBANK/J01597
  • GENBANK/J05503
  • GENBANK/J05512
  • GENBANK/K01178
  • GENBANK/K02132
  • GENBANK/L08965
  • GENBANK/L31362
  • GENBANK/L32150
  • GENBANK/M11710
  • GENBANK/M12318
  • GENBANK/M12319
  • GENBANK/M12320
  • GENBANK/M12321
  • GENBANK/M12322
  • GENBANK/M12324
  • GENBANK/M12325
  • GENBANK/M27174
  • GENBANK/M59757
  • GENBANK/U04992
  • GENBANK/U04993
  • GENBANK/U05193
  • GENBANK/U11295
  • GENBANK/U18792
  • GENBANK/U33768
  • GENBANK/X13200
  • GENBANK/X14533
  • GENBANK/X55433
  • GENBANK/X73308
  • GENBANK/Z26919