Evolution of the genetic code; Evidence from serine codon use disparity in Escherichia coli

Proc Natl Acad Sci U S A. 2020 Nov 17;117(46):28572-28575. doi: 10.1073/pnas.2014567117. Epub 2020 Nov 9.

Abstract

Among the 20 amino acids, three of them-leucine (Leu), arginine (Arg), and serine (Ser)-are encoded by six different codons. In comparison, all of the other 17 amino acids are encoded by either 4, 3, 2, or 1 codon. Peculiarly, Ser is separated into two disparate Ser codon boxes, differing by at least two-base substitutions, in contrast to Leu and Arg, of which codons are mutually exchangeable by a single-base substitution. We propose that these two different Ser codons independently emerged during evolution. In this hypothesis, at the time of the origin of life there were only seven primordial amino acids: Valine (coded by GUX [X = U, C, A or G]), alanine (coded by GCX), aspartic acid (coded by GAY [Y = U or C]), glutamic acid (coded by GAZ [Z = A or G]), glycine (coded by GGX), Ser (coded by AGY), and Arg (coded by CGX and AGZ). All of these were derived from GGX for glycine by single-base substitutions. Later in evolution, another class of Ser codons, UCX, were derived from alanine codons, GCX, distinctly different from the other primordial Ser codon, AGY. From the analysis of the Escherichia coli genome, we find extensive disparities in the usage of these two Ser codons, as some genes use only AGY for Ser in their genes. In contrast, others use only UCX, pointing to distinct differences in their origins, consistent with our hypothesis.

Keywords: LUCA; evolution; primitive amino acids; serine codons.

MeSH terms

  • Codon Usage*
  • Escherichia coli / genetics*
  • Evolution, Molecular*
  • Serine / genetics*

Substances

  • Serine