The Standard Genetic Code can Evolve from a Two-Letter GC Code Without Information Loss or Costly Reassignments

Alejandro Frank; Tom Froese

doi:10.1007/s11084-018-9559-4

The Standard Genetic Code can Evolve from a Two-Letter GC Code Without Information Loss or Costly Reassignments

Orig Life Evol Biosph. 2018 Jun;48(2):259-272. doi: 10.1007/s11084-018-9559-4. Epub 2018 Jun 29.

Authors

Alejandro Frank^{1

2

3}, Tom Froese^{4

5}

Affiliations

¹ Institute for Nuclear Sciences (ICN), National Autonomous University of Mexico (UNAM), Mexico City, Mexico.
² Center for the Sciences of Complexity (C3), National Autonomous University of Mexico (UNAM), Mexico City, Mexico.
³ El Colegio Nacional, Mexico City, Mexico.
⁴ Center for the Sciences of Complexity (C3), National Autonomous University of Mexico (UNAM), Mexico City, Mexico. t.froese@gmail.com.
⁵ Institute for Applied Mathematics and Systems Research (IIMAS), National Autonomous University of Mexico (UNAM), Mexico City, Mexico. t.froese@gmail.com.

PMID: 29959584
DOI: 10.1007/s11084-018-9559-4

Abstract

It is widely agreed that the standard genetic code must have been preceded by a simpler code that encoded fewer amino acids. How this simpler code could have expanded into the standard genetic code is not well understood because most changes to the code are costly. Taking inspiration from the recently synthesized six-letter code, we propose a novel hypothesis: the initial genetic code consisted of only two letters, G and C, and then expanded the number of available codons via the introduction of an additional pair of letters, A and U. Various lines of evidence, including the relative prebiotic abundance of the earliest assigned amino acids, the balance of their hydrophobicity, and the higher GC content in genome coding regions, indicate that the original two nucleotides were indeed G and C. This process of code expansion probably started with the third base, continued with the second base, and ended up as the standard genetic code when the second pair of letters was introduced into the first base. The proposed process is consistent with the available empirical evidence, and it uniquely avoids the problem of costly code changes by positing instead that the code expanded its capacity via the creation of new codons with extra letters.

Keywords: Code evolution; Code expansion; Origins of genetic code; Origins of life.

MeSH terms

Codon / analysis
Evolution, Molecular*
Genetic Code / genetics*
Models, Genetic
Nucleotides / analysis
Origin of Life*

Substances

Codon
Nucleotides

Grants and funding

IA104717/Dirección General de Asuntos del Personal Académico, Universidad Nacional Autónoma de México