Origin, Evolution and Stability of Overlapping Genes in Viruses: A Systematic Review

Genes (Basel). 2021 May 26;12(6):809. doi: 10.3390/genes12060809.

Abstract

During their long evolutionary history viruses generated many proteins de novo by a mechanism called "overprinting". Overprinting is a process in which critical nucleotide substitutions in a pre-existing gene can induce the expression of a novel protein by translation of an alternative open reading frame (ORF). Overlapping genes represent an intriguing example of adaptive conflict, because they simultaneously encode two proteins whose freedom to change is constrained by each other. However, overlapping genes are also a source of genetic novelties, as the constraints under which alternative ORFs evolve can give rise to proteins with unusual sequence properties, most importantly the potential for novel functions. Starting with the discovery of overlapping genes in phages infecting Escherichia coli, this review covers a range of studies dealing with detection of overlapping genes in small eukaryotic viruses (genomic length below 30 kb) and recognition of their critical role in the evolution of pathogenicity. Origin of overlapping genes, what factors favor their birth and retention, and how they manage their inherent adaptive conflict are extensively reviewed. Special attention is paid to the assembly of overlapping genes into ad hoc databases, suitable for future studies, and to the development of statistical methods for exploring viral genome sequences in search of undiscovered overlaps.

Keywords: asymmetric evolution; codon usage; de novo protein creation; modular evolution; multivariate statistics; negative selection: phylogenetic distribution; positive selection; prediction methods; sequence-composition features; symmetric evolution; virus evolution.

Publication types

  • Systematic Review

MeSH terms

  • Genes, Viral
  • Mutation Rate*
  • Phylogeny
  • Selection, Genetic
  • Viruses / classification
  • Viruses / genetics*