Looking into the genome of Thermosynechococcus elongatus (thermophilic cyanobacteria) with codon selection and usage perspective

Int J Bioinform Res Appl. 2015;11(2):130-41. doi: 10.1504/IJBRA.2015.068088.

Abstract

Genome analysis of thermophilic cyanobacterium, Thermosynechococcus elongatus BP-1 revealed factors ruling choices of codons in this organism. Multiple parameters like Nc, GC3s, RSCU, Codon Adaptation Index (CAI), optimal and rare codons, codon-pair context and amino acid usage were analysed and compositional constraint was identified as major factor. Wide range of Nc values for the same GC3 content suggested the role of translational selection. Mutational bias is suggested at synonymous position. Among optimal codons for translation, most were GC-ending. Seven codons (AGA, AGG, AUA, UAA, UAG, UCA and UGA) were found to have least occurrence in the entire genome and except stop codons all were A-ending (exception AGG). Most widely used codon-pair in the genome are G-ending or C-ending and A-ending or U-ending codons make pair with G-ending or C-ending codons. Amino acids which are largely distributed in T. elongatus tend to use G-ending or C-ending codons most frequently. Findings showed cumulative role of translational selection, translational accuracy and gene expression levels with mutational bias as key player in codon selection pattern of this organism.

Keywords: Thermosynechococcus elongatus; amino acids; bioinformatics; codon usage; codons; codon–pair context; gene expression levels; genome analysis; mutational bias; thermophile; thermophilic cyanobacteria; translational accuracy; translational selection.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acids / genetics*
  • Base Sequence
  • Chromosome Mapping / methods*
  • Codon / genetics*
  • Cyanobacteria / genetics*
  • Genome, Bacterial / genetics*
  • Molecular Sequence Data

Substances

  • Amino Acids
  • Codon