Amazing symmetrical clustering in chloroplast genomes

BMC Bioinformatics. 2020 Mar 11;21(Suppl 2):83. doi: 10.1186/s12859-020-3350-z.

Abstract

Background: Previously, a seven-cluster pattern claiming to be a universal one in bacterial genomes has been reported. Keeping in mind the most popular theory of chloroplast origin, we checked whether a similar pattern is observed in chloroplast genomes.

Results: Surprisingly, eight cluster structure has been found, for chloroplasts. The pattern observed for chloroplasts differs rather significantly, from bacterial one, and from that latter observed for cyanobacteria. The structure is provided by clustering of the fragments of equal length isolated within a genome so that each fragment is converted in triplet frequency dictionary with non-overlapping triplets with no gaps in frame tiling. The points in 63-dimensional space were clustered due to elastic map technique. The eight cluster found in chloroplasts comprises the fragments of a genome bearing tRNA genes and exhibiting excessively high GC-content, in comparison to the entire genome.

Conclusion: Chloroplasts exhibit very specific symmetry type in distribution of coding and non-coding fragments of a genome in the space of triplet frequencies: this is mirror symmetry. Cyanobacteria may have both mirror symmetry, and the rotational symmetry typical for other bacteria.

Keywords: Clustering; Order; Triplet.

MeSH terms

  • Base Composition
  • Chloroplasts / genetics*
  • Cluster Analysis
  • Cyanobacteria / genetics
  • Genome, Chloroplast*
  • RNA, Transfer / genetics

Substances

  • RNA, Transfer