Towards the Well-Tempered Chloroplast DNA Sequences

Plants (Basel). 2021 Jul 2;10(7):1360. doi: 10.3390/plants10071360.

Abstract

With the development of next-generation sequencing technology and bioinformatics tools, the process of assembling DNA sequences has become cheaper and easier, especially in the case of much shorter organelle genomes. The number of available DNA sequences of complete chloroplast genomes in public genetic databases is constantly increasing and the data are widely used in plant phylogenetic and biotechnological research. In this work, we investigated possible inconsistencies in the stored form of publicly available chloroplast genome sequence data. The impact of these inconsistencies on the results of the phylogenetic analysis was investigated and the bioinformatic solution to identify and correct inconsistencies was implemented. The whole procedure was demonstrated using five plant families (Apiaceae, Asteraceae, Campanulaceae, Lamiaceae and Rosaceae) as examples.

Keywords: chloroplast genome; cyclic shift; genome assembly; inversion; standardization.