Comparative Genomic Analysis Uncovers the Chloroplast Genome Variation and Phylogenetic Relationships of Camellia Species

Biomolecules. 2022 Oct 13;12(10):1474. doi: 10.3390/biom12101474.

Abstract

Camellia is the largest genus in the family Theaceae. Due to phenotypic diversity, frequent hybridization, and polyploidization, an understanding of the phylogenetic relationships between Camellia species remains challenging. Comparative chloroplast (cp) genomics provides an informative resource for phylogenetic analyses of Camellia. In this study, 12 chloroplast genome sequences from nine Camellia species were determined using Illumina sequencing technology via de novo assembly. The cp genome sizes ranged from 156,545 to 157,021 bp and were organized into quadripartite regions with the typical angiosperm cp genomes. Each genome harbored 87 protein-coding, 37 transfer RNA, and 8 ribosomal RNA genes in the same order and orientation. Differences in long and short sequence repeats, SNPs, and InDels were detected across the 12 cp genomes. Combining with the complete cp sequences of seven other species in the genus Camellia, a total of nine intergenic sequence divergent hotspots and 14 protein-coding genes with high sequence polymorphism were identified. These hotspots, especially the InDel (~400 bp) located in atpH-atpI region, had sufficient potential to be used as barcode markers for further phylogenetic analysis and species identification. Principal component and phylogenetic analysis suggested that regional constraints, rather than functional constraints, strongly affected the sequence evolution of the cp genomes in this study. These cp genomes could facilitate the development of new molecular markers, accurate species identification, and investigations of the phylogenomic relationships of the genus Camellia.

Keywords: Camellia; chloroplast genome; genetic relationship; marker; phylogenetic analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Camellia* / genetics
  • DNA, Intergenic
  • Genome, Chloroplast* / genetics
  • Genomics
  • Phylogeny

Substances

  • DNA, Intergenic

Grants and funding

This research was funded by The National Key R&D Program of China (2018YFD1000603-2) and the National Science Foundation of China (31770719).