Complete chloroplast genome of the Malus baccata var. gracilis provides insights into the evolution and phylogeny of Malus species

Funct Integr Genomics. 2024 Jan 18;24(1):13. doi: 10.1007/s10142-024-01291-5.

Abstract

Malus baccata (L.) var. gracilis (Rehd.) has high ornamental value and breeding significance, and comparative chloroplast genome analysis was applied to facilitate genetic breeding for desired traits and resistance and provide insight into the phylogeny of this genus. Using data from whole-genome sequencing, a tetrameric chloroplast genome with a length of 159,992 bp and a total GC content of 36.56% was constructed. The M. baccata var. gracilis chloroplast genome consists of a large single-copy sequence (88,100 bp), a short single-copy region (19,186 bp), and two inverted repeat regions, IRa (26,353 bp) and IRb (26,353 bp). This chloroplast genome contains 112 annotated genes, including 79 protein-coding genes (nine multicopy), 29 tRNA genes (eight multicopy), and four rRNA genes (all multicopy). Calculating the relative synonymous codon usage revealed a total of 32 high-frequency codons, and the codons exhibited a biased usage pattern towards A/U as the ending nucleotide. Interspecific sequence comparison and boundary analysis revealed significant sequence variation in the vast single-copy region, as well as generally similar expansion and contraction of the SSC and IR regions for 10 analyzed Malus species. M. baccata var. gracilis and Malus hupehensis were grouped together into one branch based on phylogenetic analysis of chloroplast genome sequences. The chloroplast genome of Malus species provides an important foundation for species identification, genetic diversity analysis, and Malus chloroplast genetic engineering. Additionally, the results can facilitate the use of pendant traits to improve apple tree shape.

Keywords: Bioinformatics; Chloroplast genome; Comparative genomics analysis; Malus baccata var. gracilis; Phylogeny.

MeSH terms

  • Codon / genetics
  • Genome, Chloroplast*
  • Malus*
  • Phylogeny
  • Plant Breeding

Substances

  • Codon