The Complete Chloroplast Genome of Catha edulis: A Comparative Analysis of Genome Features with Related Species

Int J Mol Sci. 2018 Feb 9;19(2):525. doi: 10.3390/ijms19020525.

Abstract

Qat (Catha edulis, Celastraceae) is a woody evergreen species with great economic and cultural importance. It is cultivated for its stimulant alkaloids cathine and cathinone in East Africa and southwest Arabia. However, genome information, especially DNA sequence resources, for C. edulis are limited, hindering studies regarding interspecific and intraspecific relationships. Herein, the complete chloroplast (cp) genome of Catha edulis is reported. This genome is 157,960 bp in length with 37% GC content and is structurally arranged into two 26,577 bp inverted repeats and two single-copy areas. The size of the small single-copy and the large single-copy regions were 18,491 bp and 86,315 bp, respectively. The C. edulis cp genome consists of 129 coding genes including 37 transfer RNA (tRNA) genes, 8 ribosomal RNA (rRNA) genes, and 84 protein coding genes. For those genes, 112 are single copy genes and 17 genes are duplicated in two inverted regions with seven tRNAs, four rRNAs, and six protein coding genes. The phylogenetic relationships resolved from the cp genome of qat and 32 other species confirms the monophyly of Celastraceae. The cp genomes of C. edulis, Euonymus japonicus and seven Celastraceae species lack the rps16 intron, which indicates an intron loss took place among an ancestor of this family. The cp genome of C. edulis provides a highly valuable genetic resource for further phylogenomic research, barcoding and cp transformation in Celastraceae.

Keywords: Catha edulis; chloroplast (cp) genome; next generation sequencing; phylogeny; repeat sequence.

MeSH terms

  • Catha / classification
  • Catha / genetics*
  • Genome, Chloroplast*
  • Open Reading Frames
  • Phylogeny
  • RNA, Ribosomal / genetics
  • RNA, Transfer / genetics
  • Repetitive Sequences, Nucleic Acid

Substances

  • RNA, Ribosomal
  • RNA, Transfer