The reference genome of camellia chekiangoleosa provides insights into camellia evolution and tea oil biosynthesis

Hortic Res. 2022 Jan 18:9:uhab083. doi: 10.1093/hr/uhab083. Online ahead of print.

Abstract

Camellia oil extracted from Camellia seeds is rich in unsaturated fatty acids (UFAs) and secondary metabolites beneficial to human health. However, no oil-tea tree genome has yet been published, which is a major obstacle to investigating the heredity improvement of oil-tea trees. Here, using both Illumina and PicBio sequencing technologies, we present the first chromosome-level genome sequence of the oil-tea tree species Camellia chekiangoleosa Hu. (CCH). The assembled genome consists of 15 pseudochromosomes with a genome size of 2.73 Gb and a scaffold N50 of 185.30 Mb. At least 2.16 Gb of the genome assembly consists of repetitive sequences, and the rest involves a high-confidence set of 64 608 protein-coding gene models. Comparative genomic analysis revealed that the CCH genome underwent a whole-genome duplication (WGD) event shared across the Camellia genus at ~57.48 MYA and a γ-WGT event shared across all core eudicot plants at ~120 MYA. Gene family clustering revealed that the genes involved in terpenoid biosynthesis have undergone rapid expansion. Furthermore, we determined the expression patterns of oleic acid accumulation- and terpenoid biosynthesis-associated genes in six tissues. We found that these genes tend to be highly expressed in leaves, pericarp tissues, roots, and seeds. The first chromosome-level genome of oil-tea trees will provide valuable resources for determining Camellia evolution and utilizing the germplasm of this taxon.