A chromosome-scale genome assembly and annotation of the spring orchid (Cymbidium goeringii)

Mol Ecol Resour. 2022 Apr;22(3):1168-1177. doi: 10.1111/1755-0998.13537. Epub 2021 Nov 5.

Abstract

Cymbidium goeringii, commonly known as the spring orchid, has long been favoured for horticultural purposes in Asian countries. It is a popular orchid with much demand for improvement and development for its valuable varieties. Until now, its reference genome has not been published despite its popularity and conservation efforts. Here, we report the de novo assembly of the C. goeringii genome, which is the largest among the orchids published to date, using a strategy that combines short- and long-read sequencing and chromosome conformation capture (Hi-C) information. The total length of all scaffolds is 3.99 Gb, with an N50 scaffold size of 178.2 Mb. A total of 29,556 protein-coding genes were annotated and 3.55 Gb (88.87% of genome) repetitive sequences were identified. We constructed pseudomolecular chromosomes using Hi-C, incorporating 89.4% of the scaffolds in 20 chromosomes. We identified 220 expanded and 106 contracted genes families in C. goeringii after divergence from its close relative. We also identified new gene families, resistance gene analogues and changes within the MADS-box genes, which control a diverse set of developmental processes during orchid evolution. Our high quality chromosomal-level assembly of C. goeringii can provide a platform for elucidating the genomic evolution of orchids, mining functional genes for agronomic traits and for developing molecular markers for accelerated breeding as well as accelerating conservation efforts.

Keywords: Hi-C; MADS-box; PacBio; Spring orchid; chromosome level assembly; genomics.

MeSH terms

  • Chromosomes
  • Genome
  • Humans
  • Molecular Sequence Annotation
  • Orchidaceae* / genetics
  • Plant Breeding*