Chromosome-scale de novo genome assembly and annotation of three representative Casuarina species: C. equisetifolia, C. glauca, and C. cunninghamiana

Plant J. 2023 Jun;114(6):1490-1505. doi: 10.1111/tpj.16201. Epub 2023 Apr 6.

Abstract

Australian pine (Casuarina spp.) is extensively planted in tropical and subtropical regions for wood production, shelterbelts, environmental protection, and ecological restoration due to their superior biological characteristics, such as rapid growth, wind and salt tolerance, and nitrogen fixation. To analyze the genomic diversity of Casuarina, we sequenced the genomes and constructed de novo genome assemblies of the three most widely planted Casuarina species: C. equisetifolia, C. glauca, and C. cunninghamiana. We generated chromosome-scale genome sequences using both Pacific Biosciences (PacBio) Sequel sequencing and chromosome conformation capture technology (Hi-C). The total genome sizes for C. equisetifolia, C. glauca, and C. cunninghamiana are 268 942 579 bp, 296 631 783 bp, and 293 483 606 bp, respectively, of which 25.91, 27.15, and 27.74% were annotated as repetitive sequences. We annotated 23 162, 24 673, and 24 674 protein-coding genes in C. equisetifolia, C. glauca, and C. cunninghamiana, respectively. We then collected branchlets from male and female individuals for whole-genome bisulfite sequencing (BS-seq) to explore the epigenetic regulation of sex determination in these three species. Transcriptome sequencing (RNA-seq) revealed differential expression of phytohormone-related genes between male and female plants. In summary, we generated three chromosome-level genome assemblies and comprehensive DNA methylation and transcriptome datasets from both male and female material for three Casuarina species, providing a basis for the comprehensive investigation of genomic diversity and functional gene discovery of Casuarina in the future.

Keywords: Casuarina; PacBio single-molecular long-read technology (SMRT) sequencing; chromosome conformation capture technology; whole-genome assembly; whole-genome bisulfite sequencing.

MeSH terms

  • Australia
  • Base Sequence
  • Chromosomes*
  • Epigenesis, Genetic*
  • Molecular Sequence Annotation
  • Repetitive Sequences, Nucleic Acid