Genome sequence of Hydrangea macrophylla and its application in analysis of the double flower phenotype

DNA Res. 2021 Jan 19;28(1):dsaa026. doi: 10.1093/dnares/dsaa026.

Abstract

Owing to its high ornamental value, the double flower phenotype of hydrangea (Hydrangea macrophylla) is one of its most important traits. In this study, genome sequence information was obtained to explore effective DNA markers and the causative genes for double flower production in hydrangea. Single-molecule real-time sequencing data followed by a Hi-C analysis were employed. Two haplotype-phased sequences were obtained from the heterozygous genome of hydrangea. One assembly consisted of 3,779 scaffolds (2.256 Gb in length and N50 of 1.5 Mb), the other also contained 3,779 scaffolds (2.227 Gb in length, and N50 of 1.4 Mb). A total of 36,930 genes were predicted in the sequences, of which 32,205 and 32,222 were found in each haplotype. A pair of 18 pseudomolecules was constructed along with a high-density single-nucleotide polymorphism (SNP) genetic linkage map. Using the genome sequence data, and two F2 populations, the SNPs linked to double flower loci (djo and dsu) were discovered. DNA markers linked to djo and dsu were developed, and these could distinguish the recessive double flower allele for each locus, respectively. The LEAFY gene is a very likely candidate as the causative gene for dsu, since frameshift was specifically observed in the double flower accession with dsu.

Keywords: de novo genome sequencing; DNA marker; double flower; hydrangea.

MeSH terms

  • Chromosome Mapping
  • Flowers / physiology*
  • Genetic Linkage
  • Genome, Plant*
  • High-Throughput Nucleotide Sequencing
  • Hydrangea / genetics*
  • Hydrangea / physiology
  • Phenotype*
  • Sequence Analysis, DNA