Single molecule real-time sequencing data sets of Hypericum perforatum L. plantlets and cell suspension cultures

Sci Data. 2024 Jan 6;11(1):42. doi: 10.1038/s41597-023-02878-6.

Abstract

Hypericum is a large genus that includes more than 500 species of pharmacological, ecological and conservation value. Although latest advances in sequencing technologies were extremely exploited for generating and assembling genomes of many living organisms, annotated whole genome sequence data is not publicly available for any of the Hypericum species so far. Bioavailability of secondary metabolites varies for different tissues and the data derived from different cultures will be a valuable tool for comparative studies. Here, we report the single molecule real-time sequencing (SMRT) data sets of Hypericum perforatum L. plantlets and cell suspension cultures for the first time. Sequencing data from cell suspension cultures yielded more than 33,000 high-quality transcripts from 20 Gb of raw data, while more than 55,000 high-quality transcripts were obtained from 35 Gb of raw data from plantlets. This dataset is a valuable tool for comparative transcriptomic analysis and will help to understand the unknown biosynthetic pathways of high medicinal value in the Hypericum genus.

Publication types

  • Dataset

MeSH terms

  • Cell Culture Techniques
  • Gene Expression Profiling
  • Hypericum* / genetics
  • Transcriptome