Full-length transcriptome analysis of Adiantum flabellulatum gametophyte

PeerJ. 2022 Mar 9:10:e13079. doi: 10.7717/peerj.13079. eCollection 2022.

Abstract

Ferns are important components of plant communities on earth, but their genomes are generally very large, with many redundant genes, making whole genome sequencing of ferns prohibitively expensive and time-consuming. This means there is a significant lack of fern reference genomes, making molecular biology research difficult. The gametophytes of ferns can survive independently, are responsible for sexual reproduction and the feeding of young sporophytes, and play an important role in the alternation of generations. For this study, we selected Adiantum flabellulatum as it has both ornamental and medicinal value and is also an indicator plant of acidic soil. The full-length transcriptome sequencing of its gametophytes was carried out using PacBio three-generation sequencing technology. A total of 354,228 transcripts were obtained, and 231,705 coding sequences (CDSs) were predicted, including 5,749 transcription factors (TFs), 2,214 transcription regulators (TRs) and 4,950 protein kinases (PKs). The transcripts annotated by non-redundant protein sequence database (NR), Kyoto encyclopedia of genes and genomes (KEGG), eukaryotic ortholog groups (KOG), Swissprot, protein family (Pfma), nucleotide sequence database (NT) and gene ontology (GO) were 251,501, 197,474, 193,630, 194,639, 195,956, 113,069 and 197,883, respectively. In addition, 138,995 simple sequence repeats (SSRs) and 111,793 long non-coding RNAs (lncRNAs) were obtained. We selected nine chlorophyll synthase genes for qRT-PCR, and the results showed that the full-length transcript sequences and the annotation information were reliable. This study can provide a reference gene set for subsequent gene expression quantification.

Keywords: Ferns; Functional annotation; Reference gene set; Three-generation sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adiantum* / genetics
  • Gene Expression Profiling / methods
  • Germ Cells, Plant / metabolism
  • Molecular Sequence Annotation
  • Transcriptome* / genetics

Grants and funding

This work was supported by the Hainan Provincial Natural Science Foundation of China (No.319MS017), the National Natural Science Foundation of China (No.31660229), the Hainan University Scientific Research Startup Fund Project (No.kyqd1620), and the National innovation and entrepreneurship training program for college students of Hainan University. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript