De novo assembly and annotation of the whole transcriptome of Muraenesox cinereus

Mar Genomics. 2022 Feb:61:100910. doi: 10.1016/j.margen.2021.100910. Epub 2021 Oct 18.

Abstract

In this study, we generated the transcriptome of Muraenesox cinereus from four combined tissues (muscle, sexual, liver and heart) using high-seq sequencing technology. De novo assembly was performed using Trinity software and a total of 62,125,296 high-quality clean reads were assembled and clustered into 75,862 unigene with an N50 of 2034 nt. After annotation, 43,157 unigenes had significant hit in Nr database. And then, 24,510 unigenes were annotated into three GO categories: biological processes, cellular components, and molecular functions. Moreover, 33,032 unigenes were mapped 25 different clusters of eukaryotic proteins. Furthermore, we predicted the structure of all unigenes using HMMER and MISA software, respectively. The result showed that a total of 33,183 nucleotide sequences of coding regions (direction of the sequences is 5'- > 3') were confirmed to the protein database and a total of 29,487 simple sequence repeats (SSR) were identified. The whole transcriptome is an important foundation for future genomic research on the M. cinereus, it could provide comprehensively understanding and further characterizations of transcriptomes of non-model organisms.

Keywords: Annotation; De novo assembly; High-Seq sequencing technology; Muraenesox cinereus; Transcriptome.

MeSH terms

  • Genomics
  • Microsatellite Repeats*
  • Molecular Sequence Annotation
  • Open Reading Frames
  • Transcriptome*