The Single-molecule long-read sequencing of Scylla paramamosain

Sci Rep. 2019 Aug 27;9(1):12401. doi: 10.1038/s41598-019-48824-8.

Abstract

Scylla paramamosain is an important aquaculture crab, which has great economical and nutritional value. To the best of our knowledge, few full-length crab transcriptomes are available. In this study, a library composed of 12 different tissues including gill, hepatopancreas, muscle, cerebral ganglion, eyestalk, thoracic ganglia, intestine, heart, testis, ovary, sperm reservoir, and hemocyte was constructed and sequenced using Pacific Biosciences single-molecule real-time (SMRT) long-read sequencing technology. A total of 284803 full-length non-chimeric reads were obtained, from which 79005 high-quality unique transcripts were obtained after error correction and sequence clustering and redundant. Additionally, a total of 52544 transcripts were annotated against protein database (NCBI nonredundant, Swiss-Prot, KOG, and KEGG database). A total of 23644 long non-coding RNAs (lncRNAs) and 131561 simple sequence repeats (SSRs) were identified. Meanwhile, the isoforms of many genes were also identified in this study. Our study provides a rich set of full-length cDNA sequences for S. paramamosain, which will greatly facilitate S. paramamosain research.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Brachyura / genetics*
  • Databases, Genetic
  • Female
  • Gene Library
  • Male
  • Microsatellite Repeats / genetics
  • Open Reading Frames / genetics*
  • Protein Isoforms / genetics
  • Protein Isoforms / metabolism
  • RNA Splicing
  • RNA, Long Noncoding / genetics
  • RNA, Long Noncoding / metabolism
  • Shellfish Proteins / metabolism
  • Transcriptome

Substances

  • Protein Isoforms
  • RNA, Long Noncoding
  • Shellfish Proteins