Full-length transcript sequencing accelerates the transcriptome research of Gymnocypris namensis, an iconic fish of the Tibetan Plateau

Sci Rep. 2020 Jun 15;10(1):9668. doi: 10.1038/s41598-020-66582-w.

Abstract

Gymnocypris namensis, the only commercial fish in Namtso Lake of Tibet in China, is rated as nearly threatened species in the Red List of China's Vertebrates. As one of the highest-altitude schizothorax fish in China, G. namensis has strong adaptability to the plateau harsh environment. Although being an indigenous economic fish with high value in research, the biological characterization, genetic diversity, and plateau adaptability of G. namensis are still unclear. Here, we used Pacific Biosciences single molecular real time long read sequencing technology to generate full-length transcripts of G. namensis. Sequences clustering analysis and error correction with Illumina-produced short reads to obtain 319,044 polished isoforms. After removing redundant reads, 125,396 non-redundant isoforms were obtained. Among all transcripts, 103,286 were annotated to public databases. Natural selection has acted on 42 genes for G. namensis, which were enriched on the functions of mismatch repair and Glutathione metabolism. Total 89,736 open reading frames, 95,947 microsatellites, and 21,360 long non-coding RNAs were identified across all transcripts. This is the first study of transcriptome in G. namensis by using PacBio Iso-seq. The acquisition of full-length transcript isoforms might accelerate the transcriptome research of G. namensis and provide basis for further research.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Conservation of Natural Resources
  • Cyprinidae / genetics*
  • Fish Proteins / genetics*
  • Gene Expression Profiling / veterinary*
  • Gene Expression Regulation
  • Microsatellite Repeats
  • Molecular Sequence Annotation
  • Open Reading Frames
  • RNA, Long Noncoding / genetics
  • Selection, Genetic
  • Sequence Analysis, RNA / veterinary
  • Single Molecule Imaging / veterinary*
  • Tibet

Substances

  • Fish Proteins
  • RNA, Long Noncoding