Functional marker detection and analysis on a comprehensive transcriptome of large yellow croaker by next generation sequencing

PLoS One. 2015 Apr 24;10(4):e0124432. doi: 10.1371/journal.pone.0124432. eCollection 2015.

Abstract

Large yellow croaker (Larimichthys crocea) is an important economic fish in China and Eastern Asia. Because of the exhaustive fishing and overdense aquaculture, the wild population and the mariculture of the species are facing serious challenges on germplasm degeneration and susceptibility to infectious disease agents. However, a comprehensive transcriptome from multi-tissues of the species has not been reported and functional molecular markers have not yet been detected and analyzed. In this work, we applied RNA-seq with the Illumina Hiseq2000 platform for a multi-tissue sample of large yellow croaker and assembled the transcriptome into 88,103 transcripts. Of them, 52,782 transcripts have been successfully annotated by nt/nr, InterPro, GO and KEGG database. Comparing with public fish proteins, we have found that 34,576 protein coding transcripts are shared in large yellow croaker with zebrafish, medaka, pufferfish, and stickleback. For functional markers, we have discovered 1,276 polymorphic SSRs and 261, 000 SNPs. The functional impact analysis of SNPs showed that the majority (~75%) of small variants cause synonymous mutations in proteins, followed by variations in 3' UTR region. The functional enrichment analysis illuminated that transcripts involved in DNA bindings, enzyme activities, and signal pathways prominently exhibit less single-nucleotide variants but genes for the constituent of the muscular tissue, the cytoskeleton, and the immunity system contain more frequent SNP mutations, which may reflect the structural and functional selections of the translated proteins. This is the first work for the high-throughput detection and analysis of functional polymorphic SSR and SNP markers in a comprehensive transcriptome of large yellow croaker. Our study provides valuable transcript sequence and functional marker resources for the quantitative trait locus (QTL) identification and molecular selection of the species in the research community.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Fish Proteins / genetics
  • Fishes / genetics
  • Gene Expression Profiling
  • Gene Ontology
  • Genetic Markers
  • High-Throughput Nucleotide Sequencing
  • INDEL Mutation
  • Molecular Sequence Annotation
  • Open Reading Frames
  • Perciformes / genetics*
  • Polymorphism, Single Nucleotide
  • Sequence Analysis, RNA
  • Species Specificity
  • Transcriptome

Substances

  • Fish Proteins
  • Genetic Markers

Grants and funding

This work was supported by grants from National “863” Project of China (2012AA10A403), National Natural Science Foundation of China (U1205122), Natural Science Foundation of Fujian Province (2014J05040), and the Foundation for Innovation Research Team of Jimer University (2010A02).