Whole genome sequencing of Chinese clearhead icefish, Protosalanx hyalocranius

Gigascience. 2017 Apr 1;6(4):1-6. doi: 10.1093/gigascience/giw012.

Abstract

Chinese clearhead icefish, Protosalanx hyalocranius , is a representative icefish species with economic importance and special appearance. Due to its great economic value in China, the fish was introduced into Lake Dianchi and several other lakes from the Lake Taihu half a century ago. Similar to the Sinocyclocheilus cavefish, the clearhead icefish has certain cavefish-like traits, such as transparent body and nearly scaleless skin. Here, we provide the whole genome sequence of this surface-dwelling fish and generated a draft genome assembly, aiming at exploring molecular mechanisms for the biological interests. A total of 252.1 Gb of raw reads were sequenced. Subsequently, a novel draft genome assembly was generated, with the scaffold N50 reaching 1.163 Mb. The genome completeness was estimated to be 98.39 % by using the CEGMA evaluation. Finally, we annotated 19 884 protein-coding genes and observed that repeat sequences account for 24.43 % of the genome assembly. We report the first draft genome of the Chinese clearhead icefish. The genome assembly will provide a solid foundation for further molecular breeding and germplasm resource protection in Chinese clearhead icefish, as well as other icefishes. It is also a valuable genetic resource for revealing the molecular mechanisms for the cavefish-like characters.

Keywords: Clearhead icefish; Gene prediction; Genome assembly; Protosalanx hyalocranius; Repetitive sequences; Whole genome sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology / methods
  • Genome*
  • Genomics* / methods
  • High-Throughput Nucleotide Sequencing
  • Molecular Sequence Annotation
  • Osmeriformes / classification
  • Osmeriformes / genetics*
  • Phenotype
  • Phylogeny
  • Sequence Analysis, DNA