Long-read sequencing and de novo genome assembly of marine medaka (Oryzias melastigma)

BMC Genomics. 2020 Sep 16;21(1):640. doi: 10.1186/s12864-020-07042-7.

Abstract

Background: Marine medaka (Oryzias melastigma) is considered as an important ecotoxicological indicator to study the biochemical, physiological and molecular responses of marine organisms towards increasing amount of pollutants in marine and estuarine waters.

Results: In this study, we reported a high-quality and accurate de novo genome assembly of marine medaka through the integration of single-molecule sequencing, Illumina paired-end sequencing, and 10X Genomics linked-reads. The 844.17 Mb assembly is estimated to cover more than 98% of the genome and is more continuous with fewer gaps and errors than the previous genome assembly. Comparison of O. melastigma with closely related species showed significant expansion of gene families associated with DNA repair and ATP-binding cassette (ABC) transporter pathways. We identified 274 genes that appear to be under significant positive selection and are involved in DNA repair, cellular transportation processes, conservation and stability of the genome. The positive selection of genes and the considerable expansion in gene numbers, especially related to stimulus responses provide strong supports for adaptations of O. melastigma under varying environmental stresses.

Conclusions: The highly contiguous marine medaka genome and comparative genomic analyses will increase our understanding of the underlying mechanisms related to its extraordinary adaptation capability, leading towards acceleration in the ongoing and future investigations in marine ecotoxicology.

Keywords: Marine ecotoxicology; Pacific biosciences SMRT sequencing; Transposable elements; de novo genome assembly.

MeSH terms

  • Animals
  • Contig Mapping
  • Evolution, Molecular
  • Genome*
  • Multigene Family
  • Oryzias / genetics*
  • Selection, Genetic
  • Sequence Alignment
  • Whole Genome Sequencing