De novo transcriptomes of 14 gammarid individuals for proteogenomic analysis of seven taxonomic groups

Sci Data. 2019 Sep 27;6(1):184. doi: 10.1038/s41597-019-0192-5.

Abstract

Gammarids are amphipods found worldwide distributed in fresh and marine waters. They play an important role in aquatic ecosystems and are well established sentinel species in ecotoxicology. In this study, we sequenced the transcriptomes of a male individual and a female individual for seven different taxonomic groups belonging to the two genera Gammarus and Echinogammarus: Gammarus fossarum A, G. fossarum B, G. fossarum C, Gammarus wautieri, Gammarus pulex, Echinogammarus berilloni, and Echinogammarus marinus. These taxa were chosen to explore the molecular diversity of transcribed genes of genotyped individuals from these groups. Transcriptomes were de novo assembled and annotated. High-quality assembly was confirmed by BUSCO comparison against the Arthropod dataset. The 14 RNA-Seq-derived protein sequence databases proposed here will be a significant resource for proteogenomics studies of these ecotoxicologically relevant non-model organisms. These transcriptomes represent reliable reference sequences for whole-transcriptome and proteome studies on other gammarids, for primer design to clone specific genes or monitor their specific expression, and for analyses of molecular differences between gammarid species.

Publication types

  • Dataset
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amphipoda / genetics*
  • Animals
  • Databases, Protein
  • Female
  • France
  • Fresh Water
  • Male
  • Proteogenomics*
  • RNA-Seq
  • Transcriptome*