The Utility of Genome Skimming for Phylogenomic Analyses as Demonstrated for Glycerid Relationships (Annelida, Glyceridae)

Genome Biol Evol. 2015 Nov 19;7(12):3443-62. doi: 10.1093/gbe/evv224.

Abstract

Glyceridae (Annelida) are a group of venomous annelids distributed worldwide from intertidal to abyssal depths. To trace the evolutionary history and complexity of glycerid venom cocktails, a solid backbone phylogeny of this group is essential. We therefore aimed to reconstruct the phylogenetic relationships of these annelids using Illumina sequencing technology. We constructed whole-genome shotgun libraries for 19 glycerid specimens and 1 outgroup species (Glycinde armigera). The chosen target genes comprise 13 mitochondrial proteins, 2 ribosomal mitochondrial genes, and 4 nuclear loci (18SrRNA, 28SrRNA, ITS1, and ITS2). Based on partitioned maximum likelihood as well as Bayesian analyses of the resulting supermatrix, we were finally able to resolve a robust glycerid phylogeny and identified three clades comprising the majority of taxa. Furthermore, we detected group II introns inside the cox1 gene of two analyzed glycerid specimens, with two different insertions in one of these species. Moreover, we generated reduced data sets comprising 10 million, 4 million, and 1 million reads from the original data sets to test the influence of the sequencing depth on assembling complete mitochondrial genomes from low coverage genome data. We estimated the coverage of mitochondrial genome sequences in each data set size by mapping the filtered Illumina reads against the respective mitochondrial contigs. By comparing the contig coverage calculated in all data set sizes, we got a hint for the scalability of our genome skimming approach. This allows estimating more precisely the number of reads that are at least necessary to reconstruct complete mitochondrial genomes in Glyceridae and probably non-model organisms in general.

Keywords: Glyceridae; group II introns; mitogenomics; sequencing coverage; venomous annelids; whole-genome shotgun sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Annelida / classification
  • Annelida / genetics*
  • Evolution, Molecular*
  • Genome, Helminth*
  • Genome, Mitochondrial*
  • Phylogeny*
  • RNA, Ribosomal / genetics

Substances

  • RNA, Ribosomal

Associated data

  • GENBANK/KT989318
  • GENBANK/KT989319
  • GENBANK/KT989320
  • GENBANK/KT989321
  • GENBANK/KT989322
  • GENBANK/KT989323
  • GENBANK/KT989324
  • GENBANK/KT989325
  • GENBANK/KT989326
  • GENBANK/KT989327
  • GENBANK/KT989328
  • GENBANK/KT989329
  • GENBANK/KT989330
  • GENBANK/KT989331
  • GENBANK/KT989332
  • GENBANK/KT989333
  • GENBANK/KT989334
  • GENBANK/KT989335
  • GENBANK/KT989336
  • GENBANK/KT989337
  • GENBANK/KT989338
  • GENBANK/KT989339
  • GENBANK/KT989340
  • GENBANK/KT989341
  • GENBANK/KT989342
  • GENBANK/KT989343
  • GENBANK/KT989344
  • GENBANK/KT989345
  • GENBANK/KT989346
  • GENBANK/KT989347
  • GENBANK/KT989348
  • GENBANK/KT989349
  • GENBANK/KT989350
  • GENBANK/KT989351
  • SRA/SRX1410234
  • SRA/SRX1410454
  • SRA/SRX1410455
  • SRA/SRX1410466
  • SRA/SRX1410480
  • SRA/SRX1410576
  • SRA/SRX1410590
  • SRA/SRX1410591
  • SRA/SRX1410629
  • SRA/SRX1410631
  • SRA/SRX1410633
  • SRA/SRX1410635
  • SRA/SRX1410637
  • SRA/SRX1410642
  • SRA/SRX1410643
  • SRA/SRX1410679
  • SRA/SRX1410680
  • SRA/SRX1410687
  • SRA/SRX1410770
  • SRA/SRX1410771