Whole-genome comparison of endogenous retrovirus segregation across wild and domestic host species populations

Proc Natl Acad Sci U S A. 2018 Oct 23;115(43):11012-11017. doi: 10.1073/pnas.1815056115. Epub 2018 Oct 8.

Abstract

Although recent advances in sequencing and computational analyses have facilitated use of endogenous retroviruses (ERVs) for deciphering coevolution among retroviruses and their hosts, sampling effects from different host populations present major challenges. Here we utilize available whole-genome data from wild and domesticated European rabbit (Oryctolagus cuniculus sp.) populations, sequenced as DNA pools by paired-end Illumina technology, for identifying segregating reference as well as nonreference ERV loci, to reveal their variation along the host phylogeny and domestication history. To produce new viruses, retroviruses must insert a proviral DNA copy into the host nuclear DNA. Occasional proviral insertions into the host germline have been passed down through generations as inherited ERVs during millions of years. These ERVs represent retroviruses that were active at the time of infection and thus present a remarkable record of historical virus-host associations. To examine segregating ERVs in host populations, we apply a reference library search strategy for anchoring ERV-associated short-sequence read pairs from pooled whole-genome sequences to reference genome assembly positions. We show that most ERVs segregate along host phylogeny but also uncover radiation of some ERVs, identified as segregating loci among wild and domestic rabbits. The study targets pertinent issues regarding genome sampling when examining virus-host evolution from the genomic ERV record and offers improved scope regarding common strategies for single-nucleotide variant analyses in host population comparative genomics.

Keywords: comparative genomics; endogenous retrovirus; evolution; host population; segregation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Animals, Domestic / virology*
  • Comparative Genomic Hybridization / methods
  • DNA / genetics
  • Endogenous Retroviruses / genetics*
  • Genome, Viral / genetics*
  • Genome-Wide Association Study / methods
  • Genomics / methods
  • Host Specificity / genetics*
  • Phylogeny
  • Polymorphism, Single Nucleotide / genetics
  • Rabbits

Substances

  • DNA