Inferring the Clonal Structure of Viral Populations from Time Series Sequencing

PLoS Comput Biol. 2015 Nov 16;11(11):e1004344. doi: 10.1371/journal.pcbi.1004344. eCollection 2015 Nov.

Abstract

RNA virus populations will undergo processes of mutation and selection resulting in a mixed population of viral particles. High throughput sequencing of a viral population subsequently contains a mixed signal of the underlying clones. We would like to identify the underlying evolutionary structures. We utilize two sources of information to attempt this; within segment linkage information, and mutation prevalence. We demonstrate that clone haplotypes, their prevalence, and maximum parsimony reticulate evolutionary structures can be identified, although the solutions may not be unique, even for complete sets of information. This is applied to a chain of influenza infection, where we infer evolutionary structures, including reassortment, and demonstrate some of the difficulties of interpretation that arise from deep sequencing due to artifacts such as template switching during PCR amplification.

MeSH terms

  • Algorithms
  • Computational Biology
  • Evolution, Molecular*
  • High-Throughput Nucleotide Sequencing
  • Models, Genetic
  • Mutation / genetics
  • Phylogeny
  • Polymerase Chain Reaction
  • RNA Viruses / classification*
  • RNA Viruses / genetics*
  • RNA, Viral / genetics*
  • Sequence Analysis, RNA / methods*

Substances

  • RNA, Viral