Detecting selected haplotype blocks in evolve and resequence experiments

Mol Ecol Resour. 2021 Jan;21(1):93-109. doi: 10.1111/1755-0998.13244. Epub 2020 Sep 6.

Abstract

Shifting from the analysis of single nucleotide polymorphisms to the reconstruction of selected haplotypes greatly facilitates the interpretation of evolve and resequence (E&R) experiments. Merging highly correlated hitchhiker SNPs into haplotype blocks reduces thousands of candidates to few selected regions. Current methods of haplotype reconstruction from Pool-seq data need a variety of data-specific parameters that are typically defined ad hoc and require haplotype sequences for validation. Here, we introduce haplovalidate, a tool which detects selected haplotypes in Pool-seq time series data without the need for sequenced haplotypes. Haplovalidate makes data-driven choices of two key parameters for the clustering procedure, the minimum correlation between SNPs constituting a cluster and the window size. Applying haplovalidate to simulated E&R data reliably detects selected haplotype blocks with low false discovery rates. Importantly, our analyses identified a restriction of the haplotype block-based approach to describe the genomic architecture of adaptation. We detected a substantial fraction of haplotypes containing multiple selection targets. These blocks were considered as one region of selection and therefore led to underestimation of the number of selection targets. We demonstrate that the separate analysis of earlier time points can significantly increase the separation of selection targets into individual haplotype blocks. We conclude that the analysis of selected haplotype blocks has great potential for the characterization of the adaptive architecture with E&R experiments.

Keywords: data-driven parameter choices; evolve and resequence; experimental evolution; haplotype reconstruction; replicated time series data; selection; sequencing of pooled individuals.

MeSH terms

  • Genome
  • Genomics*
  • Haplotypes*
  • Linkage Disequilibrium
  • Models, Genetic*
  • Polymorphism, Single Nucleotide*