Genome scans for selection and introgression based on k-nearest neighbour techniques

Mol Ecol Resour. 2020 Nov;20(6):1597-1609. doi: 10.1111/1755-0998.13221. Epub 2020 Jul 20.

Abstract

In recent years, genome-scan methods have been extensively used to detect local signatures of selection and introgression. Most of these methods are either designed for one or the other case, which may impair the study of combined cases. Here, we introduce a series of versatile genome-scan methods applicable for both cases, the detection of selection and introgression. The proposed approaches are based on nonparametric k-nearest neighbour (kNN) techniques, while incorporating pairwise Fixation Index (FST ) and pairwise nucleotide differences (dxy ) as features. We benchmark our methods using a wide range of simulation scenarios, with varying parameters, such as recombination rates, population background histories, selection strengths, the proportion of introgression and the time of gene flow. We find that kNN-based methods perform remarkably well compared with the state-of-the-art. Finally, we demonstrate how to perform kNN-based genome scans on real-world genomic data using the population genomics R-package popgenome.

Keywords: adaptation; genome scans; introgression; k-nearest neighbours.

MeSH terms

  • Computer Simulation*
  • Gene Flow
  • Genetics, Population
  • Genome*
  • Genomics*
  • Metagenomics
  • Models, Genetic*
  • Polymorphism, Single Nucleotide
  • Selection, Genetic