Deciphering signatures of natural selection via deep learning

Brief Bioinform. 2022 Sep 20;23(5):bbac354. doi: 10.1093/bib/bbac354.

Abstract

Identifying genomic regions influenced by natural selection provides fundamental insights into the genetic basis of local adaptation. However, it remains challenging to detect loci under complex spatially varying selection. We propose a deep learning-based framework, DeepGenomeScan, which can detect signatures of spatially varying selection. We demonstrate that DeepGenomeScan outperformed principal component analysis- and redundancy analysis-based genome scans in identifying loci underlying quantitative traits subject to complex spatial patterns of selection. Noticeably, DeepGenomeScan increases statistical power by up to 47.25% under nonlinear environmental selection patterns. We applied DeepGenomeScan to a European human genetic dataset and identified some well-known genes under selection and a substantial number of clinically important genes that were not identified by SPA, iHS, Fst and Bayenv when applied to the same dataset.

Keywords: deep learning; genome scan; genome-wide association studies; signatures of natural selection.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Deep Learning*
  • Genome
  • Genomics
  • Humans
  • Polymorphism, Single Nucleotide
  • Selection, Genetic

Associated data

  • Dryad/10.5061/dryad.1s7v5