The landscape of tolerated genetic variation in humans and primates

Science. 2023 Jun 2;380(6648):eabn8153. doi: 10.1126/science.abn8197. Epub 2023 Jun 2.

Abstract

Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole-genome sequencing data for 809 individuals from 233 primate species and identified 4.3 million common protein-altering variants with orthologs in humans. We show that these variants can be inferred to have nondeleterious effects in humans based on their presence at high allele frequencies in other primate populations. We use this resource to classify 6% of all possible human protein-altering variants as likely benign and impute the pathogenicity of the remaining 94% of variants with deep learning, achieving state-of-the-art accuracy for diagnosing pathogenic variants in patients with genetic diseases.

Publication types

  • Dataset

MeSH terms

  • Animals
  • Base Sequence
  • Gene Frequency
  • Genetic Variation*
  • Humans
  • Primates* / genetics
  • Whole Genome Sequencing