Whole-exome sequencing provides insights into monogenic disease prevalence in Northwest Russia

Mol Genet Genomic Med. 2019 Nov;7(11):e964. doi: 10.1002/mgg3.964. Epub 2019 Sep 3.

Abstract

Background: Allele frequency data from large exome and genome aggregation projects such as the Genome Aggregation Database (gnomAD) are of ultimate importance to the interpretation of medical resequencing data. However, allele frequencies might significantly differ in poorly studied populations that are underrepresented in large-scale projects, such as the Russian population.

Methods: In this work, we leveraged our access to a large dataset of 694 exome samples to analyze genetic variation in the Northwest Russia. We compared the spectrum of genetic variants to the dbSNP build 151, and made estimates of ClinVar-based autosomal recessive (AR) disease allele prevalence as compared to gnomAD r. 2.1.

Results: An estimated 9.3% of discovered variants were not present in dbSNP. We report statistically significant overrepresentation of pathogenic variants for several Mendelian disorders, including phenylketonuria (PAH, rs5030858), Wilson's disease (ATP7B, rs76151636), factor VII deficiency (F7, rs36209567), kyphoscoliosis type of Ehlers-Danlos syndrome (FKBP14, rs542489955), and several other recessive pathologies. We also make primary estimates of monogenic disease incidence in the population, with retinal dystrophy, cystic fibrosis, and phenylketonuria being the most frequent AR pathologies.

Conclusion: Our observations demonstrate the utility of population-specific allele frequency data to the diagnosis of monogenic disorders using high-throughput technologies.

Keywords: Mendelian disease; Russia; allele frequency; whole-exome sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers / analysis*
  • DNA Mutational Analysis
  • Exome Sequencing / methods*
  • Genetic Diseases, Inborn / epidemiology*
  • Genetic Diseases, Inborn / genetics*
  • Genetic Testing / methods*
  • Genetic Variation*
  • Hepatolenticular Degeneration / epidemiology
  • Hepatolenticular Degeneration / genetics
  • Humans
  • Prevalence
  • Prognosis
  • Russia / epidemiology

Substances

  • Biomarkers