Properties of human genes guided by their enrichment in rare and common variants

Hum Mutat. 2018 Mar;39(3):365-370. doi: 10.1002/humu.23377. Epub 2017 Dec 21.

Abstract

We analyzed 563,099 common (minor allele frequency, MAF≥0.01) and rare (MAF < 0.01) genetic variants annotated in ExAC and UniProt and 26,884 disease-causing variants from ClinVar and UniProt occurring in the coding region of 17,975 human protein-coding genes. Three novel sets of genes were identified: those enriched in rare variants (n = 32 genes), in common variants (n = 282 genes), and in disease-causing variants (n = 800 genes). Genes enriched in rare variants have far greater similarities in terms of biological and network properties to genes enriched in disease-causing variants, than to genes enriched in common variants. However, in half of the genes enriched in rare variants (AOC2, MAMDC4, ANKHD1, CDC42BPB, SPAG5, TRRAP, TANC2, IQCH, USP54, SRRM2, DOPEY2, and PITPNM1), no disease-causing variants have been identified in major, publicly available databases. Thus, genetic variants in these genes are strong candidates for disease and their identification, as part of sequencing studies, should prompt further in vitro analyses.

Keywords: genetic variants; human disease; protein coding genes; protein network.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Disease / genetics
  • Genes*
  • Genes, Essential
  • Genetic Variation*
  • Humans
  • Mutation / genetics