Structural bioinformatics survey on disease-inducing missense mutations

J Bioinform Comput Biol. 2021 Jun;19(3):2150008. doi: 10.1142/S0219720021500086. Epub 2021 Apr 22.

Abstract

Understanding the molecular mechanisms that correlate pathologies with missense mutations is of critical importance for disease risk estimations and for devising personalized therapies. Thus, we have performed a bioinformatic survey of ClinVar, a database of human genomic variations, to find signals that can account for missense mutation pathogenicity. Arginine resulted as the most frequently replaced amino acid both in benign and pathogenic mutations. By adding the structural dimension to this investigation to increase its resolution, we found that arginine mutations occurring at the protein-DNA interface increase pathogenicity 6.5 times with respect to benign variants. Glycine is the second amino acid among all the pathological missense mutations. Necessarily replaced by larger amino acids, glycine substitutions perturb the structural stability of proteins and, therefore, their functions, being mostly located in buried protein moieties. Arginine and glycine appear as representative of missense mutations causing respective changes in interaction processes and protein structural features, the two main molecular mechanisms of genome-induced pathologies.

Keywords: Human mutations; arginine substitutions; benign missense variants; glycine substitutions; pathological missense variants.

MeSH terms

  • Computational Biology*
  • Humans
  • Mutation
  • Mutation, Missense*
  • Proteins

Substances

  • Proteins