Mutational survivorship bias: The case of PNKP

PLoS One. 2020 Dec 17;15(12):e0237682. doi: 10.1371/journal.pone.0237682. eCollection 2020.

Abstract

The molecular function of a protein relies on its structure. Understanding how variants alter structure and function in multidomain proteins is key to elucidate the generation of a pathological phenotype. However, one may fall into the logical bias of assessing protein damage only based on the variants that are visible (survivorship bias), which can lead to partial conclusions. This is the case of PNKP, an important nuclear and mitochondrial DNA repair enzyme with both kinase and phosphatase function. Most variants in PNKP are confined to the kinase domain, leading to a pathological spectrum of three apparently distinct clinical entities. Since proteins and domains may have a different tolerability to variation, we evaluated whether variants in PNKP are under survivorship bias. Here, we provide the evidence that supports a higher tolerance in the kinase domain even when all variants reported are deleterious. Instead, the phosphatase domain is less tolerant due to its lower variant rates, a higher degree of sequence conservation, lower dN/dS ratios, and the presence of more disease-propensity hotspots. Together, our results support previous experimental evidence that demonstrated that the phosphatase domain is functionally more necessary and relevant for DNA repair, especially in the context of the development of the central nervous system. Finally, we propose the term "Wald's domain" for future studies analyzing the possible survivorship bias in multidomain proteins.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Cell Nucleus / genetics
  • DNA Damage / genetics
  • DNA Repair / genetics
  • DNA Repair Enzymes / genetics*
  • DNA-Binding Proteins / genetics
  • Humans
  • Mitochondria / genetics
  • Mutation / genetics*
  • Phosphoric Monoester Hydrolases / genetics
  • Phosphotransferases (Alcohol Group Acceptor) / genetics*
  • Survivorship

Substances

  • DNA-Binding Proteins
  • PNKP protein, human
  • Phosphotransferases (Alcohol Group Acceptor)
  • Phosphoric Monoester Hydrolases
  • DNA Repair Enzymes

Associated data

  • figshare/10.6084/m9.figshare.13138817.v1

Grants and funding

Our project was founded by the University of Costa Rica, Vice-rectory of Research (Project 111-B8–372). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Alejandro Leal and Andrés Arguedas received a salary from University of Costa Rica.