Genome-wide prediction of pathogenic gain- and loss-of-function variants from ensemble learning of a diverse feature set

Genome Med. 2023 Nov 30;15(1):103. doi: 10.1186/s13073-023-01261-9.

Abstract

Gain-of-function (GOF) variants give rise to increased/novel protein functions whereas loss-of-function (LOF) variants lead to diminished protein function. Experimental approaches for identifying GOF and LOF are generally slow and costly, whilst available computational methods have not been optimized to discriminate between GOF and LOF variants. We have developed LoGoFunc, a machine learning method for predicting pathogenic GOF, pathogenic LOF, and neutral genetic variants, trained on a broad range of gene-, protein-, and variant-level features describing diverse biological characteristics. LoGoFunc outperforms other tools trained solely to predict pathogenicity for identifying pathogenic GOF and LOF variants and is available at https://itanlab.shinyapps.io/goflof/ .

Keywords: Gain-of-function; Genomic medicine; Loss-of-function; Machine learning; Natural language processing; Pathogenicity prediction; Phenome-wide association studies; Precision medicine; Protein function; Variant functional impact.

MeSH terms

  • Genome*
  • Humans
  • Machine Learning
  • Proteins*

Substances

  • Proteins