Prediction of hemophilia A severity using a small-input machine-learning framework

NPJ Syst Biol Appl. 2021 May 25;7(1):22. doi: 10.1038/s41540-021-00183-9.

Abstract

Hemophilia A is a relatively rare hereditary coagulation disorder caused by a defective F8 gene resulting in a dysfunctional Factor VIII protein (FVIII). This condition impairs the coagulation cascade, and if left untreated, it causes permanent joint damage and poses a risk of fatal intracranial hemorrhage in case of traumatic events. To develop prophylactic therapies with longer half-lives and that do not trigger the development of inhibitory antibodies, it is essential to have a deep understanding of the structure of the FVIII protein. In this study, we explored alternative ways of representing the FVIII protein structure and designed a machine-learning framework to improve the understanding of the relationship between the protein structure and the disease severity. We verified a close agreement between in silico, in vitro and clinical data. Finally, we predicted the severity of all possible mutations in the FVIII structure - including those not yet reported in the medical literature. We identified several hotspots in the FVIII structure where mutations are likely to induce detrimental effects to its activity. The combination of protein structure analysis and machine learning is a powerful approach to predict and understand the effects of mutations on the disease outcome.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Hemophilia A* / diagnosis
  • Hemophilia A* / genetics
  • Humans
  • Machine Learning
  • Mutation