Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis

Erik Hartman; Aaron M Scott; Christofer Karlsson; Tirthankar Mohanty; Suvi T Vaara; Adam Linder; Lars Malmström; Johan Malmström

doi:10.1038/s41467-023-41146-4

Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis

Nat Commun. 2023 Sep 2;14(1):5359. doi: 10.1038/s41467-023-41146-4.

Authors

Erik Hartman^#¹, Aaron M Scott^#², Christofer Karlsson², Tirthankar Mohanty², Suvi T Vaara³, Adam Linder², Lars Malmström², Johan Malmström⁴

Affiliations

¹ Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden. erik.hartman@hotmail.com.
² Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden.
³ Department of Perioperative and Intensive Care, University of Helsinki and Helsinki University Hospital, Helsinki, Finland.
⁴ Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden. johan.malmstrom@med.lu.se.

^# Contributed equally.

Abstract

The incorporation of machine learning methods into proteomics workflows improves the identification of disease-relevant biomarkers and biological pathways. However, machine learning models, such as deep neural networks, typically suffer from lack of interpretability. Here, we present a deep learning approach to combine biological pathway analysis and biomarker identification to increase the interpretability of proteomics experiments. Our approach integrates a priori knowledge of the relationships between proteins and biological pathways and biological processes into sparse neural networks to create biologically informed neural networks. We employ these networks to differentiate between clinical subphenotypes of septic acute kidney injury and COVID-19, as well as acute respiratory distress syndrome of different aetiologies. To gain biological insight into the complex syndromes, we utilize feature attribution-methods to introspect the networks for the identification of proteins and pathways important for distinguishing between subtypes. The algorithms are implemented in a freely available open source Python-package ( https://github.com/InfectionMedicineProteomics/BINN ).

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Acute Kidney Injury*
Algorithms
COVID-19*
Humans
Neural Networks, Computer
Proteomics