StellarPath: Hierarchical-vertical multi-omics classifier synergizes stable markers and interpretable similarity networks for patient profiling

PLoS Comput Biol. 2024 Apr 12;20(4):e1012022. doi: 10.1371/journal.pcbi.1012022. eCollection 2024 Apr.

Abstract

The Patient Similarity Network paradigm implies modeling the similarity between patients based on specific data. The similarity can summarize patients' relationships from high-dimensional data, such as biological omics. The end PSN can undergo un/supervised learning tasks while being strongly interpretable, tailored for precision medicine, and ready to be analyzed with graph-theory methods. However, these benefits are not guaranteed and depend on the granularity of the summarized data, the clarity of the similarity measure, the complexity of the network's topology, and the implemented methods for analysis. To date, no patient classifier fully leverages the paradigm's inherent benefits. PSNs remain complex, unexploited, and meaningless. We present StellarPath, a hierarchical-vertical patient classifier that leverages pathway analysis and patient similarity concepts to find meaningful features for both classes and individuals. StellarPath processes omics data, hierarchically integrates them into pathways, and uses a novel similarity to measure how patients' pathway activity is alike. It selects biologically relevant molecules, pathways, and networks, considering molecule stability and topology. A graph convolutional neural network then predicts unknown patients based on known cases. StellarPath excels in classification performances and computational resources across sixteen datasets. It demonstrates proficiency in inferring the class of new patients described in external independent studies, following its initial training and testing phases on a local dataset. It advances the PSN paradigm and provides new markers, insights, and tools for in-depth patient profiling.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Biomarkers / metabolism
  • Computational Biology* / methods
  • Gene Expression Profiling / methods
  • Genomics / methods
  • Humans
  • Multiomics
  • Neural Networks, Computer
  • Precision Medicine* / methods
  • Proteomics / methods

Substances

  • Biomarkers

Grants and funding

LG salary has been funded by Business Finland [6478/31/2019] and the Academy of Finland [339763]. TM was supported by Business Finland [6478/31/2019] and the Academy of Finland [339763]. AM received salary from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie agreement [101034307] (NEURO-INNOVATION: Research and innovation for brain health throughout life). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.