Inductive database to support iterative data mining: Application to biomarker analysis on patient data in the Fight-HF project

J Biomed Inform. 2022 Nov:135:104212. doi: 10.1016/j.jbi.2022.104212. Epub 2022 Sep 28.

Abstract

Machine learning is now an essential part of any biomedical study but its integration into real effective Learning Health Systems, including the whole process of Knowledge Discovery from Data (KDD), is not yet realised. We propose an original extension of the KDD process model that involves an inductive database. We designed for the first time a generic model of Inductive Clinical DataBase (ICDB) aimed at hosting both patient data and learned models. We report experiments conducted on patient data in the frame of a project dedicated to fight heart failure. The results show how the ICDB approach allows to identify biomarker combinations, specific and predictive of heart fibrosis phenotype, that put forward hypotheses relative to underlying mechanisms. Two main scenarios were considered, a local-to-global KDD scenario and a trans-cohort alignment scenario. This promising proof of concept enables us to draw the contours of a next-generation Knowledge Discovery Environment (KDE).

Keywords: Biomarkers; Data mining; Heart Failure; Inductive database; Knowledge Discovery from Data (KDD).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Mining*
  • Databases, Factual
  • Knowledge Discovery*