Identification of Potential Biomarkers for Group I Pulmonary Hypertension Based on Machine Learning and Bioinformatics Analysis

Int J Mol Sci. 2023 Apr 28;24(9):8050. doi: 10.3390/ijms24098050.

Abstract

A number of processes and pathways have been reported in the development of Group I pulmonary hypertension (Group I PAH); however, novel biomarkers need to be identified for a better diagnosis and management. We employed a robust rank aggregation (RRA) algorithm to shortlist the key differentially expressed genes (DEGs) between Group I PAH patients and controls. An optimal diagnostic model was obtained by comparing seven machine learning algorithms and was verified in an independent dataset. The functional roles of key DEGs and biomarkers were analyzed using various in silico methods. Finally, the biomarkers and a set of key candidates were experimentally validated using patient samples and a cell line model. A total of 48 key DEGs with preferable diagnostic value were identified. A gradient boosting decision tree algorithm was utilized to build a diagnostic model with three biomarkers, PBRM1, CA1, and TXLNG. An immune-cell infiltration analysis revealed significant differences in the relative abundances of seven immune cells between controls and PAH patients and a correlation with the biomarkers. Experimental validation confirmed the upregulation of the three biomarkers in Group I PAH patients. In conclusion, machine learning and a bioinformatics analysis along with experimental techniques identified PBRM1, CA1, and TXLNG as potential biomarkers for Group I PAH.

Keywords: Group I pulmonary hypertension; biomarker; ferroptosis; immune infiltration; machine learning; pathway enrichment analyses; protein–protein interaction.

MeSH terms

  • Algorithms
  • Biomarkers
  • Computational Biology
  • Humans
  • Hypertension, Pulmonary* / diagnosis
  • Hypertension, Pulmonary* / genetics
  • Machine Learning

Substances

  • Biomarkers