Biomarkers prediction and immune landscape in ulcerative colitis: Findings based on bioinformatics and machine learning

Comput Biol Med. 2024 Jan:168:107778. doi: 10.1016/j.compbiomed.2023.107778. Epub 2023 Dec 2.

Abstract

Background: Ulcerative colitis (UC) presents diagnostic and therapeutic difficulties. The primary objective of this study is to identify efficacious biomarkers for diagnosis and treatment, as well as acquire a deeper understanding of the immuneological characteristics associated with the disease.

Methods: Datasets relating to UC were obtained from GEO database. Among these, three datasets were merged to create a metadata for bioinformatics analysis and machine learning. Additionally, one dataset specifically utilized for external validation. Least absolute shrinkage and selection operator (LASSO) and random forest (RF) were employed to screen signature genes. The artificial neural network (ANN) model and receiver operating characteristic (ROC) curve were used to assess the diagnostic performance of signature genes. The single sample gene set enrichment analysis (ssGSEA) was applied to reveal the immune landscape. Finally, the relationship between the signature genes, immune infiltration, and clinical characteristics was investigated through correlation analysis.

Result: By intersecting the result of LASSO, RF and WGCNA, 8 signature genes were identified, including S100A8, IL-1B, CXCL1, TCN1, MMP10, GREM1, DUOX2 and SLC6A14. The biological progress of this gene mostly encompasses acute inflammatory response, aggregation and chemotaxis of leukocyte, and response to lipopolysaccharide by mediating IL-17 signaling pathway, NF-kappa B signaling pathway, TNF signaling pathway, NOD-like receptor signaling pathway. Immune infiltration analysis shows 25 immune cells are significantly elevated in UC samples. Moreover, these signature genes exhibit a strong correlation with various immune cells and a mild to moderate correlation with the Mayo score.

Conclusion: S100A8, IL-1B, CXCL1, TCN1, MMP10, GREM1, DUOX2 and SLC6A14 have been identified as credible potential biomarkers for the diagnosis and therapy of UC. The immune response mediated by these signature biomarkers plays a crucial role in the occurrence and advancement of UC by means of the reciprocal interaction between the signature biomarkers and immune-infiltrated cells.

Keywords: Bioinformatics; Biomarkers; Deep learning model; Immune infiltration; Machine learning; Ulcerative colitis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers
  • Colitis, Ulcerative* / genetics
  • Computational Biology
  • Dual Oxidases
  • Humans
  • Machine Learning
  • Matrix Metalloproteinase 10

Substances

  • Dual Oxidases
  • Matrix Metalloproteinase 10
  • Biomarkers