Transcriptional Profiling and Deriving a Seven-Gene Signature That Discriminates Active and Latent Tuberculosis: An Integrative Bioinformatics Approach

Genes (Basel). 2022 Mar 29;13(4):616. doi: 10.3390/genes13040616.

Abstract

Tuberculosis (TB) is an infectious disease caused by Mycobacterium tuberculosis (M.tb.). Our integrative analysis aims to identify the transcriptional profiling and gene expression signature that distinguish individuals with active TB (ATB) disease, and those with latent tuberculosis infection (LTBI). In the present study, we reanalyzed a microarray dataset (GSE37250) from GEO database and explored the data for differential gene expression analysis between those with ATB and LTBI derived from Malawi and South African cohorts. We used BRB array tool to distinguish DEGs (differentially expressed genes) between ATB and LTBI. Pathway enrichment analysis of DEGs was performed using DAVID bioinformatics tool. The protein-protein interaction (PPI) network of most upregulated genes was constructed using STRING analysis. We have identified 375 upregulated genes and 152 downregulated genes differentially expressed between ATB and LTBI samples commonly shared among Malawi and South African cohorts. The constructed PPI network was significantly enriched with 76 nodes connected to 151 edges. The enriched GO term/pathways were mainly related to expression of IFN stimulated genes, interleukin-1 production, and NOD-like receptor signaling pathway. Downregulated genes were significantly enriched in the Wnt signaling, B cell development, and B cell receptor signaling pathways. The short-listed DEGs were validated in a microarray data from an independent cohort (GSE19491). ROC curve analysis was done to assess the diagnostic accuracy of the gene signature in discrimination of active and latent tuberculosis. Thus, we have derived a seven-gene signature, which included five upregulated genes FCGR1B, ANKRD22, CARD17, IFITM3, TNFAIP6 and two downregulated genes FCGBP and KLF12, as a biomarker for discrimination of active and latent tuberculosis. The identified genes have a sensitivity of 80-100% and specificity of 80-95%. Area under the curve (AUC) value of the genes ranged from 0.84 to 1. This seven-gene signature has a high diagnostic accuracy in discrimination of active and latent tuberculosis.

Keywords: active TB; bioinformatics; biomarkers; differentially expressed genes; latent TB infection; tuberculosis.

MeSH terms

  • Computational Biology
  • Humans
  • Kruppel-Like Transcription Factors / genetics
  • Latent Tuberculosis* / diagnosis
  • Latent Tuberculosis* / genetics
  • Latent Tuberculosis* / microbiology
  • Membrane Proteins / genetics
  • Mycobacterium tuberculosis* / genetics
  • RNA-Binding Proteins / genetics
  • Transcriptome / genetics
  • Tuberculosis* / diagnosis
  • Tuberculosis* / genetics
  • Tuberculosis* / microbiology

Substances

  • IFITM3 protein, human
  • KLF12 protein, human
  • Kruppel-Like Transcription Factors
  • Membrane Proteins
  • RNA-Binding Proteins