Cross-Platform Transcriptomic Data Integration, Profiling, and Mining in Vibrio cholerae

Microbiol Spectr. 2023 Jun 15;11(3):e0536922. doi: 10.1128/spectrum.05369-22. Epub 2023 May 16.

Abstract

A large number of transcriptome studies generate important data and information for the study of pathogenic mechanisms of pathogens, including Vibrio cholerae. V. cholerae transcriptome data include RNA-seq and microarray: microarray data mainly include clinical human and environmental samples, and RNA-seq data mainly focus on laboratory processing conditions, including different stresses and experimental animals in vivo. In this study, we integrated the data sets of both platforms using Rank-in and the Limma R package normalized Between Arrays function, achieving the first cross-platform transcriptome data integration of V. cholerae. By integrating the entire transcriptome data, we obtained the profiles of the most active or silent genes. By transferring the integrated expression profiles into the weighted correlation network analysis (WGCNA) pipeline, we identified the important functional modules of V. cholerae in vitro stress treatment, gene manipulation, and in vitro culture as DNA transposon, chemotaxis and signaling, signal transduction, and secondary metabolic pathways, respectively. The analysis of functional module hub genes revealed the uniqueness of clinical human samples; however, under specific expression patterning, the Δhns, ΔoxyR1 strains, and tobramycin treatment group showed high expression profile similarity with human samples. By constructing a protein-protein interaction (PPI) interaction network, we discovered several unreported novel protein interactions within transposon functional modules. IMPORTANCE We used two techniques to integrate RNA-seq data for laboratory studies with clinical microarray data for the first time. The interactions between V. cholerae genes were obtained from a global perspective, as well as comparing the similarity between clinical human samples and the current experimental conditions, and uncovering the functional modules that play a major role under different conditions. We believe that this data integration can provide us with some insight and basis for elucidating the pathogenesis and clinical control of V. cholerae.

Keywords: Vibrio cholerae; WGCNA; computational biology.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • DNA Transposable Elements
  • Gene Expression Profiling
  • Humans
  • RNA-Seq
  • Transcriptome
  • Vibrio cholerae* / genetics
  • Vibrio cholerae* / metabolism

Substances

  • DNA Transposable Elements