Reproducibility enhancement and differential expression of non predefined functional gene sets in human genome

BMC Genomics. 2014 Dec 24;15(1):1181. doi: 10.1186/1471-2164-15-1181.

Abstract

Background: Transcriptogram profiling is a method to present and analyze transcription data in a genome-wide scale that reduces noise and facilitates biological interpretation. An ordered gene list is produced, such that the probability that the genes are functionally associated exponentially decays with their distance on the list. This list presents a biological logic, evinced by the selective enrichment of successive intervals with Gene Ontology terms or KEGG pathways. Transcriptograms are expression profiles obtained by taking the average of gene expression over neighboring genes on this list. Transcriptograms enhance reproducibility and precision for expression measurements of functionally correlated gene sets.

Results: Here we present an ordering list for Homo sapiens and apply the transcriptogram profiling method to different datasets. We show that this method enhances experiment reproducibility and enhances signal. We applied the method to a diabetes study by Hwang and collaborators, which focused on expression differences between cybrids produced by the hybridization of mitochondria of diabetes mellitus donors with osteosarcoma cell lines, depleted of mitochondria. We found that the transcriptogram method revealed significant differential expression in gene sets linked to blood coagulation and wound healing pathways, and also to gene sets that do not represent any metabolic pathway or Gene Ontology term. These gene sets are connected to ECM-receptor interaction and secreted proteins.

Conclusion: The transcriptogram profiling method provided an automatic way to define sets of genes with correlated expression, reduce noise in genome-wide transcription profiles, and enhance measure reproducibility and sensitivity. These advantages enabled biologic interpretation and pointed to differentially expressed gene sets in diabetes mellitus which were not previously defined.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • Computational Biology / methods
  • Gene Expression Profiling* / methods
  • Gene Expression Profiling* / standards
  • Gene Expression Regulation*
  • Genetic Association Studies / methods
  • Genetic Association Studies / standards
  • Genome, Human*
  • Genome-Wide Association Study / methods
  • Genome-Wide Association Study / standards
  • Humans
  • Molecular Sequence Annotation
  • Reproducibility of Results
  • Transcriptome