Integrative analysis of independent transcriptome data for rare diseases

Methods. 2014 Oct 1;69(3):315-25. doi: 10.1016/j.ymeth.2014.06.003. Epub 2014 Jun 27.

Abstract

High-throughput technologies used to interrogate transcriptomes have been generating a great amount of publicly available gene expression data. For rare diseases that lack of clinical samples and research funding, there is a practical benefit to jointly analyze existing data sets commonly related to a specific rare disease. In this study, we collected a number of independently generated transcriptome data sets from four species: human, fly, mouse and worm. All data sets included samples with both normal and abnormal mitochondrial function. We reprocessed each data set to standardize format, scale and gene annotation and used HomoloGene database to map genes between species. Standardized procedure was also applied to compare gene expression profiles of normal and abnormal mitochondrial function to identify differentially expressed genes. We further used meta-analysis and other integrative analyses to recognize patterns across data sets and species. Novel insights related to mitochondrial dysfunction was revealed via these analyses, such as a group of genes consistently dysregulated by impaired mitochondrial function in multiple species. This study created a template for the study of rare diseases using genomic technologies and advanced statistical methods. All data and results generated by this study are freely available and stored at http://goo.gl/nOGWC2, to support further data mining.

Keywords: Integrative analysis; Mitochondrial dysfunction; Transcriptome data.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Data Mining
  • Databases, Genetic
  • Humans
  • Internet
  • Mice
  • Mitochondria / genetics*
  • Mitochondria / pathology
  • Molecular Sequence Annotation
  • Rare Diseases / genetics*
  • Rare Diseases / pathology
  • Transcriptome / genetics*