Functional enrichment analysis based on long noncoding RNA associations

BMC Syst Biol. 2018 Apr 24;12(Suppl 4):45. doi: 10.1186/s12918-018-0571-0.

Abstract

Background: Differential gene expression analysis using RNA-seq data is a popular approach for discovering specific regulation mechanisms under certain environmental settings. Both gene ontology (GO) and KEGG pathway enrichment analysis are major processes for investigating gene groups that participate in common biological responses or possess related functions. However, traditional approaches based on differentially expressed genes only detect a few significant GO terms and pathways, which are frequently insufficient to explain all-inclusive gene regulation mechanisms.

Methods: Transcriptomes of survivin (birc5) gene knock-down experimental and wild-type control zebrafish embryos were sequenced and assembled, and a differential expression (DE) gene list was obtained for traditional functional enrichment analysis. In addition to including DE genes with significant fold-change levels, we considered additional associated genes near or overlapped with differentially expressed long noncoding RNAs (DE lncRNAs), which may directly or indirectly activate or inhibit target genes and play important roles in regulation networks. Both the original DE gene list and the additional DE lncRNA-associated genes were combined to perform a comprehensive overrepresentation analysis.

Results: In this study, a total of 638 DE genes and 616 DE lncRNA-associated genes (lncGenes) were leveraged simultaneously in searching for significant GO terms and KEGG pathways. Compared to the traditional approach of only using a differential expression gene list, the proposed method of employing DE lncRNA-associated genes identified several additional important GO terms and KEGG pathways. In GO enrichment analysis, 60% more GO terms were obtained, and several neuron development functional terms were retrieved as complete annotations. We also observed that additional important pathways such as the FoxO and MAPK signaling pathways were retrieved, which were shown in previous reports to play important roles in apoptosis and neuron development functions regulated by the survivin gene.

Conclusions: We demonstrated that incorporating genes near or overlapped with DE lncRNAs into the DE gene list outperformed the traditional enrichment analysis method for effective biological functional interpretations. These hidden interactions between lncRNAs and target genes could facilitate more comprehensive analyses.

Keywords: Differential expression; Gene ontology (GO); KEGG; Neuron development; lncRNA.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Apoptosis / genetics
  • Computational Biology*
  • Gene Expression Profiling
  • Gene Knockdown Techniques
  • Gene Ontology
  • RNA, Long Noncoding / genetics*
  • Signal Transduction / genetics
  • Survivin / deficiency
  • Survivin / genetics
  • Zebrafish / embryology
  • Zebrafish / genetics

Substances

  • RNA, Long Noncoding
  • Survivin