A network-based computational framework to predict and differentiate functions for gene isoforms using exon-level expression data

Methods. 2021 May:189:54-64. doi: 10.1016/j.ymeth.2020.06.005. Epub 2020 Jun 10.

Abstract

Motivation: Alternative splicing makes significant contributions to functional diversity of transcripts and proteins. Many alternatively spliced gene isoforms have been shown to perform specific biological functions under different contexts. In addition to gene-level expression, the advances of high-throughput sequencing offer a chance to estimate isoform-specific exon expression with a high resolution, which is informative for studying splice variants with network analysis.

Results: In this study, we propose a novel network-based analysis framework to predict isoform-specific functions from exon-level RNA-Seq data. In particular, based on exon-level expression data, we firstly propose a unified framework, referred to as Iso-Net, to integrate two new mathematical methods (named MINet and RVNet) that infer co-expression networks at different data scenarios. We demonstrate the superior prediction accuracy of Iso-Net over the existing methods for most simulation data, especially in two extreme cases: sample size is very small and exon numbers of two isoforms are quite different. Furthermore, by defining relevant quantitative measures (e.g., Jaccard correlation coefficient) and combining differential co-expression network analysis and GO functional enrichment analysis, a co-expression network analysis framework is developed to predict functions of isoforms and further, to discover their distinct functions within the same gene. We apply Iso-Net to study gene isoforms for several important transcription factors in human myeloid differentiation with the exon-level RNA-Seq data from three different cell lines.

Availability and implementation: Iso-Net is open source and freely available from https://github.com/Dingjie-Wang/Iso-Net.

Keywords: Alternative splicing; Co-expression networks; Exon-level RNA-Seq; Gene isoforms; Matrix Correlation.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • Computational Biology / methods*
  • Exons*
  • Gene Expression Regulation
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Neural Networks, Computer*
  • Protein Isoforms*
  • Sequence Analysis, RNA
  • Software*

Substances

  • Protein Isoforms