Elucidating Genome-Wide Protein-RNA Interactions Using Differential Evolution

IEEE/ACM Trans Comput Biol Bioinform. 2019 Jan-Feb;16(1):272-282. doi: 10.1109/TCBB.2017.2776224. Epub 2017 Nov 22.

Abstract

RNA-binding proteins (RBPs) play an important role in the post-transcriptional control of RNAs, such as splicing, polyadenylation, mRNA stabilization, mRNA localization, and translation. Thanks to the recent breakthrough, non-negative matrix factorization (NMF) has been developed to combine multiple data sources to discover non-overlapping and class-specific RNA binding patterns. However, several challenges still exist in determining the number of latent dimensions in the factorization steps. In most circumstances, it is often assumed that the number of latent dimensions (or components) is given. Such trial-and-error procedures can be tedious in practice. In order to address this problem, differential evolution algorithm is proposed as the model selection method to choose the suitable number of ranks, which can adaptively decompose the input protein-RNA data matrix into different nonnegative components. Experimental results demonstrate that the proposed algorithms can improve the factorization quality over the recent state-of-the-arts. The effectiveness of the proposed algorithms are supported by comprehensive performance benchmarking on 31 genome-wide cross-linking immunoprecipitation (CLIP) coupled with high-throughput sequencing (CLIP-seq) datasets. In addition, time complexity analysis and parameter analysis are conducted to demonstrate the robustness of the proposed methods.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Computational Biology / methods*
  • Databases, Genetic
  • Gene Expression Regulation / genetics
  • Genome / genetics
  • High-Throughput Nucleotide Sequencing
  • Immunoprecipitation
  • RNA, Messenger / chemistry
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism*
  • RNA-Binding Proteins / chemistry
  • RNA-Binding Proteins / genetics
  • RNA-Binding Proteins / metabolism*
  • Sequence Analysis, RNA

Substances

  • RNA, Messenger
  • RNA-Binding Proteins