omniCLIP: probabilistic identification of protein-RNA interactions from CLIP-seq data

Genome Biol. 2018 Nov 1;19(1):183. doi: 10.1186/s13059-018-1521-2.

Abstract

CLIP-seq methods allow the generation of genome-wide maps of RNA binding protein - RNA interaction sites. However, due to differences between different CLIP-seq assays, existing computational approaches to analyze the data can only be applied to a subset of assays. Here, we present a probabilistic model called omniCLIP that can detect regulatory elements in RNAs from data of all CLIP-seq assays. omniCLIP jointly models data across replicates and can integrate background information. Therefore, omniCLIP greatly simplifies the data analysis, increases the reliability of results and paves the way for integrative studies based on data from different assays.

Keywords: Bioinformatics; CLIP-seq; Generalized linear models; HITS-CLIP; Machine learning; Mixture models; PAR-CLIP; Protein-RNA interactions; eCLIP; iCLIP.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites
  • Computational Biology / methods*
  • Computer Simulation
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Immunoprecipitation / methods*
  • RNA / genetics
  • RNA / metabolism*
  • RNA-Binding Proteins / genetics
  • RNA-Binding Proteins / metabolism*
  • Sequence Analysis, RNA
  • Software*

Substances

  • RNA-Binding Proteins
  • RNA