CLIPick: a sensitive peak caller for expression-based deconvolution of HITS-CLIP signals

Nucleic Acids Res. 2018 Nov 30;46(21):11153-11168. doi: 10.1093/nar/gky917.

Abstract

High-throughput sequencing of RNAs isolated by crosslinking immunoprecipitation (HITS-CLIP, also called CLIP-Seq) has been used to map global RNA-protein interactions. However, a critical caveat of HITS-CLIP results is that they contain non-linear background noise-different extent of non-specific interactions caused by individual transcript abundance-that has been inconsiderately normalized, resulting in sacrifice of sensitivity. To properly deconvolute RNA-protein interactions, we have implemented CLIPick, a flexible peak calling pipeline for analyzing HITS-CLIP data, which statistically determines the signal-to-noise ratio for each transcript based on the expression-dependent background simulation. Comprising of streamlined Python modules with an easy-to-use standalone graphical user interface, CLIPick robustly identifies significant peaks and quantitatively defines footprint regions within which RNA-protein interactions were occurred. CLIPick outperforms other peak callers in accuracy and sensitivity, selecting the largest number of peaks particularly in lowly expressed transcripts where such marginal signals are hard to discriminate. Specifically, the application of CLIPick to Argonaute (Ago) HITS-CLIP data were sensitive enough to uncover extended features of microRNA target sites, and these sites were experimentally validated. CLIPick enables to resolve critical interactions in a wide spectrum of transcript levels and extends the scope of HITS-CLIP analysis. CLIPick is available at: http://clip.korea.ac.kr/clipick/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Argonaute Proteins / genetics*
  • Argonaute Proteins / metabolism
  • Binding Sites
  • Computer Graphics
  • Frontal Lobe / chemistry
  • Frontal Lobe / metabolism
  • Genes, Reporter
  • Hep G2 Cells
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Immunoprecipitation / methods
  • K562 Cells
  • Luciferases / genetics
  • Luciferases / metabolism
  • MicroRNAs / genetics*
  • MicroRNAs / metabolism
  • Protein Binding
  • Protein Footprinting / methods*
  • Protein Interaction Domains and Motifs
  • RNA, Messenger / genetics*
  • RNA, Messenger / metabolism
  • Sequence Analysis, RNA / statistics & numerical data*
  • Signal-To-Noise Ratio
  • User-Computer Interface*

Substances

  • Argonaute Proteins
  • MicroRNAs
  • RNA, Messenger
  • Luciferases