A computational pipeline to infer alternative poly-adenylation from 3' sequencing data

Methods Enzymol. 2021:655:185-204. doi: 10.1016/bs.mie.2021.04.001. Epub 2021 Jun 5.

Abstract

An increasing number of investigations have established alternative polyadenylation (APA) as a key mechanism of gene regulation through altering the length of 3' untranslated region (UTR) and generating distinct mRNA termini. Further, appreciation for the significance of APA in disease contexts propelled the development of several 3' sequencing techniques. While these RNA sequencing technologies have advanced APA analysis, the intrinsic limitation of 3' read coverage and lack of appropriate computational tools constrain precise mapping and quantification of polyadenylation sites. Notably, Poly(A)-ClickSeq (PAC-seq) overcomes limiting factors such as poly(A) enrichment and 3' linker ligation steps using click-chemistry. Here we provide an updated PolyA-miner protocol, a computational approach to analyze PAC-seq or other 3'-Seq datasets. As a key practical constraint, we also provide a detailed account on the impact of sequencing depth on the number of detected polyadenylation sites and APA changes. This protocol is also updated to handle unique molecular identifiers used to address PCR duplication potentially observed in PAC-seq.

Keywords: 3′ UTR lengthening; 3′ UTR shortening; Alternative polyadenylation; PAC-seq; PolyA-miner.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • 3' Untranslated Regions
  • Poly A* / genetics
  • Poly A* / metabolism
  • Polyadenylation*
  • RNA, Messenger / metabolism
  • Sequence Analysis, RNA

Substances

  • 3' Untranslated Regions
  • RNA, Messenger
  • Poly A