Data Analysis Pipeline for Detection and Quantification of Pseudouridine (ψ) in RNA by HydraPsiSeq

Methods Mol Biol. 2023:2624:207-223. doi: 10.1007/978-1-0716-2962-8_14.

Abstract

Pseudouridine, a modified RNA residue formed by the isomerization of its parental U nucleotide, is prevalent in a majority of cellular RNAs; its presence was reported in tRNA, rRNA, and sn/snoRNA as well as in mRNA/lncRNA. Multiple analytical deep sequencing-based approaches have been proposed for pseudouridine detection and quantification, among which the most popular relies on the use of soluble carbodiimide (termed CMCT). Recently, we developed an alternative protocol for pseudouridine mapping and quantification. The principle is based on protection of pseudouridine against random RNA cleavage by hydrazine/aniline treatment (HydraPsiSeq protocol). This "negative" detection mode requires higher sequencing depth and provides a precise quantification of the pseudouridine content. All "wet-lab" technical details of the HydraPsiSeq protocol have been described in recent publications. Here, we describe all bioinformatics analysis steps required for data processing from raw reads to the pseudouridylation profile of known or unknown RNA.

Keywords: Computational biology; Data analysis; Deep sequencing; High-throughput sequencing; Pseudouridine; RNA.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Pseudouridine / genetics
  • RNA Processing, Post-Transcriptional
  • RNA* / chemistry
  • RNA, Long Noncoding*
  • RNA, Messenger / genetics
  • RNA, Ribosomal / genetics
  • RNA, Transfer / genetics

Substances

  • RNA
  • Pseudouridine
  • RNA, Transfer
  • RNA, Messenger
  • RNA, Ribosomal
  • RNA, Long Noncoding