Massive Proteogenomic Reanalysis of Publicly Available Proteomic Datasets of Human Tissues in Search for Protein Recoding via Adenosine-to-Inosine RNA Editing

J Proteome Res. 2023 Jun 2;22(6):1695-1711. doi: 10.1021/acs.jproteome.2c00740. Epub 2023 May 9.

Abstract

The proteogenomic search pipeline developed in this work has been applied for reanalysis of 40 publicly available shotgun proteomic datasets from various human tissues comprising more than 8000 individual LC-MS/MS runs, of which 5442 .raw data files were processed in total. This reanalysis was focused on searching for ADAR-mediated RNA editing events, their clustering across samples of different origins, and classification. In total, 33 recoded protein sites were identified in 21 datasets. Of those, 18 sites were detected in at least two datasets, representing the core human protein editome. In agreement with prior artworks, neural and cancer tissues were found to be enriched with recoded proteins. Quantitative analysis indicated that recoding the rate of specific sites did not directly depend on the levels of ADAR enzymes or targeted proteins themselves, rather it was governed by differential and yet undescribed regulation of interaction of enzymes with mRNA. Nine recoding sites conservative between humans and rodents were validated by targeted proteomics using stable isotope standards in the murine brain cortex and cerebellum, and an additional one was validated in human cerebrospinal fluid. In addition to previous data of the same type from cancer proteomes, we provide a comprehensive catalog of recoding events caused by ADAR RNA editing in the human proteome.

Keywords: ADAR; IGFBP7; RNA editing; human proteome; proteogenomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenosine / metabolism
  • Animals
  • Chromatography, Liquid
  • Humans
  • Inosine / genetics
  • Inosine / metabolism
  • Mice
  • Proteogenomics*
  • Proteome / genetics
  • Proteome / metabolism
  • Proteomics*
  • RNA / metabolism
  • RNA Editing
  • Tandem Mass Spectrometry

Substances

  • RNA
  • Proteome
  • Adenosine
  • Inosine