Workability of mRNA Sequencing for Predicting Protein Abundance

Genes (Basel). 2023 Nov 11;14(11):2065. doi: 10.3390/genes14112065.

Abstract

Transcriptomics methods (RNA-Seq, PCR) today are more routine and reproducible than proteomics methods, i.e., both mass spectrometry and immunochemical analysis. For this reason, most scientific studies are limited to assessing the level of mRNA content. At the same time, protein content (and its post-translational status) largely determines the cell's state and behavior. Such a forced extrapolation of conclusions from the transcriptome to the proteome often seems unjustified. The ratios of "transcript-protein" pairs can vary by several orders of magnitude for different genes. As a rule, the correlation coefficient between transcriptome-proteome levels for different tissues does not exceed 0.3-0.5. Several characteristics determine the ratio between the content of mRNA and protein: among them, the rate of movement of the ribosome along the mRNA and the number of free ribosomes in the cell, the availability of tRNA, the secondary structure, and the localization of the transcript. The technical features of the experimental methods also significantly influence the levels of the transcript and protein of the corresponding gene on the outcome of the comparison. Given the above biological features and the performance of experimental and bioinformatic approaches, one may develop various models to predict proteomic profiles based on transcriptomic data. This review is devoted to the ability of RNA sequencing methods for protein abundance prediction.

Keywords: NGS; RNA-Seq; gene expression; mRNA-to-protein ratio; mass spectrometry; protein abundance; proteome; transcriptome.

Publication types

  • Review

MeSH terms

  • Gene Expression Profiling
  • Proteome* / genetics
  • Proteomics* / methods
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Transcriptome / genetics

Substances

  • Proteome
  • RNA, Messenger