A Database of Accurate Electrophoretic Migration Patterns for Human Proteins

J Mol Biol. 2023 Feb 28;435(4):167933. doi: 10.1016/j.jmb.2022.167933. Epub 2022 Dec 26.

Abstract

Native molecular weight (MW) is one of the defining features of proteins. Denaturing gel electrophoresis (SDS-PAGE) is a very popular technique for separating proteins and determining their MW. Coupled with antibody-based detection, SDS-PAGE is widely applied for protein identification and quantitation. Yet, electrophoresis is poorly reproducible and the MWs obtained are often inaccurate. This hampers antibody validation and negatively impacts the reliability of western blot data, resulting worldwide in a considerable waste of reagents and labour. We argue that, to alleviate these problems there is a need to establish a database of reference MWs measured by SDS-PAGE. Using mass spectrometry as an orthogonal detection method, we acquired electrophoretic migration patterns for approximately 10'000 human proteins in five commonly used cell lines. We applied a robust internal calibration of migration to determine accurate and reproducible molecular weights. This in turn allows merging replicates to increase accuracy, but also enables comparing different cell lines. Mining of the data obtained highlights structural factors that affect migration of distinct classes of proteins. When combined with peptide coverage, the data produced recapitulates known post-translational modifications and differential splicing and can be used to formulate hypotheses on new or poorly known processing events. The full information is freely accessible as a web resource through a user friendly graphical interface (https://pumba.dcsr.unil.ch/). We anticipate that this database will be useful to investigators worldwide for troubleshooting western blot experiments, but could also contribute to the characterization of human proteoforms.

Keywords: differential splicing; electrophoresis; mass spectrometry; proteins molecular weight; proteoforms.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Line
  • Databases, Protein*
  • Electrophoresis, Polyacrylamide Gel*
  • Humans
  • Mass Spectrometry
  • Molecular Weight
  • Proteins* / chemistry
  • Reproducibility of Results

Substances

  • Proteins