Maximizing Sequence Coverage in Top-Down Proteomics By Automated Multimodal Gas-Phase Protein Fragmentation

Anal Chem. 2018 Nov 6;90(21):12519-12526. doi: 10.1021/acs.analchem.8b02344. Epub 2018 Oct 10.

Abstract

Intact protein sequencing by tandem mass spectrometry (MS/MS), known as top-down protein sequencing, relies on efficient gas-phase fragmentation at multiple experimental conditions to achieve extensive amino acid sequence coverage. We developed the "topdownr" R-package for automated construction of multimodal (i.e., involving CID, HCD, ETD, ETciD, EThcD, and UVPD) MS/MS fragmentation methods on an orbitrap instrument platform and systematic analysis of the resultant spectra. We used topdownr to generate and analyze thousands of MS/MS spectra for five intact proteins of 10-30 kDa. We achieved 90-100% coverage for the proteins tested and derived guiding principles for efficient sequencing of intact proteins. The data analysis workflow and statistical models of topdownr software and multimodal MS/MS experiments provide a framework for optimizing MS/MS sequencing for any intact protein. Refined topdownr software will be suited for comprehensive characterization of protein pharmaceuticals and eventually also for de novo sequencing and detailed characterization of intact proteins.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Automation*
  • Gases / chemistry
  • Proteins / chemistry*
  • Proteomics*
  • Sequence Analysis, Protein
  • Software
  • Tandem Mass Spectrometry

Substances

  • Gases
  • Proteins