Averaging Strategy To Reduce Variability in Target-Decoy Estimates of False Discovery Rate

J Proteome Res. 2019 Feb 1;18(2):585-593. doi: 10.1021/acs.jproteome.8b00802. Epub 2019 Jan 3.

Abstract

Decoy database search with target-decoy competition (TDC) provides an intuitive, easy-to-implement method for estimating the false discovery rate (FDR) associated with spectrum identifications from shotgun proteomics data. However, the procedure can yield different results for a fixed data set analyzed with different decoy databases, and this decoy-induced variability is particularly problematic for smaller FDR thresholds, data sets, or databases. The average TDC (aTDC) protocol combats this problem by exploiting multiple independently shuffled decoy databases to provide an FDR estimate with reduced variability. We provide a tutorial introduction to aTDC, describe an improved variant of the protocol that offers increased statistical power, and discuss how to deploy aTDC in practice using the Crux software toolkit.

Keywords: false discovery rate; mass spectrometry; spectrum identification.

MeSH terms

  • Databases, Protein / standards*
  • Datasets as Topic
  • Humans
  • Models, Statistical
  • Proteomics / methods*
  • Reproducibility of Results
  • Software*