Improvement of quantitative structure-retention relationship models for chromatographic retention prediction of peptides applying individual local partial least squares models

Talanta. 2020 Nov 1:219:121266. doi: 10.1016/j.talanta.2020.121266. Epub 2020 Jun 20.

Abstract

In Reversed-Phase Liquid Chromatography, Quantitative Structure-Retention Relationship (QSRR) models for retention prediction of peptides can be built, starting from large sets of theoretical molecular descriptors. Good predictive QSRR models can be obtained after selecting the most informative descriptors. Reliable retention prediction may be an aid in the correct identification of proteins/peptides in proteomics and in chromatographic method development. Traditionally, global QSRR models are built, using a calibration set containing a representative range of analytes. In this study, a strategy is presented to build individual local Partial Least Squares (PLS) models for peptides, based on selected local calibration samples, most similar to the specific query peptide to be predicted. Similar local calibration peptides are selected from a possible calibration set. The calibration samples with the lowest Euclidian distances to the query peptide are considered as most similar. Two Euclidian distances are investigated as similarity parameter, (i) in the autoscaled descriptor space and, (ii) in the PLS factor space of the global calibration samples, both after variable selection by the Final Complexity Adapted Models (FCAM) method. The predictive abilities of individual local QSRR PLS models for peptides, developed with both Euclidian distances, are found significantly better than those of two global models, i.e. before and after FCAM variable selection. The predictive abilities of the local models, developed with distances calculated in the PLS factor space, were best.

Keywords: Final complexity adapted models (FCAM); Local models; Molecular descriptors; Partial least squares; Peptides; Quantitative Structure–Retention relationships (QSRR).

MeSH terms

  • Calibration
  • Chromatography, Reverse-Phase*
  • Least-Squares Analysis
  • Peptides*
  • Proteins
  • Quantitative Structure-Activity Relationship

Substances

  • Peptides
  • Proteins