Gradient liquid chromatographic retention time prediction for suspect screening applications: A critical assessment of a generalised artificial neural network-based approach across 10 multi-residue reversed-phase analytical methods

Talanta. 2016 Jan 15:147:261-70. doi: 10.1016/j.talanta.2015.09.065. Epub 2015 Sep 30.

Abstract

For the first time, the performance of a generalised artificial neural network (ANN) approach for the prediction of 2492 chromatographic retention times (tR) is presented for a total of 1117 chemically diverse compounds present in a range of complex matrices and across 10 gradient reversed-phase liquid chromatography-(high resolution) mass spectrometry methods. Probabilistic, generalised regression, radial basis function as well as 2- and 3-layer multilayer perceptron-type neural networks were investigated to determine the most robust and accurate model for this purpose. Multi-layer perceptrons most frequently yielded the best correlations in 8 out of 10 methods. Averaged correlations of predicted versus measured tR across all methods were R(2)=0.918, 0.924 and 0.898 for the training, verification and test sets respectively. Predictions of blind test compounds (n=8-84 cases) resulted in an average absolute accuracy of 1.02±0.54min for all methods. Within this variation, absolute accuracy was observed to marginally improve for shorter runtimes, but was found to be relatively consistent with respect to analyte retention ranges (~5%). Finally, optimised and replicated network dependency on molecular descriptor data is presented and critically discussed across all methods. Overall, ANNs were considered especially suitable for suspects screening applications and could potentially be utilised in bracketed-type analyses in combination with high resolution mass spectrometry.

Keywords: Emerging contaminants; High resolution mass spectrometry; Non-target analysis; Retention time prediction; Suspect screening.

MeSH terms

  • Chromatography, Reverse-Phase / methods*
  • Mass Spectrometry
  • Neural Networks, Computer*
  • Reproducibility of Results