Separation of Chromatographic Co-Eluted Compounds by Clustering and by Functional Data Analysis

Aneta Sawikowska; Anna Piasecka; Piotr Kachlicki; Paweł Krajewski

doi:10.3390/metabo11040214

Separation of Chromatographic Co-Eluted Compounds by Clustering and by Functional Data Analysis

Metabolites. 2021 Mar 31;11(4):214. doi: 10.3390/metabo11040214.

Authors

Aneta Sawikowska^{1

2}, Anna Piasecka², Piotr Kachlicki³, Paweł Krajewski³

Affiliations

¹ Department of Mathematical and Statistical Methods, Poznań University of Life Sciences, Wojska Polskiego 28, 60-637 Poznań, Poland.
² Institute of Bioorganic Chemistry, Polish Academy of Sciences, Z. Noskowskiego 12/14, 61-704 Poznań, Poland.
³ Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479 Poznań, Poland.

Abstract

Peak overlapping is a common problem in chromatography, mainly in the case of complex biological mixtures, i.e., metabolites. Due to the existence of the phenomenon of co-elution of different compounds with similar chromatographic properties, peak separation becomes challenging. In this paper, two computational methods of separating peaks, applied, for the first time, to large chromatographic datasets, are described, compared, and experimentally validated. The methods lead from raw observations to data that can form inputs for statistical analysis. First, in both methods, data are normalized by the mass of sample, the baseline is removed, retention time alignment is conducted, and detection of peaks is performed. Then, in the first method, clustering is used to separate overlapping peaks, whereas in the second method, functional principal component analysis (FPCA) is applied for the same purpose. Simulated data and experimental results are used as examples to present both methods and to compare them. Real data were obtained in a study of metabolomic changes in barley (Hordeum vulgare) leaves under drought stress. The results suggest that both methods are suitable for separation of overlapping peaks, but the additional advantage of the FPCA is the possibility to assess the variability of individual compounds present within the same peaks of different chromatograms.

Keywords: chemometrics of chromatographic data; chromatographic peak separation; computational peak deconvolution; functional principal component analysis; metabolomics; simulation.

Abstract

Grants and funding