Crude Oil Density Prediction Improved by Multiblock Analysis of Fourier Transform Ion Cyclotron Resonance Mass Spectrometry, Fourier Transform Infrared, and Near-Infrared Spectroscopy Data

Appl Spectrosc. 2023 Oct;77(10):1138-1152. doi: 10.1177/00037028231184273. Epub 2023 Aug 1.

Abstract

Crude oils are among the world's most complex organic mixtures containing a large number of unique components and many analytical techniques lack resolving power to characterize. Fourier transform ion cyclotron resonance mass spectrometry offers a high mass accuracy, making a detailed analysis of crude oils possible. Infrared (IR) spectroscopic methods such as Fourier transform IR spectroscopy (FT-IR) and near-IR, can also be used for crude oil characterization. The three methods measure different properties of the samples, and different data sources can often be combined to improve the prediction accuracy of models. In this study, partial least squares regression (PLSR) models for each of the three methods (single-block PLSR) were compared to multiblock PLSR and sequential and orthogonalized PLSR (SO-PLSR), with the aim of predicting the density of crude oils. Variable importance in projection was used to identify the important variables for each method, as spectroscopic data often contain irrelevant variation. The variables were interpreted to evaluate their underlying chemistry and to check whether consistency could be found between the variables selected from the spectroscopic data for the single-block and multiblock methods. Combining the different blocks of data increased the prediction abilities of the models both before and after variable selection, and SO-PLSR using a reduced data set resulted in the best-performing prediction model.

Keywords: Fourier transform ion cyclotron resonance mass spectrometry; Multiblock; PLSR; crude oil; infrared spectroscopy; partial least squares regression.