Big (Bio)Chemical Data Mining Using Chemometric Methods: A Need for Chemists

Angew Chem Int Ed Engl. 2022 Nov 2;61(44):e201801134. doi: 10.1002/anie.201801134. Epub 2022 Sep 29.

Abstract

This Review summarizes how big (bio)chemical data (BBCD) can be analyzed with multivariate chemometric methods and highlights some of the important challenges faced by modern analytical researches. Here, the potential of chemometric methods to solve BBCD problems that are being encountered in chromatographic, spectroscopic and hyperspectral imaging measurements will be discussed, with an emphasis on their applications to omics sciences. In addition, insights and perspectives on how to address the analysis of BBCD are provided along with a discussion of the procedures necessary to obtain more reliable qualitative and quantitative results. In this Review, the importance of "big data" and of their relevance to (bio)chemistry are first discussed. Thereafter, analytical tools which can produce BBCD are presented as well as the theoretical background of chemometric methods and their limitations when they are applied to BBCD. Finally, the importance of chemometric methods for the analysis of BBCD in different chemical disciplines is highlighted with some examples. In this work, we have tried to cover many of the current applications of big data analysis in the (bio)chemistry field.

Keywords: Big Data; Chemometrics; Chromatography; Mass Spectrometry; Omics Science.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Big Data
  • Chemometrics*
  • Chromatography
  • Data Mining*
  • Spectrum Analysis