Using clustering as pre-processing in the framework of signal unmixing for exhaustive exploration of archaeological artefacts in Raman imaging

Talanta. 2024 Mar 26:274:125955. doi: 10.1016/j.talanta.2024.125955. Online ahead of print.

Abstract

Analytical chemistry on archaeological material is an essential part of modern archaeological investigations and from year to year, instrumental improvement has made it possible to generate data at a high spatial and temporal frequency. In particular, Raman spectral imaging can be successfully applied in archaeological research by its simplicity of implementation to study past human societies through the analysis of their material remains. This technique makes it possible to simultaneously obtain spatial and spectral information by preserving sample integrity. However, because of the inherent complexity of the samples in Archaeology (e.g. seniority, fragility, lack or full absence of any information about its composition), chemical interpretation can be difficult at first glance. Indeed, specific problems of spectral selectivity related to unexpected chemical compounds could appear due to their state of conservation. Furthermore, detecting minor compounds becomes challenging as major components impose their contributions in the acquired spectra. Therefore, a relevant chemometric approach has been introduced in this context to characterize distinct spectral sources in a Raman imaging dataset of an archaeological specimen - a mosaic fragment. The fragment was unearthed during the Ruscino archaeological dig on the outskirts of Perpignan, France. It dates back to the oppidum period. The aim is to extract selective spectral information from pixel clustering analysis in order to enhance the initial optimisation step within the Multivariate Curve Resolution and Alternating Least-Squares (MCR-ALS) algorithm, a well-known signal unmixing technique. The underlying principle of the MCR-ALS is that the acquired spectra can be expressed as linear combinations of pure spectra of all individual components present in the chemical system under study. Sometimes it can be difficult to obtain the desired results through the algorithm, particularly if initial estimates of spectral or concentration profiles are inaccurate due to complex signals, noise or lack of selectivity, resulting in rank deficiency (i.e. a poor estimation of the total number of pure signals). For this reason, an innovative threshold-based clustering algorithm, combined with multiple Orthogonal Projection Approaches (OPA), has been developed to improve matrix rank investigation and thus the initialisation step of the MCR-ALS approach before optimisation. The effective analysis of Raman imaging data for an archaeological mosaic played a crucial role in uncovering significant chemical information about a particular biogenic material. This insight sheds light on the origins of mortar manufacture during the oppidum period.

Keywords: Archaeology; Biogenic materials; Chemometrics; MCR-ALS; OPA; Raman imaging; Signal unmixing; Threshold-based clustering algorithm.