Utilizing Deep Learning Algorithms for Signal Processing in Electrochemical Biosensors: From Data Augmentation to Detection and Quantification of Chemicals of Interest

Fatemeh Esmaeili; Erica Cassie; Hong Phan T Nguyen; Natalie O V Plank; Charles P Unsworth; Alan Wang

doi:10.3390/bioengineering10121348

Utilizing Deep Learning Algorithms for Signal Processing in Electrochemical Biosensors: From Data Augmentation to Detection and Quantification of Chemicals of Interest

Bioengineering (Basel). 2023 Nov 23;10(12):1348. doi: 10.3390/bioengineering10121348.

Authors

Fatemeh Esmaeili^{1

2}, Erica Cassie^{2

3}, Hong Phan T Nguyen^{2

3}, Natalie O V Plank^{2

3}, Charles P Unsworth^{1

2}, Alan Wang^{4

5

6}

Affiliations

¹ Department of Engineering Science, University of Auckland, Auckland 1010, New Zealand.
² The MacDiarmid Institute for Advanced Materials and Nanotechnology, Victoria University of Wellington, Wellington 6021, New Zealand.
³ School of Chemical and Physical Sciences, Victoria University of Wellington, Wellington 6021, New Zealand.
⁴ Auckland Bioengineering Institute, University of Auckland, Auckland 1010, New Zealand.
⁵ Center for Medical Imaging, Faculty of Medical and Health Sciences, University of Auckland, Auckland 1010, New Zealand.
⁶ Centre for Brain Research, University of Auckland, Auckland 1010, New Zealand.

Abstract

Nanomaterial-based aptasensors serve as useful instruments for detecting small biological entities. This work utilizes data gathered from three electrochemical aptamer-based sensors varying in receptors, analytes of interest, and lengths of signals. Our ultimate objective was the automatic detection and quantification of target analytes from a segment of the signal recorded by these sensors. Initially, we proposed a data augmentation method using conditional variational autoencoders to address data scarcity. Secondly, we employed recurrent-based networks for signal extrapolation, ensuring uniform signal lengths. In the third step, we developed seven deep learning classification models (GRU, unidirectional LSTM (ULSTM), bidirectional LSTM (BLSTM), ConvGRU, ConvULSTM, ConvBLSTM, and CNN) to identify and quantify specific analyte concentrations for six distinct classes, ranging from the absence of analyte to 10 μM. Finally, the second classification model was created to distinguish between abnormal and normal data segments, detect the presence or absence of analytes in the sample, and, if detected, identify the specific analyte and quantify its concentration. Evaluating the time series forecasting showed that the GRU-based network outperformed two other ULSTM and BLSTM networks. Regarding classification models, it turned out signal extrapolation was not effective in improving the classification performance. Comparing the role of the network architectures in classification performance, the result showed that hybrid networks, including both convolutional and recurrent layers and CNN networks, achieved 82% to 99% accuracy across all three datasets. Utilizing short-term Fourier transform (STFT) as the preprocessing technique improved the performance of all datasets with accuracies from 84% to 99%. These findings underscore the effectiveness of suitable data preprocessing methods in enhancing neural network performance, enabling automatic analyte identification and quantification from electrochemical aptasensor signals.

Keywords: conditional variational auto-encoder (CVAE); convolutional long short-term memory (ConvLSTM); convolutional neural network (CNN); data augmentation; deep learning classification; gated recurrent unit (GRU); long short-term memory (LSTM); signal extrapolation.

Grants and funding

This research was funded by the Marsden Fund, managed by the Royal Society Te Apārangi, grant number VUW1708.