Understanding the learning mechanism of convolutional neural networks in spectral analysis

Xiaolei Zhang; Jinfan Xu; Jie Yang; Li Chen; Haibo Zhou; Xiangjiang Liu; Haifeng Li; Tao Lin; Yibin Ying

doi:10.1016/j.aca.2020.03.055

Understanding the learning mechanism of convolutional neural networks in spectral analysis

Anal Chim Acta. 2020 Jul 4:1119:41-51. doi: 10.1016/j.aca.2020.03.055. Epub 2020 Apr 8.

Authors

Xiaolei Zhang¹, Jinfan Xu¹, Jie Yang¹, Li Chen², Haibo Zhou³, Xiangjiang Liu¹, Haifeng Li², Tao Lin⁴, Yibin Ying⁵

Affiliations

¹ College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, Zhejiang, 310058, China; Key Laboratory of on Site Processing Equipment for Agricultural Products, Ministry of Agriculture and Rural Affairs, China.
² School of Geosciences and Info-Physics, Central South University, South Lushan Road, Changsha, 410000, China.
³ Institute of Pharmaceutical Analysis and Guangdong Province Key Laboratory of Pharmacodynamic Constituents of Traditional Chinese Medicine & New Drug Research, College of Pharmacy, Jinan University, Guangzhou 510632, China.
⁴ College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, Zhejiang, 310058, China; Key Laboratory of on Site Processing Equipment for Agricultural Products, Ministry of Agriculture and Rural Affairs, China. Electronic address: lintao1@zju.edu.cn.
⁵ College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, Zhejiang, 310058, China; Key Laboratory of on Site Processing Equipment for Agricultural Products, Ministry of Agriculture and Rural Affairs, China; Faculty of Agricultural and Food Science, Zhejiang A&F University, Hangzhou, Zhejiang, 311300, China.

PMID: 32439053
DOI: 10.1016/j.aca.2020.03.055

Abstract

Deep learning approaches, especially convolutional neural network (CNN) models, have achieved excellent performances in vibrational spectral analysis. The critical drawback of the CNN approach is the lack of interpretation, and it is regarded as a black box. Interpreting the learning mechanism of chemometric models is critical for intuitive understanding and further application. In this study, an interpretable CNN model with a global average pooling layer is presented for Raman and mid-infrared spectral data analysis. A class activation mapping (CAM)-based approach is leveraged to visualize the active variables in the whole spectrum. The visualization of active variables shows a discriminative pattern in which the most contributed variables peaked around theoretical chemical characteristic bands. The visualization of the feature maps by three convolutional layers demonstrates the data transformation pipeline and how the CNN model hierarchically extracts informative spectral features. The first layer acts as a Savitzky-Golay filter and learns spectral shape characteristics, while the second layer learns enhanced patterns from typical spectral peaks on a few correlated variables. The third layer shows stable activations on critical spectral peaks. A partial least squares - linear discriminant analysis (PLS-LDA) model is presented for comparison on classification accuracy and model interpretation. The CNN model yields mean classification accuracies of 99.01 and 100% for E. coli and meat datasets on the test set, while the PLS-LDA models obtain accuracies of 98.83 and 100%. Both the CNN and PLS-LDA models demonstrate stable patterns on active variables while CNN models are more stable than PLS-LDA models on classification performances for various dataset partitions with Monte-Carlo cross-validation.

Keywords: Class activation mapping; Deep learning; Feature visualization; Interpretation; Reliability.

MeSH terms

Deep Learning*
Discriminant Analysis
Monte Carlo Method
Neural Networks, Computer*