Raman spectroscopy and machine learning for the classification of breast cancers

Spectrochim Acta A Mol Biomol Spectrosc. 2022 Jan 5:264:120300. doi: 10.1016/j.saa.2021.120300. Epub 2021 Aug 21.

Abstract

Breast cancer is a major health threat for women. The drug responses associated with different breast cancer subtypes have obvious effects on therapeutic outcomes; therefore, the accurate classification of breast cancer subtypes is critical. Breast cancer subtype classification has recently been examined using various methods, and Raman spectroscopy has emerged as an effective technique that can be used for noninvasive breast cancer analysis. However, the accurate and rapid classification of breast cancer subtypes currently requires a great deal of effort and experience with the processing and analysis of Raman spectra data. Here, we adopted Raman spectroscopy and machine learning techniques to simplify and accelerate the process used to distinguish normal from breast cancer cells and classify breast cancer subtypes. Raman spectra were obtained from cultured breast cancer cell lines, and the data were analyzed by two machine learning algorithms: principal component analysis (PCA)-discriminant function analysis (DFA) and PCA-support vector machine (SVM). The accuracies with which these two algorithms were able to distinguish normal breast cells from breast cancer cells were both greater than 97%, and the accuracies of breast cancer subtype classification for both algorithms were both greater than 92%. Moreover, our results showed evidence to support the use of characteristic Raman spectral features as cancer cell biomarkers, such as the intensity of intrinsic Raman bands, which increased in cancer cells. Raman spectroscopy combined with machine learning techniques provides a rapid method for breast cancer analysis able to reveal differences in intracellular compositions and molecular structures among subtypes.

Keywords: Breast cancer; Cancer diagnosis; Cancer subtype classification; Machine learning; Raman spectroscopy.

MeSH terms

  • Algorithms
  • Breast Neoplasms* / diagnosis
  • Female
  • Humans
  • Machine Learning
  • Principal Component Analysis
  • Spectrum Analysis, Raman*
  • Support Vector Machine