Evaluation of classification ability of logistic regression model on SERS data of miRNAs

J Biophotonics. 2022 Dec;15(12):e202200108. doi: 10.1002/jbio.202200108. Epub 2022 Aug 13.

Abstract

Logistic regression (LR) is a supervised multiple linear regression model, which uses linear weighted calculation for input to obtain weight coefficients of model. The surface enhanced Raman spectroscopy (SERS) technology greatly enhances the Raman signal of analyte. LR model was used to analyze the data of seven types of pancreatic cancer-related miRNAs obtained from commercial SERS substrate. The classification ability of the model on such data was observed under the configurations of different key parameters (classification mode, regularization method and loss function optimization way), and the effect of the two types of data formats were also evaluated. The results showed that though LR model used to classify this data did not perform well as expected, miRNA-191 and miRNA-4306 still had high recalls (sensitivity), which laid a theoretical foundation for the purpose of using LR model to identify these two miRNAs to jointly diagnose of pancreatic cancer at miRNA level.

Keywords: logistic regression; loss function; principal component analysis; regularization; surface enhanced Raman spectroscopy.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Linear Models
  • Logistic Models
  • MicroRNAs* / genetics
  • Multivariate Analysis
  • Spectrum Analysis, Raman / methods

Substances

  • MicroRNAs