Authentication of differential gene expression in oral squamous cell carcinoma using machine learning applications

BMC Oral Health. 2021 May 29;21(1):281. doi: 10.1186/s12903-021-01642-9.

Abstract

Background: Recently, the possibility of tumour classification based on genetic data has been investigated. However, genetic datasets are difficult to handle because of their massive size and complexity of manipulation. In the present study, we examined the diagnostic performance of machine learning applications using imaging-based classifications of oral squamous cell carcinoma (OSCC) gene sets.

Methods: RNA sequencing data from SCC tissues from various sites, including oral, non-oral head and neck, oesophageal, and cervical regions, were downloaded from The Cancer Genome Atlas (TCGA). The feature genes were extracted through a convolutional neural network (CNN) and machine learning, and the performance of each analysis was compared.

Results: The ability of the machine learning analysis to classify OSCC tumours was excellent. However, the tool exhibited poorer performance in discriminating histopathologically dissimilar cancers derived from the same type of tissue than in differentiating cancers of the same histopathologic type with different tissue origins, revealing that the differential gene expression pattern is a more important factor than the histopathologic features for differentiating cancer types.

Conclusion: The CNN-based diagnostic model and the visualisation methods using RNA sequencing data were useful for correctly categorising OSCC. The analysis showed differentially expressed genes in multiwise comparisons of various types of SCCs, such as KCNA10, FOSL2, and PRDM16, and extracted leader genes from pairwise comparisons were FGF20, DLC1, and ZNF705D.

Keywords: Convolutional neural network; Diagnostic model; Oral squamous cell carcinoma; The cancer genome atlas; Tumour classification.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Carcinoma, Squamous Cell* / genetics
  • Gene Expression
  • Gene Expression Regulation, Neoplastic
  • Head and Neck Neoplasms*
  • Humans
  • Machine Learning
  • Mouth Neoplasms* / genetics
  • Squamous Cell Carcinoma of Head and Neck