Machine Learning Heuristics on Gingivobuccal Cancer Gene Datasets Reveals Key Candidate Attributes for Prognosis

Genes (Basel). 2022 Dec 16;13(12):2379. doi: 10.3390/genes13122379.

Abstract

Delayed cancer detection is one of the common causes of poor prognosis in the case of many cancers, including cancers of the oral cavity. Despite the improvement and development of new and efficient gene therapy treatments, very little has been carried out to algorithmically assess the impedance of these carcinomas. In this work, from attributes or NCBI's oral cancer datasets, viz. (i) name, (ii) gene(s), (iii) protein change, (iv) condition(s), clinical significance (last reviewed). We sought to train the number of instances emerging from them. Further, we attempt to annotate viable attributes in oral cancer gene datasets for the identification of gingivobuccal cancer (GBC). We further apply supervised and unsupervised machine learning methods to the gene datasets, revealing key candidate attributes for GBC prognosis. Our work highlights the importance of automated identification of key genes responsible for GBC that could perhaps be easily replicated in other forms of oral cancer detection.

Keywords: data mining; gene prioritization; genomic datasets; machine learning; oral cancer.

MeSH terms

  • Heuristics*
  • Humans
  • Machine Learning
  • Mouth Neoplasms* / diagnosis
  • Mouth Neoplasms* / genetics
  • Oncogenes
  • Prognosis

Grants and funding

This research received no external funding.