Classifying Breast Cancer Subtypes Using Multiple Kernel Learning Based on Omics Data

Genes (Basel). 2019 Mar 7;10(3):200. doi: 10.3390/genes10030200.

Abstract

It is very significant to explore the intrinsic differences in breast cancer subtypes. These intrinsic differences are closely related to clinical diagnosis and designation of treatment plans. With the accumulation of biological and medicine datasets, there are many different omics data that can be viewed in different aspects. Combining these multiple omics data can improve the accuracy of prediction. Meanwhile; there are also many different databases available for us to download different types of omics data. In this article, we use estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2) to define breast cancer subtypes and classify any two breast cancer subtypes using SMO-MKL algorithm. We collected mRNA data, methylation data and copy number variation (CNV) data from TCGA to classify breast cancer subtypes. Multiple Kernel Learning (MKL) is employed to use these omics data distinctly. The result of using three omics data with multiple kernels is better than that of using single omics data with multiple kernels. Furthermore; these significant genes and pathways discovered in the feature selection process are also analyzed. In experiments; the proposed method outperforms other state-of-the-art methods and has abundant biological interpretations.

Keywords: CNV; MKL; breast cancer subtypes; mRNA; methylation data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA Copy Number Variations
  • DNA Methylation
  • Female
  • Genomics / methods*
  • Humans
  • Machine Learning*
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Receptor, ErbB-2 / genetics
  • Receptor, ErbB-2 / metabolism
  • Receptors, Estrogen / genetics
  • Receptors, Estrogen / metabolism
  • Receptors, Progesterone / genetics
  • Receptors, Progesterone / metabolism
  • Triple Negative Breast Neoplasms / classification
  • Triple Negative Breast Neoplasms / genetics*

Substances

  • RNA, Messenger
  • Receptors, Estrogen
  • Receptors, Progesterone
  • ERBB2 protein, human
  • Receptor, ErbB-2